Query 035681
Match_columns 320
No_of_seqs 259 out of 1595
Neff 7.1
Searched_HMMs 46136
Date Fri Mar 29 05:18:36 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/035681.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/035681hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 2E-53 4.2E-58 419.7 18.9 226 59-320 29-262 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 100.0 4.3E-36 9.3E-41 258.1 15.3 150 165-319 1-157 (157)
3 PF00413 Peptidase_M10: Matrix 100.0 4.4E-34 9.6E-39 243.4 15.0 147 165-319 1-154 (154)
4 cd04279 ZnMc_MMP_like_1 Zinc-d 99.9 3.2E-27 6.8E-32 202.7 13.7 139 171-319 2-156 (156)
5 cd04268 ZnMc_MMP_like Zinc-dep 99.9 4.5E-24 9.7E-29 183.8 9.1 142 170-318 1-165 (165)
6 smart00235 ZnMc Zinc-dependent 99.9 1.1E-22 2.5E-27 171.1 10.3 133 163-320 2-140 (140)
7 cd04277 ZnMc_serralysin_like Z 99.9 6.3E-21 1.4E-25 168.1 13.8 122 184-319 34-186 (186)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 99.7 6.4E-17 1.4E-21 144.3 13.2 108 165-287 2-109 (198)
9 cd00203 ZnMc Zinc-dependent me 99.7 7.2E-17 1.6E-21 138.9 12.5 119 182-318 20-167 (167)
10 PF12388 Peptidase_M57: Dual-a 99.6 1.5E-14 3.2E-19 128.8 12.6 137 165-318 33-211 (211)
11 cd04276 ZnMc_MMP_like_2 Zinc-d 99.5 1.1E-13 2.5E-18 123.0 12.4 144 170-320 7-197 (197)
12 cd04280 ZnMc_astacin_like Zinc 99.5 1.1E-13 2.5E-18 121.6 11.2 91 171-288 2-92 (180)
13 PF01400 Astacin: Astacin (Pep 99.5 7.5E-14 1.6E-18 123.9 9.2 94 165-288 2-97 (191)
14 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.5 4.4E-13 9.6E-18 119.5 12.5 99 163-288 7-105 (200)
15 PF01471 PG_binding_1: Putativ 99.5 1.8E-13 3.8E-18 97.7 6.6 56 57-121 2-57 (57)
16 cd04282 ZnMc_meprin Zinc-depen 99.4 2E-12 4.3E-17 117.5 12.4 96 163-288 43-138 (230)
17 cd04283 ZnMc_hatching_enzyme Z 99.4 4.6E-12 9.9E-17 111.4 11.0 92 171-288 4-95 (182)
18 KOG3714 Meprin A metalloprotea 99.3 1.4E-11 3E-16 121.4 10.8 100 162-288 78-177 (411)
19 PF02031 Peptidase_M7: Strepto 99.0 1.3E-09 2.9E-14 89.0 7.0 112 184-319 13-130 (132)
20 TIGR02869 spore_SleB spore cor 98.9 2.8E-09 6E-14 95.4 7.1 72 49-130 5-76 (201)
21 COG5549 Predicted Zn-dependent 98.9 2.5E-09 5.3E-14 94.2 6.4 127 184-319 104-233 (236)
22 PRK10594 murein L,D-transpepti 98.3 1.1E-06 2.4E-11 89.5 7.0 84 41-124 225-343 (608)
23 COG3409 Putative peptidoglycan 97.8 2.5E-05 5.3E-10 67.9 5.4 69 49-125 37-106 (185)
24 cd04267 ZnMc_ADAM_like Zinc-de 97.7 0.00022 4.8E-09 62.8 9.6 48 269-316 132-189 (192)
25 PF11350 DUF3152: Protein of u 97.7 0.00017 3.8E-09 63.9 7.9 113 170-289 28-158 (203)
26 PF13583 Reprolysin_4: Metallo 97.6 6E-05 1.3E-09 67.7 3.5 46 271-317 138-197 (206)
27 COG2989 Uncharacterized protei 97.5 0.00014 3.1E-09 72.7 6.2 71 46-123 227-298 (561)
28 COG1913 Predicted Zn-dependent 97.4 6.7E-05 1.4E-09 64.9 2.0 30 270-299 124-153 (181)
29 PF05572 Peptidase_M43: Pregna 97.4 0.00011 2.4E-09 63.1 3.0 23 267-289 66-88 (154)
30 cd04273 ZnMc_ADAMTS_like Zinc- 97.4 0.00051 1.1E-08 61.5 6.9 116 188-316 71-196 (207)
31 PF13574 Reprolysin_2: Metallo 97.3 0.00012 2.5E-09 64.0 1.8 47 270-317 111-170 (173)
32 COG3023 ampD N-acetyl-anhydrom 97.2 0.00071 1.5E-08 62.0 6.2 58 54-122 194-252 (257)
33 PRK13267 archaemetzincin-like 97.2 0.00024 5.1E-09 62.5 2.9 29 270-298 125-153 (179)
34 PF13688 Reprolysin_5: Metallo 97.1 0.00025 5.4E-09 62.6 1.8 23 268-290 140-162 (196)
35 PF07998 Peptidase_M54: Peptid 97.1 0.00032 6.9E-09 62.4 2.5 30 269-299 144-173 (194)
36 PF13582 Reprolysin_3: Metallo 97.1 0.00022 4.9E-09 58.1 1.3 17 271-287 108-124 (124)
37 cd04271 ZnMc_ADAM_fungal Zinc- 96.8 0.0011 2.5E-08 60.4 3.7 45 272-317 147-216 (228)
38 cd04269 ZnMc_adamalysin_II_lik 96.8 0.0016 3.5E-08 57.4 4.4 31 269-299 130-166 (194)
39 cd04275 ZnMc_pappalysin_like Z 96.7 0.00035 7.5E-09 63.7 -0.6 24 268-291 135-158 (225)
40 PF08823 PG_binding_2: Putativ 96.6 0.0056 1.2E-07 46.1 5.3 55 57-120 15-73 (74)
41 COG3409 Putative peptidoglycan 96.5 0.0054 1.2E-07 53.1 5.8 62 53-122 123-184 (185)
42 PF01421 Reprolysin: Reprolysi 96.1 0.009 2E-07 52.9 5.3 48 268-315 129-184 (199)
43 cd04270 ZnMc_TACE_like Zinc-de 96.1 0.0075 1.6E-07 55.6 4.9 46 270-315 167-226 (244)
44 cd04272 ZnMc_salivary_gland_MP 95.8 0.011 2.4E-07 53.3 4.3 47 269-315 144-208 (220)
45 PF05548 Peptidase_M11: Gameto 94.1 0.031 6.7E-07 53.5 2.3 18 271-288 151-168 (314)
46 PF10462 Peptidase_M66: Peptid 93.9 0.027 5.9E-07 53.5 1.6 20 269-288 192-211 (305)
47 PF12044 Metallopep: Putative 93.2 0.052 1.1E-06 53.9 2.3 26 271-298 317-342 (423)
48 PF07172 GRP: Glycine rich pro 92.7 0.11 2.3E-06 41.1 2.9 28 1-29 1-28 (95)
49 PF01457 Peptidase_M8: Leishma 92.6 0.085 1.8E-06 53.9 2.9 60 212-286 159-226 (521)
50 TIGR03296 M6dom_TIGR03296 M6 f 92.1 0.04 8.7E-07 52.0 -0.2 19 271-289 166-184 (286)
51 PF11150 DUF2927: Protein of u 91.2 4.5 9.8E-05 36.6 12.0 148 164-319 28-194 (213)
52 PTZ00337 surface protease GP63 90.3 0.25 5.3E-06 51.0 3.4 36 246-287 211-246 (567)
53 COG3824 Predicted Zn-dependent 89.5 0.16 3.5E-06 41.6 1.1 18 269-286 108-125 (136)
54 PF09374 PG_binding_3: Predict 89.0 0.37 8.1E-06 35.9 2.6 26 96-121 1-30 (72)
55 KOG4525 Jacalin-like lectin do 88.8 0.24 5.2E-06 48.9 1.9 26 271-298 302-327 (614)
56 PF04298 Zn_peptidase_2: Putat 88.2 0.73 1.6E-05 41.9 4.5 18 269-288 88-105 (222)
57 PTZ00257 Glycoprotein GP63 (le 87.8 0.72 1.6E-05 47.9 4.7 38 246-287 236-273 (622)
58 KOG3658 Tumor necrosis factor- 87.3 0.41 8.9E-06 49.6 2.6 46 271-316 393-451 (764)
59 COG2738 Predicted Zn-dependent 85.8 1 2.2E-05 40.1 3.8 17 184-200 19-35 (226)
60 PF06262 DUF1025: Possibl zinc 85.7 0.48 1E-05 37.5 1.7 16 270-285 73-88 (97)
61 PF09471 Peptidase_M64: IgA Pe 85.7 0.44 9.6E-06 44.5 1.7 19 269-287 215-234 (264)
62 COG4783 Putative Zn-dependent 84.0 0.52 1.1E-05 47.2 1.5 19 267-286 127-145 (484)
63 PF05547 Peptidase_M6: Immune 83.3 0.22 4.8E-06 52.0 -1.6 17 271-287 222-238 (645)
64 KOG3607 Meltrins, fertilins an 83.3 0.6 1.3E-05 49.5 1.6 20 269-288 322-341 (716)
65 PF04228 Zn_peptidase: Putativ 77.3 3.4 7.3E-05 39.2 4.4 41 244-285 141-189 (292)
66 PF14247 DUF4344: Domain of un 76.7 1.4 3E-05 40.1 1.5 14 269-282 91-104 (220)
67 PF13398 Peptidase_M50B: Pepti 76.3 1.4 3.1E-05 39.3 1.5 15 269-283 21-35 (200)
68 PF06114 DUF955: Domain of unk 75.9 1.3 2.9E-05 34.6 1.1 27 248-282 28-54 (122)
69 COG5549 Predicted Zn-dependent 74.4 1.8 4E-05 38.9 1.6 30 270-299 153-182 (236)
70 COG2856 Predicted Zn peptidase 74.0 1.5 3.3E-05 39.7 1.0 26 270-297 72-98 (213)
71 PF01435 Peptidase_M48: Peptid 71.5 2.3 5E-05 37.6 1.7 20 268-288 87-106 (226)
72 COG0501 HtpX Zn-dependent prot 68.6 2.7 5.9E-05 39.1 1.5 15 268-282 155-169 (302)
73 PF12725 DUF3810: Protein of u 68.4 4.6 0.0001 38.7 3.1 37 245-289 179-215 (318)
74 PRK03982 heat shock protein Ht 68.3 2.8 6E-05 39.5 1.5 15 268-282 123-137 (288)
75 cd06258 Peptidase_M3_like The 68.2 8.3 0.00018 37.1 4.9 15 268-282 152-166 (365)
76 PRK03001 M48 family peptidase; 66.9 3.1 6.7E-05 39.1 1.5 15 268-282 122-136 (283)
77 PRK01345 heat shock protein Ht 66.5 3.1 6.8E-05 39.8 1.5 19 268-287 122-140 (317)
78 PRK03072 heat shock protein Ht 66.3 3.2 6.9E-05 39.2 1.5 15 268-282 125-139 (288)
79 cd06161 S2P-M50_SpoIVFB SpoIVF 64.5 3.7 8E-05 36.8 1.5 16 269-284 37-52 (208)
80 PRK04897 heat shock protein Ht 62.7 4.1 8.9E-05 38.6 1.5 15 268-282 135-149 (298)
81 PRK02870 heat shock protein Ht 61.5 4.4 9.6E-05 39.2 1.5 14 268-281 171-184 (336)
82 PRK05457 heat shock protein Ht 61.1 4.6 9.9E-05 38.1 1.5 14 268-281 132-145 (284)
83 PRK01265 heat shock protein Ht 61.0 4.6 9.9E-05 38.9 1.5 14 268-281 138-151 (324)
84 PRK02391 heat shock protein Ht 60.6 4.7 0.0001 38.2 1.5 14 268-281 131-144 (296)
85 PF13485 Peptidase_MA_2: Pepti 59.1 6 0.00013 31.1 1.7 14 269-282 24-37 (128)
86 cd06164 S2P-M50_SpoIVFB_CBS Sp 59.1 5.2 0.00011 36.4 1.5 15 270-284 53-67 (227)
87 KOG3538 Disintegrin metallopro 58.8 3 6.5E-05 45.3 -0.2 32 269-300 316-353 (845)
88 cd06459 M3B_Oligoendopeptidase 56.7 5 0.00011 39.3 1.0 14 269-282 221-234 (427)
89 cd06162 S2P-M50_PDZ_SREBP Ster 55.9 6.3 0.00014 37.1 1.5 13 270-282 135-147 (277)
90 PF13699 DUF4157: Domain of un 55.0 13 0.00028 28.1 2.8 30 248-282 44-73 (79)
91 cd06159 S2P-M50_PDZ_Arch Uncha 54.7 6.8 0.00015 36.6 1.5 16 270-285 118-133 (263)
92 TIGR00181 pepF oligoendopeptid 54.6 7.9 0.00017 40.1 2.1 15 268-282 376-390 (591)
93 KOG2921 Intramembrane metallop 54.5 6.4 0.00014 38.8 1.3 15 270-284 131-145 (484)
94 PF01432 Peptidase_M3: Peptida 54.5 6.2 0.00013 39.4 1.3 24 186-209 137-160 (458)
95 TIGR02289 M3_not_pepF oligoend 53.9 5.9 0.00013 40.8 1.0 15 268-282 335-349 (549)
96 cd06455 M3A_TOP Peptidase M3 T 52.3 7 0.00015 39.4 1.2 14 269-282 262-275 (472)
97 cd06160 S2P-M50_like_2 Unchara 51.6 8.4 0.00018 34.0 1.5 21 270-290 41-61 (183)
98 TIGR02290 M3_fam_3 oligoendope 50.2 7.3 0.00016 40.4 1.0 14 269-282 374-387 (587)
99 PF01447 Peptidase_M4: Thermol 49.8 8.2 0.00018 32.9 1.1 12 271-282 136-147 (150)
100 PLN02791 Nudix hydrolase homol 48.7 8.9 0.00019 41.2 1.4 21 269-289 584-604 (770)
101 PF01863 DUF45: Protein of unk 48.5 12 0.00026 32.8 2.0 38 248-290 147-184 (205)
102 cd06457 M3A_MIP Peptidase M3 m 48.4 8.7 0.00019 38.6 1.2 24 186-209 133-156 (458)
103 cd06456 M3A_DCP_Oligopeptidase 46.6 9.7 0.00021 37.9 1.2 15 268-282 206-220 (422)
104 PF11337 DUF3139: Protein of u 46.1 42 0.00091 25.5 4.5 23 52-74 29-51 (85)
105 PF01434 Peptidase_M41: Peptid 45.4 12 0.00026 33.5 1.6 15 268-282 26-40 (213)
106 TIGR00054 RIP metalloprotease 42.6 11 0.00025 37.4 1.0 12 271-282 15-26 (420)
107 PRK10779 zinc metallopeptidase 42.4 12 0.00025 37.6 1.1 12 271-282 16-27 (449)
108 cd06460 M32_Taq Peptidase fami 42.1 12 0.00026 37.1 1.0 12 271-282 160-171 (396)
109 KOG2719 Metalloprotease [Gener 42.0 13 0.00029 37.0 1.3 19 268-287 278-296 (428)
110 KOG3988 Protein-tyrosine sulfo 40.1 46 0.00099 31.7 4.4 61 42-104 297-357 (378)
111 COG1164 Oligoendopeptidase F [ 39.5 13 0.00029 38.7 1.0 14 269-282 379-392 (598)
112 COG4784 Putative Zn-dependent 39.3 21 0.00045 34.8 2.1 15 268-282 122-136 (479)
113 PF03272 Enhancin: Viral enhan 37.0 38 0.00082 36.6 3.9 42 165-208 129-191 (775)
114 COG4227 Antirestriction protei 36.8 22 0.00047 33.3 1.8 22 267-288 200-221 (316)
115 PRK10911 oligopeptidase A; Pro 35.0 18 0.0004 38.3 1.2 14 269-282 462-475 (680)
116 PF14891 Peptidase_M91: Effect 34.7 42 0.00091 29.0 3.2 18 269-286 102-119 (174)
117 PF03571 Peptidase_M49: Peptid 33.7 19 0.00042 37.1 1.0 16 269-284 292-308 (549)
118 COG2321 Predicted metalloprote 33.5 19 0.00042 33.8 0.9 39 244-282 138-179 (295)
119 CHL00132 psaF photosystem I su 31.8 1E+02 0.0023 27.1 5.1 38 59-106 45-82 (185)
120 COG3926 zliS Lysozyme family p 29.0 52 0.0011 30.1 2.8 30 94-123 93-126 (252)
121 KOG2661 Peptidase family M48 [ 28.6 31 0.00068 33.4 1.4 19 269-288 274-292 (424)
122 PF11576 DUF3236: Protein of u 27.5 42 0.00091 28.5 1.9 27 44-70 9-35 (154)
123 PF08434 CLCA_N: Calcium-activ 26.2 38 0.00082 31.7 1.5 14 271-284 150-164 (262)
124 PF06167 Peptidase_M90: Glucos 25.5 32 0.0007 31.9 0.9 19 268-286 152-170 (253)
125 PRK10280 dipeptidyl carboxypep 24.3 35 0.00077 36.2 1.1 31 187-217 364-397 (681)
126 PF13721 SecD-TM1: SecD export 23.7 1.5E+02 0.0032 23.5 4.3 29 48-76 36-64 (101)
127 cd02641 R3H_Smubp-2_like R3H d 23.6 55 0.0012 23.3 1.6 21 269-289 28-48 (60)
128 PF01431 Peptidase_M13: Peptid 22.8 47 0.001 29.1 1.4 16 269-284 35-50 (206)
129 cd06461 M2_ACE Peptidase famil 22.7 44 0.00095 34.0 1.4 16 269-284 246-261 (477)
130 PF12315 DUF3633: Protein of u 22.4 47 0.001 30.0 1.4 14 269-282 92-105 (212)
131 PRK10733 hflB ATP-dependent me 22.1 47 0.001 35.0 1.5 14 269-282 407-420 (644)
132 PF10026 DUF2268: Predicted Zn 20.6 3.3E+02 0.0072 23.8 6.4 18 269-286 64-81 (195)
133 smart00731 SprT SprT homologue 20.2 56 0.0012 27.3 1.3 16 268-283 57-72 (146)
134 cd02639 R3H_RRM R3H domain of 20.0 67 0.0015 23.0 1.5 23 269-291 28-50 (60)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=2e-53 Score=419.74 Aligned_cols=226 Identities=43% Similarity=0.737 Sum_probs=203.8
Q ss_pred cHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccCCccccccCCCcccc
Q 035681 59 GIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIPRCGVADIVNGTTRM 138 (320)
Q Consensus 59 ~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~PRCgv~D~~~~~~~~ 138 (320)
++..++.||++|||+...+..... .++..+++||+.||++++|++||++|.+|++.|.+|||||||.
T Consensus 29 ~~~~~~~yl~~~~y~~~~~~~~~~------~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~------- 95 (469)
T KOG1565|consen 29 DKVALQDYLECYGYLPPTDLTATR------ASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG------- 95 (469)
T ss_pred chhHHHHHhhhcccCCCccccccc------cCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------
Confidence 789999999999999876422111 1578899999999999999999999999999999999999991
Q ss_pred cCCcccccCCCccccccceecCCCCCCCCCCcccceEeccCCCc----hhHHHHHHHHHHHhhhcCcceEEEecCCCCCc
Q 035681 139 RSGKKRQQHNHHFRTVSHYSFFPRSPKWPASKFSLTYAFRPGTR----SDAITPVARAFQTWAANTQFKFSKIEDFANAD 214 (320)
Q Consensus 139 ~~~~~~~~~~~~~~~~~~y~~~~~~~kW~~~k~~Lty~~~~~~~----~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aD 214 (320)
+|+++++++||+ +.+|||+|.+.++ .+++.++++||+.|+++++++|.|+.....||
T Consensus 96 -----------------~~~~~~~~~kW~--k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aD 156 (469)
T KOG1565|consen 96 -----------------RYRYFPGKPKWN--KEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEAD 156 (469)
T ss_pred -----------------ccccCcccCccc--ccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCc
Confidence 477888999999 9999999998753 37899999999999999999999999866999
Q ss_pred eEEEeeeCCCCCCCCCCCCCCCCcceeeeeCC---CCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCC
Q 035681 215 IKISFESGDHGDGNPFDGRGGPEGIIAHAFAP---TNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEG 291 (320)
Q Consensus 215 I~I~f~~~~hgd~~~fdG~g~~~g~laha~~P---~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~ 291 (320)
|+|.|..+.|||++||||++ |+|||||.| .+|++|||++|.|+++ ...++|+..||+|||||+|||.||.+++
T Consensus 157 i~i~F~~~~h~d~~PFDG~~---g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~~~~ 232 (469)
T KOG1565|consen 157 IRISFFPGDHGDGFPFDGPG---GVLAHAFFPGPGIGGDLHFDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSSDPD 232 (469)
T ss_pred eeeeeeccCCCCCCcccCCC---CceecccCCCCCCCCccccCcccceecc-CCccchhHHHhhhhcccccccCCCCCcc
Confidence 99999999999999999999 999999999 6899999999999997 3457899999999999999999999999
Q ss_pred CcccccccC-CCCCCCCHHHHHHHHHHhCC
Q 035681 292 AIMYPSISA-GVTKGLHGDDIQGIRDLYNV 320 (320)
Q Consensus 292 siMyp~~~~-~~~~~L~~dDi~~iq~LYg~ 320 (320)
+||||+|.+ .....|++|||.|||.|||.
T Consensus 233 aiM~P~y~~~~~~~~L~~DDv~giq~lYG~ 262 (469)
T KOG1565|consen 233 AIMYPFYQPDSGNFDLSQDDVRGIQHLYGG 262 (469)
T ss_pred cccccccccCCCCcccChhhhhhhHHHhCC
Confidence 999999994 23458999999999999984
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00 E-value=4.3e-36 Score=258.06 Aligned_cols=150 Identities=48% Similarity=0.827 Sum_probs=135.4
Q ss_pred CCCCCcccceEeccCCC----chhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681 165 KWPASKFSLTYAFRPGT----RSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII 240 (320)
Q Consensus 165 kW~~~k~~Lty~~~~~~----~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l 240 (320)
||+ +++|||+|.+.. ..+++++|++||+.|+++++|+|+|+.+...|||+|.|....|+|+++|+|.+ |++
T Consensus 1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~---g~l 75 (157)
T cd04278 1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPG---GTL 75 (157)
T ss_pred CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCc---ccc
Confidence 799 999999998754 24788999999999999999999999876689999999999999999999999 999
Q ss_pred eeeeCC--CCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCCcccccccCC-CCCCCCHHHHHHHHHH
Q 035681 241 AHAFAP--TNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGAIMYPSISAG-VTKGLHGDDIQGIRDL 317 (320)
Q Consensus 241 aha~~P--~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~~~-~~~~L~~dDi~~iq~L 317 (320)
+||++| ..|+|+||.++.|++.....+.+++.|++|||||+|||.|+.++.+||||.+... ....|+++||.+||+|
T Consensus 76 ~~a~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~~~l 155 (157)
T cd04278 76 AHAFFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGIQAL 155 (157)
T ss_pred ccccCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHHHHh
Confidence 999998 7899999999999886534567899999999999999999999999999999843 2459999999999999
Q ss_pred hC
Q 035681 318 YN 319 (320)
Q Consensus 318 Yg 319 (320)
||
T Consensus 156 Yg 157 (157)
T cd04278 156 YG 157 (157)
T ss_pred cC
Confidence 98
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00 E-value=4.4e-34 Score=243.37 Aligned_cols=147 Identities=45% Similarity=0.843 Sum_probs=130.8
Q ss_pred CCCCCcccceEeccCCCc----hhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681 165 KWPASKFSLTYAFRPGTR----SDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII 240 (320)
Q Consensus 165 kW~~~k~~Lty~~~~~~~----~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l 240 (320)
||+ +++|||+|.+..+ .+.++++++||+.|+++.+++|+++.+. .+||+|.|....++++..|++.+ +++
T Consensus 1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~---~~~ 74 (154)
T PF00413_consen 1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSG---GTL 74 (154)
T ss_dssp SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSS---SES
T ss_pred CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccce---eee
Confidence 799 8999999998653 2688999999999999999999999963 69999999998899888899988 999
Q ss_pred eeeeCCC---CceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCCcccccccCCCCCCCCHHHHHHHHHH
Q 035681 241 AHAFAPT---NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGAIMYPSISAGVTKGLHGDDIQGIRDL 317 (320)
Q Consensus 241 aha~~P~---~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~~~~~~~L~~dDi~~iq~L 317 (320)
++++.|. .|+++|+.++.|... ....++..|++|||||+|||.|+.++++||||.+.......|+.+||.+||+|
T Consensus 75 ~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~~~l~~~Di~~i~~l 152 (154)
T PF00413_consen 75 AHAYFPNNIVSGDIHFNDDESWTID--DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDNKTLSEDDIDGIQYL 152 (154)
T ss_dssp EEEEESSSTTTTEEEEETTSHEESS--SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSSTSTTHHHHHHHHHH
T ss_pred eccccccccccccccccccccchhh--hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCCCCCCHHHHHHHHHH
Confidence 9999985 799999999999875 23568999999999999999999999999999998544359999999999999
Q ss_pred hC
Q 035681 318 YN 319 (320)
Q Consensus 318 Yg 319 (320)
||
T Consensus 153 Yg 154 (154)
T PF00413_consen 153 YG 154 (154)
T ss_dssp HS
T ss_pred hC
Confidence 98
No 4
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.95 E-value=3.2e-27 Score=202.67 Aligned_cols=139 Identities=30% Similarity=0.427 Sum_probs=110.0
Q ss_pred ccceEeccCCC------chhHHHHHHHHHHHhhhcCcceEEEecCC-CCCceEEEeeeCCCCCCCCCCCCCCCCcceeee
Q 035681 171 FSLTYAFRPGT------RSDAITPVARAFQTWAANTQFKFSKIEDF-ANADIKISFESGDHGDGNPFDGRGGPEGIIAHA 243 (320)
Q Consensus 171 ~~Lty~~~~~~------~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~-~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha 243 (320)
..|+|++.+.. +.+++++|++||+.|+++++|+|+++.+. ..+||+|.|.... +|+|.| +++||+
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g---~~~a~a 73 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAG---GGLARA 73 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCC---CceEEe
Confidence 46888888753 33788999999999999999999998753 2799999998632 788899 999999
Q ss_pred eCCCCceeEecCCCCCcc------CCccccchhHHHHHHhhhhhhCCCCCCCCC-CcccccccCCCC--CCCCHHHHHHH
Q 035681 244 FAPTNGRFHYDADEPWAV------GAVQGAFDLETVALHEIGHLLGLHHSSVEG-AIMYPSISAGVT--KGLHGDDIQGI 314 (320)
Q Consensus 244 ~~P~~G~i~~d~~e~w~~------~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-siMyp~~~~~~~--~~L~~dDi~~i 314 (320)
++|..|. ++....|.. .....+.+++.|++|||||+|||.|+.++. +||||++..+.. ..|+.+||++|
T Consensus 74 ~~p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i 151 (156)
T cd04279 74 GFPLISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATL 151 (156)
T ss_pred cccccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHH
Confidence 9997533 222222221 111124689999999999999999999998 999999985432 48999999999
Q ss_pred HHHhC
Q 035681 315 RDLYN 319 (320)
Q Consensus 315 q~LYg 319 (320)
++|||
T Consensus 152 ~~lY~ 156 (156)
T cd04279 152 KRLYG 156 (156)
T ss_pred HHHhC
Confidence 99997
No 5
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.90 E-value=4.5e-24 Score=183.85 Aligned_cols=142 Identities=23% Similarity=0.279 Sum_probs=106.8
Q ss_pred cccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCc
Q 035681 170 KFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNG 249 (320)
Q Consensus 170 k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G 249 (320)
++.|+|+|.+..+...+++|++||+.|+++++|+|+++...+.+||+|.+....+.. +|.. +.+++...|..|
T Consensus 1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~---~~~~~~~~~~~g 73 (165)
T cd04268 1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTW---SYGPSQVDPLTG 73 (165)
T ss_pred CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCcc---ccCCccCCCCCc
Confidence 468999999887778999999999999999999999998766799999988643211 0111 222222245678
Q ss_pred eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCC----------------CCCcccccccC-------CCCCCC
Q 035681 250 RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV----------------EGAIMYPSISA-------GVTKGL 306 (320)
Q Consensus 250 ~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~siMyp~~~~-------~~~~~L 306 (320)
+|+++....|.......+.++..|++|||||+|||.|+.+ ..|||++.... +....|
T Consensus 74 ~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~ 153 (165)
T cd04268 74 EILLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTI 153 (165)
T ss_pred cEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCC
Confidence 9998876554321011234688999999999999999998 89999987652 124589
Q ss_pred CHHHHHHHHHHh
Q 035681 307 HGDDIQGIRDLY 318 (320)
Q Consensus 307 ~~dDi~~iq~LY 318 (320)
+.+||.+||.||
T Consensus 154 ~~~Di~ai~~lY 165 (165)
T cd04268 154 GPYDIAAIKKLY 165 (165)
T ss_pred CHHHHHHHHhcC
Confidence 999999999999
No 6
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.88 E-value=1.1e-22 Score=171.07 Aligned_cols=133 Identities=34% Similarity=0.528 Sum_probs=108.0
Q ss_pred CCCCCCCcccceEecc-CCCch-hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681 163 SPKWPASKFSLTYAFR-PGTRS-DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII 240 (320)
Q Consensus 163 ~~kW~~~k~~Lty~~~-~~~~~-~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l 240 (320)
..+|+ +.+|+|+|. +..+. +++++|++||+.|+++++|+|+|+.+ .+||.|.|...++ .| ..+
T Consensus 2 ~~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~--------~g---~~~ 66 (140)
T smart00235 2 SKKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDG--------SG---CTL 66 (140)
T ss_pred CCcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCC--------CC---cce
Confidence 36899 889999997 54443 45899999999999999999999875 7999999998765 12 236
Q ss_pred eeeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCC---cccccccCCC-CCCCCHHHHHHHHH
Q 035681 241 AHAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGA---IMYPSISAGV-TKGLHGDDIQGIRD 316 (320)
Q Consensus 241 aha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---iMyp~~~~~~-~~~L~~dDi~~iq~ 316 (320)
++++.| .|+++++. +.|..+ ..|++|||||+|||.|+.++.+ +|++.+..-. ...|+.+|..++++
T Consensus 67 a~~g~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (140)
T smart00235 67 SHAGRP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPY 136 (140)
T ss_pred eeeecC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchh
Confidence 888888 88999976 666543 3499999999999999998877 9999997321 34778889999999
Q ss_pred HhCC
Q 035681 317 LYNV 320 (320)
Q Consensus 317 LYg~ 320 (320)
+||.
T Consensus 137 ~yg~ 140 (140)
T smart00235 137 DYGS 140 (140)
T ss_pred ccCc
Confidence 9984
No 7
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.86 E-value=6.3e-21 Score=168.15 Aligned_cols=122 Identities=22% Similarity=0.310 Sum_probs=96.0
Q ss_pred hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCC-------CceeEecCC
Q 035681 184 DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPT-------NGRFHYDAD 256 (320)
Q Consensus 184 ~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~-------~G~i~~d~~ 256 (320)
+.++++++||+.|+++++|+|+|+.+...+||+|.+..... + +..+.+++|. .|+++++.+
T Consensus 34 ~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~---~~~g~a~~p~~~~~~~~~g~i~~~~~ 101 (186)
T cd04277 34 AQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------G---NTAGYAYYPGSGSGTAYGGDIWFNSS 101 (186)
T ss_pred HHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------C---CccEEEECCCCCccccccceeEEecC
Confidence 67899999999999999999999997768999998875321 2 5677888873 368999988
Q ss_pred CCCccCCccccchhHHHHHHhhhhhhCCCCCCCC----------------CCcccccccCC--------CCCCCCHHHHH
Q 035681 257 EPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVE----------------GAIMYPSISAG--------VTKGLHGDDIQ 312 (320)
Q Consensus 257 e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~siMyp~~~~~--------~~~~L~~dDi~ 312 (320)
..+... ..+.....|++|||||+|||.|+.+. .+||.+....+ ....+...||+
T Consensus 102 ~~~~~~--~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~ 179 (186)
T cd04277 102 YDTNSD--SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIA 179 (186)
T ss_pred cccccC--CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHH
Confidence 776543 22346789999999999999998752 46887554432 22378899999
Q ss_pred HHHHHhC
Q 035681 313 GIRDLYN 319 (320)
Q Consensus 313 ~iq~LYg 319 (320)
+||+|||
T Consensus 180 AlQ~lYG 186 (186)
T cd04277 180 ALQYLYG 186 (186)
T ss_pred HHHHhhC
Confidence 9999998
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.72 E-value=6.4e-17 Score=144.25 Aligned_cols=108 Identities=29% Similarity=0.495 Sum_probs=76.3
Q ss_pred CCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeee
Q 035681 165 KWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAF 244 (320)
Q Consensus 165 kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~ 244 (320)
+|++ .+.|+|.|..+.+.+.++.|++||+.|+++++|+|++++.. .+||+|.|..+. | |+++.|.. +.. .
T Consensus 2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-G-c~S~vG~~---~~~---~ 71 (198)
T cd04327 2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-G-YWSYVGTD---ALL---I 71 (198)
T ss_pred CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-C-CCCCcCCc---ccc---c
Confidence 6985 56789999887666788999999999999999999999864 799999998742 4 45666653 221 1
Q ss_pred CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681 245 APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS 287 (320)
Q Consensus 245 ~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs 287 (320)
.+....+.+. |.... ..+.+...|++|||||||||.|.
T Consensus 72 ~~~~~t~~l~----~~~~~-~~~~~~~~~i~HElgHaLG~~HE 109 (198)
T cd04327 72 GADAPTMNLG----WFTDD-TPDPEFSRVVLHEFGHALGFIHE 109 (198)
T ss_pred CCCCceeeee----eecCC-CchhhHHHHHHHHHHHHhcCccc
Confidence 1112233332 21110 02346678999999999999996
No 9
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.72 E-value=7.2e-17 Score=138.89 Aligned_cols=119 Identities=24% Similarity=0.296 Sum_probs=90.3
Q ss_pred chhHHHHHHHHHHHhhhcCcceEEEecCC-CCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCC-----CceeEecC
Q 035681 182 RSDAITPVARAFQTWAANTQFKFSKIEDF-ANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPT-----NGRFHYDA 255 (320)
Q Consensus 182 ~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~-~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~-----~G~i~~d~ 255 (320)
+..++++|+.|++.|+++++++|+++... ..+||.+.+...++ .+ +.++.++.|. .+.+.+..
T Consensus 20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~---~~~g~a~~~~~c~~~~~~~~~~~ 88 (167)
T cd00203 20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DG---GTGGWAYLGRVCDSLRGVGVLQD 88 (167)
T ss_pred HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CC---CceEEEecCCccCCCCCcEEEec
Confidence 33688999999999999999999999864 47999988875432 12 4556677653 35565544
Q ss_pred CCCCccCCccccchhHHHHHHhhhhhhCCCCCCC--------------------CCCcccccccC---CCCCCCCHHHHH
Q 035681 256 DEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV--------------------EGAIMYPSISA---GVTKGLHGDDIQ 312 (320)
Q Consensus 256 ~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~siMyp~~~~---~~~~~L~~dDi~ 312 (320)
...+ ......|++|||||+|||.|... ..+||+|.... .....++..||.
T Consensus 89 ~~~~-------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~~ 161 (167)
T cd00203 89 NQSG-------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDID 161 (167)
T ss_pred CCcc-------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHHH
Confidence 3322 23567899999999999999765 46899998774 245689999999
Q ss_pred HHHHHh
Q 035681 313 GIRDLY 318 (320)
Q Consensus 313 ~iq~LY 318 (320)
.||++|
T Consensus 162 ~i~~~Y 167 (167)
T cd00203 162 QINKLY 167 (167)
T ss_pred HHHhhC
Confidence 999998
No 10
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.59 E-value=1.5e-14 Score=128.83 Aligned_cols=137 Identities=25% Similarity=0.292 Sum_probs=95.8
Q ss_pred CCCCCcccceEeccC----CCchhHHHHHHHHHHHhhhc-CcceEEEe--cCCCCCceEEEeeeCCCCCCCCCCCCCCCC
Q 035681 165 KWPASKFSLTYAFRP----GTRSDAITPVARAFQTWAAN-TQFKFSKI--EDFANADIKISFESGDHGDGNPFDGRGGPE 237 (320)
Q Consensus 165 kW~~~k~~Lty~~~~----~~~~~~~~~i~~A~~~Ws~v-~~l~F~e~--~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~ 237 (320)
-|....++|+..+.+ ..+...+.++.+|++.|+.+ ..|+|+-+ .....+||.|.-... +.+.
T Consensus 33 lV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~--- 101 (211)
T PF12388_consen 33 LVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPS--- 101 (211)
T ss_pred eecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCC---
Confidence 453236788888887 33446889999999999996 35788522 233466776632211 1123
Q ss_pred cceeeeeCC-CCc----eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC------------------------
Q 035681 238 GIIAHAFAP-TNG----RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS------------------------ 288 (320)
Q Consensus 238 g~laha~~P-~~G----~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~------------------------ 288 (320)
|..+.|.+| .+| .|.+...+. ......++|++|||||+|||.|++
T Consensus 102 G~ggsAGFP~s~G~P~~~I~I~~~~~------~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~Ip 175 (211)
T PF12388_consen 102 GAGGSAGFPTSNGNPYKFIQIYGLSN------YSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIP 175 (211)
T ss_pred CcceeccCCCCCCCCCceEEEEecCC------CchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECC
Confidence 567788889 554 455511111 112246789999999999999986
Q ss_pred ------CCCCcccccccCCCCCCCCHHHHHHHHHHh
Q 035681 289 ------VEGAIMYPSISAGVTKGLHGDDIQGIRDLY 318 (320)
Q Consensus 289 ------~~~siMyp~~~~~~~~~L~~dDi~~iq~LY 318 (320)
|++|||..++..+...+++..||.++++||
T Consensus 176 GTPt~~d~~SiM~ac~~~~~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 176 GTPTGADPNSIMNACFSSGEDGEFTSNDITALNYLY 211 (211)
T ss_pred CCCCCCCCchhhhccccCCCCCCcChhhHHHHHhhC
Confidence 467999999887777899999999999999
No 11
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.52 E-value=1.1e-13 Score=123.03 Aligned_cols=144 Identities=17% Similarity=0.121 Sum_probs=89.5
Q ss_pred cccceEeccCCCchhHHHHHHHHHHHhhhc------CcceEEEecCC--CCCceEEEeeeCCCCCCCCCCCCCCCCccee
Q 035681 170 KFSLTYAFRPGTRSDAITPVARAFQTWAAN------TQFKFSKIEDF--ANADIKISFESGDHGDGNPFDGRGGPEGIIA 241 (320)
Q Consensus 170 k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v------~~l~F~e~~~~--~~aDI~I~f~~~~hgd~~~fdG~g~~~g~la 241 (320)
|+.|||+|.+..|.+++++|++||+.|+++ .++.+++.... ...||++..-+--+.+ ++.. +...
T Consensus 7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~---~~gp 79 (197)
T cd04276 7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGW---AYGP 79 (197)
T ss_pred CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcc---eecc
Confidence 889999999998888999999999999997 35555555422 2468984433322322 1111 2333
Q ss_pred eeeCCCCc-----eeEecCCCCCccCC---ccccchhHHHHHHhhhhhhCCCCCCC---------------------CCC
Q 035681 242 HAFAPTNG-----RFHYDADEPWAVGA---VQGAFDLETVALHEIGHLLGLHHSSV---------------------EGA 292 (320)
Q Consensus 242 ha~~P~~G-----~i~~d~~e~w~~~~---~~~~~~l~~Va~HEiGHaLGL~Hs~~---------------------~~s 292 (320)
+..-|..| +|.++......... ...+..+..+++||+||+|||.|... ..|
T Consensus 80 s~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~S 159 (197)
T cd04276 80 SVVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSS 159 (197)
T ss_pred cccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcce
Confidence 44446444 55665533211110 01123577899999999999999631 245
Q ss_pred cccc---cccC-------CCCCCCCHHHHHHHHHHhCC
Q 035681 293 IMYP---SISA-------GVTKGLHGDDIQGIRDLYNV 320 (320)
Q Consensus 293 iMyp---~~~~-------~~~~~L~~dDi~~iq~LYg~ 320 (320)
||-. .+.. .....+...||.+||.+||.
T Consensus 160 VMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~ 197 (197)
T cd04276 160 VMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP 197 (197)
T ss_pred eecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence 5532 1100 01236788999999999984
No 12
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.51 E-value=1.1e-13 Score=121.62 Aligned_cols=91 Identities=20% Similarity=0.230 Sum_probs=64.3
Q ss_pred ccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCce
Q 035681 171 FSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGR 250 (320)
Q Consensus 171 ~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~ 250 (320)
..|.|.|++..+...++.|++||+.|++++||+|++.+. ++..|.|..+ .| |++.-|.- + ...+
T Consensus 2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~---~--------~~q~ 65 (180)
T cd04280 2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRV---G--------GRQV 65 (180)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCcc---C--------Ccee
Confidence 478999998766688899999999999999999999874 5556677654 34 34433321 0 1123
Q ss_pred eEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 251 FHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 251 i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
+.+.. ++....+++|||||||||.|..
T Consensus 66 i~l~~-----------~c~~~g~v~HE~~HalG~~HEh 92 (180)
T cd04280 66 VSLGS-----------GCFSLGTIVHELMHALGFYHEQ 92 (180)
T ss_pred EEeCC-----------CcCcCchhHHHHHHHhcCcchh
Confidence 33322 1233579999999999999974
No 13
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.50 E-value=7.5e-14 Score=123.90 Aligned_cols=94 Identities=30% Similarity=0.513 Sum_probs=62.1
Q ss_pred CCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeee
Q 035681 165 KWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAF 244 (320)
Q Consensus 165 kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~ 244 (320)
+|+ ..+|.|.|....+...++.|++|+..|++.++|+|+++. . .....|.|.. ..|| +++-|..
T Consensus 2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~-~~gC-~S~vG~~---------- 65 (191)
T PF01400_consen 2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSN-GSGC-WSYVGRQ---------- 65 (191)
T ss_dssp S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEES-SSSE-EEESS------------
T ss_pred cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeec-Cccc-cchhhhc----------
Confidence 799 889999999877767889999999999999999999998 3 4444556654 2333 3333321
Q ss_pred CCCCc--eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 245 APTNG--RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 245 ~P~~G--~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
.| .+.+. . ++....|++|||||||||.|..
T Consensus 66 ---~g~q~i~l~--~---------~c~~~~~i~HEl~HaLG~~HEh 97 (191)
T PF01400_consen 66 ---GGEQTINLG--D---------GCFSVGTILHELGHALGFWHEH 97 (191)
T ss_dssp ---SSEEEEEE---T---------TC-SHHHHHHHHHHHHTB--GG
T ss_pred ---CcceeEEec--c---------eeCCccchHHHHHHHHhhhhhh
Confidence 12 23332 1 1223579999999999999974
No 14
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.48 E-value=4.4e-13 Score=119.54 Aligned_cols=99 Identities=20% Similarity=0.317 Sum_probs=67.5
Q ss_pred CCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceee
Q 035681 163 SPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAH 242 (320)
Q Consensus 163 ~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~lah 242 (320)
..+|+ +..|.|.|........++.|++||+.|++.+||+|++..+. ...|.|..+..|| +++-|.- +.-.
T Consensus 7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~---~~yi~f~~~~~GC-~S~vG~~---~~g~- 76 (200)
T cd04281 7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPE---ENYIVFTYRPCGC-CSYVGRR---GNGP- 76 (200)
T ss_pred cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCCC---CCEEEEEECCCCe-eEcCCCc---CCCc-
Confidence 46999 88999999877665678999999999999999999997642 3345555443443 4555542 1000
Q ss_pred eeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 243 AFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 243 a~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
-.+.+.. ++....|++|||||||||.|..
T Consensus 77 ------q~isl~~-----------~C~~~Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 77 ------QAISIGK-----------NCDKFGIVVHELGHVIGFWHEH 105 (200)
T ss_pred ------eeeecCC-----------CcCcCchHHHHHHHHhcCcchh
Confidence 0122211 1223469999999999999864
No 15
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.45 E-value=1.8e-13 Score=97.72 Aligned_cols=56 Identities=29% Similarity=0.444 Sum_probs=50.2
Q ss_pred hhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhh
Q 035681 57 IKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKM 121 (320)
Q Consensus 57 ~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m 121 (320)
++.|..+|++|.++||++. ..++.|++.+++||++||+.+||++||++|.+|+++|
T Consensus 2 ~~~v~~lq~~L~~~gy~~~---------~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 2 GPDVKALQQYLNRLGYYPG---------PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp SHHHHHHHHHHHHTTTT-S---------STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred cHHHHHHHHHHHHcCCCCC---------CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence 5789999999999999953 3789999999999999999999999999999999987
No 16
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.41 E-value=2e-12 Score=117.47 Aligned_cols=96 Identities=22% Similarity=0.337 Sum_probs=66.4
Q ss_pred CCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceee
Q 035681 163 SPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAH 242 (320)
Q Consensus 163 ~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~lah 242 (320)
..+|+ + .|.|.|....+...++.|++||+.|++.+||+|++.... .. .|.|..+ .|| +++-|.. |
T Consensus 43 ~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~-~GC-~S~vG~~---g---- 107 (230)
T cd04282 43 TYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKG-SGC-WSMVGDQ---Q---- 107 (230)
T ss_pred ccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcC-CCe-eeccCcc---C----
Confidence 46999 7 899999887766788999999999999999999987642 22 3455543 243 4444432 1
Q ss_pred eeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 243 AFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 243 a~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
..-.+.+.. ++....+++|||||||||.|..
T Consensus 108 ----g~q~isl~~-----------~C~~~Gti~HEl~HalGf~HEq 138 (230)
T cd04282 108 ----GGQNLSIGA-----------GCDYKATVEHEFLHALGFYHEQ 138 (230)
T ss_pred ----CeEEEEECC-----------CcCCCchHHHHHHHHhCCcccc
Confidence 011233321 2334579999999999999974
No 17
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.36 E-value=4.6e-12 Score=111.43 Aligned_cols=92 Identities=20% Similarity=0.273 Sum_probs=60.5
Q ss_pred ccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCce
Q 035681 171 FSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGR 250 (320)
Q Consensus 171 ~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~ 250 (320)
-.|.|.|+...+...++.|++|++.|++.++|+|++.... ..-| .|..+ .| |+++-|.. |- .-.
T Consensus 4 ~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~yi--~~~~~-~g-C~S~vG~~---gg--------~q~ 67 (182)
T cd04283 4 VYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RDYL--NIESR-SG-CWSYIGRQ---GG--------RQT 67 (182)
T ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-CcEE--EEEcC-CC-ceEecCcc---CC--------cee
Confidence 3689999877666778999999999999999999987532 3334 44432 23 34444432 10 011
Q ss_pred eEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 251 FHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 251 i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
+.+.. .++....+++|||||||||.|..
T Consensus 68 i~l~~----------~~C~~~G~i~HEl~HaLG~~HEh 95 (182)
T cd04283 68 VSLQK----------QGCMYKGIIQHELLHALGFYHEQ 95 (182)
T ss_pred EecCC----------CCcCccchHHHHHHHHhCCcccc
Confidence 22211 12233579999999999999974
No 18
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.29 E-value=1.4e-11 Score=121.40 Aligned_cols=100 Identities=24% Similarity=0.412 Sum_probs=69.8
Q ss_pred CCCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCccee
Q 035681 162 RSPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIA 241 (320)
Q Consensus 162 ~~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~la 241 (320)
...+|+ ...|.|.|........+.+|+.||+.|++.++|+|++..+. ..+..+.+... || +++-|.. |
T Consensus 78 ~~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC-~S~VGr~---g--- 145 (411)
T KOG3714|consen 78 PERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GC-YSYVGRR---G--- 145 (411)
T ss_pred hhhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cc-eeeeCcc---C---
Confidence 457999 78999999988666889999999999999999999998744 34444444332 43 4444443 1
Q ss_pred eeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681 242 HAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 242 ha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
+++ +.-+++ .+.....|++|||+||||+.|.+
T Consensus 146 ------g~~------q~~sl~---~~C~~~G~i~HEl~HaLGf~Heh 177 (411)
T KOG3714|consen 146 ------GGQ------QLLSLG---DGCDRFGTIVHELMHALGFWHEH 177 (411)
T ss_pred ------CCc------cceecC---CCcCcCchhHHHHHHHhhhhhcc
Confidence 111 011111 13344779999999999999975
No 19
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.97 E-value=1.3e-09 Score=89.05 Aligned_cols=112 Identities=21% Similarity=0.312 Sum_probs=67.0
Q ss_pred hHHHHHHHHHHHhhh-cCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCceeEecCCCCCccC
Q 035681 184 DAITPVARAFQTWAA-NTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGRFHYDADEPWAVG 262 (320)
Q Consensus 184 ~~~~~i~~A~~~Ws~-v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~i~~d~~e~w~~~ 262 (320)
+.+..|.+|.+.|++ |.+++|++.. .++|+|.-... ++ +..+..-....|.|.+|.. +
T Consensus 13 ~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~----------~~---~sya~~~g~G~G~I~l~~~--~--- 71 (132)
T PF02031_consen 13 EFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGND----------PR---GSYASTDGLGSGYIFLDYQ--Q--- 71 (132)
T ss_dssp GGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE------------TT-----EEEE-SSS-EEEEEEHH--H---
T ss_pred hhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecC----------CC---CcccccCCCCcEEEEechH--H---
Confidence 688999999999986 8999999976 38888754321 12 3333333334688888742 2
Q ss_pred CccccchhHHHHHHhhhhhhCCC-CCCC-CCCcccccccCCC---CCCCCHHHHHHHHHHhC
Q 035681 263 AVQGAFDLETVALHEIGHLLGLH-HSSV-EGAIMYPSISAGV---TKGLHGDDIQGIRDLYN 319 (320)
Q Consensus 263 ~~~~~~~l~~Va~HEiGHaLGL~-Hs~~-~~siMyp~~~~~~---~~~L~~dDi~~iq~LYg 319 (320)
..+++-..|++|||||+|||. |-+- ...+|.-... |. +......++..+..||.
T Consensus 72 --~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~a-G~sCtN~~Pna~E~a~V~~~~a 130 (132)
T PF02031_consen 72 --NQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSA-GTSCTNAYPNAAERARVNSLFA 130 (132)
T ss_dssp --HHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT--TT----S--HHHHHHHHHHCT
T ss_pred --hhCCccceeeeehhccccCCCCCCCCccHHhhcCCCC-CCCCCCCCCCHHHHHHHHHHhc
Confidence 124677889999999999995 5333 4567875433 42 22567889999999884
No 20
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=98.91 E-value=2.8e-09 Score=95.43 Aligned_cols=72 Identities=21% Similarity=0.314 Sum_probs=63.8
Q ss_pred ccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccCCccc
Q 035681 49 KGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIPRCGV 128 (320)
Q Consensus 49 ~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~PRCgv 128 (320)
..++.|+.++.|..||+.|...||... ..|+.|++.|+.||+.||+.+||+++|++|..|+..|..| |.+
T Consensus 5 ~~l~~G~~g~~V~~LQ~~L~~lG~~~g---------~idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~ 74 (201)
T TIGR02869 5 QTYQRGSTGSDVIEIQRRLKAWGYYNG---------KVDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQ 74 (201)
T ss_pred ccCCCCCCcHHHHHHHHHHHHcCCCCC---------CCCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-ccc
Confidence 478899999999999999999999863 3689999999999999999999999999999999999554 444
Q ss_pred cc
Q 035681 129 AD 130 (320)
Q Consensus 129 ~D 130 (320)
+.
T Consensus 75 ~~ 76 (201)
T TIGR02869 75 QS 76 (201)
T ss_pred Cc
Confidence 43
No 21
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.90 E-value=2.5e-09 Score=94.21 Aligned_cols=127 Identities=23% Similarity=0.273 Sum_probs=76.0
Q ss_pred hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCC-CCcceeeeeCC-CCceeEecCCCCCcc
Q 035681 184 DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGG-PEGIIAHAFAP-TNGRFHYDADEPWAV 261 (320)
Q Consensus 184 ~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~-~~g~laha~~P-~~G~i~~d~~e~w~~ 261 (320)
..++++..|++.|++.-|+..+|-. ..|||+|...+. -| .+.++.|. ....++--|+- ..|--|++.-..-..
T Consensus 104 ~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~-pg--tg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~ 178 (236)
T COG5549 104 RWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNP-PG--TGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMY 178 (236)
T ss_pred hHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCC-CC--CcccccchHHHHHHHHHHhhhccCcccccccccccC
Confidence 4578999999999999777776654 489999987642 22 22222220 00111111110 111112221111111
Q ss_pred CCccccchhHHHHHHhhhhhhCCC-CCCCCCCcccccccCCCCCCCCHHHHHHHHHHhC
Q 035681 262 GAVQGAFDLETVALHEIGHLLGLH-HSSVEGAIMYPSISAGVTKGLHGDDIQGIRDLYN 319 (320)
Q Consensus 262 ~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~siMyp~~~~~~~~~L~~dDi~~iq~LYg 319 (320)
.......++..+|.||+|||||+. ||+-+ +.|.+..+ ...++..|++.+..+|-
T Consensus 179 ~pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s---~~~is~rdv~tL~r~Ye 233 (236)
T COG5549 179 PPGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS---WQGISRRDVKTLERKYE 233 (236)
T ss_pred CcccchhhhhHHHHHhhcchheecccccch-hhhhcchh---hcccCHHHHHHHHHHhc
Confidence 112234578999999999999997 97754 55655444 34599999999999994
No 22
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=98.31 E-value=1.1e-06 Score=89.54 Aligned_cols=84 Identities=19% Similarity=0.272 Sum_probs=65.8
Q ss_pred chhhhhhcccCCCCCchhcHHHHHHHHHhcCCCCCCCc---CCC--------------------------------CCCC
Q 035681 41 PFKFIEHLKGCHKGDKIKGIHELKEYMEKFGYLNYDNY---KNQ--------------------------------STHA 85 (320)
Q Consensus 41 ~~~~~~~~~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~---~~~--------------------------------~~~~ 85 (320)
+|..+..-..++.||..+.|..|.+.|...|++..... ..+ ....
T Consensus 225 ~Wp~i~~~~~LrpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 304 (608)
T PRK10594 225 PWPQLTGKATLRPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPA 304 (608)
T ss_pred CCcccCCCCCCCCCCCCchHHHHHHHHHHcCCcccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 56666656689999999999999999999999853210 000 0001
Q ss_pred CCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccC
Q 035681 86 DDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIP 124 (320)
Q Consensus 86 ~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~P 124 (320)
.+..||+++.+|||+||+.+||+.+|++++.|+..|+.|
T Consensus 305 ~~~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 305 VRAAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred cccccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 245699999999999999999999999999999998755
No 23
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=97.84 E-value=2.5e-05 Score=67.89 Aligned_cols=69 Identities=16% Similarity=0.205 Sum_probs=60.0
Q ss_pred ccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCC-CcccccCHHHHHhhccCC
Q 035681 49 KGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHL-KSTGVLDANTVAKMTIPR 125 (320)
Q Consensus 49 ~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L-~~TG~ld~~T~~~m~~PR 125 (320)
.....+..++.|..+|..|...||... ...++.|+..++.||++||+.++| +++|+++++|+..+...+
T Consensus 37 ~~~~~~~~~~~v~~lq~~L~~~g~~~~--------~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~~ 106 (185)
T COG3409 37 PVLTLGAEGPSVRILQAALNALGYYPD--------GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQL 106 (185)
T ss_pred cccccCCCCchHHHHHHHHHhcCCCCC--------CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHHh
Confidence 345566789999999999999999972 127899999999999999999999 899999999999887753
No 24
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.73 E-value=0.00022 Score=62.85 Aligned_cols=48 Identities=29% Similarity=0.308 Sum_probs=34.4
Q ss_pred hhHHHHHHhhhhhhCCCCCCCC----------CCcccccccCCCCCCCCHHHHHHHHH
Q 035681 269 DLETVALHEIGHLLGLHHSSVE----------GAIMYPSISAGVTKGLHGDDIQGIRD 316 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~~----------~siMyp~~~~~~~~~L~~dDi~~iq~ 316 (320)
....|++|||||.||+.|.... .-||.|.........++.--+..|+.
T Consensus 132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~~~~~~FS~CS~~~i~~ 189 (192)
T cd04267 132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSGLNSYRFSQCSIGSIRE 189 (192)
T ss_pred eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccCCCCCccChhhHHHHHH
Confidence 4567999999999999997652 24899877631234677766666654
No 25
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.67 E-value=0.00017 Score=63.90 Aligned_cols=113 Identities=25% Similarity=0.371 Sum_probs=72.8
Q ss_pred cccceEeccC--CC---c----hhHHHHHHHHHH---HhhhcCcceEEEecCCCCCceEEEeeeCCCCCC--CCCCCCCC
Q 035681 170 KFSLTYAFRP--GT---R----SDAITPVARAFQ---TWAANTQFKFSKIEDFANADIKISFESGDHGDG--NPFDGRGG 235 (320)
Q Consensus 170 k~~Lty~~~~--~~---~----~~~~~~i~~A~~---~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~--~~fdG~g~ 235 (320)
.+.++|++.- +. + .+..+.|...+. -|.....+.|+.|.+. .+|++|...+..--+. .+.+-.
T Consensus 28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~~-- 104 (203)
T PF11350_consen 28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDTS-- 104 (203)
T ss_pred CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCcC--
Confidence 3678998873 21 1 233344444443 4999899999999976 4999999887654332 122221
Q ss_pred CCcceeeeeCCCCceeEecCCCCCccCCccc----cchhHHHHHHhhhhhhCCCCCCC
Q 035681 236 PEGIIAHAFAPTNGRFHYDADEPWAVGAVQG----AFDLETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 236 ~~g~laha~~P~~G~i~~d~~e~w~~~~~~~----~~~l~~Va~HEiGHaLGL~Hs~~ 289 (320)
+. ...+.|..+++.+|. ..|..+...- +...+-|+=||.||+||-+|..=
T Consensus 105 --~e-~SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~C 158 (203)
T PF11350_consen 105 --GE-TSCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPC 158 (203)
T ss_pred --ce-eEeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcC
Confidence 22 233444577888877 4687765432 23456799999999999999764
No 26
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.55 E-value=6e-05 Score=67.72 Aligned_cols=46 Identities=30% Similarity=0.290 Sum_probs=31.3
Q ss_pred HHHHHHhhhhhhCCCCCCCC--------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681 271 ETVALHEIGHLLGLHHSSVE--------------GAIMYPSISAGVTKGLHGDDIQGIRDL 317 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~~~--------------~siMyp~~~~~~~~~L~~dDi~~iq~L 317 (320)
..|++|||||+|||.|..+. .+||.+.-. ....-++.-+|.-|+..
T Consensus 138 ~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~-~~~~~FS~~~i~~i~~~ 197 (206)
T PF13583_consen 138 YQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASR-PSRPYFSPPSIEYIREV 197 (206)
T ss_pred chHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCC-CCCCCCCchhHHHHhCC
Confidence 46799999999999997642 268863222 22234778888877643
No 27
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.54 E-value=0.00014 Score=72.66 Aligned_cols=71 Identities=20% Similarity=0.250 Sum_probs=60.7
Q ss_pred hhcccCCCCCchhcHHHHHHHHHhcCC-CCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhcc
Q 035681 46 EHLKGCHKGDKIKGIHELKEYMEKFGY-LNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTI 123 (320)
Q Consensus 46 ~~~~~~~~g~~~~~v~~l~~yL~~fGY-l~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~ 123 (320)
..-..++.|+....|..|.+.|..-|= ++.. ......||.++.+|||+||+.+||+.+|++++.|++.|.-
T Consensus 227 ~~~~~LrpG~~~~~v~aL~~~L~~~~~d~~~a-------~~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~ 298 (561)
T COG2989 227 PAGALLRPGVTSPDVPALRARLARSGMDLPSA-------AGSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV 298 (561)
T ss_pred CCccccCCCCCchhHHHHHHHHHhcCccchhh-------ccCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence 555678999999999999999999984 3322 2244589999999999999999999999999999999964
No 28
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.43 E-value=6.7e-05 Score=64.94 Aligned_cols=30 Identities=43% Similarity=0.586 Sum_probs=26.2
Q ss_pred hHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681 270 LETVALHEIGHLLGLHHSSVEGAIMYPSIS 299 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~ 299 (320)
+..=++||+||+|||.|=+++.-||++..+
T Consensus 124 v~KEv~HElGH~~GL~HC~N~~CVM~FSnS 153 (181)
T COG1913 124 VVKEVLHELGHLLGLSHCPNPRCVMNFSNS 153 (181)
T ss_pred HHHHHHHHhhhhcCcccCCCCCcEEeCCcc
Confidence 445689999999999999999999998754
No 29
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.40 E-value=0.00011 Score=63.14 Aligned_cols=23 Identities=35% Similarity=0.642 Sum_probs=15.6
Q ss_pred cchhHHHHHHhhhhhhCCCCCCC
Q 035681 267 AFDLETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 267 ~~~l~~Va~HEiGHaLGL~Hs~~ 289 (320)
..+...|++||+||-|||.|...
T Consensus 66 ~~~~g~TltHEvGH~LGL~HtF~ 88 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLGLYHTFG 88 (154)
T ss_dssp TS-SSHHHHHHHHHHTT---TT-
T ss_pred ccccccchhhhhhhhhccccccc
Confidence 34667899999999999999864
No 30
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.35 E-value=0.00051 Score=61.53 Aligned_cols=116 Identities=18% Similarity=0.269 Sum_probs=63.2
Q ss_pred HHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCceeEecCCCCCccCCcccc
Q 035681 188 PVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGRFHYDADEPWAVGAVQGA 267 (320)
Q Consensus 188 ~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~i~~d~~e~w~~~~~~~~ 267 (320)
..-+.|..|.......-. .+....|+.+-+...+.... ++++ ++++.|+.- | + ......+.+.. ..+
T Consensus 71 ~~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~---~~~G~A~~g--g-i-C~~~~s~~i~~-~~~ 137 (207)
T cd04273 71 KSLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNC---DTLGLAPVG--G-M-CSPSRSCSINE-DTG 137 (207)
T ss_pred HHHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCC---CceEEeccc--c-C-CCCCcceEEEc-CCC
Confidence 346889999774332110 11235777766665433211 3555 777777641 1 1 00000111110 012
Q ss_pred chhHHHHHHhhhhhhCCCCCCC---------CCCcccccccCC-CCCCCCHHHHHHHHH
Q 035681 268 FDLETVALHEIGHLLGLHHSSV---------EGAIMYPSISAG-VTKGLHGDDIQGIRD 316 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs~~---------~~siMyp~~~~~-~~~~L~~dDi~~iq~ 316 (320)
.....|++|||||.||+.|-.+ ..-||+|..... ....++.-=+..++.
T Consensus 138 ~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~ 196 (207)
T cd04273 138 LSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTS 196 (207)
T ss_pred ceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHH
Confidence 3345799999999999999765 268999988742 233566544444443
No 31
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.28 E-value=0.00012 Score=64.04 Aligned_cols=47 Identities=28% Similarity=0.255 Sum_probs=19.3
Q ss_pred hHHHHHHhhhhhhCCCCCCCC-------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681 270 LETVALHEIGHLLGLHHSSVE-------------GAIMYPSISAGVTKGLHGDDIQGIRDL 317 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~-------------~siMyp~~~~~~~~~L~~dDi~~iq~L 317 (320)
...|++|||||.||+.|..+. ..||++.... ....++.--|..|+++
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~~~-~~~~fS~cS~~~i~~~ 170 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAGIC-NNDKFSPCSIRQIRAV 170 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS-GG-GG--------------
T ss_pred eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCCCC-CCcccccccccccccc
Confidence 457899999999999998654 2789865321 2236777777777654
No 32
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=97.21 E-value=0.00071 Score=61.97 Aligned_cols=58 Identities=21% Similarity=0.339 Sum_probs=49.4
Q ss_pred CCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCC-CcccccCHHHHHhhc
Q 035681 54 GDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHL-KSTGVLDANTVAKMT 122 (320)
Q Consensus 54 g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L-~~TG~ld~~T~~~m~ 122 (320)
+-+..+|..||+.|.+|||..+ . +.||..++.+|++||.-|+= ..+|+.|.+|++.+.
T Consensus 194 ~~~~~~v~~lq~~L~~YGY~v~----------~-~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l~ 252 (257)
T COG3023 194 LLKGEDVAALQEMLARYGYGVE----------I-GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAILQ 252 (257)
T ss_pred hhccCCHHHHHHHHHHhCcCCC----------c-chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHHH
Confidence 3344889999999999999862 3 78999999999999998875 569999999998763
No 33
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.20 E-value=0.00024 Score=62.55 Aligned_cols=29 Identities=34% Similarity=0.494 Sum_probs=25.2
Q ss_pred hHHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681 270 LETVALHEIGHLLGLHHSSVEGAIMYPSI 298 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~ 298 (320)
+..+++||+||.|||.|-.++.-+|+..-
T Consensus 125 ~~k~~~HElGH~lGL~HC~~~~CvM~~s~ 153 (179)
T PRK13267 125 VRKEVTHELGHTLGLEHCDNPRCVMNFSN 153 (179)
T ss_pred HHHHHHHHHHHHcCCccCCCCCccCCCCC
Confidence 45679999999999999999999999643
No 34
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.08 E-value=0.00025 Score=62.62 Aligned_cols=23 Identities=35% Similarity=0.459 Sum_probs=14.9
Q ss_pred chhHHHHHHhhhhhhCCCCCCCC
Q 035681 268 FDLETVALHEIGHLLGLHHSSVE 290 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs~~~ 290 (320)
.....|++|||||.||+.|..+.
T Consensus 140 ~~~~~~~AHEiGH~lGa~HD~~~ 162 (196)
T PF13688_consen 140 YNGAITFAHEIGHNLGAPHDGDY 162 (196)
T ss_dssp HHHHHHHHHHHHHHTT-----SS
T ss_pred CceehhhHHhHHHhcCCCCCCCC
Confidence 34567999999999999997764
No 35
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=97.07 E-value=0.00032 Score=62.36 Aligned_cols=30 Identities=40% Similarity=0.559 Sum_probs=20.8
Q ss_pred hhHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681 269 DLETVALHEIGHLLGLHHSSVEGAIMYPSIS 299 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~ 299 (320)
.+..+++||+||++||.|=.+ .-+|++..+
T Consensus 144 R~~Kea~HElGH~~GL~HC~~-~CvM~~s~s 173 (194)
T PF07998_consen 144 RVCKEAVHELGHLFGLDHCEN-RCVMNFSNS 173 (194)
T ss_dssp HHHHHHHHHHHHHTT----SS-TSTTS--SS
T ss_pred HHHHHHHHHHHHHcCCcCCCC-CCccCCCCC
Confidence 456799999999999999999 999998643
No 36
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.06 E-value=0.00022 Score=58.12 Aligned_cols=17 Identities=53% Similarity=0.786 Sum_probs=12.1
Q ss_pred HHHHHHhhhhhhCCCCC
Q 035681 271 ETVALHEIGHLLGLHHS 287 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs 287 (320)
..|++|||||.|||.|+
T Consensus 108 ~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 108 VDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp TTHHHHHHHHHTT----
T ss_pred ceEeeehhhHhcCCCCC
Confidence 47999999999999995
No 37
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.80 E-value=0.0011 Score=60.43 Aligned_cols=45 Identities=36% Similarity=0.402 Sum_probs=32.0
Q ss_pred HHHHHhhhhhhCCCCCCCC-------------------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681 272 TVALHEIGHLLGLHHSSVE-------------------------GAIMYPSISAGVTKGLHGDDIQGIRDL 317 (320)
Q Consensus 272 ~Va~HEiGHaLGL~Hs~~~-------------------------~siMyp~~~~~~~~~L~~dDi~~iq~L 317 (320)
.|++|||||.||+.|-.+. .=||+|.... ..+.++.-=+.-|+.+
T Consensus 147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~ 216 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSL 216 (228)
T ss_pred eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-CCCccChhhHHHHHHH
Confidence 6999999999999996431 1199998764 3446776666666544
No 38
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.77 E-value=0.0016 Score=57.44 Aligned_cols=31 Identities=39% Similarity=0.546 Sum_probs=25.0
Q ss_pred hhHHHHHHhhhhhhCCCCCCC------CCCccccccc
Q 035681 269 DLETVALHEIGHLLGLHHSSV------EGAIMYPSIS 299 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~------~~siMyp~~~ 299 (320)
..-.|++|||||.||+.|-.+ ..-||+|...
T Consensus 130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~ 166 (194)
T cd04269 130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPS 166 (194)
T ss_pred HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCC
Confidence 445799999999999999543 3679999866
No 39
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.68 E-value=0.00035 Score=63.70 Aligned_cols=24 Identities=46% Similarity=0.800 Sum_probs=20.8
Q ss_pred chhHHHHHHhhhhhhCCCCCCCCC
Q 035681 268 FDLETVALHEIGHLLGLHHSSVEG 291 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs~~~~ 291 (320)
+++..|++||+||.|||.|+....
T Consensus 135 ~n~g~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 135 YNLGDTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ccccceeEEeccceeeeeeeecCC
Confidence 577789999999999999987554
No 40
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=96.55 E-value=0.0056 Score=46.10 Aligned_cols=55 Identities=22% Similarity=0.375 Sum_probs=44.8
Q ss_pred hhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCC----cccccCHHHHHh
Q 035681 57 IKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLK----STGVLDANTVAK 120 (320)
Q Consensus 57 ~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~----~TG~ld~~T~~~ 120 (320)
.+-+.++|+-|++.||+..+ .++.+++.+++|+++|+..-||. ..|.+|...++-
T Consensus 15 ~~~~~evq~~L~~lGyy~g~---------~~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~ 73 (74)
T PF08823_consen 15 GDVAREVQEALKRLGYYKGE---------ADGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAY 73 (74)
T ss_pred HHHHHHHHHHHHHcCCccCC---------CCCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhh
Confidence 34578999999999998843 77889999999999999988874 356688877654
No 41
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=96.50 E-value=0.0054 Score=53.06 Aligned_cols=62 Identities=24% Similarity=0.310 Sum_probs=53.5
Q ss_pred CCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhc
Q 035681 53 KGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMT 122 (320)
Q Consensus 53 ~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~ 122 (320)
.+.....+..++.-+...+|... ...++.|+..++.||+.||+.++|+++|++...|...+.
T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~--------~~~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~ 184 (185)
T COG3409 123 LGLGGGDVATLQQPLPLLGYRSG--------IRVDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR 184 (185)
T ss_pred ccccccchHHHHHHHHhcccccC--------CCCCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence 36677778888888888888774 237899999999999999999999999999999988763
No 42
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.14 E-value=0.009 Score=52.93 Aligned_cols=48 Identities=27% Similarity=0.514 Sum_probs=28.9
Q ss_pred chhHHHHHHhhhhhhCCCCCCC--------CCCcccccccCCCCCCCCHHHHHHHH
Q 035681 268 FDLETVALHEIGHLLGLHHSSV--------EGAIMYPSISAGVTKGLHGDDIQGIR 315 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs~~--------~~siMyp~~~~~~~~~L~~dDi~~iq 315 (320)
..+-.+++|||||.||+.|-.. ..-||.|.........++.-=+..++
T Consensus 129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~fS~CS~~~~~ 184 (199)
T PF01421_consen 129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKFSNCSRRQFE 184 (199)
T ss_dssp HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSSTSSS-HHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCCCCCcCHHHHHHHH
Confidence 3556799999999999999655 35789996653333356554444433
No 43
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.14 E-value=0.0075 Score=55.59 Aligned_cols=46 Identities=28% Similarity=0.416 Sum_probs=31.0
Q ss_pred hHHHHHHhhhhhhCCCCCCC-----C------CCcccccccCC---CCCCCCHHHHHHHH
Q 035681 270 LETVALHEIGHLLGLHHSSV-----E------GAIMYPSISAG---VTKGLHGDDIQGIR 315 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~-----~------~siMyp~~~~~---~~~~L~~dDi~~iq 315 (320)
...|++|||||.||+.|-.. + .=||+|....+ ....++.-=++.++
T Consensus 167 ~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~ 226 (244)
T cd04270 167 SDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSIS 226 (244)
T ss_pred HHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHH
Confidence 45799999999999999655 1 34899986432 12356654444443
No 44
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.79 E-value=0.011 Score=53.32 Aligned_cols=47 Identities=26% Similarity=0.268 Sum_probs=29.8
Q ss_pred hhHHHHHHhhhhhhCCCCCCC------------------CCCcccccccCCCCCCCCHHHHHHHH
Q 035681 269 DLETVALHEIGHLLGLHHSSV------------------EGAIMYPSISAGVTKGLHGDDIQGIR 315 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~------------------~~siMyp~~~~~~~~~L~~dDi~~iq 315 (320)
....+++|||||.||+.|-.+ ..-||.|.........++.--+.-++
T Consensus 144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~fS~CS~~~~~ 208 (220)
T cd04272 144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNGERQYRFSQCSQRQIR 208 (220)
T ss_pred ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCCCCCCccCcchHHHHH
Confidence 346899999999999999532 24689986442122345554444443
No 45
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.07 E-value=0.031 Score=53.50 Aligned_cols=18 Identities=44% Similarity=0.771 Sum_probs=16.5
Q ss_pred HHHHHHhhhhhhCCCCCC
Q 035681 271 ETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~ 288 (320)
..|+.||+||-|||.|+.
T Consensus 151 ~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 151 WATIMHELGHNLGLWHAG 168 (314)
T ss_pred HHHHHHHhhhhccccccC
Confidence 469999999999999995
No 46
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=93.93 E-value=0.027 Score=53.51 Aligned_cols=20 Identities=35% Similarity=0.527 Sum_probs=14.9
Q ss_pred hhHHHHHHhhhhhhCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~ 288 (320)
....+.+||+||++||+|.+
T Consensus 192 s~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 192 SYGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp -SHHHHHHHHHHTTT--SST
T ss_pred CccceeehhhhhhcCCCCCC
Confidence 45789999999999999944
No 47
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.22 E-value=0.052 Score=53.86 Aligned_cols=26 Identities=42% Similarity=0.583 Sum_probs=20.7
Q ss_pred HHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681 271 ETVALHEIGHLLGLHHSSVEGAIMYPSI 298 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~~~~siMyp~~ 298 (320)
....+|||||++|+.|..+ .||---|
T Consensus 317 iGA~lHEiGH~fg~pH~~~--GIM~Rdy 342 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQED--GIMLRDY 342 (423)
T ss_pred hHHHHHHHHHhcCCCCCCC--CeeeCCc
Confidence 4578999999999999875 7885333
No 48
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=92.72 E-value=0.11 Score=41.08 Aligned_cols=28 Identities=32% Similarity=0.435 Sum_probs=18.9
Q ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhh
Q 035681 1 MASKAFLSFSIFAVIQIILLSLLSHNYSV 29 (320)
Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 29 (320)
|++|.+++|.++|++ +||+++-++|+.+
T Consensus 1 MaSK~~llL~l~LA~-lLlisSevaa~~~ 28 (95)
T PF07172_consen 1 MASKAFLLLGLLLAA-LLLISSEVAAREL 28 (95)
T ss_pred CchhHHHHHHHHHHH-HHHHHhhhhhHHh
Confidence 998888777777776 5566555555443
No 49
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=92.61 E-value=0.085 Score=53.93 Aligned_cols=60 Identities=22% Similarity=0.315 Sum_probs=28.8
Q ss_pred CCceEEEeeeCCCCCCCCCCCCCCCCcceeeee--------CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhC
Q 035681 212 NADIKISFESGDHGDGNPFDGRGGPEGIIAHAF--------APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLG 283 (320)
Q Consensus 212 ~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~--------~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLG 283 (320)
++|+.|.......+. +++|.|. -|.-|.|.|+... +... ....+..|++|||.|+||
T Consensus 159 ~~D~vlyV~~~~~~~-----------~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLG 223 (521)
T PF01457_consen 159 NADFVLYVTARPSSS-----------STLAWAAPCQQDSDGRPIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALG 223 (521)
T ss_dssp S-SEEEEEE----ST-----------T---EEEEEEE-TTS-EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT
T ss_pred CccEEEEEEEeeccC-----------CceEEEeehhhccCCCCeeEEEEEchhH---ccch-hhhcccceeeeeeeeeee
Confidence 577776655432211 4666554 2455777776532 1110 123567899999999999
Q ss_pred CCC
Q 035681 284 LHH 286 (320)
Q Consensus 284 L~H 286 (320)
+.-
T Consensus 224 Fs~ 226 (521)
T PF01457_consen 224 FSS 226 (521)
T ss_dssp -SH
T ss_pred ecc
Confidence 975
No 50
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.08 E-value=0.04 Score=51.97 Aligned_cols=19 Identities=42% Similarity=0.464 Sum_probs=16.2
Q ss_pred HHHHHHhhhhhhCCCCCCC
Q 035681 271 ETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~~ 289 (320)
..|++||+||.|||.+--+
T Consensus 166 igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred eeeeehhhhcccCCCcccc
Confidence 5799999999999987543
No 51
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=91.19 E-value=4.5 Score=36.59 Aligned_cols=148 Identities=17% Similarity=0.104 Sum_probs=80.7
Q ss_pred CCCCCCcccceEeccCCCch---hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCC---------CCCC
Q 035681 164 PKWPASKFSLTYAFRPGTRS---DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDG---------NPFD 231 (320)
Q Consensus 164 ~kW~~~k~~Lty~~~~~~~~---~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~---------~~fd 231 (320)
.||. ..|+.++....+. ...+.+..-+++-..++.+.+..++.. +|+|.|.|...+.... ..+.
T Consensus 28 ~Rw~---~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~ 103 (213)
T PF11150_consen 28 RRWE---GPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV 103 (213)
T ss_pred ccCC---CCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence 5897 4799998844333 234566666788888898888888876 8999999987655110 0000
Q ss_pred CC-----CCCCcceeeeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCC-CCCC-CCCcccccccCCCCC
Q 035681 232 GR-----GGPEGIIAHAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLH-HSSV-EGAIMYPSISAGVTK 304 (320)
Q Consensus 232 G~-----g~~~g~laha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~-~~siMyp~~~~~~~~ 304 (320)
=+ .....-+..++....+.+.....-. .+.......-+...+..||-.+|||. .|.. +.||-+-- ....
T Consensus 104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFNDd---~~~~ 179 (213)
T PF11150_consen 104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFNDD---NEFA 179 (213)
T ss_pred chhhhcccccceEEEEEEecCCCceEEEEEEE-EEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeCC---Cccc
Confidence 00 0000111222221111111100000 00000112356778888999999995 3332 23332211 1233
Q ss_pred CCCHHHHHHHHHHhC
Q 035681 305 GLHGDDIQGIRDLYN 319 (320)
Q Consensus 305 ~L~~dDi~~iq~LYg 319 (320)
.|+.-|..=|+.||-
T Consensus 180 ~LT~~D~lLLr~LYd 194 (213)
T PF11150_consen 180 LLTGFDELLLRMLYD 194 (213)
T ss_pred cccHHHHHHHHHHcC
Confidence 789999999999994
No 52
>PTZ00337 surface protease GP63; Provisional
Probab=90.33 E-value=0.25 Score=50.99 Aligned_cols=36 Identities=28% Similarity=0.371 Sum_probs=24.8
Q ss_pred CCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681 246 PTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS 287 (320)
Q Consensus 246 P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs 287 (320)
|.-|.|.|+....= .......|++|||.||||+...
T Consensus 211 Pi~G~in~np~~i~------~~~~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 211 PFAAAVNFDPRQIA------VTNGDVRVAAHELGHALGFVRD 246 (567)
T ss_pred ceEEEEEECHHHcc------chhHHHHHHHHHHHHHHccCHH
Confidence 56677877763321 1124678999999999999653
No 53
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=89.53 E-value=0.16 Score=41.56 Aligned_cols=18 Identities=44% Similarity=0.822 Sum_probs=15.4
Q ss_pred hhHHHHHHhhhhhhCCCC
Q 035681 269 DLETVALHEIGHLLGLHH 286 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~H 286 (320)
.+..|++|||||-.||.-
T Consensus 108 ~vthvliHEIgHhFGLsD 125 (136)
T COG3824 108 QVTHVLIHEIGHHFGLSD 125 (136)
T ss_pred HhhhhhhhhhhhhcCCCh
Confidence 456899999999999964
No 54
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=88.97 E-value=0.37 Score=35.92 Aligned_cols=26 Identities=27% Similarity=0.376 Sum_probs=20.4
Q ss_pred HHHHHHHHhcC----CCcccccCHHHHHhh
Q 035681 96 SAIKTYQLNYH----LKSTGVLDANTVAKM 121 (320)
Q Consensus 96 ~Aik~fQ~~~~----L~~TG~ld~~T~~~m 121 (320)
.|+|-.|+..| +.++|++++.|++.+
T Consensus 1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al 30 (72)
T PF09374_consen 1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL 30 (72)
T ss_dssp HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence 48899999999 999999999999999
No 55
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=88.76 E-value=0.24 Score=48.87 Aligned_cols=26 Identities=35% Similarity=0.521 Sum_probs=20.4
Q ss_pred HHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681 271 ETVALHEIGHLLGLHHSSVEGAIMYPSI 298 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~~~~siMyp~~ 298 (320)
...++|||||.||+.|.+ +.||--.|
T Consensus 302 lGA~~HElGH~lgcpHq~--~GvMlrgy 327 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQS--EGVMLRGY 327 (614)
T ss_pred HHHHHHHhhhccCCCCCC--CceEecCc
Confidence 457899999999999976 56884443
No 56
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=88.18 E-value=0.73 Score=41.85 Aligned_cols=18 Identities=39% Similarity=0.689 Sum_probs=13.3
Q ss_pred hhHHHHHHhhhhhhCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~ 288 (320)
.-..|++||.|||+ .|..
T Consensus 88 aAvaVAAHEvGHAi--Q~a~ 105 (222)
T PF04298_consen 88 AAVAVAAHEVGHAI--QHAE 105 (222)
T ss_pred HHHHHHHHHHhHHH--hccc
Confidence 34579999999994 5543
No 57
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=87.81 E-value=0.72 Score=47.88 Aligned_cols=38 Identities=21% Similarity=0.282 Sum_probs=24.8
Q ss_pred CCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681 246 PTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS 287 (320)
Q Consensus 246 P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs 287 (320)
|.-|.+.|+....- . ........|++|||.|+||+.-+
T Consensus 236 Pi~G~iNinp~~i~---s-~~~~~~~rv~~HEi~HALGFS~~ 273 (622)
T PTZ00257 236 PAVGVMNIPAANIV---S-RYDQGTTRTVTHEVAHALGFSSV 273 (622)
T ss_pred ceEEEEeeCHHHCC---C-ccchHHHHHHHHHHHHHhcCCHH
Confidence 56677877753211 0 01124678999999999999543
No 58
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=87.29 E-value=0.41 Score=49.61 Aligned_cols=46 Identities=28% Similarity=0.419 Sum_probs=34.4
Q ss_pred HHHHHHhhhhhhCCCCCCC----------CCCcccccccCCCC---CCCCHHHHHHHHH
Q 035681 271 ETVALHEIGHLLGLHHSSV----------EGAIMYPSISAGVT---KGLHGDDIQGIRD 316 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs~~----------~~siMyp~~~~~~~---~~L~~dDi~~iq~ 316 (320)
.-|++|||||..|=.|-++ -+=|||++...|++ .+++.--+..|..
T Consensus 393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~ 451 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISK 451 (764)
T ss_pred heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHH
Confidence 4589999999999999665 13489999887643 3777776666653
No 59
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=85.81 E-value=1 Score=40.12 Aligned_cols=17 Identities=12% Similarity=0.173 Sum_probs=13.6
Q ss_pred hHHHHHHHHHHHhhhcC
Q 035681 184 DAITPVARAFQTWAANT 200 (320)
Q Consensus 184 ~~~~~i~~A~~~Ws~v~ 200 (320)
..+.-+..++++|++|.
T Consensus 19 wAQ~kvks~y~KySkV~ 35 (226)
T COG2738 19 WAQSKVKSTYKKYSKVR 35 (226)
T ss_pred HHHHHHHHHHHHHhhcc
Confidence 45667889999999984
No 60
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=85.69 E-value=0.48 Score=37.52 Aligned_cols=16 Identities=31% Similarity=0.692 Sum_probs=12.8
Q ss_pred hHHHHHHhhhhhhCCC
Q 035681 270 LETVALHEIGHLLGLH 285 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~ 285 (320)
+..|++|||||.+|+.
T Consensus 73 I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHcCCC
Confidence 5679999999999985
No 61
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=85.67 E-value=0.44 Score=44.50 Aligned_cols=19 Identities=37% Similarity=0.746 Sum_probs=12.0
Q ss_pred hhHHHHHHhhhhhh-CCCCC
Q 035681 269 DLETVALHEIGHLL-GLHHS 287 (320)
Q Consensus 269 ~l~~Va~HEiGHaL-GL~Hs 287 (320)
....|++||+||++ ||+-.
T Consensus 215 ~~~~v~vHE~GHsf~~LaDE 234 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADE 234 (264)
T ss_dssp THHHHHHHHHHHHTT-----
T ss_pred cccceeeeeccccccccccc
Confidence 56789999999965 45554
No 62
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=84.05 E-value=0.52 Score=47.16 Aligned_cols=19 Identities=42% Similarity=0.642 Sum_probs=15.5
Q ss_pred cchhHHHHHHhhhhhhCCCC
Q 035681 267 AFDLETVALHEIGHLLGLHH 286 (320)
Q Consensus 267 ~~~l~~Va~HEiGHaLGL~H 286 (320)
..++..|++|||||+.| +|
T Consensus 127 esElagViAHEigHv~q-rH 145 (484)
T COG4783 127 ESELAGVIAHEIGHVAQ-RH 145 (484)
T ss_pred HHHHHHHHHHHHHHHhh-hh
Confidence 34789999999999976 45
No 63
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=83.30 E-value=0.22 Score=52.05 Aligned_cols=17 Identities=47% Similarity=0.532 Sum_probs=14.7
Q ss_pred HHHHHHhhhhhhCCCCC
Q 035681 271 ETVALHEIGHLLGLHHS 287 (320)
Q Consensus 271 ~~Va~HEiGHaLGL~Hs 287 (320)
..|++||+||.|||.--
T Consensus 222 iGVfaHEfGH~LGLPDl 238 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLPDL 238 (645)
T ss_pred eEEEEeeccccCCCCCc
Confidence 57999999999999643
No 64
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=83.29 E-value=0.6 Score=49.50 Aligned_cols=20 Identities=40% Similarity=0.710 Sum_probs=17.7
Q ss_pred hhHHHHHHhhhhhhCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~ 288 (320)
....|++|||||.||+.|-.
T Consensus 322 ~~a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 322 AFAVVLAHELGHNLGMIHDE 341 (716)
T ss_pred hHHHHHHHHHHhhcCccccc
Confidence 45679999999999999976
No 65
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.33 E-value=3.4 Score=39.22 Aligned_cols=41 Identities=27% Similarity=0.490 Sum_probs=24.5
Q ss_pred eCCCCceeEecCCCCCc-cC---CccccchhHHHHHHhhhhh----hCCC
Q 035681 244 FAPTNGRFHYDADEPWA-VG---AVQGAFDLETVALHEIGHL----LGLH 285 (320)
Q Consensus 244 ~~P~~G~i~~d~~e~w~-~~---~~~~~~~l~~Va~HEiGHa----LGL~ 285 (320)
|-|.+..|.+|... |. +. ...+..-...|++||.||. ||+-
T Consensus 141 YCp~D~tIYlD~~f-~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l~Gil 189 (292)
T PF04228_consen 141 YCPADQTIYLDLSF-FDELQQRFGASGDFAQAYVLAHEYGHHVQNLLGIL 189 (292)
T ss_pred eCCCCCEEEechHH-HHHHHHHhCCccHHHHHHHHHHHHHHHHHHHhhhH
Confidence 45788899998742 21 10 1112233456999999994 5653
No 66
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=76.75 E-value=1.4 Score=40.13 Aligned_cols=14 Identities=36% Similarity=0.610 Sum_probs=12.2
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
.+..|+.||+||+|
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 57789999999985
No 67
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=76.28 E-value=1.4 Score=39.28 Aligned_cols=15 Identities=40% Similarity=0.638 Sum_probs=12.4
Q ss_pred hhHHHHHHhhhhhhC
Q 035681 269 DLETVALHEIGHLLG 283 (320)
Q Consensus 269 ~l~~Va~HEiGHaLG 283 (320)
....|+.||+||++-
T Consensus 21 ~~l~t~~HE~gHal~ 35 (200)
T PF13398_consen 21 RLLVTFVHELGHALA 35 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456799999999874
No 68
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=75.93 E-value=1.3 Score=34.63 Aligned_cols=27 Identities=22% Similarity=0.313 Sum_probs=17.6
Q ss_pred CceeEecCCCCCccCCccccchhHHHHHHhhhhhh
Q 035681 248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLL 282 (320)
Q Consensus 248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaL 282 (320)
...|.++... .......+++|||||.+
T Consensus 28 ~~~I~in~~~--------~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 28 NPIIFINSNL--------SPERQRFTLAHELGHIL 54 (122)
T ss_dssp TTEEEEESSS---------HHHHHHHHHHHHHHHH
T ss_pred CCEEEECCCC--------CHHHHHHHHHHHHHHHH
Confidence 4566666531 12345679999999974
No 69
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=74.37 E-value=1.8 Score=38.88 Aligned_cols=30 Identities=40% Similarity=0.624 Sum_probs=26.1
Q ss_pred hHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681 270 LETVALHEIGHLLGLHHSSVEGAIMYPSIS 299 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~ 299 (320)
...+.-|++||++||+|.+.+..||+|.+.
T Consensus 153 tal~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 153 TALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred HHHHHHHHhhhccCcccccccccccCCccc
Confidence 345777999999999999999999998765
No 70
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=73.98 E-value=1.5 Score=39.68 Aligned_cols=26 Identities=27% Similarity=0.372 Sum_probs=16.6
Q ss_pred hHHHHHHhhhhhhCCCCCCCC-CCccccc
Q 035681 270 LETVALHEIGHLLGLHHSSVE-GAIMYPS 297 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~-~siMyp~ 297 (320)
-.-|++||+||+ |-|.... ...+++.
T Consensus 72 ~rFtlAHELGH~--llH~~~~~~~~~~~~ 98 (213)
T COG2856 72 KRFTLAHELGHA--LLHTDLNTRFDAEPT 98 (213)
T ss_pred HHHHHHHHHhHH--Hhccccchhhhcccc
Confidence 356999999999 4555433 3334444
No 71
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=71.55 E-value=2.3 Score=37.63 Aligned_cols=20 Identities=35% Similarity=0.506 Sum_probs=15.9
Q ss_pred chhHHHHHHhhhhhhCCCCCC
Q 035681 268 FDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
..+..|+.||+||.. .+|..
T Consensus 87 ~el~aVlaHElgH~~-~~h~~ 106 (226)
T PF01435_consen 87 DELAAVLAHELGHIK-HRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHHH-TTHCC
T ss_pred HHHHHHHHHHHHHHH-cCCcc
Confidence 378899999999997 45544
No 72
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=68.55 E-value=2.7 Score=39.14 Aligned_cols=15 Identities=47% Similarity=0.736 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+++.|+.||+||..
T Consensus 155 dEl~aVlaHElgHi~ 169 (302)
T COG0501 155 DELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHh
Confidence 478999999999984
No 73
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=68.38 E-value=4.6 Score=38.74 Aligned_cols=37 Identities=27% Similarity=0.499 Sum_probs=26.6
Q ss_pred CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCC
Q 035681 245 APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 245 ~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 289 (320)
.|=.|+.+.|.+-. ...+..|++||++|-+|....+.
T Consensus 179 ~PFT~EA~vN~~~p--------~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 179 NPFTGEANVNTDLP--------PYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred cCCcceeecCCCCC--------cccccHHHHHHHHHHhCCCCHHH
Confidence 45566666665322 24667899999999999998753
No 74
>PRK03982 heat shock protein HtpX; Provisional
Probab=68.32 E-value=2.8 Score=39.50 Aligned_cols=15 Identities=40% Similarity=0.536 Sum_probs=13.0
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.++..|++|||||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 368999999999974
No 75
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=68.25 E-value=8.3 Score=37.15 Aligned_cols=15 Identities=33% Similarity=0.512 Sum_probs=12.4
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+-..++.||+||++
T Consensus 152 ~~~v~tl~HE~GHa~ 166 (365)
T cd06258 152 HDDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHHhHHH
Confidence 455678999999998
No 76
>PRK03001 M48 family peptidase; Provisional
Probab=66.87 E-value=3.1 Score=39.07 Aligned_cols=15 Identities=27% Similarity=0.483 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.++..|++|||||.-
T Consensus 122 ~El~aVlAHElgHi~ 136 (283)
T PRK03001 122 REIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHh
Confidence 478999999999974
No 77
>PRK01345 heat shock protein HtpX; Provisional
Probab=66.49 E-value=3.1 Score=39.83 Aligned_cols=19 Identities=21% Similarity=0.336 Sum_probs=15.2
Q ss_pred chhHHHHHHhhhhhhCCCCC
Q 035681 268 FDLETVALHEIGHLLGLHHS 287 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs 287 (320)
.++..|++|||||.- -+|.
T Consensus 122 dEL~aVlAHElgHi~-~~d~ 140 (317)
T PRK01345 122 EEVAGVMAHELAHVK-NRDT 140 (317)
T ss_pred HHHHHHHHHHHHHHH-cCCH
Confidence 378999999999986 3553
No 78
>PRK03072 heat shock protein HtpX; Provisional
Probab=66.31 E-value=3.2 Score=39.17 Aligned_cols=15 Identities=33% Similarity=0.472 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.++..|++||+||.-
T Consensus 125 ~El~aVlAHElgHi~ 139 (288)
T PRK03072 125 RELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHHh
Confidence 478999999999974
No 79
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=64.51 E-value=3.7 Score=36.78 Aligned_cols=16 Identities=44% Similarity=0.546 Sum_probs=12.3
Q ss_pred hhHHHHHHhhhhhhCC
Q 035681 269 DLETVALHEIGHLLGL 284 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL 284 (320)
-+..|+.||+||++=-
T Consensus 37 l~~~v~iHElgH~~~A 52 (208)
T cd06161 37 LFLSVLLHELGHALVA 52 (208)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3568999999997533
No 80
>PRK04897 heat shock protein HtpX; Provisional
Probab=62.75 E-value=4.1 Score=38.63 Aligned_cols=15 Identities=47% Similarity=0.536 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.++..|++||+||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 478999999999974
No 81
>PRK02870 heat shock protein HtpX; Provisional
Probab=61.49 E-value=4.4 Score=39.20 Aligned_cols=14 Identities=36% Similarity=0.657 Sum_probs=12.8
Q ss_pred chhHHHHHHhhhhh
Q 035681 268 FDLETVALHEIGHL 281 (320)
Q Consensus 268 ~~l~~Va~HEiGHa 281 (320)
.++..|++||+||.
T Consensus 171 dEL~aVlAHELgHi 184 (336)
T PRK02870 171 DELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHH
Confidence 47899999999998
No 82
>PRK05457 heat shock protein HtpX; Provisional
Probab=61.05 E-value=4.6 Score=38.07 Aligned_cols=14 Identities=43% Similarity=0.667 Sum_probs=12.6
Q ss_pred chhHHHHHHhhhhh
Q 035681 268 FDLETVALHEIGHL 281 (320)
Q Consensus 268 ~~l~~Va~HEiGHa 281 (320)
.++..|++||+||.
T Consensus 132 ~El~aVlAHElgHi 145 (284)
T PRK05457 132 DEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHH
Confidence 47899999999996
No 83
>PRK01265 heat shock protein HtpX; Provisional
Probab=61.00 E-value=4.6 Score=38.90 Aligned_cols=14 Identities=50% Similarity=0.790 Sum_probs=12.4
Q ss_pred chhHHHHHHhhhhh
Q 035681 268 FDLETVALHEIGHL 281 (320)
Q Consensus 268 ~~l~~Va~HEiGHa 281 (320)
.+++.|++||+||.
T Consensus 138 ~El~aVlAHElgHi 151 (324)
T PRK01265 138 DEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 47899999999995
No 84
>PRK02391 heat shock protein HtpX; Provisional
Probab=60.59 E-value=4.7 Score=38.24 Aligned_cols=14 Identities=43% Similarity=0.667 Sum_probs=12.5
Q ss_pred chhHHHHHHhhhhh
Q 035681 268 FDLETVALHEIGHL 281 (320)
Q Consensus 268 ~~l~~Va~HEiGHa 281 (320)
.++..|++||+||.
T Consensus 131 ~El~aVlaHElgHi 144 (296)
T PRK02391 131 DELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHH
Confidence 47899999999996
No 85
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=59.11 E-value=6 Score=31.09 Aligned_cols=14 Identities=36% Similarity=0.451 Sum_probs=12.1
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
.+..|++||+.|.+
T Consensus 24 ~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 24 WLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 46799999999985
No 86
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=59.08 E-value=5.2 Score=36.41 Aligned_cols=15 Identities=47% Similarity=0.713 Sum_probs=11.9
Q ss_pred hHHHHHHhhhhhhCC
Q 035681 270 LETVALHEIGHLLGL 284 (320)
Q Consensus 270 l~~Va~HEiGHaLGL 284 (320)
+..|+.||+||++=-
T Consensus 53 ~~~v~iHElgH~~~A 67 (227)
T cd06164 53 FASVLLHELGHSLVA 67 (227)
T ss_pred HHHHHHHHHHHHHHH
Confidence 467999999998643
No 87
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=58.79 E-value=3 Score=45.28 Aligned_cols=32 Identities=22% Similarity=0.386 Sum_probs=25.3
Q ss_pred hhHHHHHHhhhhhhCCCCCCCCC------CcccccccC
Q 035681 269 DLETVALHEIGHLLGLHHSSVEG------AIMYPSISA 300 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~~~------siMyp~~~~ 300 (320)
....+++||+||.+|+.|-...+ .||.|....
T Consensus 316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~ 353 (845)
T KOG3538|consen 316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG 353 (845)
T ss_pred ccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence 35579999999999999976533 589888763
No 88
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=56.70 E-value=5 Score=39.34 Aligned_cols=14 Identities=36% Similarity=0.477 Sum_probs=11.5
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+-..++.||+||++
T Consensus 221 ~~v~tl~HE~GHa~ 234 (427)
T cd06459 221 DDVFTLAHELGHAF 234 (427)
T ss_pred hhHHHHHHHhhHHH
Confidence 44678999999985
No 89
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=55.93 E-value=6.3 Score=37.11 Aligned_cols=13 Identities=38% Similarity=0.603 Sum_probs=11.0
Q ss_pred hHHHHHHhhhhhh
Q 035681 270 LETVALHEIGHLL 282 (320)
Q Consensus 270 l~~Va~HEiGHaL 282 (320)
+..|++||+||++
T Consensus 135 ~isvvvHElgHal 147 (277)
T cd06162 135 LISGVVHEMGHGV 147 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 4579999999985
No 90
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=54.99 E-value=13 Score=28.12 Aligned_cols=30 Identities=10% Similarity=0.214 Sum_probs=19.3
Q ss_pred CceeEecCCCCCccCCccccchhHHHHHHhhhhhh
Q 035681 248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLL 282 (320)
Q Consensus 248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaL 282 (320)
+.+|+|...+. ... ...-..+++||+.|++
T Consensus 44 G~~I~f~~g~~-~~~----s~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 44 GNDIYFAPGKY-NPD----SPEGRALLAHELAHVV 73 (79)
T ss_pred CCEEEEcCCCc-CCC----CCCcchhHhHHHHHHH
Confidence 67888865431 111 1233679999999985
No 91
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=54.67 E-value=6.8 Score=36.59 Aligned_cols=16 Identities=25% Similarity=0.341 Sum_probs=12.1
Q ss_pred hHHHHHHhhhhhhCCC
Q 035681 270 LETVALHEIGHLLGLH 285 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~ 285 (320)
...|++||+||++=-.
T Consensus 118 ~isv~iHElgHa~~Ar 133 (263)
T cd06159 118 VVGVVVHELSHGILAR 133 (263)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3579999999986433
No 92
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=54.64 E-value=7.9 Score=40.06 Aligned_cols=15 Identities=27% Similarity=0.474 Sum_probs=12.2
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+-..|++||+||++
T Consensus 376 ~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 376 LNSVFTLAHELGHSM 390 (591)
T ss_pred cchHHHHHHHhhhHH
Confidence 345679999999995
No 93
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=54.48 E-value=6.4 Score=38.76 Aligned_cols=15 Identities=47% Similarity=0.756 Sum_probs=12.1
Q ss_pred hHHHHHHhhhhhhCC
Q 035681 270 LETVALHEIGHLLGL 284 (320)
Q Consensus 270 l~~Va~HEiGHaLGL 284 (320)
+..++.||+||+|-=
T Consensus 131 vi~~vvHElGHalAA 145 (484)
T KOG2921|consen 131 VITVVVHELGHALAA 145 (484)
T ss_pred HHHHHHHHhhHHHHH
Confidence 457889999999754
No 94
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=54.45 E-value=6.2 Score=39.38 Aligned_cols=24 Identities=13% Similarity=0.160 Sum_probs=16.3
Q ss_pred HHHHHHHHHHhhhcCcceEEEecC
Q 035681 186 ITPVARAFQTWAANTQFKFSKIED 209 (320)
Q Consensus 186 ~~~i~~A~~~Ws~v~~l~F~e~~~ 209 (320)
.+++..-+...+.+=+|+|+++..
T Consensus 137 ~~~~~g~~~l~~~lfgi~~~~~~~ 160 (458)
T PF01432_consen 137 EEVLEGLFELAERLFGIRFEEVPD 160 (458)
T ss_dssp HHHHHHHHHHHHHHHTEEEEECTC
T ss_pred HHHHHHHHHHHHHHhcEEEEeccc
Confidence 355555566677777888888764
No 95
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=53.88 E-value=5.9 Score=40.76 Aligned_cols=15 Identities=33% Similarity=0.306 Sum_probs=12.3
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+-..|++||+||++
T Consensus 335 ~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 335 SGDIDVLTHEAGHAF 349 (549)
T ss_pred hhHHHHHHHHhhHHH
Confidence 345679999999997
No 96
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=52.27 E-value=7 Score=39.44 Aligned_cols=14 Identities=36% Similarity=0.653 Sum_probs=11.6
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+-..|+.||+||+|
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 44578999999997
No 97
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=51.62 E-value=8.4 Score=33.95 Aligned_cols=21 Identities=33% Similarity=0.675 Sum_probs=14.9
Q ss_pred hHHHHHHhhhhhhCCCCCCCC
Q 035681 270 LETVALHEIGHLLGLHHSSVE 290 (320)
Q Consensus 270 l~~Va~HEiGHaLGL~Hs~~~ 290 (320)
+..+..||+||++--..-.++
T Consensus 41 ~~~l~iHElgH~~~A~~~G~~ 61 (183)
T cd06160 41 LAILGIHEMGHYLAARRHGVK 61 (183)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 456889999998766544443
No 98
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=50.22 E-value=7.3 Score=40.39 Aligned_cols=14 Identities=29% Similarity=0.399 Sum_probs=11.6
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+-..|++||+||++
T Consensus 374 ~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 374 RDVSTLAHELGHAY 387 (587)
T ss_pred hhHHHHHHHhhHHH
Confidence 34578999999998
No 99
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=49.81 E-value=8.2 Score=32.92 Aligned_cols=12 Identities=33% Similarity=0.396 Sum_probs=10.1
Q ss_pred HHHHHHhhhhhh
Q 035681 271 ETVALHEIGHLL 282 (320)
Q Consensus 271 ~~Va~HEiGHaL 282 (320)
.-|++|||+|.+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 459999999974
No 100
>PLN02791 Nudix hydrolase homolog
Probab=48.67 E-value=8.9 Score=41.16 Aligned_cols=21 Identities=24% Similarity=0.275 Sum_probs=17.4
Q ss_pred hhHHHHHHhhhhhhCCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~ 289 (320)
.+.+|++||+||-+|=.+.-.
T Consensus 584 f~~~v~lHElgHGsG~~~~~~ 604 (770)
T PLN02791 584 FFTHTICHECCHGIGPHTITL 604 (770)
T ss_pred HHHHHHHHHhhccccccceec
Confidence 457899999999999977544
No 101
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=48.50 E-value=12 Score=32.77 Aligned_cols=38 Identities=24% Similarity=0.373 Sum_probs=26.0
Q ss_pred CceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCC
Q 035681 248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVE 290 (320)
Q Consensus 248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~ 290 (320)
.|.|.|+ |.+-..+ ..-+.-|++||+=|..-..||..-
T Consensus 147 ~~~I~ln----~~L~~~P-~~~idYVvvHEL~Hl~~~nHs~~F 184 (205)
T PF01863_consen 147 KGNITLN----WRLVMAP-PEVIDYVVVHELCHLRHPNHSKRF 184 (205)
T ss_pred CCcEEee----cccccCC-ccHHHHHHHHHHHHhccCCCCHHH
Confidence 5667765 3332111 124678999999999999998753
No 102
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=48.45 E-value=8.7 Score=38.63 Aligned_cols=24 Identities=4% Similarity=-0.082 Sum_probs=17.3
Q ss_pred HHHHHHHHHHhhhcCcceEEEecC
Q 035681 186 ITPVARAFQTWAANTQFKFSKIED 209 (320)
Q Consensus 186 ~~~i~~A~~~Ws~v~~l~F~e~~~ 209 (320)
-.+++--|..-+.+=+|+|++++.
T Consensus 133 ~~vl~gl~~~~~~lfgi~~~~~~~ 156 (458)
T cd06457 133 GTVMEGLSRLFSRLYGIRLVPVPL 156 (458)
T ss_pred HHHHHHHHHHHHHHhCeEEEecCC
Confidence 355566666677788999999763
No 103
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=46.60 E-value=9.7 Score=37.93 Aligned_cols=15 Identities=47% Similarity=0.649 Sum_probs=12.1
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+-..|+.||+||+|
T Consensus 206 ~~~v~tLfHEfGHal 220 (422)
T cd06456 206 HDEVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344578999999998
No 104
>PF11337 DUF3139: Protein of unknown function (DUF3139); InterPro: IPR021486 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=46.07 E-value=42 Score=25.49 Aligned_cols=23 Identities=22% Similarity=0.338 Sum_probs=18.1
Q ss_pred CCCCchhcHHHHHHHHHhcCCCC
Q 035681 52 HKGDKIKGIHELKEYMEKFGYLN 74 (320)
Q Consensus 52 ~~g~~~~~v~~l~~yL~~fGYl~ 74 (320)
++-++.....++.+||++-||-.
T Consensus 29 ~~~~k~~~~k~i~~yL~e~gy~e 51 (85)
T PF11337_consen 29 NPYQKHKAEKAIDWYLQEQGYKE 51 (85)
T ss_pred chhhHHHHHHHHHHHHHHcCCcH
Confidence 44466677889999999999864
No 105
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=45.39 E-value=12 Score=33.52 Aligned_cols=15 Identities=33% Similarity=0.543 Sum_probs=12.8
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.+...+|.||-||+|
T Consensus 26 ~~~~~~A~HEAGhAv 40 (213)
T PF01434_consen 26 EEKRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 467889999999975
No 106
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=42.57 E-value=11 Score=37.37 Aligned_cols=12 Identities=42% Similarity=0.844 Sum_probs=10.1
Q ss_pred HHHHHHhhhhhh
Q 035681 271 ETVALHEIGHLL 282 (320)
Q Consensus 271 ~~Va~HEiGHaL 282 (320)
..|++||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 469999999964
No 107
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=42.37 E-value=12 Score=37.59 Aligned_cols=12 Identities=33% Similarity=0.744 Sum_probs=9.7
Q ss_pred HHHHHHhhhhhh
Q 035681 271 ETVALHEIGHLL 282 (320)
Q Consensus 271 ~~Va~HEiGHaL 282 (320)
.-|++||+||-|
T Consensus 16 ~li~vHElGHfl 27 (449)
T PRK10779 16 VLITVHEFGHFW 27 (449)
T ss_pred HHHHHHHHHHHH
Confidence 358899999964
No 108
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=42.14 E-value=12 Score=37.12 Aligned_cols=12 Identities=42% Similarity=0.606 Sum_probs=9.9
Q ss_pred HHHHHHhhhhhh
Q 035681 271 ETVALHEIGHLL 282 (320)
Q Consensus 271 ~~Va~HEiGHaL 282 (320)
.-+++||+||+|
T Consensus 160 l~t~iHE~GHal 171 (396)
T cd06460 160 LFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHhhHHH
Confidence 457899999985
No 109
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=41.96 E-value=13 Score=36.98 Aligned_cols=19 Identities=42% Similarity=0.630 Sum_probs=14.5
Q ss_pred chhHHHHHHhhhhhhCCCCC
Q 035681 268 FDLETVALHEIGHLLGLHHS 287 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~Hs 287 (320)
.++..|+.||+||- =++|.
T Consensus 278 eel~AVl~HELGHW-~~~H~ 296 (428)
T KOG2719|consen 278 EELVAVLAHELGHW-KLNHV 296 (428)
T ss_pred HHHHHHHHHHhhHH-HHhhH
Confidence 46889999999996 24454
No 110
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=40.06 E-value=46 Score=31.69 Aligned_cols=61 Identities=20% Similarity=0.334 Sum_probs=43.6
Q ss_pred hhhhhhcccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHh
Q 035681 42 FKFIEHLKGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLN 104 (320)
Q Consensus 42 ~~~~~~~~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~ 104 (320)
.+++..-.||=.||.+.+++.+--+|..+||.++.++.|= +..+..--..++...+.||--
T Consensus 297 l~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~anpPnY--gkpd~~v~~nt~~v~k~f~~~ 357 (378)
T KOG3988|consen 297 LEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYANPPNY--GKPDPDVIYNTKRVYKIFNLP 357 (378)
T ss_pred HHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCCCCCC--CCCCHHHHHHHHHHHHHhccH
Confidence 6777778899999999999999999999999987643221 223332235566666655543
No 111
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=39.53 E-value=13 Score=38.73 Aligned_cols=14 Identities=29% Similarity=0.332 Sum_probs=11.6
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+-..|++||+||++
T Consensus 379 ~dV~TLaHElGHs~ 392 (598)
T COG1164 379 RDVFTLAHELGHSV 392 (598)
T ss_pred hHHHHHHHHccHHH
Confidence 44679999999985
No 112
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=39.28 E-value=21 Score=34.78 Aligned_cols=15 Identities=33% Similarity=0.634 Sum_probs=12.9
Q ss_pred chhHHHHHHhhhhhh
Q 035681 268 FDLETVALHEIGHLL 282 (320)
Q Consensus 268 ~~l~~Va~HEiGHaL 282 (320)
.++-.|+.||+||+.
T Consensus 122 sEvAAVl~HEmgHVt 136 (479)
T COG4784 122 SEVAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHHHhhhhhee
Confidence 478899999999984
No 113
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=37.01 E-value=38 Score=36.58 Aligned_cols=42 Identities=10% Similarity=0.105 Sum_probs=26.6
Q ss_pred CCCCCcccceEeccCCC------ch----hHH-----------HHHHHHHHHhhhcCcceEEEec
Q 035681 165 KWPASKFSLTYAFRPGT------RS----DAI-----------TPVARAFQTWAANTQFKFSKIE 208 (320)
Q Consensus 165 kW~~~k~~Lty~~~~~~------~~----~~~-----------~~i~~A~~~Ws~v~~l~F~e~~ 208 (320)
.|. +..-.|.+.... |. .++ +--.+.|+...+.+.|.|....
T Consensus 129 ~w~--~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~ 191 (775)
T PF03272_consen 129 EWD--DSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSD 191 (775)
T ss_pred hhh--hcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCc
Confidence 688 778888876531 21 111 2226778888888888886543
No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=36.84 E-value=22 Score=33.27 Aligned_cols=22 Identities=32% Similarity=0.475 Sum_probs=17.3
Q ss_pred cchhHHHHHHhhhhhhCCCCCC
Q 035681 267 AFDLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 267 ~~~l~~Va~HEiGHaLGL~Hs~ 288 (320)
..+...|+.||+||--|=.+--
T Consensus 200 a~~yyaTl~HElghwtgh~~rl 221 (316)
T COG4227 200 AINYYATLLHELGHWTGHEARL 221 (316)
T ss_pred hHhHHHHHHHHhccccCchhhh
Confidence 4578899999999998765543
No 115
>PRK10911 oligopeptidase A; Provisional
Probab=34.99 E-value=18 Score=38.32 Aligned_cols=14 Identities=43% Similarity=0.548 Sum_probs=11.6
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+-..|+.||+||+|
T Consensus 462 ~~v~tlfHEfGHal 475 (680)
T PRK10911 462 DEVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHHhHHH
Confidence 34568999999998
No 116
>PF14891 Peptidase_M91: Effector protein
Probab=34.68 E-value=42 Score=29.03 Aligned_cols=18 Identities=22% Similarity=0.150 Sum_probs=13.7
Q ss_pred hhHHHHHHhhhhhhCCCC
Q 035681 269 DLETVALHEIGHLLGLHH 286 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~H 286 (320)
+-..|+.|||.||.-..+
T Consensus 102 ~p~v~L~HEL~HA~~~~~ 119 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMN 119 (174)
T ss_pred HHHHHHHHHHHHHHHHHC
Confidence 456799999999965543
No 117
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=33.74 E-value=19 Score=37.15 Aligned_cols=16 Identities=44% Similarity=0.553 Sum_probs=11.7
Q ss_pred hhHHHHHHhh-hhhhCC
Q 035681 269 DLETVALHEI-GHLLGL 284 (320)
Q Consensus 269 ~l~~Va~HEi-GHaLGL 284 (320)
...+|++||+ ||..|=
T Consensus 292 f~~~v~lHEllGHGsGk 308 (549)
T PF03571_consen 292 FFVQVGLHELLGHGSGK 308 (549)
T ss_dssp HHHHHHHHHHCCCCS--
T ss_pred HhhhhhHHhhccCcCcc
Confidence 3467999998 998884
No 118
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=33.55 E-value=19 Score=33.77 Aligned_cols=39 Identities=23% Similarity=0.342 Sum_probs=22.3
Q ss_pred eCCCCceeEecCCCCCcc---CCccccchhHHHHHHhhhhhh
Q 035681 244 FAPTNGRFHYDADEPWAV---GAVQGAFDLETVALHEIGHLL 282 (320)
Q Consensus 244 ~~P~~G~i~~d~~e~w~~---~~~~~~~~l~~Va~HEiGHaL 282 (320)
|.|.+..+++|....-.. -...+.+-..-|++||.||.+
T Consensus 138 YCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHV 179 (295)
T COG2321 138 YCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHV 179 (295)
T ss_pred ecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHH
Confidence 456788888887321000 011222344569999999953
No 119
>CHL00132 psaF photosystem I subunit III; Validated
Probab=31.81 E-value=1e+02 Score=27.06 Aligned_cols=38 Identities=5% Similarity=0.195 Sum_probs=23.5
Q ss_pred cHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcC
Q 035681 59 GIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYH 106 (320)
Q Consensus 59 ~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~ 106 (320)
.|+.|++.|++| +.. .........+.+.+.++|.++..
T Consensus 45 ~~k~Le~rlk~Y---~~~-------s~p~lal~~qi~~tk~RFe~Y~~ 82 (185)
T CHL00132 45 SVKKLENRLAKY---EAN-------SPPALALQQQIDKTKARFDKYGR 82 (185)
T ss_pred HHHHHHhhhhcc---CCC-------CChHHHHHHHHHHHHHHHHHhcc
Confidence 366777888865 211 11223456777888888888743
No 120
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=28.97 E-value=52 Score=30.10 Aligned_cols=30 Identities=27% Similarity=0.263 Sum_probs=26.5
Q ss_pred HHHHHHHHHHhcC----CCcccccCHHHHHhhcc
Q 035681 94 LESAIKTYQLNYH----LKSTGVLDANTVAKMTI 123 (320)
Q Consensus 94 ~~~Aik~fQ~~~~----L~~TG~ld~~T~~~m~~ 123 (320)
--.|++-.|+..+ .+++|++..+|+..+.+
T Consensus 93 p~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~~ 126 (252)
T COG3926 93 PGRAAKWLQKALGPAYTVRVDGVIGAQTLAALKK 126 (252)
T ss_pred cchHHHHHHHHhccCCcccccCcccHHHHHHHHh
Confidence 3578999999999 99999999999998854
No 121
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=28.61 E-value=31 Score=33.41 Aligned_cols=19 Identities=42% Similarity=0.574 Sum_probs=14.8
Q ss_pred hhHHHHHHhhhhhhCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSS 288 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~ 288 (320)
.+..|+.||++|++ .+|+.
T Consensus 274 glAtvLgHE~aHaV-arH~A 292 (424)
T KOG2661|consen 274 GLATVLGHEIAHAV-ARHAA 292 (424)
T ss_pred HHHHHHHHHHHHHH-HHHHH
Confidence 47789999999986 45653
No 122
>PF11576 DUF3236: Protein of unknown function (DUF3236); InterPro: IPR012019 This family of proteins with unknown function appears to be restricted to Methanobacteria. ; PDB: 3BRC_B.
Probab=27.53 E-value=42 Score=28.47 Aligned_cols=27 Identities=30% Similarity=0.700 Sum_probs=18.1
Q ss_pred hhhhcccCCCCCchhcHHHHHHHHHhc
Q 035681 44 FIEHLKGCHKGDKIKGIHELKEYMEKF 70 (320)
Q Consensus 44 ~~~~~~~~~~g~~~~~v~~l~~yL~~f 70 (320)
+.+.+.|++.||+-+.|..+|+|...-
T Consensus 9 y~eS~~~~R~GDk~EEv~~Ir~~I~na 35 (154)
T PF11576_consen 9 YLESLEGTRRGDKEEEVEAIREYILNA 35 (154)
T ss_dssp HHHHHTT---S--HHHHHHHHHHHHH-
T ss_pred HHHHHhCcccCCcHHHHHHHHHHHhcC
Confidence 455678999999999999999999865
No 123
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=26.17 E-value=38 Score=31.67 Aligned_cols=14 Identities=43% Similarity=0.665 Sum_probs=10.9
Q ss_pred HHHHHHhhhhh-hCC
Q 035681 271 ETVALHEIGHL-LGL 284 (320)
Q Consensus 271 ~~Va~HEiGHa-LGL 284 (320)
..|++||-+|. =|+
T Consensus 150 grv~VhEWAhlRWGV 164 (262)
T PF08434_consen 150 GRVFVHEWAHLRWGV 164 (262)
T ss_pred ceeeeehhhhhcccc
Confidence 46999999995 455
No 124
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=25.51 E-value=32 Score=31.91 Aligned_cols=19 Identities=32% Similarity=0.476 Sum_probs=13.8
Q ss_pred chhHHHHHHhhhhhhCCCC
Q 035681 268 FDLETVALHEIGHLLGLHH 286 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLGL~H 286 (320)
.|-.+|++||+-|.|=+..
T Consensus 152 ~dg~NVviHEfAH~LD~~~ 170 (253)
T PF06167_consen 152 NDGHNVVIHEFAHKLDMED 170 (253)
T ss_dssp SSS--HHHHHHHHHHHCTT
T ss_pred CCCcchHHHHHHHHHHhhc
Confidence 3567899999999996654
No 125
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=24.28 E-value=35 Score=36.24 Aligned_cols=31 Identities=23% Similarity=0.236 Sum_probs=19.2
Q ss_pred HHH-HHHHHHhhhcCcceEEEecCC--CCCceEE
Q 035681 187 TPV-ARAFQTWAANTQFKFSKIEDF--ANADIKI 217 (320)
Q Consensus 187 ~~i-~~A~~~Ws~v~~l~F~e~~~~--~~aDI~I 217 (320)
.++ .-.|..-+..=+|+|+++.+. =.+|++.
T Consensus 364 ~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~ 397 (681)
T PRK10280 364 TVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV 397 (681)
T ss_pred HHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence 455 356666777888999986531 1356653
No 126
>PF13721 SecD-TM1: SecD export protein N-terminal TM region
Probab=23.67 E-value=1.5e+02 Score=23.46 Aligned_cols=29 Identities=7% Similarity=0.185 Sum_probs=22.3
Q ss_pred cccCCCCCchhcHHHHHHHHHhcCCCCCC
Q 035681 48 LKGCHKGDKIKGIHELKEYMEKFGYLNYD 76 (320)
Q Consensus 48 ~~~~~~g~~~~~v~~l~~yL~~fGYl~~~ 76 (320)
.+..+.|...+...++++.|++-|--...
T Consensus 36 Is~~~~~~~~~~~~~v~~~L~~~~I~~k~ 64 (101)
T PF13721_consen 36 ISASSAGVQLPDAFQVEQALKAAGIAVKS 64 (101)
T ss_pred EecCCCCccCChHHHHHHHHHHCCCCcce
Confidence 44556677777788999999999987653
No 127
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=23.61 E-value=55 Score=23.32 Aligned_cols=21 Identities=33% Similarity=0.597 Sum_probs=17.7
Q ss_pred hhHHHHHHhhhhhhCCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSSV 289 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~ 289 (320)
..+.-++||+.+.+||.|.+.
T Consensus 28 ~~eR~~vH~lA~~~gL~s~S~ 48 (60)
T cd02641 28 SHDRLLVHELAEELGLRHEST 48 (60)
T ss_pred HHHHHHHHHHHHHcCCceEee
Confidence 457789999999999998653
No 128
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=22.76 E-value=47 Score=29.11 Aligned_cols=16 Identities=25% Similarity=0.320 Sum_probs=12.8
Q ss_pred hhHHHHHHhhhhhhCC
Q 035681 269 DLETVALHEIGHLLGL 284 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL 284 (320)
.+-.|++|||+|++.-
T Consensus 35 ~lG~ilahel~hafd~ 50 (206)
T PF01431_consen 35 GLGFILAHELMHAFDP 50 (206)
T ss_dssp THHHHHHHHHHHCTST
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3567999999998754
No 129
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=22.66 E-value=44 Score=33.97 Aligned_cols=16 Identities=25% Similarity=0.312 Sum_probs=12.7
Q ss_pred hhHHHHHHhhhhhhCC
Q 035681 269 DLETVALHEIGHLLGL 284 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL 284 (320)
+-..|+.||+||+...
T Consensus 246 ~D~~t~~HE~GH~~yy 261 (477)
T cd06461 246 EDFVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4467889999999854
No 130
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=22.45 E-value=47 Score=29.97 Aligned_cols=14 Identities=21% Similarity=0.218 Sum_probs=11.2
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
-+-.|++||++||-
T Consensus 92 l~gsiLAHE~mHa~ 105 (212)
T PF12315_consen 92 LTGSILAHELMHAW 105 (212)
T ss_pred HHhhHHHHHHHHHH
Confidence 34579999999973
No 131
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=22.08 E-value=47 Score=34.97 Aligned_cols=14 Identities=43% Similarity=0.698 Sum_probs=12.1
Q ss_pred hhHHHHHHhhhhhh
Q 035681 269 DLETVALHEIGHLL 282 (320)
Q Consensus 269 ~l~~Va~HEiGHaL 282 (320)
+...|+.||.||+|
T Consensus 407 ~~~~~a~he~gha~ 420 (644)
T PRK10733 407 QKESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999986
No 132
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=20.58 E-value=3.3e+02 Score=23.83 Aligned_cols=18 Identities=22% Similarity=0.301 Sum_probs=15.1
Q ss_pred hhHHHHHHhhhhhhCCCC
Q 035681 269 DLETVALHEIGHLLGLHH 286 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~H 286 (320)
.+..+++||+-|++=+..
T Consensus 64 ~l~~~iaHE~hH~~r~~~ 81 (195)
T PF10026_consen 64 ELPALIAHEYHHNCRYEQ 81 (195)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 789999999999975543
No 133
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=20.21 E-value=56 Score=27.33 Aligned_cols=16 Identities=31% Similarity=0.250 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhhC
Q 035681 268 FDLETVALHEIGHLLG 283 (320)
Q Consensus 268 ~~l~~Va~HEiGHaLG 283 (320)
..+..|++||+=|++=
T Consensus 57 ~~l~~~l~HEm~H~~~ 72 (146)
T smart00731 57 DRLRETLLHELCHAAL 72 (146)
T ss_pred HHHHhhHHHHHHHHHH
Confidence 3677899999999763
No 134
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.03 E-value=67 Score=23.01 Aligned_cols=23 Identities=30% Similarity=0.332 Sum_probs=19.0
Q ss_pred hhHHHHHHhhhhhhCCCCCCCCC
Q 035681 269 DLETVALHEIGHLLGLHHSSVEG 291 (320)
Q Consensus 269 ~l~~Va~HEiGHaLGL~Hs~~~~ 291 (320)
.-+.-++|.+-|-|||.|.++.+
T Consensus 28 ~~eRriih~la~~lGL~~~s~G~ 50 (60)
T cd02639 28 PAERRIVHLLASRLGLNHVSDGT 50 (60)
T ss_pred HHHHHHHHHHHHHcCCceEEeCC
Confidence 44678999999999999987644
Done!