Query 042321
Match_columns 169
No_of_seqs 154 out of 901
Neff 7.3
Searched_HMMs 46136
Date Fri Mar 29 05:05:50 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/042321.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/042321hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 3.3E-39 7.2E-44 281.5 12.4 161 1-168 37-230 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 99.9 7.3E-24 1.6E-28 161.0 9.9 99 68-168 1-126 (157)
3 PF00413 Peptidase_M10: Matrix 99.9 2E-23 4.2E-28 157.0 9.4 97 68-167 1-123 (154)
4 smart00235 ZnMc Zinc-dependent 99.8 1.6E-19 3.6E-24 134.3 9.1 98 66-167 2-104 (140)
5 cd04327 ZnMc_MMP_like_3 Zinc-d 99.7 1.3E-17 2.7E-22 131.5 9.7 95 68-166 2-109 (198)
6 cd04268 ZnMc_MMP_like Zinc-dep 99.7 7.1E-18 1.5E-22 128.1 8.0 95 73-168 1-113 (165)
7 cd04279 ZnMc_MMP_like_1 Zinc-d 99.7 6.6E-17 1.4E-21 122.5 8.7 95 74-168 2-123 (156)
8 cd04277 ZnMc_serralysin_like Z 99.6 1E-15 2.2E-20 119.1 8.1 82 87-168 34-132 (186)
9 cd04280 ZnMc_astacin_like Zinc 99.6 1.4E-14 3.1E-19 112.7 8.1 89 74-166 2-91 (180)
10 PF01400 Astacin: Astacin (Pep 99.4 1.2E-13 2.5E-18 108.5 5.8 93 68-166 2-96 (191)
11 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.4 6.8E-13 1.5E-17 104.9 8.5 95 66-166 7-104 (200)
12 cd00203 ZnMc Zinc-dependent me 99.4 1.7E-12 3.6E-17 98.5 7.3 94 75-168 2-115 (167)
13 cd04282 ZnMc_meprin Zinc-depen 99.4 2.6E-12 5.6E-17 103.5 8.6 94 66-166 43-137 (230)
14 cd04276 ZnMc_MMP_like_2 Zinc-d 99.3 1.8E-12 3.8E-17 102.4 6.2 96 73-168 7-135 (197)
15 cd04283 ZnMc_hatching_enzyme Z 99.2 3.5E-11 7.7E-16 93.9 7.9 88 74-166 4-94 (182)
16 KOG3714 Meprin A metalloprotea 99.2 5.9E-11 1.3E-15 103.1 7.1 96 66-166 79-176 (411)
17 PF02031 Peptidase_M7: Strepto 99.1 1.1E-10 2.5E-15 85.2 5.7 87 75-168 3-96 (132)
18 PF01471 PG_binding_1: Putativ 99.1 9.2E-11 2E-15 74.4 3.5 46 1-54 12-57 (57)
19 PF12388 Peptidase_M57: Dual-a 98.7 1.1E-07 2.4E-12 75.6 8.9 94 73-166 38-150 (211)
20 COG5549 Predicted Zn-dependent 97.9 1.1E-05 2.5E-10 63.7 3.9 76 88-166 105-205 (236)
21 PF13583 Reprolysin_4: Metallo 97.8 1.3E-05 2.9E-10 63.6 2.4 20 149-168 137-156 (206)
22 PF13582 Reprolysin_3: Metallo 97.7 1.8E-05 3.8E-10 57.1 1.2 18 149-166 107-124 (124)
23 TIGR02869 spore_SleB spore cor 97.4 0.00018 3.8E-09 57.1 3.8 36 22-57 36-71 (201)
24 PF13688 Reprolysin_5: Metallo 97.3 0.00011 2.4E-09 57.1 1.7 20 149-168 142-161 (196)
25 PF13574 Reprolysin_2: Metallo 97.3 0.00012 2.6E-09 56.5 1.8 20 149-168 111-130 (173)
26 PF05572 Peptidase_M43: Pregna 97.2 0.00013 2.9E-09 55.4 1.6 19 149-167 69-87 (154)
27 cd04269 ZnMc_adamalysin_II_lik 97.1 0.00027 5.9E-09 55.0 1.8 19 149-167 131-149 (194)
28 cd04267 ZnMc_ADAM_like Zinc-de 97.0 0.00026 5.6E-09 55.0 1.3 21 147-167 131-151 (192)
29 PF11350 DUF3152: Protein of u 97.0 0.0036 7.7E-08 49.5 7.2 93 73-166 28-156 (203)
30 PRK13267 archaemetzincin-like 96.9 0.00054 1.2E-08 53.4 1.9 20 149-168 125-144 (179)
31 cd04275 ZnMc_pappalysin_like Z 96.8 0.00013 2.9E-09 58.7 -2.0 21 147-167 135-155 (225)
32 PF07998 Peptidase_M54: Peptid 96.8 0.00063 1.4E-08 53.7 1.6 18 149-166 145-162 (194)
33 cd04270 ZnMc_TACE_like Zinc-de 96.7 0.0008 1.7E-08 54.7 1.8 19 150-168 168-186 (244)
34 COG1913 Predicted Zn-dependent 96.7 0.00072 1.6E-08 52.3 1.4 20 149-168 124-143 (181)
35 cd04273 ZnMc_ADAMTS_like Zinc- 96.7 0.00033 7.2E-09 55.3 -0.5 75 92-168 72-159 (207)
36 cd04271 ZnMc_ADAM_fungal Zinc- 96.7 0.00062 1.3E-08 54.9 0.8 17 151-167 147-163 (228)
37 PF01421 Reprolysin: Reprolysi 96.6 0.00091 2E-08 52.3 1.5 20 149-168 131-150 (199)
38 cd04272 ZnMc_salivary_gland_MP 96.5 0.0014 3E-08 52.3 1.7 20 148-167 144-163 (220)
39 PRK10594 murein L,D-transpepti 96.5 0.0029 6.4E-08 57.6 3.9 37 21-57 307-343 (608)
40 TIGR03296 M6dom_TIGR03296 M6 f 96.2 0.0013 2.8E-08 54.7 -0.0 19 150-168 166-184 (286)
41 COG2989 Uncharacterized protei 95.6 0.012 2.7E-07 52.6 3.7 37 21-57 263-299 (561)
42 PF10462 Peptidase_M66: Peptid 95.3 0.0083 1.8E-07 50.5 1.4 17 149-165 193-209 (305)
43 PF05548 Peptidase_M11: Gameto 95.2 0.0094 2E-07 50.4 1.4 19 149-167 150-168 (314)
44 PF05547 Peptidase_M6: Immune 94.5 0.0067 1.5E-07 55.8 -1.2 19 150-168 222-240 (645)
45 PF04298 Zn_peptidase_2: Putat 94.4 0.048 1E-06 43.9 3.5 69 92-162 21-102 (222)
46 PF12044 Metallopep: Putative 93.2 0.049 1.1E-06 47.8 1.8 18 150-167 317-334 (423)
47 COG3409 Putative peptidoglycan 93.2 0.067 1.5E-06 40.6 2.3 35 23-57 70-105 (185)
48 PF09471 Peptidase_M64: IgA Pe 92.1 0.081 1.7E-06 43.7 1.6 20 148-167 215-235 (264)
49 KOG3607 Meltrins, fertilins an 92.1 0.081 1.8E-06 49.4 1.7 20 148-167 322-341 (716)
50 COG3824 Predicted Zn-dependent 92.1 0.062 1.3E-06 39.3 0.8 17 149-165 109-125 (136)
51 PF01457 Peptidase_M8: Leishma 91.7 0.094 2E-06 47.2 1.7 17 149-165 210-226 (521)
52 TIGR02290 M3_fam_3 oligoendope 91.3 0.11 2.4E-06 47.2 1.8 16 146-161 372-387 (587)
53 PF06262 DUF1025: Possibl zinc 90.4 0.17 3.6E-06 35.6 1.6 17 149-165 73-89 (97)
54 PTZ00337 surface protease GP63 90.0 0.16 3.4E-06 46.3 1.5 17 149-165 229-245 (567)
55 KOG3658 Tumor necrosis factor- 88.7 0.16 3.5E-06 46.8 0.6 21 149-169 392-412 (764)
56 KOG4525 Jacalin-like lectin do 88.4 0.24 5.1E-06 43.7 1.4 18 150-167 302-319 (614)
57 PF09374 PG_binding_3: Predict 88.0 0.53 1.1E-05 31.1 2.6 29 29-57 1-33 (72)
58 PF14247 DUF4344: Domain of un 87.6 0.29 6.3E-06 39.4 1.4 13 149-161 92-104 (220)
59 COG4783 Putative Zn-dependent 87.3 0.3 6.4E-06 43.4 1.4 15 148-162 129-143 (484)
60 PTZ00257 Glycoprotein GP63 (le 86.8 0.37 7.9E-06 44.3 1.7 16 149-164 256-271 (622)
61 COG2738 Predicted Zn-dependent 86.0 0.69 1.5E-05 36.7 2.6 78 89-168 21-111 (226)
62 PF06114 DUF955: Domain of unk 85.4 0.46 9.9E-06 32.9 1.3 13 149-161 42-54 (122)
63 COG2856 Predicted Zn peptidase 84.5 0.43 9.4E-06 38.2 0.9 13 149-161 72-84 (213)
64 TIGR02289 M3_not_pepF oligoend 83.5 1.2 2.6E-05 40.3 3.4 17 145-161 333-349 (549)
65 PF01435 Peptidase_M48: Peptid 82.7 0.71 1.5E-05 36.0 1.5 15 148-162 88-102 (226)
66 cd06161 S2P-M50_SpoIVFB SpoIVF 81.8 0.77 1.7E-05 36.3 1.4 13 149-161 38-50 (208)
67 PRK03982 heat shock protein Ht 81.0 0.82 1.8E-05 37.9 1.3 14 148-161 124-137 (288)
68 cd06459 M3B_Oligoendopeptidase 80.6 1.7 3.8E-05 37.3 3.3 16 146-161 219-234 (427)
69 PRK03001 M48 family peptidase; 79.8 0.95 2.1E-05 37.4 1.3 14 148-161 123-136 (283)
70 COG0501 HtpX Zn-dependent prot 78.9 1.1 2.3E-05 36.7 1.4 14 148-161 156-169 (302)
71 PRK03072 heat shock protein Ht 78.9 1 2.3E-05 37.4 1.3 14 148-161 126-139 (288)
72 cd06258 Peptidase_M3_like The 78.8 0.99 2.2E-05 38.3 1.2 16 146-161 151-166 (365)
73 PRK01345 heat shock protein Ht 78.8 1.1 2.3E-05 38.0 1.3 14 148-161 123-136 (317)
74 TIGR00181 pepF oligoendopeptid 78.8 2 4.4E-05 39.0 3.2 17 145-161 374-390 (591)
75 PLN02791 Nudix hydrolase homol 78.2 1.1 2.4E-05 42.4 1.3 17 149-165 585-601 (770)
76 cd06164 S2P-M50_SpoIVFB_CBS Sp 77.0 1.3 2.9E-05 35.6 1.4 13 149-161 53-65 (227)
77 PRK04897 heat shock protein Ht 76.9 1.3 2.8E-05 37.0 1.3 14 148-161 136-149 (298)
78 PF01432 Peptidase_M3: Peptida 76.6 1.3 2.8E-05 38.9 1.3 15 147-161 240-254 (458)
79 cd06162 S2P-M50_PDZ_SREBP Ster 76.0 1.5 3.2E-05 36.6 1.4 13 149-161 135-147 (277)
80 PRK01265 heat shock protein Ht 75.2 1.5 3.3E-05 37.2 1.3 13 148-160 139-151 (324)
81 PRK05457 heat shock protein Ht 75.0 1.6 3.4E-05 36.4 1.3 13 148-160 133-145 (284)
82 PF01447 Peptidase_M4: Thermol 75.0 1.6 3.5E-05 33.0 1.3 12 150-161 136-147 (150)
83 cd06455 M3A_TOP Peptidase M3 T 74.8 1.5 3.3E-05 38.9 1.2 14 148-161 262-275 (472)
84 PF01863 DUF45: Protein of unk 74.5 9.3 0.0002 29.4 5.5 71 88-167 110-182 (205)
85 PF08823 PG_binding_2: Putativ 74.4 3.4 7.4E-05 27.5 2.6 34 23-57 17-50 (74)
86 PRK02391 heat shock protein Ht 74.4 1.7 3.6E-05 36.5 1.3 13 148-160 132-144 (296)
87 cd06159 S2P-M50_PDZ_Arch Uncha 74.3 1.7 3.7E-05 35.8 1.4 13 149-161 118-130 (263)
88 PRK02870 heat shock protein Ht 73.8 1.8 3.8E-05 37.0 1.4 13 148-160 172-184 (336)
89 cd06457 M3A_MIP Peptidase M3 m 73.5 1.7 3.7E-05 38.5 1.2 24 88-111 132-155 (458)
90 KOG3538 Disintegrin metallopro 73.2 0.83 1.8E-05 43.5 -0.8 23 145-167 313-335 (845)
91 PF13485 Peptidase_MA_2: Pepti 73.2 2 4.4E-05 29.8 1.4 13 149-161 25-37 (128)
92 cd06160 S2P-M50_like_2 Unchara 71.7 2.2 4.8E-05 33.2 1.4 14 149-162 41-54 (183)
93 KOG2921 Intramembrane metallop 71.6 2 4.3E-05 37.6 1.2 13 149-161 131-143 (484)
94 cd06456 M3A_DCP_Oligopeptidase 70.6 2.2 4.7E-05 37.5 1.2 16 146-161 205-220 (422)
95 PF03571 Peptidase_M49: Peptid 68.2 2.4 5.2E-05 38.6 1.0 14 149-162 293-307 (549)
96 PF08823 PG_binding_2: Putativ 67.1 9.1 0.0002 25.4 3.4 46 1-54 25-74 (74)
97 PF04228 Zn_peptidase: Putativ 67.1 2.6 5.7E-05 35.3 1.0 13 149-161 170-182 (292)
98 PF03272 Enhancin: Viral enhan 64.4 3.3 7.2E-05 39.3 1.2 66 93-161 173-248 (775)
99 PF11150 DUF2927: Protein of u 64.3 34 0.00074 27.3 6.8 57 67-127 28-87 (213)
100 cd06460 M32_Taq Peptidase fami 64.1 3.5 7.6E-05 36.1 1.2 13 149-161 159-171 (396)
101 PF12725 DUF3810: Protein of u 63.3 3.8 8.2E-05 34.7 1.2 18 147-164 194-211 (318)
102 PRK10911 oligopeptidase A; Pro 61.8 4 8.6E-05 38.1 1.2 14 148-161 462-475 (680)
103 COG1164 Oligoendopeptidase F [ 60.2 4.2 9.1E-05 37.4 1.0 16 146-161 377-392 (598)
104 COG4412 Uncharacterized protei 60.1 2.4 5.2E-05 38.6 -0.5 17 152-168 349-365 (760)
105 COG3023 ampD N-acetyl-anhydrom 59.2 14 0.00031 30.4 3.8 34 21-54 217-251 (257)
106 KOG1047 Bifunctional leukotrie 59.1 6.8 0.00015 35.8 2.1 30 131-160 264-299 (613)
107 KOG2719 Metalloprotease [Gener 56.3 5.9 0.00013 35.0 1.2 13 148-160 279-291 (428)
108 PF08434 CLCA_N: Calcium-activ 52.8 9.9 0.00022 31.4 2.0 21 113-134 98-118 (262)
109 PRK10280 dipeptidyl carboxypep 48.7 9 0.00019 35.9 1.2 34 88-121 362-398 (681)
110 PRK10733 hflB ATP-dependent me 48.5 9.1 0.0002 35.5 1.2 13 149-161 408-420 (644)
111 KOG2661 Peptidase family M48 [ 48.4 9.9 0.00022 32.8 1.3 13 149-161 275-287 (424)
112 CHL00176 ftsH cell division pr 45.8 10 0.00022 35.2 1.1 13 149-161 438-450 (638)
113 PF06167 Peptidase_M90: Glucos 45.6 9.6 0.00021 31.3 0.8 18 147-164 152-169 (253)
114 COG4227 Antirestriction protei 45.2 11 0.00024 31.4 1.1 18 148-165 202-219 (316)
115 COG3227 LasB Zinc metalloprote 44.5 8.9 0.00019 34.4 0.5 12 150-161 338-349 (507)
116 cd02641 R3H_Smubp-2_like R3H d 42.0 15 0.00033 23.2 1.2 18 149-166 29-46 (60)
117 PF13699 DUF4157: Domain of un 41.8 13 0.00028 24.9 0.8 13 149-161 61-73 (79)
118 PF02074 Peptidase_M32: Carbox 41.2 12 0.00026 33.7 0.8 12 150-161 260-271 (494)
119 PF06540 GMAP: Galanin message 39.9 14 0.00031 23.7 0.8 25 24-48 22-46 (62)
120 COG2317 Zn-dependent carboxype 39.5 13 0.00028 33.4 0.7 9 153-161 263-271 (497)
121 COG3590 PepO Predicted metallo 39.0 8.7 0.00019 35.3 -0.4 13 149-161 487-499 (654)
122 cd06461 M2_ACE Peptidase famil 38.2 17 0.00037 32.6 1.3 17 145-161 243-259 (477)
123 PF04315 DUF462: Protein of un 36.1 21 0.00045 27.5 1.3 15 147-161 39-53 (164)
124 PF12315 DUF3633: Protein of u 36.0 21 0.00045 28.6 1.3 13 149-161 93-105 (212)
125 PF14891 Peptidase_M91: Effect 35.7 21 0.00046 27.3 1.3 13 149-161 103-115 (174)
126 TIGR01241 FtsH_fam ATP-depende 35.5 19 0.00041 32.1 1.1 13 149-161 311-323 (495)
127 TIGR02412 pepN_strep_liv amino 34.0 29 0.00062 33.2 2.2 12 149-160 287-298 (831)
128 smart00731 SprT SprT homologue 33.9 22 0.00049 26.2 1.2 13 149-161 59-71 (146)
129 PF13402 M60-like: Peptidase M 33.5 19 0.00042 29.5 0.9 12 150-161 220-231 (307)
130 COG0339 Dcp Zn-dependent oligo 33.0 20 0.00043 33.6 0.9 35 87-121 362-398 (683)
131 cd02639 R3H_RRM R3H domain of 32.8 25 0.00053 22.4 1.1 19 149-167 29-47 (60)
132 COG2321 Predicted metalloprote 30.8 23 0.00049 29.7 0.8 12 149-160 167-178 (295)
133 TIGR02411 leuko_A4_hydro leuko 30.5 31 0.00067 31.8 1.7 12 149-160 279-290 (601)
134 KOG2987 Fatty acid desaturase 29.3 26 0.00056 29.2 0.9 20 149-168 83-102 (324)
135 PRK15410 DgsA anti-repressor M 29.1 22 0.00047 29.4 0.5 19 147-165 140-158 (260)
136 PF15639 Tox-MPTase3: Metallop 27.7 1.3E+02 0.0029 22.3 4.2 69 90-159 23-110 (135)
137 COG4784 Putative Zn-dependent 27.5 31 0.00067 30.0 1.1 15 147-161 122-136 (479)
138 PF10263 SprT-like: SprT-like 27.4 37 0.0008 24.9 1.4 14 148-161 59-72 (157)
139 TIGR00868 hCaCC calcium-activa 27.2 67 0.0014 31.1 3.3 20 149-168 149-169 (863)
140 COG3926 zliS Lysozyme family p 25.2 73 0.0016 26.0 2.7 30 28-57 94-127 (252)
141 PF08571 Yos1: Yos1-like; Int 25.0 1.2E+02 0.0026 20.5 3.4 41 1-41 24-64 (80)
142 cd02640 R3H_NRF R3H domain of 23.1 51 0.0011 20.9 1.3 18 149-166 29-46 (60)
143 KOG3624 M13 family peptidase [ 22.4 40 0.00087 31.3 0.9 13 149-161 518-530 (687)
144 KOG2090 Metalloendopeptidase f 22.3 42 0.00091 31.2 1.0 13 149-161 479-491 (704)
145 PF08634 Pet127: Mitochondrial 21.9 89 0.0019 26.1 2.7 55 38-101 154-214 (274)
146 COG5192 BMS1 GTP-binding prote 21.3 53 0.0012 30.7 1.4 22 36-57 775-796 (1077)
147 PF07236 Phytoreo_S7: Phytoreo 21.2 32 0.00069 30.7 0.0 33 25-57 204-236 (506)
148 PF15127 DUF4565: Protein of u 21.1 63 0.0014 22.4 1.5 15 88-102 62-76 (91)
149 PF13590 DUF4136: Domain of un 20.9 1.8E+02 0.0038 20.8 4.0 41 77-125 19-69 (151)
150 cd06007 R3H_DEXH_helicase R3H 20.4 63 0.0014 20.4 1.3 18 149-166 28-45 (59)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=3.3e-39 Score=281.55 Aligned_cols=161 Identities=29% Similarity=0.374 Sum_probs=142.3
Q ss_pred CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC-----Ccccc-ccCCCCCCCCCc
Q 042321 1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH-----DAELQ-ILGSPRWPASKF 74 (169)
Q Consensus 1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p-----D~~~~-~~g~~kW~~~k~ 74 (169)
|++|||+.+.+...... .++.+++||+.||++++|++||++|.+|++.|++| |-+|+ +++.+||+ +.
T Consensus 37 l~~~~y~~~~~~~~~~~-----~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~~~~~~~~~~kW~--k~ 109 (469)
T KOG1565|consen 37 LECYGYLPPTDLTATRA-----SQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDGRYRYFPGKPKWN--KE 109 (469)
T ss_pred hhhcccCCCcccccccc-----CchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCCccccCcccCccc--cc
Confidence 67899999887543211 36889999999999999999999999999999999 65677 77889999 99
Q ss_pred ceEEEeccCCch----hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCC--------------
Q 042321 75 SLTYAFLPGTRG----DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAV-------------- 136 (169)
Q Consensus 75 ~LTy~~~~~~~~----~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~-------------- 136 (169)
+|||+|.+++++ +++.++++||++|++|++|+|.|+.....|||+|.|..+.|||++||||.
T Consensus 110 ~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~ 189 (469)
T KOG1565|consen 110 HLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGI 189 (469)
T ss_pred ccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCC
Confidence 999999999755 79999999999999999999999998569999999999999999999953
Q ss_pred --------CCceee-cccccchHHHHHHhhhhhcCCCCccC
Q 042321 137 --------VESWLM-LQLLLESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 137 --------~e~w~~-~~~~~~~~~va~HEiGHaLGL~h~~~ 168 (169)
+|.|+. ....+|++.||+|||||+|||+|+.+
T Consensus 190 ~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS~~ 230 (469)
T KOG1565|consen 190 GGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHSSD 230 (469)
T ss_pred CCccccCcccceeccCCccchhHHHhhhhcccccccCCCCC
Confidence 577876 45677899999999999999999965
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.90 E-value=7.3e-24 Score=161.03 Aligned_cols=99 Identities=32% Similarity=0.460 Sum_probs=81.7
Q ss_pred CCCCCCcceEEEeccCCc----hhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC------
Q 042321 68 RWPASKFSLTYAFLPGTR----GDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV------ 137 (169)
Q Consensus 68 kW~~~k~~LTy~~~~~~~----~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~------ 137 (169)
||+ +++|||+|.+..+ .+++++|++||+.|+++++|+|+|+.....|||+|.|.+..|+|+++|++.+
T Consensus 1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a 78 (157)
T cd04278 1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA 78 (157)
T ss_pred CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence 799 9999999987653 3789999999999999999999999876689999999999999877887532
Q ss_pred ---------------Cceeec--ccccchHHHHHHhhhhhcCCCCccC
Q 042321 138 ---------------ESWLML--QLLLESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 138 ---------------e~w~~~--~~~~~~~~va~HEiGHaLGL~h~~~ 168 (169)
+.|+.. ....+++.|++|||||||||.|+.+
T Consensus 79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~ 126 (157)
T cd04278 79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSD 126 (157)
T ss_pred cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCC
Confidence 123321 2345699999999999999999754
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.90 E-value=2e-23 Score=156.97 Aligned_cols=97 Identities=33% Similarity=0.496 Sum_probs=78.6
Q ss_pred CCCCCCcceEEEeccCCch----hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCC-----
Q 042321 68 RWPASKFSLTYAFLPGTRG----DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVE----- 138 (169)
Q Consensus 68 kW~~~k~~LTy~~~~~~~~----~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e----- 138 (169)
||+ +++|||+|.+.++. ++++++++||+.|+++++|+|.++... .|||+|.|....+++..+|++...
T Consensus 1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~~~~~~~ 77 (154)
T PF00413_consen 1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSGGTLAHA 77 (154)
T ss_dssp SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccceeeeecc
Confidence 799 99999999986643 689999999999999999999999864 699999999999988877774321
Q ss_pred -----------------ceeecccccchHHHHHHhhhhhcCCCCcc
Q 042321 139 -----------------SWLMLQLLLESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 139 -----------------~w~~~~~~~~~~~va~HEiGHaLGL~h~~ 167 (169)
.|.......++..|++|||||||||.|+-
T Consensus 78 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~ 123 (154)
T PF00413_consen 78 YFPNNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN 123 (154)
T ss_dssp EESSSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred ccccccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence 12111223458999999999999999974
No 4
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.80 E-value=1.6e-19 Score=134.28 Aligned_cols=98 Identities=27% Similarity=0.392 Sum_probs=72.9
Q ss_pred CCCCCCCCcceEEEec-c-CCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCC-CCCCCCCC--CCCce
Q 042321 66 SPRWPASKFSLTYAFL-P-GTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDH-GAGALSTA--VVESW 140 (169)
Q Consensus 66 ~~kW~~~k~~LTy~~~-~-~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~H-gd~~pfd~--~~e~w 140 (169)
..+|+ +.+|+|.|. + .++.+++++|++||+.|+++++|+|+|+.. .|||+|.|....| |+..++.+ .+..+
T Consensus 2 ~~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~g~~~a~~g~~~g~~~ 77 (140)
T smart00235 2 SKKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGSGCTLSHAGRPGGDQH 77 (140)
T ss_pred CCcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCCCcceeeeecCCCceE
Confidence 46999 899999996 3 333356999999999999999999999865 7999999999988 55212222 12222
Q ss_pred eecccccchHHHHHHhhhhhcCCCCcc
Q 042321 141 LMLQLLLESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 141 ~~~~~~~~~~~va~HEiGHaLGL~h~~ 167 (169)
...+..+....|++|||||||||.|.-
T Consensus 78 ~~~~~~~~~~~~~~HEigHaLGl~H~~ 104 (140)
T smart00235 78 FSLGNGCINTGVAAHELGHALGLYHEQ 104 (140)
T ss_pred EEccCCcCCcccHHHHHHHHhcCCcCC
Confidence 112333334569999999999999974
No 5
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74 E-value=1.3e-17 Score=131.46 Aligned_cols=95 Identities=28% Similarity=0.334 Sum_probs=73.2
Q ss_pred CCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCc---e---e
Q 042321 68 RWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVES---W---L 141 (169)
Q Consensus 68 kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~---w---~ 141 (169)
+|++ .+.|+|.|.+..+.+.+++|++||+.|+.+++|+|+++... .+||+|.|..+. |+ +++.|.... . +
T Consensus 2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-Gc-~S~vG~~~~~~~~~~~t 77 (198)
T cd04327 2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-GY-WSYVGTDALLIGADAPT 77 (198)
T ss_pred CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-CC-CCCcCCcccccCCCCce
Confidence 7986 67899999887666789999999999999999999999864 799999999854 55 445543210 1 1
Q ss_pred e-ccc------ccchHHHHHHhhhhhcCCCCc
Q 042321 142 M-LQL------LLESQSVCMELIFKELGIFIN 166 (169)
Q Consensus 142 ~-~~~------~~~~~~va~HEiGHaLGL~h~ 166 (169)
+ ++. .++++.|++|||||||||.|.
T Consensus 78 ~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HE 109 (198)
T cd04327 78 MNLGWFTDDTPDPEFSRVVLHEFGHALGFIHE 109 (198)
T ss_pred eeeeeecCCCchhhHHHHHHHHHHHHhcCccc
Confidence 1 221 345678999999999999985
No 6
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.74 E-value=7.1e-18 Score=128.13 Aligned_cols=95 Identities=19% Similarity=0.173 Sum_probs=71.0
Q ss_pred CcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCC---CCCCCCCCC------CCCceee-
Q 042321 73 KFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGD---HGAGALSTA------VVESWLM- 142 (169)
Q Consensus 73 k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~---Hgd~~pfd~------~~e~w~~- 142 (169)
++.|+|.|.+..+...+++|++||+.|+++++++|+|+.....+||+|.|.+.. .|+ +.+.+ ..+-|..
T Consensus 1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~-~~~~~~~~~~~~g~i~~~~ 79 (165)
T cd04268 1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGT-WSYGPSQVDPLTGEILLAR 79 (165)
T ss_pred CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCc-cccCCccCCCCCccEEeeE
Confidence 467999999887778899999999999999999999998755799999998743 232 22222 1222211
Q ss_pred ccc--------ccchHHHHHHhhhhhcCCCCccC
Q 042321 143 LQL--------LLESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 143 ~~~--------~~~~~~va~HEiGHaLGL~h~~~ 168 (169)
... +..++.|++|||||||||.|+.+
T Consensus 80 ~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~ 113 (165)
T cd04268 80 VYLYSSFVEYSGARLRNTAEHELGHALGLRHNFA 113 (165)
T ss_pred EEEchhHHHHHHHHHHHHHHHHHHHHhcccccCc
Confidence 111 12378999999999999999864
No 7
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.70 E-value=6.6e-17 Score=122.54 Aligned_cols=95 Identities=16% Similarity=0.169 Sum_probs=67.2
Q ss_pred cceEEEeccCC------chhhHHHHHHHHHhhccCCCcceEeccCC-CcccEEEEeecC----CCCCC-----CCCC---
Q 042321 74 FSLTYAFLPGT------RGDAINPVARAFQTWAPNTQFQFAESQDY-RNADIKISFESG----DHGAG-----ALST--- 134 (169)
Q Consensus 74 ~~LTy~~~~~~------~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~-~~ADI~I~F~~~----~Hgd~-----~pfd--- 134 (169)
..|+|++.+.+ ..+.+++|++||+.|+++++|+|+++... .+|||+|.|... .-|+. .|..
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~ 81 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDG 81 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCC
Confidence 46888888754 22789999999999999999999998753 279999999963 11221 1111
Q ss_pred CCCCceee-c--c-----cccchHHHHHHhhhhhcCCCCccC
Q 042321 135 AVVESWLM-L--Q-----LLLESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 135 ~~~e~w~~-~--~-----~~~~~~~va~HEiGHaLGL~h~~~ 168 (169)
+....|.. . . ...+++.||+|||||+|||.|+.+
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~ 123 (156)
T cd04279 82 NRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSD 123 (156)
T ss_pred cccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCC
Confidence 11122221 1 1 145799999999999999999875
No 8
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.63 E-value=1e-15 Score=119.15 Aligned_cols=82 Identities=18% Similarity=0.219 Sum_probs=58.7
Q ss_pred hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCC--CC---CCCC-----CCCceeec-------ccccch
Q 042321 87 DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGA--GA---LSTA-----VVESWLML-------QLLLES 149 (169)
Q Consensus 87 ~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd--~~---pfd~-----~~e~w~~~-------~~~~~~ 149 (169)
+.++++++||+.|+.+++|+|+|+.....|||+|.+....++. ++ |-.. .+.-|.-. ..+...
T Consensus 34 ~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~g~~~ 113 (186)
T cd04277 34 AQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSPGSYG 113 (186)
T ss_pred HHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCCChhh
Confidence 5789999999999999999999998866899999998864221 11 2111 11222110 013347
Q ss_pred HHHHHHhhhhhcCCCCccC
Q 042321 150 QSVCMELIFKELGIFINYN 168 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~~ 168 (169)
..|++|||||||||.|..+
T Consensus 114 ~~t~~HEiGHaLGL~H~~~ 132 (186)
T cd04277 114 YQTIIHEIGHALGLEHPGD 132 (186)
T ss_pred HHHHHHHHHHHhcCCCCCc
Confidence 8899999999999999753
No 9
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.55 E-value=1.4e-14 Score=112.70 Aligned_cols=89 Identities=16% Similarity=0.131 Sum_probs=66.0
Q ss_pred cceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee-cccccchHHH
Q 042321 74 FSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM-LQLLLESQSV 152 (169)
Q Consensus 74 ~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-~~~~~~~~~v 152 (169)
..|.|.|.+.-+...++.|++|++.|+++|+|+|+|+.. ++..|.|..+ .||.+........-.+ ++.+|....+
T Consensus 2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc~S~vG~~~~~q~i~l~~~c~~~g~ 77 (180)
T cd04280 2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGCWSYVGRVGGRQVVSLGSGCFSLGT 77 (180)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCcceecCccCCceeEEeCCCcCcCch
Confidence 468999988545578999999999999999999999864 5567777776 5664433322211112 4445666789
Q ss_pred HHHhhhhhcCCCCc
Q 042321 153 CMELIFKELGIFIN 166 (169)
Q Consensus 153 a~HEiGHaLGL~h~ 166 (169)
++|||||||||.|.
T Consensus 78 v~HE~~HalG~~HE 91 (180)
T cd04280 78 IVHELMHALGFYHE 91 (180)
T ss_pred hHHHHHHHhcCcch
Confidence 99999999999985
No 10
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.45 E-value=1.2e-13 Score=108.50 Aligned_cols=93 Identities=23% Similarity=0.300 Sum_probs=64.6
Q ss_pred CCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee--ccc
Q 042321 68 RWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM--LQL 145 (169)
Q Consensus 68 kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~--~~~ 145 (169)
||+ +.+|.|.|.+..+...++.|++|+..|++.|+++|+++. . .....|.|..+ .|| +++-|..+.... ++.
T Consensus 2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~gC-~S~vG~~~g~q~i~l~~ 75 (191)
T PF01400_consen 2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SGC-WSYVGRQGGEQTINLGD 75 (191)
T ss_dssp S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SSE-EEESS--SSEEEEEE-T
T ss_pred cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-ccc-cchhhhcCcceeEEecc
Confidence 899 889999999865557789999999999999999999998 3 44556677654 355 344454333322 445
Q ss_pred ccchHHHHHHhhhhhcCCCCc
Q 042321 146 LLESQSVCMELIFKELGIFIN 166 (169)
Q Consensus 146 ~~~~~~va~HEiGHaLGL~h~ 166 (169)
++....|++|||||||||.|.
T Consensus 76 ~c~~~~~i~HEl~HaLG~~HE 96 (191)
T PF01400_consen 76 GCFSVGTILHELGHALGFWHE 96 (191)
T ss_dssp TC-SHHHHHHHHHHHHTB--G
T ss_pred eeCCccchHHHHHHHHhhhhh
Confidence 555678999999999999985
No 11
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.42 E-value=6.8e-13 Score=104.92 Aligned_cols=95 Identities=14% Similarity=0.176 Sum_probs=69.1
Q ss_pred CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC--Cceee-
Q 042321 66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV--ESWLM- 142 (169)
Q Consensus 66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~--e~w~~- 142 (169)
..+|+ +..|-|.|...-+...+++|++|+..|++.|+++|++... ..-.|.|..+..||. ++-|.. ..-.+
T Consensus 7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~---~~~yi~f~~~~~GC~-S~vG~~~~g~q~is 80 (200)
T cd04281 7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTP---EENYIVFTYRPCGCC-SYVGRRGNGPQAIS 80 (200)
T ss_pred cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCC---CCCEEEEEECCCCee-EcCCCcCCCceeee
Confidence 36999 8999999976544456789999999999999999999764 234666766555664 333321 11122
Q ss_pred cccccchHHHHHHhhhhhcCCCCc
Q 042321 143 LQLLLESQSVCMELIFKELGIFIN 166 (169)
Q Consensus 143 ~~~~~~~~~va~HEiGHaLGL~h~ 166 (169)
++.+|....|++|||||||||.|.
T Consensus 81 l~~~C~~~Gti~HEl~HaLGf~HE 104 (200)
T cd04281 81 IGKNCDKFGIVVHELGHVIGFWHE 104 (200)
T ss_pred cCCCcCcCchHHHHHHHHhcCcch
Confidence 445565678999999999999984
No 12
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.37 E-value=1.7e-12 Score=98.51 Aligned_cols=94 Identities=17% Similarity=0.140 Sum_probs=64.1
Q ss_pred ceEEEeccC--------CchhhHHHHHHHHHhhccCCCcceEeccCC-CcccEEEEeecCCCC---CCCCCCCC------
Q 042321 75 SLTYAFLPG--------TRGDAINPVARAFQTWAPNTQFQFAESQDY-RNADIKISFESGDHG---AGALSTAV------ 136 (169)
Q Consensus 75 ~LTy~~~~~--------~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~-~~ADI~I~F~~~~Hg---d~~pfd~~------ 136 (169)
.|+|.|... .+..++++|+.|++.|+++++++|+++... ..+||.|.+....++ .++++-+.
T Consensus 2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~ 81 (167)
T cd00203 2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR 81 (167)
T ss_pred EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence 355666544 223688999999999999999999999864 379999999875432 12233222
Q ss_pred CCceeec-c-cccchHHHHHHhhhhhcCCCCccC
Q 042321 137 VESWLML-Q-LLLESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 137 ~e~w~~~-~-~~~~~~~va~HEiGHaLGL~h~~~ 168 (169)
..-|... . ....+..|++|||||+|||.|+.+
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~ 115 (167)
T cd00203 82 GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHD 115 (167)
T ss_pred CcEEEecCCcccccchhhHHHHHHHHhCCCccCc
Confidence 1122111 1 112478899999999999999753
No 13
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.36 E-value=2.6e-12 Score=103.52 Aligned_cols=94 Identities=15% Similarity=0.236 Sum_probs=68.1
Q ss_pred CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee-cc
Q 042321 66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM-LQ 144 (169)
Q Consensus 66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-~~ 144 (169)
..+|+ + .+-|.|...-+...++.|++|++.|++.|+++|++.... .. .|.|..+. ||.+...-.+..-.+ ++
T Consensus 43 ~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~~-GC~S~vG~~gg~q~isl~ 115 (230)
T cd04282 43 TYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKGS-GCWSMVGDQQGGQNLSIG 115 (230)
T ss_pred ccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcCC-CeeeccCccCCeEEEEEC
Confidence 36999 7 899999775455678999999999999999999987642 22 56777653 664433221112122 44
Q ss_pred cccchHHHHHHhhhhhcCCCCc
Q 042321 145 LLLESQSVCMELIFKELGIFIN 166 (169)
Q Consensus 145 ~~~~~~~va~HEiGHaLGL~h~ 166 (169)
.+|....+++|||||||||.|.
T Consensus 116 ~~C~~~Gti~HEl~HalGf~HE 137 (230)
T cd04282 116 AGCDYKATVEHEFLHALGFYHE 137 (230)
T ss_pred CCcCCCchHHHHHHHHhCCccc
Confidence 5566678999999999999985
No 14
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.34 E-value=1.8e-12 Score=102.36 Aligned_cols=96 Identities=14% Similarity=0.095 Sum_probs=64.0
Q ss_pred CcceEEEeccCCchhhHHHHHHHHHhhccC------CCcceEeccCC--CcccEEEEeecCCCCC-C----CC--CCC-C
Q 042321 73 KFSLTYAFLPGTRGDAINPVARAFQTWAPN------TQFQFAESQDY--RNADIKISFESGDHGA-G----AL--STA-V 136 (169)
Q Consensus 73 k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~v------t~L~F~ev~~~--~~ADI~I~F~~~~Hgd-~----~p--fd~-~ 136 (169)
|+.|||.|.++++.+++++|++||+.|+++ .+..+.++... ..+||++.+-+.-|.+ + .| .|. .
T Consensus 7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~~~~~~~gps~~dPrT 86 (197)
T cd04276 7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSVVDPRT 86 (197)
T ss_pred CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecCCCcceecccccCCCC
Confidence 889999999998889999999999999988 35555555432 1369996655544443 1 11 221 1
Q ss_pred CCceee---cc--------c-c-----cchHHHHHHhhhhhcCCCCccC
Q 042321 137 VESWLM---LQ--------L-L-----LESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 137 ~e~w~~---~~--------~-~-----~~~~~va~HEiGHaLGL~h~~~ 168 (169)
+|-... +. . . -.+..+++||+||+|||.||++
T Consensus 87 GeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~ 135 (197)
T cd04276 87 GEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFK 135 (197)
T ss_pred CCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccccc
Confidence 222110 00 0 0 1167899999999999999974
No 15
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.23 E-value=3.5e-11 Score=93.86 Aligned_cols=88 Identities=15% Similarity=0.160 Sum_probs=60.2
Q ss_pred cceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCC-CCceee-cc-cccchH
Q 042321 74 FSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAV-VESWLM-LQ-LLLESQ 150 (169)
Q Consensus 74 ~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~-~e~w~~-~~-~~~~~~ 150 (169)
-.|-|.|...-+...++.|++|+..|++.|+++|++.... .. .|.|..+. ||+ +.-|. +....+ ++ .++...
T Consensus 4 ~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~--yi~~~~~~-gC~-S~vG~~gg~q~i~l~~~~C~~~ 78 (182)
T cd04283 4 VYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RD--YLNIESRS-GCW-SYIGRQGGRQTVSLQKQGCMYK 78 (182)
T ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-Cc--EEEEEcCC-Cce-EecCccCCceeEecCCCCcCcc
Confidence 3578888765444678999999999999999999987542 33 35555543 553 33332 122212 32 345567
Q ss_pred HHHHHhhhhhcCCCCc
Q 042321 151 SVCMELIFKELGIFIN 166 (169)
Q Consensus 151 ~va~HEiGHaLGL~h~ 166 (169)
.+++|||||||||.|.
T Consensus 79 G~i~HEl~HaLG~~HE 94 (182)
T cd04283 79 GIIQHELLHALGFYHE 94 (182)
T ss_pred chHHHHHHHHhCCccc
Confidence 8999999999999984
No 16
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.17 E-value=5.9e-11 Score=103.10 Aligned_cols=96 Identities=18% Similarity=0.226 Sum_probs=72.0
Q ss_pred CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCc--eeec
Q 042321 66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVES--WLML 143 (169)
Q Consensus 66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~--w~~~ 143 (169)
..+|+ ...|-|.|.+.-....+.+|++|++.|++.|+++|.|.... ..+..|-+... |+.+...-.+.. -..+
T Consensus 79 ~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC~S~VGr~gg~~q~~sl 153 (411)
T KOG3714|consen 79 ERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GCYSYVGRRGGGQQLLSL 153 (411)
T ss_pred hhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cceeeeCccCCCccceec
Confidence 46999 88999999987555789999999999999999999998753 44555555444 775444322222 1225
Q ss_pred ccccchHHHHHHhhhhhcCCCCc
Q 042321 144 QLLLESQSVCMELIFKELGIFIN 166 (169)
Q Consensus 144 ~~~~~~~~va~HEiGHaLGL~h~ 166 (169)
+.++.-+.|++||++||||+.|.
T Consensus 154 ~~~C~~~G~i~HEl~HaLGf~He 176 (411)
T KOG3714|consen 154 GDGCDRFGTIVHELMHALGFWHE 176 (411)
T ss_pred CCCcCcCchhHHHHHHHhhhhhc
Confidence 66677788999999999999984
No 17
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.12 E-value=1.1e-10 Score=85.23 Aligned_cols=87 Identities=18% Similarity=0.235 Sum_probs=49.3
Q ss_pred ceEEEeccCCchhhHHHHHHHHHhh-ccCCCcceEeccCCCcccEEEEeecCCCCCCCC-CCCCCCceee-cc----ccc
Q 042321 75 SLTYAFLPGTRGDAINPVARAFQTW-APNTQFQFAESQDYRNADIKISFESGDHGAGAL-STAVVESWLM-LQ----LLL 147 (169)
Q Consensus 75 ~LTy~~~~~~~~~~~~~i~~Af~~W-s~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~p-fd~~~e~w~~-~~----~~~ 147 (169)
+|+|.-...+ +.+.+|.+|.+.| +.|++++|.+.. .|+|+|.-....-| .+. -++-+ ...+ ++ -..
T Consensus 3 tv~Ydas~A~--~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~~~~-sya~~~g~G-~G~I~l~~~~~qgy 75 (132)
T PF02031_consen 3 TVYYDASRAP--EFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGNDPRG-SYASTDGLG-SGYIFLDYQQNQGY 75 (132)
T ss_dssp EEEEEEEE-G--GGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE--TT---EEEE-SSS--EEEEEEHHHHHHS
T ss_pred EEEEeCCCCc--hhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecCCCC-cccccCCCC-cEEEEechHHhhCC
Confidence 5666554432 6889999999999 889999999975 38888774331111 111 12211 1111 11 233
Q ss_pred chHHHHHHhhhhhcCCCCccC
Q 042321 148 ESQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 148 ~~~~va~HEiGHaLGL~h~~~ 168 (169)
+-..|++|||||.|||.+.||
T Consensus 76 ~~~RIaaHE~GHiLGLPD~y~ 96 (132)
T PF02031_consen 76 NSTRIAAHELGHILGLPDHYP 96 (132)
T ss_dssp -HHHHHHHHHHHHHT----TT
T ss_pred ccceeeeehhccccCCCCCCC
Confidence 467799999999999999997
No 18
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.09 E-value=9.2e-11 Score=74.39 Aligned_cols=46 Identities=28% Similarity=0.358 Sum_probs=37.5
Q ss_pred CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCCccccCChHHhhhh
Q 042321 1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKM 54 (169)
Q Consensus 1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m 54 (169)
|.++||++... +..+++.+.+||+.||+++|||+||++|.+|+++|
T Consensus 12 L~~~gy~~~~~--------~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 12 LNRLGYYPGPV--------DGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp HHHTTTT-SST--------TSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred HHHcCCCCCCC--------CCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence 56789993321 24568999999999999999999999999999987
No 19
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=98.69 E-value=1.1e-07 Score=75.58 Aligned_cols=94 Identities=17% Similarity=0.187 Sum_probs=58.2
Q ss_pred CcceEEEecc----CCchhhHHHHHHHHHhhccC-CCcceEec--cCCCcccEEEEeecCCC----C--CCCC-CCCCCC
Q 042321 73 KFSLTYAFLP----GTRGDAINPVARAFQTWAPN-TQFQFAES--QDYRNADIKISFESGDH----G--AGAL-STAVVE 138 (169)
Q Consensus 73 k~~LTy~~~~----~~~~~~~~~i~~Af~~Ws~v-t~L~F~ev--~~~~~ADI~I~F~~~~H----g--d~~p-fd~~~e 138 (169)
.++|+-.+.. ......+.++.+|++.|..+ ..|+|+-. .....+||.|.-....- | .++| ..|.+-
T Consensus 38 ~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsAGFP~s~G~P~ 117 (211)
T PF12388_consen 38 PRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSAGFPTSNGNPY 117 (211)
T ss_pred CCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceeccCCCCCCCCC
Confidence 4667766655 12236789999999999886 35677422 22235678764222111 1 2456 666665
Q ss_pred ceeec----ccccc-hHHHHHHhhhhhcCCCCc
Q 042321 139 SWLML----QLLLE-SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 139 ~w~~~----~~~~~-~~~va~HEiGHaLGL~h~ 166 (169)
.|-.+ ....+ ..+|++|||||++||.|+
T Consensus 118 ~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHT 150 (211)
T PF12388_consen 118 KFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHT 150 (211)
T ss_pred ceEEEEecCCCchhHHHHHHHHHhhhhcccccc
Confidence 55322 11222 678999999999999995
No 20
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.91 E-value=1.1e-05 Score=63.75 Aligned_cols=76 Identities=18% Similarity=0.183 Sum_probs=48.0
Q ss_pred hHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCC-CC------------CCC------Cce-ee----c
Q 042321 88 AINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGAL-ST------------AVV------ESW-LM----L 143 (169)
Q Consensus 88 ~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~p-fd------------~~~------e~w-~~----~ 143 (169)
..+++..|.+.|++.-||..+|- ..+|||+|.-.... |-|.. +. +.. .+- .+ .
T Consensus 105 wq~a~~tava~wa~~fpl~ive~--~eeaDItie~~n~p-gtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg 181 (236)
T COG5549 105 WQGAYLTAVAGWAKTFPLIIVER--FEEADITIEVGNPP-GTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPG 181 (236)
T ss_pred HHHHHHHHHHHHHHhCCceeeec--ceeeeEEEEecCCC-CCcccccchHHHHHHHHHHhhhccCcccccccccccCCcc
Confidence 46889999999999855555554 45899999866542 22211 11 000 000 00 0
Q ss_pred ccccchHHHHHHhhhhhcCCC-Cc
Q 042321 144 QLLLESQSVCMELIFKELGIF-IN 166 (169)
Q Consensus 144 ~~~~~~~~va~HEiGHaLGL~-h~ 166 (169)
.....+.++|-||+|||||+. |+
T Consensus 182 ~~~e~L~~tarhElGhaLgi~ghs 205 (236)
T COG5549 182 ELRENLNPTARHELGHALGIWGHS 205 (236)
T ss_pred cchhhhhHHHHHhhcchheecccc
Confidence 122238999999999999998 65
No 21
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.78 E-value=1.3e-05 Score=63.58 Aligned_cols=20 Identities=15% Similarity=0.215 Sum_probs=17.3
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
...+++|||||+|||.|+..
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFD 156 (206)
T ss_pred cchHHHHHHHHHhcCCCCcc
Confidence 45789999999999999864
No 22
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.65 E-value=1.8e-05 Score=57.15 Aligned_cols=18 Identities=17% Similarity=0.115 Sum_probs=12.6
Q ss_pred hHHHHHHhhhhhcCCCCc
Q 042321 149 SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~ 166 (169)
...|++|||||.|||.|+
T Consensus 107 ~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 107 GVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp STTHHHHHHHHHTT----
T ss_pred cceEeeehhhHhcCCCCC
Confidence 357999999999999995
No 23
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.39 E-value=0.00018 Score=57.06 Aligned_cols=36 Identities=17% Similarity=0.160 Sum_probs=33.4
Q ss_pred chhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321 22 DFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 22 ~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p 57 (169)
.|+..+.+||+.||+.+||+++|.+|..|...|..|
T Consensus 36 ~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~ 71 (201)
T TIGR02869 36 VFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA 71 (201)
T ss_pred ccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc
Confidence 467999999999999999999999999999999666
No 24
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.30 E-value=0.00011 Score=57.14 Aligned_cols=20 Identities=15% Similarity=-0.014 Sum_probs=13.4
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
...|++|||||.||+.|+.+
T Consensus 142 ~~~~~AHEiGH~lGa~HD~~ 161 (196)
T PF13688_consen 142 GAITFAHEIGHNLGAPHDGD 161 (196)
T ss_dssp HHHHHHHHHHHHTT-----S
T ss_pred eehhhHHhHHHhcCCCCCCC
Confidence 67799999999999999753
No 25
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.29 E-value=0.00012 Score=56.51 Aligned_cols=20 Identities=10% Similarity=0.140 Sum_probs=14.9
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
...|++|||||.||+.|+.+
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~ 130 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFD 130 (173)
T ss_dssp HHHHHHHHHHHHHT---SSS
T ss_pred eeeeehhhhHhhcCCCCCCC
Confidence 56789999999999999875
No 26
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.24 E-value=0.00013 Score=55.40 Aligned_cols=19 Identities=11% Similarity=0.141 Sum_probs=14.3
Q ss_pred hHHHHHHhhhhhcCCCCcc
Q 042321 149 SQSVCMELIFKELGIFINY 167 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~ 167 (169)
...|++||+||-|||.|..
T Consensus 69 ~g~TltHEvGH~LGL~HtF 87 (154)
T PF05572_consen 69 FGKTLTHEVGHWLGLYHTF 87 (154)
T ss_dssp SSHHHHHHHHHHTT---TT
T ss_pred cccchhhhhhhhhcccccc
Confidence 5679999999999999975
No 27
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.07 E-value=0.00027 Score=55.01 Aligned_cols=19 Identities=11% Similarity=-0.041 Sum_probs=16.9
Q ss_pred hHHHHHHhhhhhcCCCCcc
Q 042321 149 SQSVCMELIFKELGIFINY 167 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~ 167 (169)
.-.|++|||||.||+.|+.
T Consensus 131 ~a~~~AHElGH~lG~~HD~ 149 (194)
T cd04269 131 FAVTMAHELGHNLGMEHDD 149 (194)
T ss_pred HHHHHHHHHHhhcCCCcCC
Confidence 5679999999999999975
No 28
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.03 E-value=0.00026 Score=54.98 Aligned_cols=21 Identities=14% Similarity=-0.039 Sum_probs=17.9
Q ss_pred cchHHHHHHhhhhhcCCCCcc
Q 042321 147 LESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 147 ~~~~~va~HEiGHaLGL~h~~ 167 (169)
.....|++||+||.|||.|+.
T Consensus 131 ~~~~~~~aHElGH~lG~~HD~ 151 (192)
T cd04267 131 LLTALTMAHELGHNLGAEHDG 151 (192)
T ss_pred eeehhhhhhhHHhhcCCcCCC
Confidence 346679999999999999975
No 29
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=96.97 E-value=0.0036 Score=49.54 Aligned_cols=93 Identities=16% Similarity=0.137 Sum_probs=57.1
Q ss_pred CcceEEEecc--CC---c----hhhHHHHHHHH---HhhccCCCcceEeccCCCcccEEEEeecCCCCCC-C-CCCCC--
Q 042321 73 KFSLTYAFLP--GT---R----GDAINPVARAF---QTWAPNTQFQFAESQDYRNADIKISFESGDHGAG-A-LSTAV-- 136 (169)
Q Consensus 73 k~~LTy~~~~--~~---~----~~~~~~i~~Af---~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~-~-pfd~~-- 136 (169)
.+.++|.+.- .. . +++-.+|.+.+ +-|...-.+.|+.|... .+|++|...+..--+. + +.+-.
T Consensus 28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~~~e 106 (203)
T PF11350_consen 28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDTSGE 106 (203)
T ss_pred CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCcCce
Confidence 3678998763 11 1 23344444444 47999889999999986 5999999988753221 1 22211
Q ss_pred --------------CCceeec--c--cccc--hHHHHHHhhhhhcCCCCc
Q 042321 137 --------------VESWLML--Q--LLLE--SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 137 --------------~e~w~~~--~--~~~~--~~~va~HEiGHaLGL~h~ 166 (169)
...|... . ..+. -..|+=||.||+||-.|.
T Consensus 107 ~SC~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~ 156 (203)
T PF11350_consen 107 TSCRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHE 156 (203)
T ss_pred eEeecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCC
Confidence 1233220 0 1111 355999999999998874
No 30
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=96.87 E-value=0.00054 Score=53.40 Aligned_cols=20 Identities=15% Similarity=0.008 Sum_probs=16.5
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
...+++||+||.|||.|--|
T Consensus 125 ~~k~~~HElGH~lGL~HC~~ 144 (179)
T PRK13267 125 VRKEVTHELGHTLGLEHCDN 144 (179)
T ss_pred HHHHHHHHHHHHcCCccCCC
Confidence 45579999999999999643
No 31
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.81 E-value=0.00013 Score=58.74 Aligned_cols=21 Identities=10% Similarity=0.191 Sum_probs=18.4
Q ss_pred cchHHHHHHhhhhhcCCCCcc
Q 042321 147 LESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 147 ~~~~~va~HEiGHaLGL~h~~ 167 (169)
.++..|++|||||.|||.|..
T Consensus 135 ~n~g~t~~HEvGH~lGL~HtF 155 (225)
T cd04275 135 YNLGDTATHEVGHWLGLYHTF 155 (225)
T ss_pred ccccceeEEeccceeeeeeee
Confidence 357789999999999999975
No 32
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.78 E-value=0.00063 Score=53.67 Aligned_cols=18 Identities=6% Similarity=-0.130 Sum_probs=12.8
Q ss_pred hHHHHHHhhhhhcCCCCc
Q 042321 149 SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~ 166 (169)
...+|+||+||.+||.|=
T Consensus 145 ~~Kea~HElGH~~GL~HC 162 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDHC 162 (194)
T ss_dssp HHHHHHHHHHHHTT----
T ss_pred HHHHHHHHHHHHcCCcCC
Confidence 567999999999999983
No 33
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.71 E-value=0.0008 Score=54.73 Aligned_cols=19 Identities=11% Similarity=-0.001 Sum_probs=16.7
Q ss_pred HHHHHHhhhhhcCCCCccC
Q 042321 150 QSVCMELIFKELGIFINYN 168 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~~ 168 (169)
..|++||+||.||+.|+.+
T Consensus 168 a~t~AHElGHnlGm~HD~~ 186 (244)
T cd04270 168 DLVTAHELGHNFGSPHDPD 186 (244)
T ss_pred HHHHHHHHHHhcCCCCCCC
Confidence 4689999999999999864
No 34
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=96.71 E-value=0.00072 Score=52.27 Aligned_cols=20 Identities=15% Similarity=-0.017 Sum_probs=16.3
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
.+.=++||+||.|||.|=-|
T Consensus 124 v~KEv~HElGH~~GL~HC~N 143 (181)
T COG1913 124 VVKEVLHELGHLLGLSHCPN 143 (181)
T ss_pred HHHHHHHHhhhhcCcccCCC
Confidence 34467999999999999655
No 35
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.70 E-value=0.00033 Score=55.31 Aligned_cols=75 Identities=16% Similarity=0.104 Sum_probs=40.6
Q ss_pred HHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCC------C--CCCCCCC--CCcee--ecc-cccchHHHHHHhhh
Q 042321 92 VARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHG------A--GALSTAV--VESWL--MLQ-LLLESQSVCMELIF 158 (169)
Q Consensus 92 i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hg------d--~~pfd~~--~e~w~--~~~-~~~~~~~va~HEiG 158 (169)
.-.+|..|....... .. .+....|+.+-|...+.. + |.++-|. ...++ +.+ ..+....|++||||
T Consensus 72 ~L~~F~~w~~~~~~~-~~-~~~~~~D~a~llt~~d~~~~~~~~~~~G~A~~ggiC~~~~s~~i~~~~~~~~a~~~aHElG 149 (207)
T cd04273 72 SLKSFCRWQKKLNPP-ND-SDPEHHDHAILLTRQDICRSNGNCDTLGLAPVGGMCSPSRSCSINEDTGLSSAFTIAHELG 149 (207)
T ss_pred HHHHHHHHHHHcCCc-cc-ccccccceEEEEeeecccccCCCCCceEEeccccCCCCCcceEEEcCCCceeEEeeeeech
Confidence 468899996633221 10 011357777777665432 1 1222211 11222 111 23335668999999
Q ss_pred hhcCCCCccC
Q 042321 159 KELGIFINYN 168 (169)
Q Consensus 159 HaLGL~h~~~ 168 (169)
|.||+.|+..
T Consensus 150 H~LG~~HD~~ 159 (207)
T cd04273 150 HVLGMPHDGD 159 (207)
T ss_pred hhcCCCCCCC
Confidence 9999999853
No 36
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.65 E-value=0.00062 Score=54.89 Aligned_cols=17 Identities=18% Similarity=0.128 Sum_probs=15.5
Q ss_pred HHHHHhhhhhcCCCCcc
Q 042321 151 SVCMELIFKELGIFINY 167 (169)
Q Consensus 151 ~va~HEiGHaLGL~h~~ 167 (169)
.|++||+||.||+.|+.
T Consensus 147 ~t~AHElGHnLGm~HD~ 163 (228)
T cd04271 147 QVFAHEIGHTFGAVHDC 163 (228)
T ss_pred eehhhhhhhhcCCCCCC
Confidence 58999999999999974
No 37
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.62 E-value=0.00091 Score=52.28 Aligned_cols=20 Identities=10% Similarity=0.027 Sum_probs=14.2
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
+-.+++|||||.||+.|+.+
T Consensus 131 ~a~~~AHelGH~lGm~HD~~ 150 (199)
T PF01421_consen 131 FAVIIAHELGHNLGMPHDGD 150 (199)
T ss_dssp HHHHHHHHHHHHTT---TTT
T ss_pred HHHHHHHHHHHhcCCCCCCC
Confidence 55688999999999999864
No 38
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=96.49 E-value=0.0014 Score=52.27 Aligned_cols=20 Identities=10% Similarity=-0.152 Sum_probs=17.4
Q ss_pred chHHHHHHhhhhhcCCCCcc
Q 042321 148 ESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 148 ~~~~va~HEiGHaLGL~h~~ 167 (169)
....+++||+||.||+.|+.
T Consensus 144 ~~~~~~AHElGH~lG~~HD~ 163 (220)
T cd04272 144 YGVYTMTHELAHLLGAPHDG 163 (220)
T ss_pred ccHHHHHHHHHHHhCCCCCC
Confidence 35789999999999999973
No 39
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=96.46 E-value=0.0029 Score=57.57 Aligned_cols=37 Identities=22% Similarity=0.247 Sum_probs=34.9
Q ss_pred cchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321 21 DDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 21 ~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p 57 (169)
..++.++.+|+|+||+..||+++|+++..|++.|+.|
T Consensus 307 ~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 307 AAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred cccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 3578999999999999999999999999999999988
No 40
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=96.18 E-value=0.0013 Score=54.74 Aligned_cols=19 Identities=21% Similarity=0.209 Sum_probs=17.3
Q ss_pred HHHHHHhhhhhcCCCCccC
Q 042321 150 QSVCMELIFKELGIFINYN 168 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~~ 168 (169)
..|++||+||.|||.+-|+
T Consensus 166 igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred eeeeehhhhcccCCCcccc
Confidence 5799999999999999886
No 41
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.61 E-value=0.012 Score=52.57 Aligned_cols=37 Identities=22% Similarity=0.311 Sum_probs=34.0
Q ss_pred cchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321 21 DDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 21 ~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p 57 (169)
..|+.++.+|+|+||+..|||..|++++.|++.|+--
T Consensus 263 ~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~s 299 (561)
T COG2989 263 PAYDPELVEAVKRFQARHGLPADGVIGPATRAALNVS 299 (561)
T ss_pred ccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhccC
Confidence 4688999999999999999999999999999988764
No 42
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=95.26 E-value=0.0083 Score=50.46 Aligned_cols=17 Identities=6% Similarity=-0.103 Sum_probs=14.0
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
...+.+||+||++||.|
T Consensus 193 ~~~~f~HE~GH~~GL~H 209 (305)
T PF10462_consen 193 YGNEFSHELGHNFGLGH 209 (305)
T ss_dssp SHHHHHHHHHHTTT--S
T ss_pred ccceeehhhhhhcCCCC
Confidence 67799999999999999
No 43
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.15 E-value=0.0094 Score=50.37 Aligned_cols=19 Identities=16% Similarity=0.165 Sum_probs=16.8
Q ss_pred hHHHHHHhhhhhcCCCCcc
Q 042321 149 SQSVCMELIFKELGIFINY 167 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~ 167 (169)
-..|++||+||-|||.|++
T Consensus 150 ~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 150 DWATIMHELGHNLGLWHAG 168 (314)
T ss_pred cHHHHHHHhhhhccccccC
Confidence 4569999999999999985
No 44
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=94.51 E-value=0.0067 Score=55.77 Aligned_cols=19 Identities=21% Similarity=0.176 Sum_probs=17.0
Q ss_pred HHHHHHhhhhhcCCCCccC
Q 042321 150 QSVCMELIFKELGIFINYN 168 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~~ 168 (169)
..|++||+||.|||.+-|+
T Consensus 222 iGVfaHEfGH~LGLPDlYD 240 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLPDLYD 240 (645)
T ss_pred eEEEEeeccccCCCCCccc
Confidence 5589999999999998885
No 45
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.36 E-value=0.048 Score=43.86 Aligned_cols=69 Identities=10% Similarity=0.068 Sum_probs=38.5
Q ss_pred HHHHHHhhccCCC---cceEecc-----CCCcccEEEEeecCCCCCCCCCCCCCCceee-----cccccchHHHHHHhhh
Q 042321 92 VARAFQTWAPNTQ---FQFAESQ-----DYRNADIKISFESGDHGAGALSTAVVESWLM-----LQLLLESQSVCMELIF 158 (169)
Q Consensus 92 i~~Af~~Ws~vt~---L~F~ev~-----~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-----~~~~~~~~~va~HEiG 158 (169)
++..|+.++++.+ ++=.|+. ...-.|++|.-..|.-+|-| |.....-.. .+..+--..||+||.|
T Consensus 21 vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtDHY--dP~~k~vrLS~~vy~~~SiaAvaVAAHEvG 98 (222)
T PF04298_consen 21 VKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTDHY--DPRNKVVRLSEDVYNGRSIAAVAVAAHEVG 98 (222)
T ss_pred HHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCCCc--CCCCCEEEeCCccCCCCCHHHHHHHHHHHh
Confidence 4677777777642 2222221 11246889998877545533 322211111 2234446779999999
Q ss_pred hhcC
Q 042321 159 KELG 162 (169)
Q Consensus 159 HaLG 162 (169)
||+=
T Consensus 99 HAiQ 102 (222)
T PF04298_consen 99 HAIQ 102 (222)
T ss_pred HHHh
Confidence 9973
No 46
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.21 E-value=0.049 Score=47.82 Aligned_cols=18 Identities=11% Similarity=-0.103 Sum_probs=15.5
Q ss_pred HHHHHHhhhhhcCCCCcc
Q 042321 150 QSVCMELIFKELGIFINY 167 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~ 167 (169)
....+|||||++|+.|.-
T Consensus 317 iGA~lHEiGH~fg~pH~~ 334 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQE 334 (423)
T ss_pred hHHHHHHHHHhcCCCCCC
Confidence 447899999999999964
No 47
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=93.16 E-value=0.067 Score=40.58 Aligned_cols=35 Identities=11% Similarity=0.253 Sum_probs=31.4
Q ss_pred hhHHHHHHHHHHHHhcCC-CccccCChHHhhhhhcC
Q 042321 23 FDELLESAIKNYQLNYHL-KSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 23 ~~~~~~~ai~~~Q~~~gL-~~TG~lD~~T~~~m~~p 57 (169)
+......|++.||+.++| +++|.++++|...+...
T Consensus 70 ~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~ 105 (185)
T COG3409 70 YGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQ 105 (185)
T ss_pred cCcccHHHHHHHhhhcCcccccccccHHHHHHHHHH
Confidence 457789999999999999 89999999999988774
No 48
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=92.13 E-value=0.081 Score=43.70 Aligned_cols=20 Identities=15% Similarity=0.207 Sum_probs=12.8
Q ss_pred chHHHHHHhhhhhcC-CCCcc
Q 042321 148 ESQSVCMELIFKELG-IFINY 167 (169)
Q Consensus 148 ~~~~va~HEiGHaLG-L~h~~ 167 (169)
.+..|++||+||++| |.+-|
T Consensus 215 ~~~~v~vHE~GHsf~~LaDEY 235 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADEY 235 (264)
T ss_dssp THHHHHHHHHHHHTT------
T ss_pred cccceeeeecccccccccccc
Confidence 478999999999765 55544
No 49
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=92.05 E-value=0.081 Score=49.41 Aligned_cols=20 Identities=15% Similarity=0.091 Sum_probs=17.4
Q ss_pred chHHHHHHhhhhhcCCCCcc
Q 042321 148 ESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 148 ~~~~va~HEiGHaLGL~h~~ 167 (169)
....|++||+||.|||.|+-
T Consensus 322 ~~a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 322 AFAVVLAHELGHNLGMIHDE 341 (716)
T ss_pred hHHHHHHHHHHhhcCccccc
Confidence 36779999999999999963
No 50
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=92.05 E-value=0.062 Score=39.27 Aligned_cols=17 Identities=18% Similarity=0.108 Sum_probs=15.0
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
..+|++|||||-.||.+
T Consensus 109 vthvliHEIgHhFGLsD 125 (136)
T COG3824 109 VTHVLIHEIGHHFGLSD 125 (136)
T ss_pred hhhhhhhhhhhhcCCCh
Confidence 56799999999999985
No 51
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.74 E-value=0.094 Score=47.20 Aligned_cols=17 Identities=18% Similarity=0.077 Sum_probs=14.3
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
+..|++|||.||||+.-
T Consensus 210 ~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 210 FFRTVIHEIAHALGFSS 226 (521)
T ss_dssp HHHHHHHHHHHHTT-SH
T ss_pred ccceeeeeeeeeeeecc
Confidence 77899999999999863
No 52
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=91.26 E-value=0.11 Score=47.17 Aligned_cols=16 Identities=6% Similarity=-0.186 Sum_probs=13.1
Q ss_pred ccchHHHHHHhhhhhc
Q 042321 146 LLESQSVCMELIFKEL 161 (169)
Q Consensus 146 ~~~~~~va~HEiGHaL 161 (169)
..+-..|++||+|||+
T Consensus 372 ~~~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 372 SRRDVSTLAHELGHAY 387 (587)
T ss_pred CchhHHHHHHHhhHHH
Confidence 3456779999999997
No 53
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=90.44 E-value=0.17 Score=35.64 Aligned_cols=17 Identities=18% Similarity=0.131 Sum_probs=13.4
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
...|++|||||-+|+..
T Consensus 73 I~~tlvhEiah~fG~~~ 89 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGISD 89 (97)
T ss_dssp HHHHHHHHHHHHTT--H
T ss_pred HHHHHHHHHHHHcCCCH
Confidence 67799999999999863
No 54
>PTZ00337 surface protease GP63; Provisional
Probab=90.00 E-value=0.16 Score=46.29 Aligned_cols=17 Identities=18% Similarity=0.141 Sum_probs=15.3
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
++.|++|||.||||+..
T Consensus 229 ~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 229 DVRVAAHELGHALGFVR 245 (567)
T ss_pred HHHHHHHHHHHHHccCH
Confidence 78899999999999853
No 55
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=88.68 E-value=0.16 Score=46.78 Aligned_cols=21 Identities=19% Similarity=0.047 Sum_probs=18.1
Q ss_pred hHHHHHHhhhhhcCCCCccCC
Q 042321 149 SQSVCMELIFKELGIFINYNI 169 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~~ 169 (169)
..-|++|||||.-|-.|++.|
T Consensus 392 s~lt~AHEiGHNfGSpHDp~~ 412 (764)
T KOG3658|consen 392 SDLTLAHEIGHNFGSPHDPDI 412 (764)
T ss_pred hheeehhhhccccCCCCCCCC
Confidence 455889999999999999864
No 56
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=88.40 E-value=0.24 Score=43.74 Aligned_cols=18 Identities=11% Similarity=-0.125 Sum_probs=15.2
Q ss_pred HHHHHHhhhhhcCCCCcc
Q 042321 150 QSVCMELIFKELGIFINY 167 (169)
Q Consensus 150 ~~va~HEiGHaLGL~h~~ 167 (169)
...++||+||.||+.|..
T Consensus 302 lGA~~HElGH~lgcpHq~ 319 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQS 319 (614)
T ss_pred HHHHHHHhhhccCCCCCC
Confidence 446799999999999964
No 57
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=87.95 E-value=0.53 Score=31.13 Aligned_cols=29 Identities=17% Similarity=0.200 Sum_probs=22.4
Q ss_pred HHHHHHHHhcC----CCccccCChHHhhhhhcC
Q 042321 29 SAIKNYQLNYH----LKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 29 ~ai~~~Q~~~g----L~~TG~lD~~T~~~m~~p 57 (169)
.|+|.+|+-.| +.+.|++.+.|+..+...
T Consensus 1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al~~~ 33 (72)
T PF09374_consen 1 RAIKFLQRALGLGADVAVDGIIGPKTLAALNAA 33 (72)
T ss_dssp HHHHHHHHHHT----S--SS--SHHHHHHHHHS
T ss_pred CHHHHHHHHHccCCccCCCCCcCHHHHHHHhhC
Confidence 48999999999 999999999999998666
No 58
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=87.60 E-value=0.29 Score=39.40 Aligned_cols=13 Identities=8% Similarity=-0.184 Sum_probs=11.9
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...|+.||+||||
T Consensus 92 ~~~~l~HE~GHAl 104 (220)
T PF14247_consen 92 VLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHH
Confidence 7889999999996
No 59
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=87.31 E-value=0.3 Score=43.44 Aligned_cols=15 Identities=20% Similarity=-0.082 Sum_probs=13.4
Q ss_pred chHHHHHHhhhhhcC
Q 042321 148 ESQSVCMELIFKELG 162 (169)
Q Consensus 148 ~~~~va~HEiGHaLG 162 (169)
++-.|++|||||..|
T Consensus 129 ElagViAHEigHv~q 143 (484)
T COG4783 129 ELAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHHhh
Confidence 489999999999876
No 60
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.81 E-value=0.37 Score=44.31 Aligned_cols=16 Identities=13% Similarity=0.179 Sum_probs=14.7
Q ss_pred hHHHHHHhhhhhcCCC
Q 042321 149 SQSVCMELIFKELGIF 164 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~ 164 (169)
...|++|||.||||..
T Consensus 256 ~~rv~~HEi~HALGFS 271 (622)
T PTZ00257 256 TTRTVTHEVAHALGFS 271 (622)
T ss_pred HHHHHHHHHHHHhcCC
Confidence 6889999999999985
No 61
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=86.03 E-value=0.69 Score=36.70 Aligned_cols=78 Identities=9% Similarity=0.000 Sum_probs=44.7
Q ss_pred HHHHHHHHHhhccCC---CcceEeccC-----CCcccEEEEeecCCCCCCCCCCCCCCce--e---ecccccchHHHHHH
Q 042321 89 INPVARAFQTWAPNT---QFQFAESQD-----YRNADIKISFESGDHGAGALSTAVVESW--L---MLQLLLESQSVCME 155 (169)
Q Consensus 89 ~~~i~~Af~~Ws~vt---~L~F~ev~~-----~~~ADI~I~F~~~~Hgd~~pfd~~~e~w--~---~~~~~~~~~~va~H 155 (169)
+.-++.+++.||+|. +++=.||.. ..--|+++.-.+|.=-|- +|.....- + ..+..+.-..||+|
T Consensus 21 Q~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTDH--YDP~~kvvrLSe~~y~g~Sia~~aVAAH 98 (226)
T COG2738 21 QSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTDH--YDPRRKVVRLSEANYYGPSIAAIAVAAH 98 (226)
T ss_pred HHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCcccc--cChhhheeeccccccCCccHHHHHHHHH
Confidence 455788999999985 343333321 113477777666543321 22211111 1 12344457789999
Q ss_pred hhhhhcCCCCccC
Q 042321 156 LIFKELGIFINYN 168 (169)
Q Consensus 156 EiGHaLGL~h~~~ 168 (169)
|.|||+==.++|.
T Consensus 99 EVGHAiQd~~~Y~ 111 (226)
T COG2738 99 EVGHAIQDQEDYA 111 (226)
T ss_pred HhhHHHhhhcccH
Confidence 9999997766663
No 62
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=85.35 E-value=0.46 Score=32.86 Aligned_cols=13 Identities=8% Similarity=-0.156 Sum_probs=11.1
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-..+++|||||.+
T Consensus 42 ~~f~laHELgH~~ 54 (122)
T PF06114_consen 42 QRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 5669999999975
No 63
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=84.55 E-value=0.43 Score=38.21 Aligned_cols=13 Identities=8% Similarity=-0.090 Sum_probs=10.9
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-+-|++||+||+|
T Consensus 72 ~rFtlAHELGH~l 84 (213)
T COG2856 72 KRFTLAHELGHAL 84 (213)
T ss_pred HHHHHHHHHhHHH
Confidence 3459999999997
No 64
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=83.46 E-value=1.2 Score=40.27 Aligned_cols=17 Identities=6% Similarity=-0.271 Sum_probs=13.7
Q ss_pred cccchHHHHHHhhhhhc
Q 042321 145 LLLESQSVCMELIFKEL 161 (169)
Q Consensus 145 ~~~~~~~va~HEiGHaL 161 (169)
...+-..|++||+|||+
T Consensus 333 ~t~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 333 GTSGDIDVLTHEAGHAF 349 (549)
T ss_pred CChhHHHHHHHHhhHHH
Confidence 34457789999999996
No 65
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=82.66 E-value=0.71 Score=35.98 Aligned_cols=15 Identities=13% Similarity=-0.148 Sum_probs=12.9
Q ss_pred chHHHHHHhhhhhcC
Q 042321 148 ESQSVCMELIFKELG 162 (169)
Q Consensus 148 ~~~~va~HEiGHaLG 162 (169)
++..|++||+||...
T Consensus 88 el~aVlaHElgH~~~ 102 (226)
T PF01435_consen 88 ELAAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHc
Confidence 488999999999863
No 66
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=81.81 E-value=0.77 Score=36.32 Aligned_cols=13 Identities=23% Similarity=0.061 Sum_probs=11.3
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|++||+||++
T Consensus 38 ~~~v~iHElgH~~ 50 (208)
T cd06161 38 FLSVLLHELGHAL 50 (208)
T ss_pred HHHHHHHHHHHHH
Confidence 5679999999985
No 67
>PRK03982 heat shock protein HtpX; Provisional
Probab=81.03 E-value=0.82 Score=37.94 Aligned_cols=14 Identities=14% Similarity=-0.016 Sum_probs=12.2
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++||+||.-
T Consensus 124 El~AVlAHElgHi~ 137 (288)
T PRK03982 124 ELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHH
Confidence 48999999999963
No 68
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=80.61 E-value=1.7 Score=37.32 Aligned_cols=16 Identities=6% Similarity=-0.056 Sum_probs=12.9
Q ss_pred ccchHHHHHHhhhhhc
Q 042321 146 LLESQSVCMELIFKEL 161 (169)
Q Consensus 146 ~~~~~~va~HEiGHaL 161 (169)
..+-..+++||+|||+
T Consensus 219 ~~~~v~tl~HE~GHa~ 234 (427)
T cd06459 219 TLDDVFTLAHELGHAF 234 (427)
T ss_pred ChhhHHHHHHHhhHHH
Confidence 3456779999999985
No 69
>PRK03001 M48 family peptidase; Provisional
Probab=79.77 E-value=0.95 Score=37.44 Aligned_cols=14 Identities=14% Similarity=0.007 Sum_probs=12.2
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++||+||.-
T Consensus 123 El~aVlAHElgHi~ 136 (283)
T PRK03001 123 EIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHh
Confidence 48999999999964
No 70
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=78.94 E-value=1.1 Score=36.74 Aligned_cols=14 Identities=14% Similarity=-0.081 Sum_probs=12.3
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++||+||..
T Consensus 156 El~aVlaHElgHi~ 169 (302)
T COG0501 156 ELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHh
Confidence 48999999999974
No 71
>PRK03072 heat shock protein HtpX; Provisional
Probab=78.93 E-value=1 Score=37.43 Aligned_cols=14 Identities=14% Similarity=-0.043 Sum_probs=12.3
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++||+||.-
T Consensus 126 El~aVlAHElgHi~ 139 (288)
T PRK03072 126 ELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHh
Confidence 48999999999964
No 72
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=78.85 E-value=0.99 Score=38.27 Aligned_cols=16 Identities=0% Similarity=-0.168 Sum_probs=12.9
Q ss_pred ccchHHHHHHhhhhhc
Q 042321 146 LLESQSVCMELIFKEL 161 (169)
Q Consensus 146 ~~~~~~va~HEiGHaL 161 (169)
..+-..|++||+|||+
T Consensus 151 ~~~~v~tl~HE~GHa~ 166 (365)
T cd06258 151 GHDDINTLFHEFGHAV 166 (365)
T ss_pred CHHHHHHHHHHHhHHH
Confidence 3446779999999997
No 73
>PRK01345 heat shock protein HtpX; Provisional
Probab=78.76 E-value=1.1 Score=37.97 Aligned_cols=14 Identities=14% Similarity=-0.014 Sum_probs=12.4
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++|||||.-
T Consensus 123 EL~aVlAHElgHi~ 136 (317)
T PRK01345 123 EVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHH
Confidence 48999999999975
No 74
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=78.76 E-value=2 Score=39.02 Aligned_cols=17 Identities=12% Similarity=-0.121 Sum_probs=13.8
Q ss_pred cccchHHHHHHhhhhhc
Q 042321 145 LLLESQSVCMELIFKEL 161 (169)
Q Consensus 145 ~~~~~~~va~HEiGHaL 161 (169)
...+-..|++||+|||+
T Consensus 374 ~~~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 374 GTLNSVFTLAHELGHSM 390 (591)
T ss_pred CCcchHHHHHHHhhhHH
Confidence 34567889999999995
No 75
>PLN02791 Nudix hydrolase homolog
Probab=78.19 E-value=1.1 Score=42.37 Aligned_cols=17 Identities=6% Similarity=-0.064 Sum_probs=14.8
Q ss_pred hHHHHHHhhhhhcCCCC
Q 042321 149 SQSVCMELIFKELGIFI 165 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h 165 (169)
+.+|++||+||.+|=..
T Consensus 585 ~~~v~lHElgHGsG~~~ 601 (770)
T PLN02791 585 FTHTICHECCHGIGPHT 601 (770)
T ss_pred HHHHHHHHhhccccccc
Confidence 78899999999999643
No 76
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=77.02 E-value=1.3 Score=35.57 Aligned_cols=13 Identities=23% Similarity=0.102 Sum_probs=11.4
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|.+||+||++
T Consensus 53 ~~~v~iHElgH~~ 65 (227)
T cd06164 53 FASVLLHELGHSL 65 (227)
T ss_pred HHHHHHHHHHHHH
Confidence 6779999999986
No 77
>PRK04897 heat shock protein HtpX; Provisional
Probab=76.91 E-value=1.3 Score=37.03 Aligned_cols=14 Identities=21% Similarity=-0.071 Sum_probs=12.2
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
++..|++||+||.-
T Consensus 136 El~aVlAHElgHi~ 149 (298)
T PRK04897 136 ELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHh
Confidence 48999999999964
No 78
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=76.62 E-value=1.3 Score=38.86 Aligned_cols=15 Identities=0% Similarity=-0.066 Sum_probs=12.2
Q ss_pred cchHHHHHHhhhhhc
Q 042321 147 LESQSVCMELIFKEL 161 (169)
Q Consensus 147 ~~~~~va~HEiGHaL 161 (169)
.+-..|+.||+|||+
T Consensus 240 ~~~v~tLfHE~GHa~ 254 (458)
T PF01432_consen 240 HDDVETLFHEFGHAM 254 (458)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred hhhHHHHHHHHhHHH
Confidence 345779999999986
No 79
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=76.03 E-value=1.5 Score=36.59 Aligned_cols=13 Identities=8% Similarity=-0.176 Sum_probs=11.3
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|++||+||++
T Consensus 135 ~isvvvHElgHal 147 (277)
T cd06162 135 LISGVVHEMGHGV 147 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 5679999999986
No 80
>PRK01265 heat shock protein HtpX; Provisional
Probab=75.22 E-value=1.5 Score=37.25 Aligned_cols=13 Identities=15% Similarity=-0.054 Sum_probs=11.6
Q ss_pred chHHHHHHhhhhh
Q 042321 148 ESQSVCMELIFKE 160 (169)
Q Consensus 148 ~~~~va~HEiGHa 160 (169)
++..|++||+||.
T Consensus 139 El~aVlAHElgHi 151 (324)
T PRK01265 139 EIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHH
Confidence 4899999999995
No 81
>PRK05457 heat shock protein HtpX; Provisional
Probab=75.03 E-value=1.6 Score=36.36 Aligned_cols=13 Identities=23% Similarity=0.087 Sum_probs=11.6
Q ss_pred chHHHHHHhhhhh
Q 042321 148 ESQSVCMELIFKE 160 (169)
Q Consensus 148 ~~~~va~HEiGHa 160 (169)
++..|++||+||.
T Consensus 133 El~aVlAHElgHi 145 (284)
T PRK05457 133 EVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHH
Confidence 4899999999995
No 82
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=75.02 E-value=1.6 Score=32.97 Aligned_cols=12 Identities=8% Similarity=0.041 Sum_probs=10.3
Q ss_pred HHHHHHhhhhhc
Q 042321 150 QSVCMELIFKEL 161 (169)
Q Consensus 150 ~~va~HEiGHaL 161 (169)
..|++||++|++
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 559999999974
No 83
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=74.77 E-value=1.5 Score=38.88 Aligned_cols=14 Identities=0% Similarity=-0.061 Sum_probs=12.0
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
+-..|+.||+|||+
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 46678999999998
No 84
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=74.46 E-value=9.3 Score=29.42 Aligned_cols=71 Identities=14% Similarity=0.093 Sum_probs=41.2
Q ss_pred hHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC-Cceeecccccc-hHHHHHHhhhhhcCCCC
Q 042321 88 AINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV-ESWLMLQLLLE-SQSVCMELIFKELGIFI 165 (169)
Q Consensus 88 ~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~-e~w~~~~~~~~-~~~va~HEiGHaLGL~h 165 (169)
.+..+..-+..|+...++. ...|+|.-.....|.+ +-++.. =+|.+...-.+ +..|++||+-|..-..|
T Consensus 110 ~~~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc-~~~~~I~ln~~L~~~P~~~idYVvvHEL~Hl~~~nH 180 (205)
T PF01863_consen 110 AKEYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC-SSKGNITLNWRLVMAPPEVIDYVVVHELCHLRHPNH 180 (205)
T ss_pred HHHHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC-CCCCcEEeecccccCCccHHHHHHHHHHHHhccCCC
Confidence 3455566666777665553 3456666656555543 433321 12222111112 67899999999999999
Q ss_pred cc
Q 042321 166 NY 167 (169)
Q Consensus 166 ~~ 167 (169)
+-
T Consensus 181 s~ 182 (205)
T PF01863_consen 181 SK 182 (205)
T ss_pred CH
Confidence 74
No 85
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=74.44 E-value=3.4 Score=27.48 Aligned_cols=34 Identities=21% Similarity=0.247 Sum_probs=20.1
Q ss_pred hhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321 23 FDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 23 ~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p 57 (169)
...++.++|++. -++.=+++|..|++|.+.|..-
T Consensus 17 ~~~evq~~L~~l-Gyy~g~~~g~~d~a~~~Al~~~ 50 (74)
T PF08823_consen 17 VAREVQEALKRL-GYYKGEADGVWDEATEDALRAW 50 (74)
T ss_pred HHHHHHHHHHHc-CCccCCCCCcccHHHHHHHHHH
Confidence 344555555544 2233377788888887776554
No 86
>PRK02391 heat shock protein HtpX; Provisional
Probab=74.42 E-value=1.7 Score=36.45 Aligned_cols=13 Identities=15% Similarity=0.049 Sum_probs=11.7
Q ss_pred chHHHHHHhhhhh
Q 042321 148 ESQSVCMELIFKE 160 (169)
Q Consensus 148 ~~~~va~HEiGHa 160 (169)
++..|++||+||.
T Consensus 132 El~aVlaHElgHi 144 (296)
T PRK02391 132 ELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHH
Confidence 4899999999995
No 87
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=74.35 E-value=1.7 Score=35.85 Aligned_cols=13 Identities=8% Similarity=0.082 Sum_probs=11.2
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|++||+||++
T Consensus 118 ~isv~iHElgHa~ 130 (263)
T cd06159 118 VVGVVVHELSHGI 130 (263)
T ss_pred HHHHHHHHHHHHH
Confidence 5678999999985
No 88
>PRK02870 heat shock protein HtpX; Provisional
Probab=73.82 E-value=1.8 Score=37.04 Aligned_cols=13 Identities=23% Similarity=0.069 Sum_probs=11.9
Q ss_pred chHHHHHHhhhhh
Q 042321 148 ESQSVCMELIFKE 160 (169)
Q Consensus 148 ~~~~va~HEiGHa 160 (169)
++..|++||+||.
T Consensus 172 EL~aVlAHELgHi 184 (336)
T PRK02870 172 ELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHH
Confidence 4899999999997
No 89
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=73.52 E-value=1.7 Score=38.47 Aligned_cols=24 Identities=0% Similarity=-0.128 Sum_probs=16.3
Q ss_pred hHHHHHHHHHhhccCCCcceEecc
Q 042321 88 AINPVARAFQTWAPNTQFQFAESQ 111 (169)
Q Consensus 88 ~~~~i~~Af~~Ws~vt~L~F~ev~ 111 (169)
+..+++.-|..-+.+=+|+|+++.
T Consensus 132 l~~vl~gl~~~~~~lfgi~~~~~~ 155 (458)
T cd06457 132 LGTVMEGLSRLFSRLYGIRLVPVP 155 (458)
T ss_pred HHHHHHHHHHHHHHHhCeEEEecC
Confidence 445556666666677788888875
No 90
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=73.18 E-value=0.83 Score=43.53 Aligned_cols=23 Identities=13% Similarity=-0.068 Sum_probs=18.8
Q ss_pred cccchHHHHHHhhhhhcCCCCcc
Q 042321 145 LLLESQSVCMELIFKELGIFINY 167 (169)
Q Consensus 145 ~~~~~~~va~HEiGHaLGL~h~~ 167 (169)
.++....|++||+||.+|+.|+.
T Consensus 313 ~gl~~aftiahE~GH~~gm~hd~ 335 (845)
T KOG3538|consen 313 TGLPSAFTIAHELGHNFGMEHDG 335 (845)
T ss_pred CCCccceeeeeccccccCcccCC
Confidence 34446679999999999999984
No 91
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=73.16 E-value=2 Score=29.77 Aligned_cols=13 Identities=8% Similarity=-0.010 Sum_probs=11.3
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|++||+.|++
T Consensus 25 ~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 25 LDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHH
Confidence 5689999999985
No 92
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=71.74 E-value=2.2 Score=33.24 Aligned_cols=14 Identities=7% Similarity=-0.104 Sum_probs=11.5
Q ss_pred hHHHHHHhhhhhcC
Q 042321 149 SQSVCMELIFKELG 162 (169)
Q Consensus 149 ~~~va~HEiGHaLG 162 (169)
+..+.+||+||++-
T Consensus 41 ~~~l~iHElgH~~~ 54 (183)
T cd06160 41 LAILGIHEMGHYLA 54 (183)
T ss_pred HHHHHHHHHHHHHH
Confidence 56688999999863
No 93
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=71.64 E-value=2 Score=37.62 Aligned_cols=13 Identities=15% Similarity=0.089 Sum_probs=11.0
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...++.||+||||
T Consensus 131 vi~~vvHElGHal 143 (484)
T KOG2921|consen 131 VITVVVHELGHAL 143 (484)
T ss_pred HHHHHHHHhhHHH
Confidence 4567899999998
No 94
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=70.60 E-value=2.2 Score=37.46 Aligned_cols=16 Identities=6% Similarity=-0.043 Sum_probs=12.7
Q ss_pred ccchHHHHHHhhhhhc
Q 042321 146 LLESQSVCMELIFKEL 161 (169)
Q Consensus 146 ~~~~~~va~HEiGHaL 161 (169)
..+-..|+.||+||||
T Consensus 205 ~~~~v~tLfHEfGHal 220 (422)
T cd06456 205 THDEVTTLFHEFGHAL 220 (422)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 3446678899999998
No 95
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=68.20 E-value=2.4 Score=38.62 Aligned_cols=14 Identities=14% Similarity=-0.007 Sum_probs=11.9
Q ss_pred hHHHHHHhh-hhhcC
Q 042321 149 SQSVCMELI-FKELG 162 (169)
Q Consensus 149 ~~~va~HEi-GHaLG 162 (169)
+.+|++||+ ||..|
T Consensus 293 ~~~v~lHEllGHGsG 307 (549)
T PF03571_consen 293 FVQVGLHELLGHGSG 307 (549)
T ss_dssp HHHHHHHHHCCCCS-
T ss_pred hhhhhHHhhccCcCc
Confidence 678999999 99988
No 96
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=67.10 E-value=9.1 Score=25.42 Aligned_cols=46 Identities=15% Similarity=0.334 Sum_probs=31.9
Q ss_pred CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCC----ccccCChHHhhhh
Q 042321 1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLK----STEVLDAGTVGKM 54 (169)
Q Consensus 1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~----~TG~lD~~T~~~m 54 (169)
|.+.||+..... ..+++.+.+|++.|+-.-+|. .-|.+|...++.|
T Consensus 25 L~~lGyy~g~~~--------g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L 74 (74)
T PF08823_consen 25 LKRLGYYKGEAD--------GVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL 74 (74)
T ss_pred HHHcCCccCCCC--------CcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence 567899865531 124689999999999886664 4455787776543
No 97
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=67.09 E-value=2.6 Score=35.33 Aligned_cols=13 Identities=8% Similarity=-0.156 Sum_probs=10.3
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...|++||.||.+
T Consensus 170 ~ayVlAHEyGHHV 182 (292)
T PF04228_consen 170 QAYVLAHEYGHHV 182 (292)
T ss_pred HHHHHHHHHHHHH
Confidence 4558999999963
No 98
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=64.44 E-value=3.3 Score=39.27 Aligned_cols=66 Identities=18% Similarity=0.235 Sum_probs=33.9
Q ss_pred HHHHHhhccCCCcceEeccCCCcccEEE-EeecCC-CCCCCCCCCCCCceee-----ccccc---chHHHHHHhhhhhc
Q 042321 93 ARAFQTWAPNTQFQFAESQDYRNADIKI-SFESGD-HGAGALSTAVVESWLM-----LQLLL---ESQSVCMELIFKEL 161 (169)
Q Consensus 93 ~~Af~~Ws~vt~L~F~ev~~~~~ADI~I-~F~~~~-Hgd~~pfd~~~e~w~~-----~~~~~---~~~~va~HEiGHaL 161 (169)
..-|...-+.+.|.|...... +.++.= .|.+.+ ||-|.++-+. .|.. ++..+ -.-..++|||||+-
T Consensus 173 ~~i~~~Yd~l~Gl~~~~~~~~-~~n~~~kYF~KAD~~G~G~AYY~~--~w~a~ss~s~~~~L~~~~~nW~~LHEIgHgY 248 (775)
T PF03272_consen 173 NDIFKFYDDLTGLSDDPSDPV-DKNFNNKYFAKADKSGPGAAYYGS--NWTAQSSSSLSFYLNPSPTNWGALHEIGHGY 248 (775)
T ss_pred HHHHHHHHhhhCCCCCCCccc-ccccccceEEEecCCCCCCccccc--cceecCchhHHHHhCCCCCCchhhhhhhhhc
Confidence 455666666667777654321 111111 254443 6655555443 4532 11222 13458999999984
No 99
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=64.25 E-value=34 Score=27.32 Aligned_cols=57 Identities=16% Similarity=0.183 Sum_probs=42.2
Q ss_pred CCCCCCCcceEEEeccCCch---hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCC
Q 042321 67 PRWPASKFSLTYAFLPGTRG---DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDH 127 (169)
Q Consensus 67 ~kW~~~k~~LTy~~~~~~~~---~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~H 127 (169)
.||. + .++..+.+.++. .-.+.+..-++.-+.++.+.+..+... +|+|.|.|...+.
T Consensus 28 ~Rw~--~-PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~ 87 (213)
T PF11150_consen 28 RRWE--G-PVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDD 87 (213)
T ss_pred ccCC--C-CeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccch
Confidence 5998 3 478888754443 233555555666688999999998876 8999999999876
No 100
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=64.07 E-value=3.5 Score=36.06 Aligned_cols=13 Identities=8% Similarity=-0.148 Sum_probs=10.8
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-..+++||.||||
T Consensus 159 ~l~t~iHE~GHal 171 (396)
T cd06460 159 ALFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHHhhHHH
Confidence 4568899999996
No 101
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=63.29 E-value=3.8 Score=34.71 Aligned_cols=18 Identities=11% Similarity=0.054 Sum_probs=15.4
Q ss_pred cchHHHHHHhhhhhcCCC
Q 042321 147 LESQSVCMELIFKELGIF 164 (169)
Q Consensus 147 ~~~~~va~HEiGHaLGL~ 164 (169)
..+..|++||++|-+|..
T Consensus 194 ~~~P~T~~HElAHq~G~a 211 (318)
T PF12725_consen 194 YSLPFTICHELAHQLGFA 211 (318)
T ss_pred ccccHHHHHHHHHHhCCC
Confidence 347889999999999975
No 102
>PRK10911 oligopeptidase A; Provisional
Probab=61.81 E-value=4 Score=38.11 Aligned_cols=14 Identities=7% Similarity=-0.071 Sum_probs=11.8
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
+-..|+.||+||+|
T Consensus 462 ~~v~tlfHEfGHal 475 (680)
T PRK10911 462 DEVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHHhHHH
Confidence 35668999999998
No 103
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=60.19 E-value=4.2 Score=37.41 Aligned_cols=16 Identities=6% Similarity=-0.103 Sum_probs=13.0
Q ss_pred ccchHHHHHHhhhhhc
Q 042321 146 LLESQSVCMELIFKEL 161 (169)
Q Consensus 146 ~~~~~~va~HEiGHaL 161 (169)
.++-..|++||+||+.
T Consensus 377 t~~dV~TLaHElGHs~ 392 (598)
T COG1164 377 TLRDVFTLAHELGHSV 392 (598)
T ss_pred chhHHHHHHHHccHHH
Confidence 4456889999999985
No 104
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.06 E-value=2.4 Score=38.56 Aligned_cols=17 Identities=24% Similarity=0.225 Sum_probs=15.6
Q ss_pred HHHHhhhhhcCCCCccC
Q 042321 152 VCMELIFKELGIFINYN 168 (169)
Q Consensus 152 va~HEiGHaLGL~h~~~ 168 (169)
|.+||.||-|||.+.|.
T Consensus 349 v~ahEyghdlglPDEyd 365 (760)
T COG4412 349 VEAHEYGHDLGLPDEYD 365 (760)
T ss_pred eeeeecccccCCCcccc
Confidence 78999999999998875
No 105
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=59.23 E-value=14 Score=30.39 Aligned_cols=34 Identities=15% Similarity=0.326 Sum_probs=23.2
Q ss_pred cchhHHHHHHHHHHHHhcCCC-ccccCChHHhhhh
Q 042321 21 DDFDELLESAIKNYQLNYHLK-STEVLDAGTVGKM 54 (169)
Q Consensus 21 ~~~~~~~~~ai~~~Q~~~gL~-~TG~lD~~T~~~m 54 (169)
..+++..+..|+.||.-|+=. .+|+.|.+|++.+
T Consensus 217 ~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l 251 (257)
T COG3023 217 GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAIL 251 (257)
T ss_pred chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHH
Confidence 446777777777777776522 5777777777654
No 106
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=59.06 E-value=6.8 Score=35.79 Aligned_cols=30 Identities=13% Similarity=-0.099 Sum_probs=19.9
Q ss_pred CCCCCCCCcee------ecccccchHHHHHHhhhhh
Q 042321 131 ALSTAVVESWL------MLQLLLESQSVCMELIFKE 160 (169)
Q Consensus 131 ~pfd~~~e~w~------~~~~~~~~~~va~HEiGHa 160 (169)
+||.|...+.- ++..+-.+..|++|||-|+
T Consensus 264 FP~gGMENPcltF~TpTllaGDrsl~~vIaHEIAHS 299 (613)
T KOG1047|consen 264 FPFGGMENPCLTFVTPTLLAGDRSLVDVIAHEIAHS 299 (613)
T ss_pred CCcccccCcceeeecchhhcCCcchhhHHHHHhhhh
Confidence 67777654441 1233334789999999997
No 107
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=56.27 E-value=5.9 Score=34.97 Aligned_cols=13 Identities=15% Similarity=-0.156 Sum_probs=11.5
Q ss_pred chHHHHHHhhhhh
Q 042321 148 ESQSVCMELIFKE 160 (169)
Q Consensus 148 ~~~~va~HEiGHa 160 (169)
++..|++||+||-
T Consensus 279 el~AVl~HELGHW 291 (428)
T KOG2719|consen 279 ELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHhhHH
Confidence 4889999999995
No 108
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=52.77 E-value=9.9 Score=31.45 Aligned_cols=21 Identities=24% Similarity=0.393 Sum_probs=13.3
Q ss_pred CCcccEEEEeecCCCCCCCCCC
Q 042321 113 YRNADIKISFESGDHGAGALST 134 (169)
Q Consensus 113 ~~~ADI~I~F~~~~Hgd~~pfd 134 (169)
...|||+|.-....+|| .|+.
T Consensus 98 ~~~Adv~V~~~~~~~~d-~PyT 118 (262)
T PF08434_consen 98 YETADVRVASPNPKYGD-EPYT 118 (262)
T ss_pred hcCCCEEEeCCCCCCCC-CCce
Confidence 45899999854444455 3543
No 109
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=48.75 E-value=9 Score=35.86 Aligned_cols=34 Identities=18% Similarity=0.160 Sum_probs=20.6
Q ss_pred hHHHH-HHHHHhhccCCCcceEeccC--CCcccEEEE
Q 042321 88 AINPV-ARAFQTWAPNTQFQFAESQD--YRNADIKIS 121 (169)
Q Consensus 88 ~~~~i-~~Af~~Ws~vt~L~F~ev~~--~~~ADI~I~ 121 (169)
...++ .--|..-+..=.|+|+++.+ ....|++..
T Consensus 362 l~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~~ 398 (681)
T PRK10280 362 LNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVW 398 (681)
T ss_pred HHHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeEE
Confidence 34555 35666666777888988653 235566543
No 110
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=48.49 E-value=9.1 Score=35.47 Aligned_cols=13 Identities=8% Similarity=-0.015 Sum_probs=11.6
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...||.||+||||
T Consensus 408 ~~~~a~he~gha~ 420 (644)
T PRK10733 408 KESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHH
Confidence 5779999999996
No 111
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=48.39 E-value=9.9 Score=32.77 Aligned_cols=13 Identities=15% Similarity=-0.015 Sum_probs=11.6
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+-.|+.|||+|++
T Consensus 275 lAtvLgHE~aHaV 287 (424)
T KOG2661|consen 275 LATVLGHEIAHAV 287 (424)
T ss_pred HHHHHHHHHHHHH
Confidence 7779999999986
No 112
>CHL00176 ftsH cell division protein; Validated
Probab=45.76 E-value=10 Score=35.25 Aligned_cols=13 Identities=0% Similarity=0.048 Sum_probs=11.5
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...||.||.||||
T Consensus 438 ~~~vA~hEaGhA~ 450 (638)
T CHL00176 438 KRLIAYHEVGHAI 450 (638)
T ss_pred HHHHHHHhhhhHH
Confidence 5679999999998
No 113
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=45.59 E-value=9.6 Score=31.28 Aligned_cols=18 Identities=11% Similarity=0.167 Sum_probs=12.8
Q ss_pred cchHHHHHHhhhhhcCCC
Q 042321 147 LESQSVCMELIFKELGIF 164 (169)
Q Consensus 147 ~~~~~va~HEiGHaLGL~ 164 (169)
-|-..|++||+-|+|=+.
T Consensus 152 ~dg~NVviHEfAH~LD~~ 169 (253)
T PF06167_consen 152 NDGHNVVIHEFAHKLDME 169 (253)
T ss_dssp SSS--HHHHHHHHHHHCT
T ss_pred CCCcchHHHHHHHHHHhh
Confidence 346789999999999543
No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=45.19 E-value=11 Score=31.38 Aligned_cols=18 Identities=6% Similarity=-0.189 Sum_probs=14.9
Q ss_pred chHHHHHHhhhhhcCCCC
Q 042321 148 ESQSVCMELIFKELGIFI 165 (169)
Q Consensus 148 ~~~~va~HEiGHaLGL~h 165 (169)
++..|++||+||.-|=++
T Consensus 202 ~yyaTl~HElghwtgh~~ 219 (316)
T COG4227 202 NYYATLLHELGHWTGHEA 219 (316)
T ss_pred hHHHHHHHHhccccCchh
Confidence 488899999999988543
No 115
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=44.47 E-value=8.9 Score=34.37 Aligned_cols=12 Identities=8% Similarity=0.041 Sum_probs=10.0
Q ss_pred HHHHHHhhhhhc
Q 042321 150 QSVCMELIFKEL 161 (169)
Q Consensus 150 ~~va~HEiGHaL 161 (169)
..|++||+.|..
T Consensus 338 LDVvAHElTHGv 349 (507)
T COG3227 338 LDVVAHELTHGV 349 (507)
T ss_pred cceehhhhcchh
Confidence 459999999974
No 116
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=41.99 E-value=15 Score=23.20 Aligned_cols=18 Identities=17% Similarity=0.265 Sum_probs=15.4
Q ss_pred hHHHHHHhhhhhcCCCCc
Q 042321 149 SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~ 166 (169)
...-++|++.+.+||.|.
T Consensus 29 ~eR~~vH~lA~~~gL~s~ 46 (60)
T cd02641 29 HDRLLVHELAEELGLRHE 46 (60)
T ss_pred HHHHHHHHHHHHcCCceE
Confidence 566789999999999875
No 117
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=41.81 E-value=13 Score=24.92 Aligned_cols=13 Identities=0% Similarity=0.033 Sum_probs=11.0
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-..+++||+.|.+
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 4679999999985
No 118
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=41.19 E-value=12 Score=33.72 Aligned_cols=12 Identities=8% Similarity=-0.191 Sum_probs=9.5
Q ss_pred HHHHHHhhhhhc
Q 042321 150 QSVCMELIFKEL 161 (169)
Q Consensus 150 ~~va~HEiGHaL 161 (169)
..-++||.||||
T Consensus 260 l~s~iHE~GHal 271 (494)
T PF02074_consen 260 LFSTIHETGHAL 271 (494)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 345689999997
No 119
>PF06540 GMAP: Galanin message associated peptide (GMAP); InterPro: IPR013068 Galanin is a peptide hormone that controls various biological activities []. Galanin-like immuno-reactivity has been found in the central and peripheral nervous systems of mammals, with high concentrations demonstrated in discrete regions of the central nervous system, including the median eminence, hypothalamus, arcuate nucleus, septum, neuro-intermediate lobe of the pituitary, and the spinal cord. Its localisation within neurosecretory granules suggests that galanin may function as a neurotransmitter, and it has been shown to coexist with a variety of other peptide and amine neurotransmitters within individual neurons []. Although the precise physiological role of galanin is uncertain, it has a number of pharmacological properties: it stimulates food intake, when injected into the third ventricle of rats; it increases levels of plasma growth hormone and prolactin, and decreases dopamine levels in the median eminence []; and infusion into humans results in hyperglycemia and glucose intolerance, and inhibits pancreatic release of insulin, somatostatin and pancreatic peptide. Galanin also modulates smooth muscle contractility within the gastro-intestinal and genito-urinary tracts, all such activities suggesting that the hormone may play an important role in the nervous modulation of endocrine and smooth muscle function []. This domain represents the galanin message-associated peptide (GMAP) domain which is found C-terminal to the galanin domain in the preprogalanin precursor protein. GMAP sequences in different species show a high degree of homology, but the biological function of the GMAP peptide is not known [].
Probab=39.87 E-value=14 Score=23.66 Aligned_cols=25 Identities=28% Similarity=0.392 Sum_probs=22.6
Q ss_pred hHHHHHHHHHHHHhcCCCccccCCh
Q 042321 24 DELLESAIKNYQLNYHLKSTEVLDA 48 (169)
Q Consensus 24 ~~~~~~ai~~~Q~~~gL~~TG~lD~ 48 (169)
++++...|..|=.|.+|+..|-||.
T Consensus 22 d~nivrTiiEFLtfLhLKEaGALd~ 46 (62)
T PF06540_consen 22 DDNIVRTIIEFLTFLHLKEAGALDN 46 (62)
T ss_pred hhHHHHHHHHHHHHHHHHHhcchhc
Confidence 6788999999999999999999984
No 120
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=39.50 E-value=13 Score=33.37 Aligned_cols=9 Identities=11% Similarity=-0.057 Sum_probs=7.9
Q ss_pred HHHhhhhhc
Q 042321 153 CMELIFKEL 161 (169)
Q Consensus 153 a~HEiGHaL 161 (169)
++||.||||
T Consensus 263 ~iHE~GHAl 271 (497)
T COG2317 263 TIHETGHAL 271 (497)
T ss_pred HHHhhhhHH
Confidence 489999997
No 121
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=39.02 E-value=8.7 Score=35.27 Aligned_cols=13 Identities=15% Similarity=-0.118 Sum_probs=10.9
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+-.|+.|||||+.
T Consensus 487 IGaVIgHEI~HgF 499 (654)
T COG3590 487 IGAVIGHEIGHGF 499 (654)
T ss_pred ccceehhhhcccc
Confidence 5679999999974
No 122
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=38.15 E-value=17 Score=32.56 Aligned_cols=17 Identities=6% Similarity=-0.154 Sum_probs=13.2
Q ss_pred cccchHHHHHHhhhhhc
Q 042321 145 LLLESQSVCMELIFKEL 161 (169)
Q Consensus 145 ~~~~~~~va~HEiGHaL 161 (169)
...+-+.|+.||+||+-
T Consensus 243 ~t~~D~~t~~HE~GH~~ 259 (477)
T cd06461 243 VNMEDFVTVHHEMGHIQ 259 (477)
T ss_pred CCHHHHHHHHHHHHHHH
Confidence 34446779999999996
No 123
>PF04315 DUF462: Protein of unknown function, DUF462; InterPro: IPR007411 This family consists of bacterial proteins of uncharacterised function.
Probab=36.06 E-value=21 Score=27.49 Aligned_cols=15 Identities=7% Similarity=-0.232 Sum_probs=12.3
Q ss_pred cchHHHHHHhhhhhc
Q 042321 147 LESQSVCMELIFKEL 161 (169)
Q Consensus 147 ~~~~~va~HEiGHaL 161 (169)
-+++.-|+|||.|=.
T Consensus 39 hgf~aSALHEIaHWc 53 (164)
T PF04315_consen 39 HGFFASALHEIAHWC 53 (164)
T ss_pred cchHHHHHHHHHHHH
Confidence 358999999999954
No 124
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=36.01 E-value=21 Score=28.62 Aligned_cols=13 Identities=8% Similarity=0.143 Sum_probs=11.0
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+-++++||.+||-
T Consensus 93 ~gsiLAHE~mHa~ 105 (212)
T PF12315_consen 93 TGSILAHELMHAW 105 (212)
T ss_pred HhhHHHHHHHHHH
Confidence 5778999999983
No 125
>PF14891 Peptidase_M91: Effector protein
Probab=35.70 E-value=21 Score=27.25 Aligned_cols=13 Identities=8% Similarity=0.015 Sum_probs=11.0
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-..|+.||+.||.
T Consensus 103 p~v~L~HEL~HA~ 115 (174)
T PF14891_consen 103 PFVVLYHELIHAY 115 (174)
T ss_pred HHHHHHHHHHHHH
Confidence 5678999999985
No 126
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=35.54 E-value=19 Score=32.10 Aligned_cols=13 Identities=15% Similarity=-0.013 Sum_probs=11.2
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
...+|.||.||||
T Consensus 311 ~~~~A~hEaGhAl 323 (495)
T TIGR01241 311 KKLVAYHEAGHAL 323 (495)
T ss_pred HHHHHHHHHhHHH
Confidence 4679999999996
No 127
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=33.98 E-value=29 Score=33.20 Aligned_cols=12 Identities=8% Similarity=0.044 Sum_probs=9.8
Q ss_pred hHHHHHHhhhhh
Q 042321 149 SQSVCMELIFKE 160 (169)
Q Consensus 149 ~~~va~HEiGHa 160 (169)
...|++|||.|-
T Consensus 287 ~~~viaHElAHq 298 (831)
T TIGR02412 287 RAGVILHEMAHM 298 (831)
T ss_pred HHHHHHHHHHHH
Confidence 457999999993
No 128
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=33.87 E-value=22 Score=26.25 Aligned_cols=13 Identities=0% Similarity=-0.000 Sum_probs=11.6
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+..|++||+=|++
T Consensus 59 l~~~l~HEm~H~~ 71 (146)
T smart00731 59 LRETLLHELCHAA 71 (146)
T ss_pred HHhhHHHHHHHHH
Confidence 7779999999986
No 129
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=33.48 E-value=19 Score=29.54 Aligned_cols=12 Identities=0% Similarity=-0.466 Sum_probs=9.0
Q ss_pred HHHHHHhhhhhc
Q 042321 150 QSVCMELIFKEL 161 (169)
Q Consensus 150 ~~va~HEiGHaL 161 (169)
..-..|||||-+
T Consensus 220 ~WG~~HE~GH~~ 231 (307)
T PF13402_consen 220 GWGPWHELGHNH 231 (307)
T ss_dssp -HHHHHHHHHHH
T ss_pred Ceeehhhhhhhc
Confidence 446799999964
No 130
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=33.04 E-value=20 Score=33.56 Aligned_cols=35 Identities=14% Similarity=0.185 Sum_probs=24.2
Q ss_pred hhHHHHHHHHHhhccCCCcceEeccCC--CcccEEEE
Q 042321 87 DAINPVARAFQTWAPNTQFQFAESQDY--RNADIKIS 121 (169)
Q Consensus 87 ~~~~~i~~Af~~Ws~vt~L~F~ev~~~--~~ADI~I~ 121 (169)
...+++.-.|.+-...=+|+|+|..+. ...|+++-
T Consensus 362 ~l~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~ 398 (683)
T COG0339 362 PLNKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF 398 (683)
T ss_pred ChhHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence 345677777777778888999997432 35677754
No 131
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=32.83 E-value=25 Score=22.42 Aligned_cols=19 Identities=16% Similarity=0.186 Sum_probs=15.8
Q ss_pred hHHHHHHhhhhhcCCCCcc
Q 042321 149 SQSVCMELIFKELGIFINY 167 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~ 167 (169)
-..-++|.+-|-|||.|.+
T Consensus 29 ~eRriih~la~~lGL~~~s 47 (60)
T cd02639 29 AERRIVHLLASRLGLNHVS 47 (60)
T ss_pred HHHHHHHHHHHHcCCceEE
Confidence 4557899999999999875
No 132
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=30.75 E-value=23 Score=29.68 Aligned_cols=12 Identities=8% Similarity=-0.103 Sum_probs=9.8
Q ss_pred hHHHHHHhhhhh
Q 042321 149 SQSVCMELIFKE 160 (169)
Q Consensus 149 ~~~va~HEiGHa 160 (169)
...|++||.||-
T Consensus 167 qaYViAHEVGHH 178 (295)
T COG2321 167 QAYVIAHEVGHH 178 (295)
T ss_pred HHHHHHhhhhHH
Confidence 456999999984
No 133
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=30.46 E-value=31 Score=31.75 Aligned_cols=12 Identities=8% Similarity=0.014 Sum_probs=10.3
Q ss_pred hHHHHHHhhhhh
Q 042321 149 SQSVCMELIFKE 160 (169)
Q Consensus 149 ~~~va~HEiGHa 160 (169)
...|++|||.|.
T Consensus 279 ~~~viaHElAHq 290 (601)
T TIGR02411 279 NVDVIAHELAHS 290 (601)
T ss_pred hhhhHHHHHHhh
Confidence 567999999994
No 134
>KOG2987 consensus Fatty acid desaturase [Lipid transport and metabolism]
Probab=29.35 E-value=26 Score=29.22 Aligned_cols=20 Identities=20% Similarity=0.135 Sum_probs=16.9
Q ss_pred hHHHHHHhhhhhcCCCCccC
Q 042321 149 SQSVCMELIFKELGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~~~ 168 (169)
...-|+|||-|-|+.++++.
T Consensus 83 sl~LAIHeiSHN~aFg~~rp 102 (324)
T KOG2987|consen 83 SLTLAIHEISHNLAFGTNRP 102 (324)
T ss_pred hHHHHHHHhhhhhhcccCch
Confidence 45689999999999998764
No 135
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=29.15 E-value=22 Score=29.41 Aligned_cols=19 Identities=5% Similarity=0.032 Sum_probs=15.5
Q ss_pred cchHHHHHHhhhhhcCCCC
Q 042321 147 LESQSVCMELIFKELGIFI 165 (169)
Q Consensus 147 ~~~~~va~HEiGHaLGL~h 165 (169)
.|-..|++||+-|.|=+.+
T Consensus 140 ~dg~NVvIHEFAH~LDm~~ 158 (260)
T PRK15410 140 ASGFNLIIHEVAHKLDMRN 158 (260)
T ss_pred CCCcchhHhHHHhHhhhhc
Confidence 3567899999999997665
No 136
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=27.75 E-value=1.3e+02 Score=22.28 Aligned_cols=69 Identities=13% Similarity=0.162 Sum_probs=41.1
Q ss_pred HHHHHHHHhhccCCCcceEeccC-CCcccEEEEeecCCC--CCCCCCCCCCCceeec-------c---------cccchH
Q 042321 90 NPVARAFQTWAPNTQFQFAESQD-YRNADIKISFESGDH--GAGALSTAVVESWLML-------Q---------LLLESQ 150 (169)
Q Consensus 90 ~~i~~Af~~Ws~vt~L~F~ev~~-~~~ADI~I~F~~~~H--gd~~pfd~~~e~w~~~-------~---------~~~~~~ 150 (169)
..+-+|++.|...+-..-++.-. +..+-|+|.-..+.| |-..|+++..+-. +. + ..+-..
T Consensus 23 p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~A~G~t~p~~~~~~i~-I~~~lV~~~Ek~~~~~r~~~~~~v~ 101 (135)
T PF15639_consen 23 PAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGTAYGYTTPDDGSQVIR-IDGDLVNMFEKGPGERRAGNVYLVG 101 (135)
T ss_pred hHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccceeeeeecCCCCCcEEE-eeHHHhhhhhcCCcccccceEEEee
Confidence 55678888887766444444322 456889999888854 4445666443221 10 0 011156
Q ss_pred HHHHHhhhh
Q 042321 151 SVCMELIFK 159 (169)
Q Consensus 151 ~va~HEiGH 159 (169)
.+++||.=|
T Consensus 102 ~TlLHEliH 110 (135)
T PF15639_consen 102 STLLHELIH 110 (135)
T ss_pred hHHHHHHHH
Confidence 799999877
No 137
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=27.52 E-value=31 Score=30.04 Aligned_cols=15 Identities=13% Similarity=-0.110 Sum_probs=12.4
Q ss_pred cchHHHHHHhhhhhc
Q 042321 147 LESQSVCMELIFKEL 161 (169)
Q Consensus 147 ~~~~~va~HEiGHaL 161 (169)
-++-.|+.||+||..
T Consensus 122 sEvAAVl~HEmgHVt 136 (479)
T COG4784 122 SEVAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHHHhhhhhee
Confidence 358889999999964
No 138
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=27.39 E-value=37 Score=24.94 Aligned_cols=14 Identities=7% Similarity=-0.031 Sum_probs=12.2
Q ss_pred chHHHHHHhhhhhc
Q 042321 148 ESQSVCMELIFKEL 161 (169)
Q Consensus 148 ~~~~va~HEiGHaL 161 (169)
.+..|++||+=|+.
T Consensus 59 ~~~~tL~HEm~H~~ 72 (157)
T PF10263_consen 59 ELIDTLLHEMAHAA 72 (157)
T ss_pred HHHHHHHHHHHHHH
Confidence 38889999999986
No 139
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=27.18 E-value=67 Score=31.13 Aligned_cols=20 Identities=25% Similarity=0.315 Sum_probs=14.9
Q ss_pred hHHHHHHhhhhh-cCCCCccC
Q 042321 149 SQSVCMELIFKE-LGIFINYN 168 (169)
Q Consensus 149 ~~~va~HEiGHa-LGL~h~~~ 168 (169)
.-.+.+||-||. -|+.+.||
T Consensus 149 ~~r~~VheWah~R~GvfdEy~ 169 (863)
T TIGR00868 149 RGRVFVHEWAHLRWGVFDEYN 169 (863)
T ss_pred ccchhhhhhhhhhcccccccc
Confidence 345899999996 46666665
No 140
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=25.15 E-value=73 Score=25.97 Aligned_cols=30 Identities=20% Similarity=0.223 Sum_probs=26.4
Q ss_pred HHHHHHHHHhcC----CCccccCChHHhhhhhcC
Q 042321 28 ESAIKNYQLNYH----LKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 28 ~~ai~~~Q~~~g----L~~TG~lD~~T~~~m~~p 57 (169)
-.|++-+|+-.+ .++.|++-..|+..+++-
T Consensus 94 ~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~~~ 127 (252)
T COG3926 94 GRAAKWLQKALGPAYTVRVDGVIGAQTLAALKKD 127 (252)
T ss_pred chHHHHHHHHhccCCcccccCcccHHHHHHHHhc
Confidence 578999999999 999999999999877654
No 141
>PF08571 Yos1: Yos1-like; InterPro: IPR013880 In yeast, Yos1 is a subunit of the Yip1p-Yif1p complex and is required for transport between the endoplasmic reticulum and the Golgi complex. Yos1 appears to be conserved in eukaryotes [].
Probab=24.97 E-value=1.2e+02 Score=20.48 Aligned_cols=41 Identities=12% Similarity=0.199 Sum_probs=22.6
Q ss_pred CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCC
Q 042321 1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLK 41 (169)
Q Consensus 1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~ 41 (169)
|.|+|+-.++....|....+......+..-|..-|.++..|
T Consensus 24 L~kiGw~~~~~~~~g~~~~~~siK~qii~LI~aVrT~mRiP 64 (80)
T PF08571_consen 24 LAKIGWSSSQNSGFGFGGDPNSIKAQIINLIRAVRTLMRIP 64 (80)
T ss_pred hccCCCCCccccccccCCCCcCHHHHHHHHHHHHHHHHHhH
Confidence 67889977665432211222334556666666666665555
No 142
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=23.08 E-value=51 Score=20.87 Aligned_cols=18 Identities=11% Similarity=0.198 Sum_probs=15.3
Q ss_pred hHHHHHHhhhhhcCCCCc
Q 042321 149 SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~ 166 (169)
...-++|++-+.+||.+.
T Consensus 29 ~eR~~vH~~a~~~gL~s~ 46 (60)
T cd02640 29 EERALIHQIAQKYGLKSR 46 (60)
T ss_pred HHHHHHHHHHHHcCCcee
Confidence 566889999999999864
No 143
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=22.42 E-value=40 Score=31.29 Aligned_cols=13 Identities=8% Similarity=-0.171 Sum_probs=10.9
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
+-.|++|||||+.
T Consensus 518 iG~vigHEl~H~F 530 (687)
T KOG3624|consen 518 IGFVIGHELTHGF 530 (687)
T ss_pred HHHHHHHHHhhcc
Confidence 5569999999965
No 144
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=22.30 E-value=42 Score=31.23 Aligned_cols=13 Identities=0% Similarity=-0.066 Sum_probs=10.6
Q ss_pred hHHHHHHhhhhhc
Q 042321 149 SQSVCMELIFKEL 161 (169)
Q Consensus 149 ~~~va~HEiGHaL 161 (169)
-.-++-||+|||+
T Consensus 479 ~vetLFHEmGHAM 491 (704)
T KOG2090|consen 479 EVETLFHEMGHAM 491 (704)
T ss_pred HHHHHHHHHHHHH
Confidence 4558899999995
No 145
>PF08634 Pet127: Mitochondrial protein Pet127; InterPro: IPR013943 Pet127 has been implicated in mitochondrial RNA stability and/or processing and is localised to the mitochondrial membrane [].
Probab=21.93 E-value=89 Score=26.06 Aligned_cols=55 Identities=15% Similarity=0.134 Sum_probs=33.8
Q ss_pred cCCCccccCChHHhhhhhcCCccccccCCCCCCCCCcceEEEeccCCch------hhHHHHHHHHHhhcc
Q 042321 38 YHLKSTEVLDAGTVGKMRTHDAELQILGSPRWPASKFSLTYAFLPGTRG------DAINPVARAFQTWAP 101 (169)
Q Consensus 38 ~gL~~TG~lD~~T~~~m~~pD~~~~~~g~~kW~~~k~~LTy~~~~~~~~------~~~~~i~~Af~~Ws~ 101 (169)
=.||.||+.|-.|++...- ||... .. .....|.|....-. |..+.|+.||-.++=
T Consensus 154 prLPg~G~FDLKTRAv~~I---R~d~~---~~---e~~~gY~I~~~~G~~ESfEREy~DliRsa~LKYsl 214 (274)
T PF08634_consen 154 PRLPGTGVFDLKTRAVVAI---RYDSS---NY---ENGTGYQIRKRFGQFESFEREYYDLIRSAFLKYSL 214 (274)
T ss_pred CCCCCCCceeccchheeeh---hccCc---ch---hccCCeEeeeeeCchhHHHHHHHHHHHHHHHHHHH
Confidence 4689999999999976422 33211 11 22256666542221 677888888887764
No 146
>COG5192 BMS1 GTP-binding protein required for 40S ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=21.33 E-value=53 Score=30.74 Aligned_cols=22 Identities=14% Similarity=0.006 Sum_probs=17.4
Q ss_pred HhcCCCccccCChHHhhhhhcC
Q 042321 36 LNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 36 ~~~gL~~TG~lD~~T~~~m~~p 57 (169)
+|-.||+--.-|..|+..|-+=
T Consensus 775 RFQsiPvys~~DsrTRnRMlKY 796 (1077)
T COG5192 775 RFQSIPVYSMKDSRTRNRMLKY 796 (1077)
T ss_pred hhcccceeeecchhhhhhhhhc
Confidence 3445899999999999988754
No 147
>PF07236 Phytoreo_S7: Phytoreovirus S7 protein; InterPro: IPR009873 This family consists of several Phytoreovirus S7 proteins which are thought to be viral core proteins [].; GO: 0019012 virion; PDB: 1UF2_K.
Probab=21.21 E-value=32 Score=30.68 Aligned_cols=33 Identities=24% Similarity=0.204 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321 25 ELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH 57 (169)
Q Consensus 25 ~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p 57 (169)
--+..||+-|-.+.....+|.+|++|+-+++.|
T Consensus 204 ~li~~a~kyF~~yv~~~~~g~~d~at~i~LN~p 236 (506)
T PF07236_consen 204 MLIMKAIKYFRNYVNNQSSGSMDQATRILLNTP 236 (506)
T ss_dssp ---------------------------------
T ss_pred HHHHHHHHHHHHhhhhhhcCCCChHHHHHhcCc
Confidence 357889999999999999999999999999999
No 148
>PF15127 DUF4565: Protein of unknown function (DUF4565)
Probab=21.13 E-value=63 Score=22.37 Aligned_cols=15 Identities=33% Similarity=0.603 Sum_probs=12.6
Q ss_pred hHHHHHHHHHhhccC
Q 042321 88 AINPVARAFQTWAPN 102 (169)
Q Consensus 88 ~~~~i~~Af~~Ws~v 102 (169)
.++++.+|++.|+.+
T Consensus 62 SqEIl~dAlqQWA~~ 76 (91)
T PF15127_consen 62 SQEILSDALQQWAEN 76 (91)
T ss_pred HHHHHHHHHHHHHHh
Confidence 468889999999874
No 149
>PF13590 DUF4136: Domain of unknown function (DUF4136)
Probab=20.91 E-value=1.8e+02 Score=20.81 Aligned_cols=41 Identities=24% Similarity=0.443 Sum_probs=25.1
Q ss_pred EEEeccCC--ch--------hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecC
Q 042321 77 TYAFLPGT--RG--------DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESG 125 (169)
Q Consensus 77 Ty~~~~~~--~~--------~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~ 125 (169)
||+|.... +. .++++|+++|. ..-|+++ ...||+.|.|.-.
T Consensus 19 Ty~~~~~~~~~~~~~~~~~~~i~~~v~~~L~------~~G~~~~--~~~aDl~V~~~~~ 69 (151)
T PF13590_consen 19 TYAFVPSSSDPAQSNPLDDQRIQDAVEQELA------AKGYRRV--PENADLLVSYHYS 69 (151)
T ss_pred eEEEecCCcCccccCcHHHHHHHHHHHHHHH------HCCCeec--ccCCCEEEEEEEE
Confidence 78777652 11 24556666664 3556666 2478888887654
No 150
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.39 E-value=63 Score=20.40 Aligned_cols=18 Identities=11% Similarity=0.318 Sum_probs=15.4
Q ss_pred hHHHHHHhhhhhcCCCCc
Q 042321 149 SQSVCMELIFKELGIFIN 166 (169)
Q Consensus 149 ~~~va~HEiGHaLGL~h~ 166 (169)
...-++|++-+.+||.+.
T Consensus 28 ~eR~~vH~~a~~~gL~s~ 45 (59)
T cd06007 28 HERAVIHRLCRKLGLKSK 45 (59)
T ss_pred HHHHHHHHHHHHcCCCce
Confidence 566889999999999875
Done!