Query psy1890
Match_columns 242
No_of_seqs 189 out of 1300
Neff 7.7
Searched_HMMs 46136
Date Fri Aug 16 18:53:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1890.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1890hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd04282 ZnMc_meprin Zinc-depen 100.0 7.8E-68 1.7E-72 449.4 19.5 204 5-217 24-230 (230)
2 cd04281 ZnMc_BMP1_TLD Zinc-dep 100.0 4E-67 8.8E-72 437.9 19.8 199 17-218 2-200 (200)
3 cd04283 ZnMc_hatching_enzyme Z 100.0 1E-61 2.2E-66 399.5 17.2 177 29-217 5-182 (182)
4 PF01400 Astacin: Astacin (Pep 100.0 6.4E-62 1.4E-66 406.0 15.4 191 23-219 1-191 (191)
5 KOG3714|consensus 100.0 2.2E-60 4.7E-65 437.3 17.2 204 12-221 69-273 (411)
6 cd04280 ZnMc_astacin_like Zinc 100.0 1.5E-59 3.3E-64 388.3 17.3 180 27-215 1-180 (180)
7 cd04327 ZnMc_MMP_like_3 Zinc-d 100.0 7.9E-45 1.7E-49 304.7 13.3 170 23-216 1-198 (198)
8 smart00235 ZnMc Zinc-dependent 100.0 8E-33 1.7E-37 219.5 13.5 136 22-166 2-140 (140)
9 cd04268 ZnMc_MMP_like Zinc-dep 100.0 1E-32 2.2E-37 224.0 10.8 148 28-215 2-165 (165)
10 cd00203 ZnMc Zinc-dependent me 99.9 2.5E-22 5.5E-27 163.0 12.8 148 29-215 2-167 (167)
11 cd04278 ZnMc_MMP Zinc-dependen 99.8 1.2E-19 2.6E-24 146.7 11.3 98 24-123 1-125 (157)
12 cd04277 ZnMc_serralysin_like Z 99.8 1.8E-17 4E-22 137.4 13.7 142 37-216 30-186 (186)
13 PF00413 Peptidase_M10: Matrix 99.7 2.2E-15 4.9E-20 120.6 14.1 98 24-123 1-123 (154)
14 cd04279 ZnMc_MMP_like_1 Zinc-d 99.7 1.2E-15 2.6E-20 123.1 11.3 98 29-126 3-125 (156)
15 cd04276 ZnMc_MMP_like_2 Zinc-d 99.1 3.9E-09 8.4E-14 88.4 14.0 158 26-216 6-196 (197)
16 KOG1565|consensus 99.0 1.3E-09 2.8E-14 102.1 10.0 100 21-122 102-228 (469)
17 PF12388 Peptidase_M57: Dual-a 99.0 1.1E-08 2.5E-13 85.7 13.5 160 22-215 31-211 (211)
18 PF02031 Peptidase_M7: Strepto 98.4 4.9E-07 1.1E-11 69.6 4.6 85 29-120 3-92 (132)
19 PF13583 Reprolysin_4: Metallo 97.4 0.00021 4.5E-09 60.3 4.8 20 105-124 137-156 (206)
20 COG5549 Predicted Zn-dependent 97.4 0.00037 8.1E-09 58.2 5.9 77 41-120 104-202 (236)
21 PF13574 Reprolysin_2: Metallo 96.6 0.0018 3.9E-08 53.1 2.9 21 105-125 111-131 (173)
22 PF13582 Reprolysin_3: Metallo 96.6 0.001 2.2E-08 51.0 1.4 19 104-122 106-124 (124)
23 PF05548 Peptidase_M11: Gameto 96.5 0.0055 1.2E-07 55.0 5.7 78 41-124 88-169 (314)
24 PF05572 Peptidase_M43: Pregna 96.3 0.0016 3.5E-08 52.5 1.4 21 106-126 70-90 (154)
25 cd04267 ZnMc_ADAM_like Zinc-de 96.1 0.0032 7E-08 52.1 2.2 19 106-124 134-152 (192)
26 PF13688 Reprolysin_5: Metallo 96.0 0.0037 7.9E-08 51.8 1.8 23 105-127 142-164 (196)
27 cd04275 ZnMc_pappalysin_like Z 95.4 0.002 4.3E-08 55.2 -1.8 22 105-126 137-158 (225)
28 cd04272 ZnMc_salivary_gland_MP 95.2 0.014 2.9E-07 49.6 2.6 21 105-125 145-165 (220)
29 cd04269 ZnMc_adamalysin_II_lik 95.2 0.0087 1.9E-07 49.6 1.3 19 106-124 132-150 (194)
30 COG1913 Predicted Zn-dependent 95.0 0.0099 2.1E-07 48.5 1.1 19 107-125 126-144 (181)
31 cd04270 ZnMc_TACE_like Zinc-de 94.7 0.015 3.2E-07 50.4 1.4 17 107-123 169-185 (244)
32 cd04271 ZnMc_ADAM_fungal Zinc- 94.7 0.011 2.5E-07 50.6 0.7 18 107-124 147-164 (228)
33 PRK13267 archaemetzincin-like 94.5 0.018 3.9E-07 47.6 1.4 15 107-121 127-141 (179)
34 cd04273 ZnMc_ADAMTS_like Zinc- 94.3 0.0087 1.9E-07 50.3 -0.9 20 105-124 140-159 (207)
35 PF07998 Peptidase_M54: Peptid 94.3 0.021 4.5E-07 47.7 1.4 18 107-124 147-164 (194)
36 PF01421 Reprolysin: Reprolysi 93.7 0.03 6.6E-07 46.5 1.4 19 106-124 132-150 (199)
37 PF11350 DUF3152: Protein of u 93.0 0.038 8.3E-07 46.2 0.8 98 28-126 29-160 (203)
38 TIGR03296 M6dom_TIGR03296 M6 f 93.0 0.028 6E-07 49.8 -0.1 20 105-124 165-184 (286)
39 PF10462 Peptidase_M66: Peptid 92.8 0.046 9.9E-07 48.8 1.1 16 106-121 194-209 (305)
40 cd02641 R3H_Smubp-2_like R3H d 92.5 0.16 3.5E-06 34.2 3.2 40 95-134 18-58 (60)
41 PF12044 Metallopep: Putative 92.3 0.059 1.3E-06 50.2 1.1 20 106-125 317-336 (423)
42 PF05547 Peptidase_M6: Immune 89.6 0.14 3.1E-06 50.2 1.0 16 105-120 221-236 (645)
43 TIGR02290 M3_fam_3 oligoendope 89.6 0.16 3.5E-06 49.4 1.4 77 41-120 311-390 (587)
44 PF01457 Peptidase_M8: Leishma 89.4 0.17 3.7E-06 48.6 1.3 17 106-122 211-227 (521)
45 cd06459 M3B_Oligoendopeptidase 89.1 0.73 1.6E-05 42.5 5.3 25 96-120 213-237 (427)
46 TIGR00181 pepF oligoendopeptid 89.0 0.74 1.6E-05 44.8 5.5 75 41-119 315-392 (591)
47 PTZ00337 surface protease GP63 88.1 0.23 4.9E-06 48.2 1.2 17 105-121 229-245 (567)
48 KOG4525|consensus 87.2 0.24 5.1E-06 46.1 0.7 20 106-125 302-321 (614)
49 PTZ00257 Glycoprotein GP63 (le 86.1 0.35 7.7E-06 47.1 1.3 17 105-121 256-272 (622)
50 cd02640 R3H_NRF R3H domain of 86.0 1 2.2E-05 30.4 3.1 40 95-134 18-58 (60)
51 cd06007 R3H_DEXH_helicase R3H 85.0 1.5 3.3E-05 29.4 3.6 40 95-134 17-57 (59)
52 KOG3607|consensus 82.7 0.6 1.3E-05 46.5 1.3 18 106-123 324-341 (716)
53 cd02642 R3H_encore_like R3H do 81.8 2.4 5.3E-05 28.6 3.7 40 95-135 22-61 (63)
54 cd02646 R3H_G-patch R3H domain 80.6 2.5 5.4E-05 28.0 3.4 40 95-134 17-56 (58)
55 PF12725 DUF3810: Protein of u 80.6 0.87 1.9E-05 41.0 1.4 23 102-124 193-215 (318)
56 TIGR02289 M3_not_pepF oligoend 80.3 1.3 2.9E-05 42.7 2.7 24 97-120 329-352 (549)
57 PF09471 Peptidase_M64: IgA Pe 79.5 0.98 2.1E-05 39.6 1.4 15 106-120 217-231 (264)
58 PF04298 Zn_peptidase_2: Putat 77.6 2 4.4E-05 36.6 2.7 24 95-118 73-102 (222)
59 PF03272 Enhancin: Viral enhan 77.5 1.7 3.8E-05 43.8 2.6 134 11-151 114-277 (775)
60 COG2856 Predicted Zn peptidase 77.4 1.2 2.6E-05 37.9 1.2 16 107-124 74-89 (213)
61 PF13398 Peptidase_M50B: Pepti 76.2 1.4 2.9E-05 37.0 1.3 22 106-127 23-44 (200)
62 PF06262 DUF1025: Possibl zinc 74.9 1.6 3.4E-05 32.4 1.1 19 106-124 74-92 (97)
63 PF06114 DUF955: Domain of unk 73.9 1.8 4E-05 31.8 1.4 23 95-117 29-54 (122)
64 PF01863 DUF45: Protein of unk 73.7 9.4 0.0002 31.4 5.8 19 105-123 164-182 (205)
65 PF14247 DUF4344: Domain of un 71.6 2.1 4.6E-05 36.5 1.3 19 106-126 93-111 (220)
66 COG3824 Predicted Zn-dependent 68.3 2.2 4.7E-05 32.9 0.6 17 106-122 110-126 (136)
67 PF01432 Peptidase_M3: Peptida 68.3 2.5 5.4E-05 39.7 1.2 21 105-125 242-262 (458)
68 PF11667 DUF3267: Protein of u 67.4 5 0.00011 30.1 2.5 20 107-126 6-25 (111)
69 COG1164 Oligoendopeptidase F [ 66.0 3.1 6.8E-05 40.7 1.4 78 41-120 316-395 (598)
70 PF02163 Peptidase_M50: Peptid 65.9 3.2 7E-05 33.9 1.3 23 106-128 8-30 (192)
71 cd06258 Peptidase_M3_like The 64.5 3.3 7.1E-05 37.5 1.2 14 105-118 154-167 (365)
72 TIGR00043 metalloprotein, YbeY 62.9 4.5 9.7E-05 30.6 1.5 18 107-124 75-92 (110)
73 COG5549 Predicted Zn-dependent 62.4 5.6 0.00012 33.6 2.1 76 107-218 155-235 (236)
74 cd06455 M3A_TOP Peptidase M3 T 61.8 4.1 8.8E-05 38.6 1.3 14 105-118 263-276 (472)
75 PF15639 Tox-MPTase3: Metallop 61.2 19 0.00041 28.1 4.6 69 43-115 22-110 (135)
76 cd05709 S2P-M50 Site-2 proteas 61.0 4.5 9.7E-05 32.8 1.3 22 106-127 9-30 (180)
77 PHA02456 zinc metallopeptidase 61.0 3.9 8.4E-05 31.1 0.8 37 80-120 58-94 (141)
78 COG4783 Putative Zn-dependent 59.9 4.5 9.8E-05 38.2 1.2 15 105-119 130-144 (484)
79 PF01447 Peptidase_M4: Thermol 59.6 5.2 0.00011 32.0 1.4 12 106-117 136-147 (150)
80 cd06457 M3A_MIP Peptidase M3 m 59.4 4.7 0.0001 38.1 1.3 18 106-123 249-266 (458)
81 COG4412 Uncharacterized protei 58.2 4.1 8.9E-05 39.1 0.6 24 39-62 234-257 (760)
82 PF02130 UPF0054: Uncharacteri 58.1 6 0.00013 31.4 1.5 16 107-122 110-125 (145)
83 cd06163 S2P-M50_PDZ_RseP-like 55.2 6.6 0.00014 32.4 1.3 22 106-127 10-31 (182)
84 COG0319 Predicted metal-depend 54.8 7.3 0.00016 31.4 1.5 18 107-124 110-127 (153)
85 cd06456 M3A_DCP_Oligopeptidase 54.6 6.4 0.00014 36.8 1.3 14 105-118 208-221 (422)
86 KOG3658|consensus 54.5 4 8.8E-05 40.1 -0.1 18 106-123 393-410 (764)
87 PF13485 Peptidase_MA_2: Pepti 54.1 7.3 0.00016 28.7 1.3 12 106-117 26-37 (128)
88 COG2738 Predicted Zn-dependent 53.7 6.5 0.00014 32.9 1.0 24 95-118 76-105 (226)
89 PRK00016 metal-binding heat sh 52.6 7.6 0.00017 31.3 1.3 17 107-123 115-131 (159)
90 cd06162 S2P-M50_PDZ_SREBP Ster 52.4 7.5 0.00016 34.4 1.3 22 106-127 136-157 (277)
91 cd06460 M32_Taq Peptidase fami 52.3 7.9 0.00017 36.0 1.5 28 96-126 148-177 (396)
92 cd06159 S2P-M50_PDZ_Arch Uncha 52.0 7.6 0.00017 34.0 1.3 23 106-128 119-141 (263)
93 cd06006 R3H_unknown_2 R3H doma 51.6 19 0.00041 24.1 2.9 38 97-134 20-57 (59)
94 PF01435 Peptidase_M48: Peptid 49.8 9 0.00019 31.8 1.3 12 106-117 90-101 (226)
95 PRK13963 unkown domain/putativ 49.0 9.9 0.00021 33.1 1.5 16 107-122 217-232 (258)
96 cd06158 S2P-M50_like_1 Unchara 48.4 9.6 0.00021 31.3 1.3 13 106-118 10-22 (181)
97 cd06164 S2P-M50_SpoIVFB_CBS Sp 47.6 9.9 0.00021 32.5 1.3 23 106-128 54-76 (227)
98 smart00731 SprT SprT homologue 46.8 11 0.00024 29.7 1.3 15 106-120 60-74 (146)
99 PF13699 DUF4157: Domain of un 46.7 12 0.00025 26.6 1.3 14 105-118 61-74 (79)
100 KOG2921|consensus 46.3 10 0.00022 35.2 1.1 34 106-142 132-165 (484)
101 cd02325 R3H R3H domain. The na 46.2 32 0.00068 21.6 3.3 40 95-134 18-57 (59)
102 PF10263 SprT-like: SprT-like 46.0 11 0.00025 29.6 1.3 17 105-121 60-76 (157)
103 cd02639 R3H_RRM R3H domain of 45.8 21 0.00045 24.0 2.4 29 95-123 18-47 (60)
104 PRK10911 oligopeptidase A; Pro 45.4 10 0.00022 37.8 1.2 12 106-117 464-475 (680)
105 PRK02870 heat shock protein Ht 45.2 8.8 0.00019 34.9 0.6 12 105-116 173-184 (336)
106 PF12315 DUF3633: Protein of u 43.1 13 0.00029 31.4 1.3 17 105-121 93-109 (212)
107 PF14521 Aspzincin_M35: Lysine 41.7 12 0.00026 29.7 0.8 14 105-118 96-111 (148)
108 KOG3538|consensus 40.9 8.4 0.00018 39.3 -0.2 23 105-127 317-339 (845)
109 cd06160 S2P-M50_like_2 Unchara 40.9 15 0.00033 30.3 1.3 24 106-129 42-65 (183)
110 PRK03982 heat shock protein Ht 40.8 14 0.0003 32.6 1.2 13 105-117 125-137 (288)
111 PRK10280 dipeptidyl carboxypep 40.0 15 0.00032 36.7 1.3 13 106-118 465-477 (681)
112 COG0501 HtpX Zn-dependent prot 40.0 15 0.00032 32.1 1.2 12 106-117 158-169 (302)
113 PRK03001 M48 family peptidase; 39.0 16 0.00034 32.2 1.2 13 105-117 124-136 (283)
114 cd06161 S2P-M50_SpoIVFB SpoIVF 38.7 16 0.00035 30.6 1.2 24 105-128 38-61 (208)
115 PRK03072 heat shock protein Ht 38.2 16 0.00035 32.3 1.2 13 105-117 127-139 (288)
116 smart00393 R3H Putative single 38.1 44 0.00096 23.3 3.2 40 96-135 38-77 (79)
117 PRK01345 heat shock protein Ht 37.5 17 0.00037 32.7 1.2 13 105-117 124-136 (317)
118 PF04228 Zn_peptidase: Putativ 37.1 18 0.00039 32.2 1.3 10 107-116 172-181 (292)
119 PRK05457 heat shock protein Ht 36.9 17 0.00036 32.2 1.1 12 105-116 134-145 (284)
120 PF06167 Peptidase_M90: Glucos 35.1 22 0.00047 31.0 1.4 17 105-121 154-170 (253)
121 PRK04897 heat shock protein Ht 34.7 19 0.00041 32.0 1.1 12 106-117 138-149 (298)
122 PF14891 Peptidase_M91: Effect 33.3 21 0.00045 29.0 1.0 19 106-124 104-122 (174)
123 PRK02391 heat shock protein Ht 33.0 22 0.00048 31.6 1.2 12 105-116 133-144 (296)
124 PRK01265 heat shock protein Ht 32.4 22 0.00047 32.2 1.1 11 105-115 140-150 (324)
125 KOG2556|consensus 31.6 23 0.00049 33.7 1.0 21 106-126 256-276 (666)
126 PLN02887 hydrolase family prot 30.5 27 0.00059 34.1 1.5 19 107-125 232-250 (580)
127 TIGR00054 RIP metalloprotease 30.5 25 0.00055 32.8 1.2 12 106-117 15-26 (420)
128 PLN02791 Nudix hydrolase homol 30.4 20 0.00044 36.2 0.6 21 106-126 586-606 (770)
129 PF01424 R3H: R3H domain; Int 30.2 70 0.0015 21.0 3.1 40 96-135 22-61 (63)
130 PF01431 Peptidase_M13: Peptid 29.9 25 0.00054 29.0 0.9 14 107-120 38-51 (206)
131 PRK04351 hypothetical protein; 29.5 30 0.00065 27.6 1.3 16 105-120 61-76 (149)
132 PF02102 Peptidase_M35: Deuter 29.4 29 0.00062 31.9 1.3 20 101-120 290-312 (359)
133 PF13058 DUF3920: Protein of u 29.1 31 0.00066 26.3 1.2 31 35-65 3-34 (126)
134 COG0339 Dcp Zn-dependent oligo 28.0 28 0.00062 34.5 1.1 24 43-66 364-387 (683)
135 PF13203 DUF2201_N: Putative m 27.8 34 0.00074 30.0 1.5 21 106-127 61-81 (292)
136 PRK10779 zinc metallopeptidase 27.1 32 0.0007 32.4 1.3 10 106-115 16-25 (449)
137 PRK04860 hypothetical protein; 26.6 36 0.00079 27.5 1.3 17 105-121 63-79 (160)
138 PRK15410 DgsA anti-repressor M 25.2 35 0.00077 29.9 1.1 17 105-121 142-158 (260)
139 cd06461 M2_ACE Peptidase famil 24.8 34 0.00074 32.7 1.0 17 105-121 247-263 (477)
140 KOG2090|consensus 24.5 39 0.00084 33.2 1.3 12 106-117 480-491 (704)
141 COG2317 Zn-dependent carboxype 24.0 35 0.00077 32.4 0.9 10 108-117 262-271 (497)
142 PF09768 Peptidase_M76: Peptid 23.6 44 0.00094 27.4 1.3 25 106-140 72-96 (173)
143 KOG3624|consensus 23.3 43 0.00093 33.3 1.4 29 107-137 520-548 (687)
144 PF10026 DUF2268: Predicted Zn 23.2 3E+02 0.0066 22.6 6.3 75 41-123 4-83 (195)
145 PF10023 DUF2265: Predicted am 23.1 37 0.00081 30.9 0.8 15 106-120 166-180 (337)
146 PF01434 Peptidase_M41: Peptid 23.0 46 0.001 27.9 1.4 13 106-118 29-41 (213)
147 KOG3314|consensus 22.7 49 0.0011 27.1 1.3 25 106-140 92-116 (194)
148 KOG0201|consensus 22.2 39 0.00085 31.9 0.8 30 106-139 113-142 (467)
149 PF04450 BSP: Peptidase of pla 21.7 49 0.0011 27.8 1.2 16 106-121 97-112 (205)
150 COG3590 PepO Predicted metallo 20.7 29 0.00062 33.8 -0.4 21 107-129 489-509 (654)
151 PF10460 Peptidase_M30: Peptid 20.6 53 0.0011 30.3 1.3 16 106-121 140-155 (366)
No 1
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=100.00 E-value=7.8e-68 Score=449.45 Aligned_cols=204 Identities=37% Similarity=0.621 Sum_probs=191.6
Q ss_pred cCCCccCC--CCcccccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCC
Q psy1890 5 VDGFRDHD--GVKNLAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKS 82 (242)
Q Consensus 5 ~~g~~~~~--~~R~~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~g 82 (242)
++||+.++ +.|+|+.....+||+ +|||.|+++|+...++.|++||++|++.|||+|+|++. +.+||.|.. +.|
T Consensus 24 ~eGDI~l~~~~~R~a~~~~~~~Wp~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~-e~~yi~i~~---~~G 98 (230)
T cd04282 24 FEGDILLDEGQSRNGLIGDTYRWPF-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEG-ESNYIFFFK---GSG 98 (230)
T ss_pred ccccccCCcccccccccCcccCCCc-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCC-CCcEEEEEc---CCC
Confidence 78999987 689999999999999 89999999999999999999999999999999999995 889999985 589
Q ss_pred CCCccCCCCCccceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcC
Q psy1890 83 CSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPY 162 (242)
Q Consensus 83 c~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~y 162 (242)
|||+||+.+|. |.|+|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||+|++...+.++|+||
T Consensus 99 C~S~vG~~gg~-q~isl~~~C~~~Gti~HEl~HalGf~HEqsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pY 177 (230)
T cd04282 99 CWSMVGDQQGG-QNLSIGAGCDYKATVEHEFLHALGFYHEQSRSDRDDYVKIWWDQILSGREHNFNKYDDSFSTDLNTPY 177 (230)
T ss_pred eeeccCccCCe-EEEEECCCcCCCchHHHHHHHHhCCcccccccccccceEEeecccCchHHHHhhhcCccccccCCCCC
Confidence 99999999888 99999999999999999999999999999999999999999999999999999999888888899999
Q ss_pred CCCcccccCCCccccCCCCCeeeeccCCCcccc-ccCCcCcccHHHHHHHHhHCCC
Q psy1890 163 DFGSIMHYSRFAFSRDSKSPTITPKKTLNVQAD-KMGQRINISKMDIAKLNTLYKC 217 (242)
Q Consensus 163 D~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~-~ig~r~~lS~~Di~~in~lY~C 217 (242)
||+|||||++.+|++++..|||+|+++ ... .||||.+||+.|+++||+||+|
T Consensus 178 Dy~SIMHY~~~aFs~~~~~pTi~~~~~---~~~~~iGqr~~lS~~Di~~iN~~Y~C 230 (230)
T cd04282 178 DYESVMHYSPFSFNKGASEPTITTKIP---EFNDIIGQRLDFSDIDLERLNRMYNC 230 (230)
T ss_pred CcccccccCCCccccCCCCceeeecCC---cccccccccCCCCHHHHHHHHHhcCC
Confidence 999999999999999844499999987 443 6999999999999999999999
No 2
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=100.00 E-value=4e-67 Score=437.85 Aligned_cols=199 Identities=42% Similarity=0.700 Sum_probs=181.4
Q ss_pred cccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccce
Q psy1890 17 LAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSY 96 (242)
Q Consensus 17 ~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~ 96 (242)
|......+||+++|||.|+++|+...++.|++||++|++.|||+|+|++. +.+||.|... +.||||+||+.++..|.
T Consensus 2 ~~~~~~~~Wp~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~yi~f~~~--~~GC~S~vG~~~~g~q~ 78 (200)
T cd04281 2 ATARKERIWPGGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTP-EENYIVFTYR--PCGCCSYVGRRGNGPQA 78 (200)
T ss_pred CccCccCcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCC-CCCEEEEEEC--CCCeeEcCCCcCCCcee
Confidence 45567789999999999999999999999999999999999999999995 8899999865 57999999999733399
Q ss_pred EEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccc
Q psy1890 97 VNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFS 176 (242)
Q Consensus 97 i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~ 176 (242)
|+|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+.++|+||||+|||||++.+|+
T Consensus 79 isl~~~C~~~Gti~HEl~HaLGf~HEhsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aFs 158 (200)
T cd04281 79 ISIGKNCDKFGIVVHELGHVIGFWHEHTRPDRDDHVTIIRENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTFS 158 (200)
T ss_pred eecCCCcCcCchHHHHHHHHhcCcchhccccccceEEEeecccCcchhhhhhhcCccccccCCCccCCCccccCCCCccc
Confidence 99999999999999999999999999999999999999999999999999999988888889999999999999999999
Q ss_pred cCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCCCC
Q psy1890 177 RDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCP 218 (242)
Q Consensus 177 ~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~ 218 (242)
+++..+||+|+++.+.-...||||.+||+.|+++||+||+||
T Consensus 159 ~~~~~~Ti~p~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C~ 200 (200)
T cd04281 159 RGMFLDTILPKRDPNGVRPEIGQRTRLSEGDIIQANKLYKCP 200 (200)
T ss_pred cCCCCCceEECCCcccccccccccCCCCHHHHHHHHHhcCCC
Confidence 988779999987621101369999999999999999999997
No 3
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=100.00 E-value=1e-61 Score=399.55 Aligned_cols=177 Identities=41% Similarity=0.750 Sum_probs=165.0
Q ss_pred eEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeC-CCCcccc
Q psy1890 29 LIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLG-KNCFVMG 107 (242)
Q Consensus 29 ~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~-~~c~~~~ 107 (242)
+|||.|+++|++.+++.|++||++|++.|||+|++++. +.+||.|.. +.||||+||+.++. |.|+|+ .+|...|
T Consensus 5 ~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~~~---~~gC~S~vG~~gg~-q~i~l~~~~C~~~G 79 (182)
T cd04283 5 YVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNIES---RSGCWSYIGRQGGR-QTVSLQKQGCMYKG 79 (182)
T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEEEc---CCCceEecCccCCc-eeEecCCCCcCccc
Confidence 69999999999999999999999999999999999984 889999975 58999999999888 999997 6899999
Q ss_pred hhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCCeeeec
Q psy1890 108 TVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITPK 187 (242)
Q Consensus 108 ~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~~ 187 (242)
+|+|||||||||+|||+|||||+||.|+|+||.++..+||++++. .++++||||+|||||++.+|++++. +||+|+
T Consensus 80 ~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~ 155 (182)
T cd04283 80 IIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPI 155 (182)
T ss_pred hHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEEC
Confidence 999999999999999999999999999999999999999999853 4578999999999999999999887 999999
Q ss_pred cCCCccccccCCcCcccHHHHHHHHhHCCC
Q psy1890 188 KTLNVQADKMGQRINISKMDIAKLNTLYKC 217 (242)
Q Consensus 188 ~~~~~~~~~ig~r~~lS~~Di~~in~lY~C 217 (242)
++ ....||||.+||+.|+++||+||+|
T Consensus 156 ~~---~~~~iGqr~~lS~~Di~~iN~~Y~C 182 (182)
T cd04283 156 PD---PNVPIGQRQGMSNLDILRINKLYNC 182 (182)
T ss_pred Cc---ccccccCCCCCCHHHHHHHHHhcCC
Confidence 76 3347999999999999999999998
No 4
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=100.00 E-value=6.4e-62 Score=405.98 Aligned_cols=191 Identities=45% Similarity=0.827 Sum_probs=148.2
Q ss_pred CCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCCC
Q psy1890 23 KLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGKN 102 (242)
Q Consensus 23 ~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~~ 102 (242)
++||+++|||.|++++++.+++.|++||++|++.|||+|++++..+..+|.|.. +.||||+||+.++. |.|+|+.+
T Consensus 1 ~~Wp~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~~---~~gC~S~vG~~~g~-q~i~l~~~ 76 (191)
T PF01400_consen 1 KKWPNGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERTENEDDYISFSN---GSGCWSYVGRQGGE-QTINLGDG 76 (191)
T ss_dssp -S-GGGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-SSSSSSEEEEES---SSSEEEESS--SSE-EEEEE-TT
T ss_pred CcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECCCCCceEEEeec---CccccchhhhcCcc-eeEEecce
Confidence 589999999999999999999999999999999999999999944678899875 58999999999777 99999999
Q ss_pred CcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCC
Q psy1890 103 CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSP 182 (242)
Q Consensus 103 c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ 182 (242)
|...++|+|||||||||.|||+|||||+||.|+|+||.++...||.+.+...+.++++||||+|||||++.+|++++..+
T Consensus 77 c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~~ 156 (191)
T PF01400_consen 77 CFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQP 156 (191)
T ss_dssp C-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS-
T ss_pred eCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCCC
Confidence 99999999999999999999999999999999999999999999999988788889999999999999999999998679
Q ss_pred eeeeccCCCccccccCCcCcccHHHHHHHHhHCCCCC
Q psy1890 183 TITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCPN 219 (242)
Q Consensus 183 ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~~ 219 (242)
||.|+++. ....||++.+||+.|+++||++|+|+.
T Consensus 157 ti~~~~~~--~~~~iG~~~~lS~~D~~~in~~Y~C~~ 191 (191)
T PF01400_consen 157 TITPKDPK--YQETIGQRNRLSFTDIKQINKMYNCPE 191 (191)
T ss_dssp SEEESSTT--CTS-GGG-SS--HHHHHHHHHHTT---
T ss_pred eEEecCCC--CcccccccCCCCHHHHHHHHHhcCCCC
Confidence 99999862 123799999999999999999999973
No 5
>KOG3714|consensus
Probab=100.00 E-value=2.2e-60 Score=437.26 Aligned_cols=204 Identities=41% Similarity=0.768 Sum_probs=190.2
Q ss_pred CCCcccccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCC
Q psy1890 12 DGVKNLAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTG 91 (242)
Q Consensus 12 ~~~R~~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~ 91 (242)
.+.|+++....++||+++|||.|+++|+...|+.|+.||++|++.|||+|+|.++.+.+++.|... .||||+||+.+
T Consensus 69 ~~~r~~~~~~~~~Wp~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~---~gC~S~VGr~g 145 (411)
T KOG3714|consen 69 RSRRNGTSNPERRWPNGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTG---GGCYSYVGRRG 145 (411)
T ss_pred hhhhhcccChhhcCCCCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCCCcceEEEeCC---CcceeeeCccC
Confidence 466888889999999999999999999999999999999999999999999998656788888765 49999999999
Q ss_pred CccceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccC
Q psy1890 92 KKVSYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYS 171 (242)
Q Consensus 92 g~~q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~ 171 (242)
+..|.++|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+.++++||||+|||||.
T Consensus 146 g~~q~~sl~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMHY~ 225 (411)
T KOG3714|consen 146 GGQQLLSLGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMHYA 225 (411)
T ss_pred CCccceecCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCCcccccC
Confidence 86699999999999999999999999999999999999999999999999999999999988888999999999999999
Q ss_pred CCccccCCCCCeeeeccCCCcccc-ccCCcCcccHHHHHHHHhHCCCCCCC
Q psy1890 172 RFAFSRDSKSPTITPKKTLNVQAD-KMGQRINISKMDIAKLNTLYKCPNKY 221 (242)
Q Consensus 172 ~~~f~~~g~~~ti~~~~~~~~~~~-~ig~r~~lS~~Di~~in~lY~C~~~~ 221 (242)
+.+|++++..+||+|++. ..+ .||||..||+.|+.+||+||.|+..+
T Consensus 226 ~~afs~~~~~~ti~~~~~---~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~~ 273 (411)
T KOG3714|consen 226 PYAFSKNGSLPTIVPKDN---GFQNTMGQRERLSFYDIRKINKLYCCPECC 273 (411)
T ss_pred CcccCcCCCCCceecccc---cccccccccCcCCHHHHHHHHhhcCCCccC
Confidence 999999999889999987 444 59999999999999999999997654
No 6
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=100.00 E-value=1.5e-59 Score=388.28 Aligned_cols=180 Identities=44% Similarity=0.792 Sum_probs=169.3
Q ss_pred CCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCCCCccc
Q psy1890 27 DGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGKNCFVM 106 (242)
Q Consensus 27 ~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~~c~~~ 106 (242)
+++|||.|+++|++.+++.|++||++|++.|||+|+|++. +.++|.|... .||||+||+.++. |.|+|+.+|...
T Consensus 1 ~~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~-~~~~I~f~~~---~Gc~S~vG~~~~~-q~i~l~~~c~~~ 75 (180)
T cd04280 1 NGTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT-EKDYIRIVKG---SGCWSYVGRVGGR-QVVSLGSGCFSL 75 (180)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC-CCcEEEEEcC---CCcceecCccCCc-eeEEeCCCcCcC
Confidence 4789999999999999999999999999999999999996 7899999864 8999999999877 999999999999
Q ss_pred chhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCCeeee
Q psy1890 107 GTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITP 186 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~ 186 (242)
++++|||||||||.|||+|||||+||.|+|+||.++...||++++...+..+++||||+|||||++.+|++++. +||+|
T Consensus 76 g~v~HE~~HalG~~HEh~R~DRD~yv~i~~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~ 154 (180)
T cd04280 76 GTIVHELMHALGFYHEQSRPDRDDYVTINWENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVP 154 (180)
T ss_pred chhHHHHHHHhcCcchhcccccCCeEEEeecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEE
Confidence 99999999999999999999999999999999999999999999888877889999999999999999999987 99999
Q ss_pred ccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890 187 KKTLNVQADKMGQRINISKMDIAKLNTLY 215 (242)
Q Consensus 187 ~~~~~~~~~~ig~r~~lS~~Di~~in~lY 215 (242)
+++ ....+||+.+||+.|+++||+||
T Consensus 155 ~~~---~~~~~g~~~~~S~~D~~~in~~Y 180 (180)
T cd04280 155 KDP---GYQIIGQREGLSFLDIKKINKMY 180 (180)
T ss_pred CCc---hhhcccCCCCCCHHHHHHHHHhC
Confidence 987 44458999999999999999999
No 7
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=100.00 E-value=7.9e-45 Score=304.73 Aligned_cols=170 Identities=26% Similarity=0.321 Sum_probs=135.8
Q ss_pred CCCCCCe-EEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCC-CeEEEEccCCCCCCCCccCCCCC----ccce
Q psy1890 23 KLWKDGL-IYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQT-PYIVYKESPNGKSCSSNVGFTGK----KVSY 96 (242)
Q Consensus 23 ~~Wp~~~-I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~-~~I~i~~~~~~~gc~S~vG~~~g----~~q~ 96 (242)
++||+++ ++|.|.+++++.+|+.|++||++|++.|||+|++++..+. .+|.|.. +.||||+||+.+. ..++
T Consensus 1 ~~W~~~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~---~~Gc~S~vG~~~~~~~~~~~t 77 (198)
T cd04327 1 KLWRNGTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTP---GDGYWSYVGTDALLIGADAPT 77 (198)
T ss_pred CCCCCCCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEec---CCCCCCCcCCcccccCCCCce
Confidence 5899986 4577888788899999999999999999999999985222 2355543 4799999999642 2378
Q ss_pred EEeCCC------CcccchhHHHHHHHhcccccccCCCCCceEEEeee--cccc-----------c-cc-ccccccCcccc
Q psy1890 97 VNLGKN------CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYE--LIKK-----------G-SE-INFKKRNFKEA 155 (242)
Q Consensus 97 i~l~~~------c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~--ni~~-----------~-~~-~nf~k~~~~~~ 155 (242)
|+|+.. |...++|+|||||||||.|||+||||| |.|+ +|.+ + .. .+|.++.. .
T Consensus 78 ~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh~rpdrd----i~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~--~ 151 (198)
T cd04327 78 MNLGWFTDDTPDPEFSRVVLHEFGHALGFIHEHQSPAAN----IPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD--G 151 (198)
T ss_pred eeeeeecCCCchhhHHHHHHHHHHHHhcCcccccCCCCC----CCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--c
Confidence 888653 455799999999999999999999999 7887 3432 1 22 67888764 3
Q ss_pred ccCCCcCCCCcccccC-CCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890 156 TTLDLPYDFGSIMHYS-RFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK 216 (242)
Q Consensus 156 ~~~g~~yD~~SIMhY~-~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~ 216 (242)
..+++||||+|||||+ +.+|+++|. + .|++.+||+.|+++||+||+
T Consensus 152 ~~~~~~yD~~SIMHY~~~~~~t~~g~-~--------------t~~~~~lS~~D~~~i~~~Yp 198 (198)
T cd04327 152 DVAYSPYDPDSIMHYPFPGSLTLDGE-E--------------VPPNRTLSDKDKAFMRLLYP 198 (198)
T ss_pred cccCCCCCcHHHHcCCCcHHhhcCCc-c--------------cCCCCCCCHHHHHHHHHhCc
Confidence 4578999999999999 789988876 3 25678899999999999995
No 8
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=100.00 E-value=8e-33 Score=219.46 Aligned_cols=136 Identities=37% Similarity=0.710 Sum_probs=116.8
Q ss_pred CCCCCCCeEEEEEC-CCCCHHH-HHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCC-CccCCCCCccceEE
Q psy1890 22 AKLWKDGLIYYEID-KNFSVDR-KRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCS-SNVGFTGKKVSYVN 98 (242)
Q Consensus 22 ~~~Wp~~~I~Y~~~-~~~~~~~-~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~-S~vG~~~g~~q~i~ 98 (242)
.++||+++|+|+|+ +.++..+ ++.|++||+.|++.+||+|++++.+...+|.|.... ..||| |++|+.++. +.++
T Consensus 2 ~~~W~~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~-~~g~~~a~~g~~~g~-~~~~ 79 (140)
T smart00235 2 SKKWPKGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGD-GSGCTLSHAGRPGGD-QHFS 79 (140)
T ss_pred CCcCCCCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECC-CCCcceeeeecCCCc-eEEE
Confidence 46899999999997 8888876 999999999999999999999885455678887651 13998 999999777 8999
Q ss_pred eCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCc
Q psy1890 99 LGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGS 166 (242)
Q Consensus 99 l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~S 166 (242)
++.+|...++++|||||||||.|||+|||||+|+.|+|+++ ..+|.+.. ....+.+|||+|
T Consensus 80 ~~~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~----~~~~~~~~---~~~~~~~~~yg~ 140 (140)
T smart00235 80 LGNGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNI----TRNFDLSN---DDSLGIPYDYGS 140 (140)
T ss_pred ccCCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhh----hhcccccc---ccCCCchhccCc
Confidence 98899999999999999999999999999999999999987 35665542 245778999986
No 9
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.98 E-value=1e-32 Score=224.02 Aligned_cols=148 Identities=27% Similarity=0.297 Sum_probs=121.4
Q ss_pred CeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEcc---CCCCCCCCccCCC--CCccceEEeCCC
Q psy1890 28 GLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKES---PNGKSCSSNVGFT--GKKVSYVNLGKN 102 (242)
Q Consensus 28 ~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~---~~~~gc~S~vG~~--~g~~q~i~l~~~ 102 (242)
..|+|.|++.++...|+.|++||+.|++.+||+|++++..+.+.|+|... +.+.+||+++|.. ++. +.|.++..
T Consensus 2 ~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~-g~i~~~~~ 80 (165)
T cd04268 2 KPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLT-GEILLARV 80 (165)
T ss_pred CCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCC-ccEEeeEE
Confidence 47999999999999999999999999999999999998633455665432 1258899999986 334 77888765
Q ss_pred Cc-----------ccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccC
Q psy1890 103 CF-----------VMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYS 171 (242)
Q Consensus 103 c~-----------~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~ 171 (242)
|. ..++++|||||||||.|+|+|+|||.|+ ..++.+||+.|||||.
T Consensus 81 ~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~-----------------------~~~~~~~~~~SvM~y~ 137 (165)
T cd04268 81 YLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNV-----------------------DLLAEKGDTSSVMDYA 137 (165)
T ss_pred EEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcc-----------------------hhhccCCCCcccCCCC
Confidence 54 4699999999999999999999999776 2356799999999999
Q ss_pred CCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890 172 RFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY 215 (242)
Q Consensus 172 ~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY 215 (242)
...|+.+ . .+|++..||..|+++|+.||
T Consensus 138 ~~~~~~~----------~------~~~~~~~~~~~Di~ai~~lY 165 (165)
T cd04268 138 PSNFSIQ----------L------GDGQKYTIGPYDIAAIKKLY 165 (165)
T ss_pred ccccccc----------c------ccccCCCCCHHHHHHHHhcC
Confidence 8777543 1 35677789999999999999
No 10
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.88 E-value=2.5e-22 Score=162.95 Aligned_cols=148 Identities=28% Similarity=0.359 Sum_probs=109.5
Q ss_pred eEEEEECCC--------CCHHHHHHHHHHHHHHhccccEEEEecCCC-CCCeEEEEcc----CCCCCCCCccCCCC-Ccc
Q psy1890 29 LIYYEIDKN--------FSVDRKRFIVKALNNIMTFSCVRFRRRTHN-QTPYIVYKES----PNGKSCSSNVGFTG-KKV 94 (242)
Q Consensus 29 ~I~Y~~~~~--------~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~~I~i~~~----~~~~gc~S~vG~~~-g~~ 94 (242)
.|+|.|..+ ++...++.|+.|++.|++.+||+|+++... ....|.|... ..+.+|+|++|... ...
T Consensus 2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~ 81 (167)
T cd00203 2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR 81 (167)
T ss_pred EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence 467777543 567889999999999999999999999852 1444544332 12467889998863 222
Q ss_pred ceEEeCCCC----cccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCccccc
Q psy1890 95 SYVNLGKNC----FVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHY 170 (242)
Q Consensus 95 q~i~l~~~c----~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY 170 (242)
+.+.+...+ ....+++|||||||||.|++.+++|++++.+. ......++|..|||||
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~-------------------~~~~~~~~~~~siM~y 142 (167)
T cd00203 82 GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTID-------------------DTLNAEDDDYYSVMSY 142 (167)
T ss_pred CcEEEecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcccc-------------------ccccCCCCCCCeEecc
Confidence 445554333 45689999999999999999999999988662 1124568999999999
Q ss_pred CCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890 171 SRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY 215 (242)
Q Consensus 171 ~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY 215 (242)
....++ ++++..+|..|+++|+++|
T Consensus 143 ~~~~~~--------------------~~~~~~fS~~d~~~i~~~Y 167 (167)
T cd00203 143 TKGSFS--------------------DGQRKDFSQCDIDQINKLY 167 (167)
T ss_pred CccccC--------------------cccCCCcCHHHHHHHHhhC
Confidence 865332 3345679999999999998
No 11
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.82 E-value=1.2e-19 Score=146.70 Aligned_cols=98 Identities=21% Similarity=0.237 Sum_probs=67.0
Q ss_pred CCCCCeEEEEECCCC----CHHHHHHHHHHHHHHhccccEEEEecCCCCCCeE--EEEccCCCCCC-CCccCCCCC----
Q psy1890 24 LWKDGLIYYEIDKNF----SVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYI--VYKESPNGKSC-SSNVGFTGK---- 92 (242)
Q Consensus 24 ~Wp~~~I~Y~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I--~i~~~~~~~gc-~S~vG~~~g---- 92 (242)
+||+++|+|+|++.. ....+++|++||+.|++.+||+|++++......| .|... ..+| +++-|..+.
T Consensus 1 kW~~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~--~~~~~~~~~~~~g~l~~a 78 (157)
T cd04278 1 KWSKTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARG--NHGDGYPFDGPGGTLAHA 78 (157)
T ss_pred CCCCCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeec--ccCCCCCCCCCccccccc
Confidence 799999999997754 3678999999999999999999999985323334 44433 3444 333332110
Q ss_pred -----ccceEEeC-----------CCCcccchhHHHHHHHhcccccc
Q psy1890 93 -----KVSYVNLG-----------KNCFVMGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 93 -----~~q~i~l~-----------~~c~~~~~i~HEigHALGl~HEh 123 (242)
..-.+.|. .++....+++|||||||||.|++
T Consensus 79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~ 125 (157)
T cd04278 79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS 125 (157)
T ss_pred cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence 00112221 23445689999999999999974
No 12
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.75 E-value=1.8e-17 Score=137.43 Aligned_cols=142 Identities=20% Similarity=0.152 Sum_probs=92.8
Q ss_pred CCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCC---ccceEEeC---------C
Q psy1890 37 NFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGK---KVSYVNLG---------K 101 (242)
Q Consensus 37 ~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g---~~q~i~l~---------~ 101 (242)
.+++.+++.|++||+.|++.+||+|++++..+...|+|..... +.+++++.+...+ ....+.|. .
T Consensus 30 ~~~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~ 109 (186)
T cd04277 30 ALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSP 109 (186)
T ss_pred cCCHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCC
Confidence 3567899999999999999999999999865456788865521 2345555554321 11223332 2
Q ss_pred CCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCC
Q psy1890 102 NCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKS 181 (242)
Q Consensus 102 ~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~ 181 (242)
++....+++|||||||||.|+|..++.+.+ +. .....-.-.|||.|.......
T Consensus 110 g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~---------~~--------------~~~~~~~~~SVMSY~~~~~~~---- 162 (186)
T cd04277 110 GSYGYQTIIHEIGHALGLEHPGDYNGGDPV---------PP--------------TYALDSREYTVMSYNSGYGNG---- 162 (186)
T ss_pred ChhhHHHHHHHHHHHhcCCCCCcCCCCCCC---------Cc--------------cccccCcceEEEeecCCCCCC----
Confidence 566679999999999999999998875421 00 000111345999997542110
Q ss_pred CeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890 182 PTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK 216 (242)
Q Consensus 182 ~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~ 216 (242)
.. . .-+....|...||++|+.||+
T Consensus 163 ~~-----~------~~~~~~~~~~~DI~AlQ~lYG 186 (186)
T cd04277 163 AS-----A------GGGYPQTPMLLDIAALQYLYG 186 (186)
T ss_pred cc-----c------cCcccCCccHHHHHHHHHhhC
Confidence 00 0 112234688999999999995
No 13
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.66 E-value=2.2e-15 Score=120.60 Aligned_cols=98 Identities=19% Similarity=0.235 Sum_probs=63.2
Q ss_pred CCCCCeEEEEECCC---CC-HHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEE--EccCCCCCCCCccCCCC------
Q psy1890 24 LWKDGLIYYEIDKN---FS-VDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVY--KESPNGKSCSSNVGFTG------ 91 (242)
Q Consensus 24 ~Wp~~~I~Y~~~~~---~~-~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i--~~~~~~~gc~S~vG~~~------ 91 (242)
||+..+|+|+|... ++ +..+++|++||+.|+..+.|+|+++...+ .-|.| .... ...+..+-|..+
T Consensus 1 ~W~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~-adi~i~~~~~~-~~~~~~~~~~~~~~~~~~ 78 (154)
T PF00413_consen 1 KWPKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGN-ADIRISFGSNN-HGDGYSFDGSGGTLAHAY 78 (154)
T ss_dssp SSSSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSS-CSEEEEEESSS-SSSSS-CSSSSSESEEEE
T ss_pred CCCCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCC-cceeeeeeccc-cCcccccccceeeeeccc
Confidence 79999999999763 23 46899999999999999999999998523 43444 3221 122222112111
Q ss_pred --C--ccceEEeCCC--------C-cccchhHHHHHHHhcccccc
Q psy1890 92 --K--KVSYVNLGKN--------C-FVMGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 92 --g--~~q~i~l~~~--------c-~~~~~i~HEigHALGl~HEh 123 (242)
. ..-.+.+... . ....+++|||||||||.|++
T Consensus 79 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~ 123 (154)
T PF00413_consen 79 FPNNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN 123 (154)
T ss_dssp ESSSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred cccccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence 0 0123444211 1 23489999999999999975
No 14
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.65 E-value=1.2e-15 Score=123.07 Aligned_cols=98 Identities=20% Similarity=0.216 Sum_probs=63.2
Q ss_pred eEEEEECCC------CCHHHHHHHHHHHHHHhccccEEEEecCCC--CCCe-EEEEc----cCCCCCCCCccCCC-----
Q psy1890 29 LIYYEIDKN------FSVDRKRFIVKALNNIMTFSCVRFRRRTHN--QTPY-IVYKE----SPNGKSCSSNVGFT----- 90 (242)
Q Consensus 29 ~I~Y~~~~~------~~~~~~~~I~~A~~~w~~~tci~F~~~~~~--~~~~-I~i~~----~~~~~gc~S~vG~~----- 90 (242)
.|+|++++. .....++.|++|++.|++.+||+|++.... +.+. |.|.. ...++.|.+..+..
T Consensus 3 ~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~~ 82 (156)
T cd04279 3 PIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGN 82 (156)
T ss_pred CeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCCc
Confidence 578888764 245789999999999999999999998742 3332 44432 10123343333321
Q ss_pred C-Cc-cceEEeCC-----CCcccchhHHHHHHHhcccccccCC
Q psy1890 91 G-KK-VSYVNLGK-----NCFVMGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 91 ~-g~-~q~i~l~~-----~c~~~~~i~HEigHALGl~HEh~rp 126 (242)
. .. ...++++. .....++++|||||||||.||++++
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~ 125 (156)
T cd04279 83 RKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRP 125 (156)
T ss_pred ccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCc
Confidence 1 11 12333332 1234689999999999999999864
No 15
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.08 E-value=3.9e-09 Score=88.37 Aligned_cols=158 Identities=15% Similarity=0.064 Sum_probs=85.2
Q ss_pred CCCeEEEEECCCCCHHHHHHHHHHHHHHhcc------ccEEEEecCCCC--CCe-----EEEEccCCCCCCCCc----cC
Q psy1890 26 KDGLIYYEIDKNFSVDRKRFIVKALNNIMTF------SCVRFRRRTHNQ--TPY-----IVYKESPNGKSCSSN----VG 88 (242)
Q Consensus 26 p~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~------tci~F~~~~~~~--~~~-----I~i~~~~~~~gc~S~----vG 88 (242)
|+..|.|.|++.++...++.|++|++.|++. ..+.++...... ..- |+.... ..+-+++ ..
T Consensus 6 pk~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~--~~~~~~~gps~~d 83 (197)
T cd04276 6 PKEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHS--PNGGWAYGPSVVD 83 (197)
T ss_pred CCCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEec--CCCcceecccccC
Confidence 5678999999887788999999999999975 344444443211 112 333333 2122222 11
Q ss_pred CCCCc--cceEEeCCCCc--------------ccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCc
Q psy1890 89 FTGKK--VSYVNLGKNCF--------------VMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNF 152 (242)
Q Consensus 89 ~~~g~--~q~i~l~~~c~--------------~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~ 152 (242)
-..|+ .-.+.|..+-. ...+++||+||+|||.|.....-. .+ .....++
T Consensus 84 PrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~-----------~~----~~~l~~~ 148 (197)
T cd04276 84 PRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSD-----------GS----NEELEDP 148 (197)
T ss_pred CCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccccccccc-----------Cc----hhhhcch
Confidence 11122 13344432111 236899999999999997654311 00 0000000
Q ss_pred cccccCCCcCCCCcccccCCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890 153 KEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK 216 (242)
Q Consensus 153 ~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~ 216 (242)
......+.. .|||-|.+......+. +. .-.-...+-..|+.+|+.+|+
T Consensus 149 ~~~~~~~~~---~SVMdY~~~~~~~~~~-------~~------~~~~~~~~g~yDi~Aiq~~Y~ 196 (197)
T cd04276 149 LGTKEKGAT---SSVMDYPPPNVAAQGE-------DQ------GDYYPPTIGPYDKWAIEYGYT 196 (197)
T ss_pred hhhhhcCCc---ceeecCccccccccCc-------cc------cccccCCCchHHHHHHHHHcC
Confidence 111122332 3999998654322211 10 001123477789999999995
No 16
>KOG1565|consensus
Probab=99.03 E-value=1.3e-09 Score=102.10 Aligned_cols=100 Identities=16% Similarity=0.177 Sum_probs=65.0
Q ss_pred CCCCCCCCeEEEEECCC---CC-HHHHHHHHHHHHHHhccccEEEEecCC-CCCC-eEEEEccCCCCCC-CCccCCC---
Q psy1890 21 PAKLWKDGLIYYEIDKN---FS-VDRKRFIVKALNNIMTFSCVRFRRRTH-NQTP-YIVYKESPNGKSC-SSNVGFT--- 90 (242)
Q Consensus 21 ~~~~Wp~~~I~Y~~~~~---~~-~~~~~~I~~A~~~w~~~tci~F~~~~~-~~~~-~I~i~~~~~~~gc-~S~vG~~--- 90 (242)
...+|++..|+|.|.+. ++ ...+.++++|+..|+++|-++|.|+.. ...+ .|.|... ..|+ +.+-|..
T Consensus 102 ~~~kW~k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~--~h~d~~PFDG~~g~l 179 (469)
T KOG1565|consen 102 GKPKWNKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPG--DHGDGFPFDGPGGVL 179 (469)
T ss_pred ccCcccccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeecc--CCCCCCcccCCCCce
Confidence 45699999999999653 33 578999999999999999999999986 3333 2444322 1221 1111111
Q ss_pred ----------CCc-----cceEEeC--CCCcccchhHHHHHHHhccccc
Q psy1890 91 ----------GKK-----VSYVNLG--KNCFVMGTVQHETLHALGFWHE 122 (242)
Q Consensus 91 ----------~g~-----~q~i~l~--~~c~~~~~i~HEigHALGl~HE 122 (242)
+|. .....++ .+-....|++|||||||||.|.
T Consensus 180 aHAf~Pg~~~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS 228 (469)
T KOG1565|consen 180 AHAFFPGPGIGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHS 228 (469)
T ss_pred ecccCCCCCCCCccccCcccceeccCCccchhHHHhhhhcccccccCCC
Confidence 111 1222221 1334568999999999999997
No 17
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.00 E-value=1.1e-08 Score=85.74 Aligned_cols=160 Identities=23% Similarity=0.257 Sum_probs=85.8
Q ss_pred CCCCC--CCeEEEEECC----CCCHHHHHHHHHHHHHHhc-cccEEEEe--cCC-CCCCeEEEEccC---CCCCCCCccC
Q psy1890 22 AKLWK--DGLIYYEIDK----NFSVDRKRFIVKALNNIMT-FSCVRFRR--RTH-NQTPYIVYKESP---NGKSCSSNVG 88 (242)
Q Consensus 22 ~~~Wp--~~~I~Y~~~~----~~~~~~~~~I~~A~~~w~~-~tci~F~~--~~~-~~~~~I~i~~~~---~~~gc~S~vG 88 (242)
..+|. ..+|...+.. .++...+..+.+|++.|+. ...|+|+- .+. ...+.+.+.... .+.+-.+...
T Consensus 31 tnlV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsAGFP 110 (211)
T PF12388_consen 31 TNLVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSAGFP 110 (211)
T ss_pred eeeecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceeccCC
Confidence 34563 2467777766 5678899999999999998 45888841 121 123334443221 1222223222
Q ss_pred CCCCc-cceEEeC---CCC--cccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCc-
Q psy1890 89 FTGKK-VSYVNLG---KNC--FVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLP- 161 (242)
Q Consensus 89 ~~~g~-~q~i~l~---~~c--~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~- 161 (242)
..+|. ...|.+. ..- ....+++|||||++||.|+.- -+|.. - -.+..++. ..+ ......|+|
T Consensus 111 ~s~G~P~~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHTD~-~~R~S---C-G~~~nEg~-~~v-----GAi~IpGTPt 179 (211)
T PF12388_consen 111 TSNGNPYKFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHTDY-FNRSS---C-GSGGNEGS-AGV-----GAIHIPGTPT 179 (211)
T ss_pred CCCCCCCceEEEEecCCCchhHHHHHHHHHhhhhccccccCc-CCccc---c-cccCCcCc-ccc-----ceEECCCCCC
Confidence 11232 1233331 111 124699999999999999752 22210 0 00000000 000 011122333
Q ss_pred -CCCCcccccCCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890 162 -YDFGSIMHYSRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY 215 (242)
Q Consensus 162 -yD~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY 215 (242)
+|..|||... |+. +....+|..|+.+|+.||
T Consensus 180 ~~d~~SiM~ac---~~~--------------------~~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 180 GADPNSIMNAC---FSS--------------------GEDGEFTSNDITALNYLY 211 (211)
T ss_pred CCCCchhhhcc---ccC--------------------CCCCCcChhhHHHHHhhC
Confidence 6999999975 111 112359999999999999
No 18
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.37 E-value=4.9e-07 Score=69.64 Aligned_cols=85 Identities=22% Similarity=0.246 Sum_probs=50.3
Q ss_pred eEEEEECCCCCHHHHHHHHHHHHHHhc-cccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCC----CC
Q psy1890 29 LIYYEIDKNFSVDRKRFIVKALNNIMT-FSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGK----NC 103 (242)
Q Consensus 29 ~I~Y~~~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~----~c 103 (242)
+|.|.-+.. ...+..|.+|.+.|+. +++|+|++.+. ..+.|... ...-+|+-...+...-+|-|+. +-
T Consensus 3 tv~Ydas~A--~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~--~~~~~sya~~~g~G~G~I~l~~~~~qgy 75 (132)
T PF02031_consen 3 TVYYDASRA--PEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEG--NDPRGSYASTDGLGSGYIFLDYQQNQGY 75 (132)
T ss_dssp EEEEEEEE---GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE----TT--EEEE-SSS-EEEEEEHHHHHHS
T ss_pred EEEEeCCCC--chhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEe--cCCCCcccccCCCCcEEEEechHHhhCC
Confidence 456655443 4578899999999986 78999999885 33444432 1222454444433225666642 34
Q ss_pred cccchhHHHHHHHhccc
Q psy1890 104 FVMGTVQHETLHALGFW 120 (242)
Q Consensus 104 ~~~~~i~HEigHALGl~ 120 (242)
....+++|||||.|||.
T Consensus 76 ~~~RIaaHE~GHiLGLP 92 (132)
T PF02031_consen 76 NSTRIAAHELGHILGLP 92 (132)
T ss_dssp -HHHHHHHHHHHHHT--
T ss_pred ccceeeeehhccccCCC
Confidence 45689999999999994
No 19
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.41 E-value=0.00021 Score=60.33 Aligned_cols=20 Identities=35% Similarity=0.418 Sum_probs=17.9
Q ss_pred ccchhHHHHHHHhccccccc
Q psy1890 105 VMGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~ 124 (242)
...+++|||||+||+.|+..
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFD 156 (206)
T ss_pred cchHHHHHHHHHhcCCCCcc
Confidence 34789999999999999997
No 20
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.40 E-value=0.00037 Score=58.16 Aligned_cols=77 Identities=17% Similarity=0.125 Sum_probs=45.8
Q ss_pred HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEE-ccCCCCCCCCccCCCC-------------Cc-----cceEEeCC
Q psy1890 41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYK-ESPNGKSCSSNVGFTG-------------KK-----VSYVNLGK 101 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~-~~~~~~gc~S~vG~~~-------------g~-----~q~i~l~~ 101 (242)
..+.++..|++.|.+.-.+..++..+ +.+ |.|. .++ +.-.|.-.|+.. +. ...+.+.+
T Consensus 104 ~wq~a~~tava~wa~~fpl~ive~~e-eaD-Itie~~n~-pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~p 180 (236)
T COG5549 104 RWQGAYLTAVAGWAKTFPLIIVERFE-EAD-ITIEVGNP-PGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPP 180 (236)
T ss_pred hHHHHHHHHHHHHHHhCCceeeecce-eee-EEEEecCC-CCCcccccchHHHHHHHHHHhhhccCcccccccccccCCc
Confidence 45789999999999987777777774 444 5553 332 222233333321 10 01223322
Q ss_pred CC---cccchhHHHHHHHhccc
Q psy1890 102 NC---FVMGTVQHETLHALGFW 120 (242)
Q Consensus 102 ~c---~~~~~i~HEigHALGl~ 120 (242)
+- ...+++.||+|||||++
T Consensus 181 g~~~e~L~~tarhElGhaLgi~ 202 (236)
T COG5549 181 GELRENLNPTARHELGHALGIW 202 (236)
T ss_pred ccchhhhhHHHHHhhcchheec
Confidence 22 23489999999999987
No 21
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=96.56 E-value=0.0018 Score=53.12 Aligned_cols=21 Identities=33% Similarity=0.332 Sum_probs=15.4
Q ss_pred ccchhHHHHHHHhcccccccC
Q psy1890 105 VMGTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~r 125 (242)
...+++|||||.||..|...-
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~ 131 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDN 131 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS
T ss_pred eeeeehhhhHhhcCCCCCCCC
Confidence 346899999999999999883
No 22
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.56 E-value=0.001 Score=50.98 Aligned_cols=19 Identities=37% Similarity=0.457 Sum_probs=13.3
Q ss_pred cccchhHHHHHHHhccccc
Q psy1890 104 FVMGTVQHETLHALGFWHE 122 (242)
Q Consensus 104 ~~~~~i~HEigHALGl~HE 122 (242)
....+++|||||.||+.|.
T Consensus 106 ~~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 106 SGVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp TSTTHHHHHHHHHTT----
T ss_pred ccceEeeehhhHhcCCCCC
Confidence 3468999999999999993
No 23
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=96.47 E-value=0.0055 Score=55.02 Aligned_cols=78 Identities=22% Similarity=0.248 Sum_probs=45.5
Q ss_pred HHHHHHHHHHHHHhccc-cE-EEEecCCCCCCeEEEEccCCCCCC-CCccCCCCCccceEEe-CCCCcccchhHHHHHHH
Q psy1890 41 DRKRFIVKALNNIMTFS-CV-RFRRRTHNQTPYIVYKESPNGKSC-SSNVGFTGKKVSYVNL-GKNCFVMGTVQHETLHA 116 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~t-ci-~F~~~~~~~~~~I~i~~~~~~~gc-~S~vG~~~g~~q~i~l-~~~c~~~~~i~HEigHA 116 (242)
...-.+..+.+.+.+.. .+ .|.. -...|.|... +..| |+-+|..+|....+.. +.++...++++||+||-
T Consensus 88 ~d~~~~~~~Ad~~a~~~lG~~~~s~----y~h~vyvlP~--~~~C~w~Gla~v~G~~~~~~~~~~~~~~~~~~~HElgHN 161 (314)
T PF05548_consen 88 DDWYGWADAADAAARAQLGVNAFSS----YTHRVYVLPP--GFACGWAGLATVPGSQSWLWISGYGVQDWATIMHELGHN 161 (314)
T ss_pred HHHHHHHHHHHHHHHhhcCcccccc----cceEEEEcCC--CCCCCceEEeecCCcceeeeecCcccccHHHHHHHhhhh
Confidence 33445666666666544 33 2311 2244655543 3347 5544444444122333 33566778999999999
Q ss_pred hccccccc
Q psy1890 117 LGFWHEQA 124 (242)
Q Consensus 117 LGl~HEh~ 124 (242)
|||.|...
T Consensus 162 ~GL~Ha~~ 169 (314)
T PF05548_consen 162 LGLWHAGR 169 (314)
T ss_pred ccccccCC
Confidence 99999974
No 24
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.35 E-value=0.0016 Score=52.52 Aligned_cols=21 Identities=33% Similarity=0.401 Sum_probs=14.1
Q ss_pred cchhHHHHHHHhcccccccCC
Q psy1890 106 MGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rp 126 (242)
..|++||+||-|||.|..+..
T Consensus 70 g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 70 GKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp SHHHHHHHHHHTT---TT---
T ss_pred ccchhhhhhhhhcccccccCC
Confidence 479999999999999999764
No 25
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.14 E-value=0.0032 Score=52.06 Aligned_cols=19 Identities=37% Similarity=0.363 Sum_probs=17.0
Q ss_pred cchhHHHHHHHhccccccc
Q psy1890 106 MGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~ 124 (242)
..+++|||||.||+.|...
T Consensus 134 ~~~~aHElGH~lG~~HD~~ 152 (192)
T cd04267 134 ALTMAHELGHNLGAEHDGG 152 (192)
T ss_pred hhhhhhhHHhhcCCcCCCC
Confidence 4699999999999999865
No 26
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=95.97 E-value=0.0037 Score=51.81 Aligned_cols=23 Identities=30% Similarity=0.205 Sum_probs=14.7
Q ss_pred ccchhHHHHHHHhcccccccCCC
Q psy1890 105 VMGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~rpD 127 (242)
...+++|||||.||+.|.+..+.
T Consensus 142 ~~~~~AHEiGH~lGa~HD~~~~~ 164 (196)
T PF13688_consen 142 GAITFAHEIGHNLGAPHDGDYSS 164 (196)
T ss_dssp HHHHHHHHHHHHTT-----SSS-
T ss_pred eehhhHHhHHHhcCCCCCCCCCC
Confidence 45799999999999999887653
No 27
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=95.45 E-value=0.002 Score=55.17 Aligned_cols=22 Identities=32% Similarity=0.392 Sum_probs=19.2
Q ss_pred ccchhHHHHHHHhcccccccCC
Q psy1890 105 VMGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~rp 126 (242)
...|++||+||.|||.|.++.-
T Consensus 137 ~g~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 137 LGDTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ccceeEEeccceeeeeeeecCC
Confidence 3469999999999999999774
No 28
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.22 E-value=0.014 Score=49.60 Aligned_cols=21 Identities=33% Similarity=0.343 Sum_probs=18.1
Q ss_pred ccchhHHHHHHHhcccccccC
Q psy1890 105 VMGTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~r 125 (242)
...+++|||||.||+.|....
T Consensus 145 ~~~~~AHElGH~lG~~HD~~~ 165 (220)
T cd04272 145 GVYTMTHELAHLLGAPHDGSP 165 (220)
T ss_pred cHHHHHHHHHHHhCCCCCCCC
Confidence 357999999999999998654
No 29
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.19 E-value=0.0087 Score=49.59 Aligned_cols=19 Identities=37% Similarity=0.484 Sum_probs=16.9
Q ss_pred cchhHHHHHHHhccccccc
Q psy1890 106 MGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~ 124 (242)
..+++|||||.||+.|...
T Consensus 132 a~~~AHElGH~lG~~HD~~ 150 (194)
T cd04269 132 AVTMAHELGHNLGMEHDDG 150 (194)
T ss_pred HHHHHHHHHhhcCCCcCCC
Confidence 4799999999999999864
No 30
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=95.00 E-value=0.0099 Score=48.51 Aligned_cols=19 Identities=37% Similarity=0.356 Sum_probs=16.0
Q ss_pred chhHHHHHHHhcccccccC
Q psy1890 107 GTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~r 125 (242)
--++||+||.|||.|=-++
T Consensus 126 KEv~HElGH~~GL~HC~N~ 144 (181)
T COG1913 126 KEVLHELGHLLGLSHCPNP 144 (181)
T ss_pred HHHHHHhhhhcCcccCCCC
Confidence 4689999999999986554
No 31
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=94.68 E-value=0.015 Score=50.40 Aligned_cols=17 Identities=29% Similarity=0.282 Sum_probs=15.2
Q ss_pred chhHHHHHHHhcccccc
Q psy1890 107 GTVQHETLHALGFWHEQ 123 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh 123 (242)
.+++|||||.||+.|--
T Consensus 169 ~t~AHElGHnlGm~HD~ 185 (244)
T cd04270 169 LVTAHELGHNFGSPHDP 185 (244)
T ss_pred HHHHHHHHHhcCCCCCC
Confidence 68999999999999754
No 32
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=94.67 E-value=0.011 Score=50.58 Aligned_cols=18 Identities=28% Similarity=0.385 Sum_probs=15.7
Q ss_pred chhHHHHHHHhccccccc
Q psy1890 107 GTVQHETLHALGFWHEQA 124 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~ 124 (242)
.+++|||||.||+.|--.
T Consensus 147 ~t~AHElGHnLGm~HD~~ 164 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCT 164 (228)
T ss_pred eehhhhhhhhcCCCCCCC
Confidence 589999999999998653
No 33
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=94.47 E-value=0.018 Score=47.57 Aligned_cols=15 Identities=47% Similarity=0.601 Sum_probs=13.6
Q ss_pred chhHHHHHHHhcccc
Q psy1890 107 GTVQHETLHALGFWH 121 (242)
Q Consensus 107 ~~i~HEigHALGl~H 121 (242)
..++||+||.|||.|
T Consensus 127 k~~~HElGH~lGL~H 141 (179)
T PRK13267 127 KEVTHELGHTLGLEH 141 (179)
T ss_pred HHHHHHHHHHcCCcc
Confidence 458999999999998
No 34
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=94.31 E-value=0.0087 Score=50.30 Aligned_cols=20 Identities=35% Similarity=0.386 Sum_probs=17.6
Q ss_pred ccchhHHHHHHHhccccccc
Q psy1890 105 VMGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~ 124 (242)
...+++|||||.||+.|...
T Consensus 140 ~a~~~aHElGH~LG~~HD~~ 159 (207)
T cd04273 140 SAFTIAHELGHVLGMPHDGD 159 (207)
T ss_pred eEEeeeeechhhcCCCCCCC
Confidence 45799999999999999865
No 35
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=94.28 E-value=0.021 Score=47.74 Aligned_cols=18 Identities=28% Similarity=0.296 Sum_probs=12.3
Q ss_pred chhHHHHHHHhccccccc
Q psy1890 107 GTVQHETLHALGFWHEQA 124 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~ 124 (242)
..++||+||.|||.|=..
T Consensus 147 Kea~HElGH~~GL~HC~~ 164 (194)
T PF07998_consen 147 KEAVHELGHLFGLDHCEN 164 (194)
T ss_dssp HHHHHHHHHHTT----SS
T ss_pred HHHHHHHHHHcCCcCCCC
Confidence 679999999999998654
No 36
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=93.75 E-value=0.03 Score=46.55 Aligned_cols=19 Identities=32% Similarity=0.396 Sum_probs=14.1
Q ss_pred cchhHHHHHHHhccccccc
Q psy1890 106 MGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~ 124 (242)
..+++|||||.||+.|...
T Consensus 132 a~~~AHelGH~lGm~HD~~ 150 (199)
T PF01421_consen 132 AVIIAHELGHNLGMPHDGD 150 (199)
T ss_dssp HHHHHHHHHHHTT---TTT
T ss_pred HHHHHHHHHHhcCCCCCCC
Confidence 3688999999999999887
No 37
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=93.02 E-value=0.038 Score=46.19 Aligned_cols=98 Identities=21% Similarity=0.334 Sum_probs=55.6
Q ss_pred CeEEEEEC--CCCC-------HHHHHHHHHHHHH---HhccccEEEEecCCCCCCe-EEEEccC-CCCCCC--CccCC--
Q psy1890 28 GLIYYEID--KNFS-------VDRKRFIVKALNN---IMTFSCVRFRRRTHNQTPY-IVYKESP-NGKSCS--SNVGF-- 89 (242)
Q Consensus 28 ~~I~Y~~~--~~~~-------~~~~~~I~~A~~~---w~~~tci~F~~~~~~~~~~-I~i~~~~-~~~gc~--S~vG~-- 89 (242)
..+.|.++ +.+. ...-..|...+.. |....-++|+.+...+.++ |.+.... -..-|. ..-|.
T Consensus 29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~~~Df~I~Lasp~T~~~lC~g~~~~~e~S 108 (203)
T PF11350_consen 29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSGAPDFRISLASPGTTDRLCAGLDTSGETS 108 (203)
T ss_pred ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCCCCCEEEEECCcchhhhhccCcCcCceeE
Confidence 45788873 3331 3455666666654 8888889999999755453 4443220 023351 11111
Q ss_pred --C--CCccceEEeCCC------------CcccchhHHHHHHHhcccccccCC
Q psy1890 90 --T--GKKVSYVNLGKN------------CFVMGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 90 --~--~g~~q~i~l~~~------------c~~~~~i~HEigHALGl~HEh~rp 126 (242)
. +++ -.||+..+ -...-+|-||.||+||..||-.--
T Consensus 109 C~~~~g~~-VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg 160 (203)
T PF11350_consen 109 CRNPAGGR-VVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPG 160 (203)
T ss_pred eecCCCCe-EEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCC
Confidence 1 122 34555211 113468999999999998887753
No 38
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.96 E-value=0.028 Score=49.80 Aligned_cols=20 Identities=30% Similarity=0.345 Sum_probs=16.9
Q ss_pred ccchhHHHHHHHhccccccc
Q psy1890 105 VMGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~ 124 (242)
..+|++||+||+|||.+.-.
T Consensus 165 ~igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 165 GVGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred ceeeeehhhhcccCCCcccc
Confidence 46999999999999976654
No 39
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=92.84 E-value=0.046 Score=48.82 Aligned_cols=16 Identities=31% Similarity=0.403 Sum_probs=13.0
Q ss_pred cchhHHHHHHHhcccc
Q psy1890 106 MGTVQHETLHALGFWH 121 (242)
Q Consensus 106 ~~~i~HEigHALGl~H 121 (242)
.++++||+||++||.|
T Consensus 194 ~~~f~HE~GH~~GL~H 209 (305)
T PF10462_consen 194 GNEFSHELGHNFGLGH 209 (305)
T ss_dssp HHHHHHHHHHTTT--S
T ss_pred cceeehhhhhhcCCCC
Confidence 4789999999999999
No 40
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=92.48 E-value=0.16 Score=34.21 Aligned_cols=40 Identities=20% Similarity=0.265 Sum_probs=31.4
Q ss_pred ceEEeCC-CCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 95 SYVNLGK-NCFVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 95 q~i~l~~-~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
..+.|.+ -+.....++|++.+.+||.|+..-...+.||.|
T Consensus 18 ~~l~F~p~ls~~eR~~vH~lA~~~gL~s~S~G~g~~R~v~v 58 (60)
T cd02641 18 TELEFPPTLSSHDRLLVHELAEELGLRHESTGEGSDRVITV 58 (60)
T ss_pred CcEECCCCCCHHHHHHHHHHHHHcCCceEeeCCCCceEEEe
Confidence 4456654 333458999999999999999998888888766
No 41
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=92.26 E-value=0.059 Score=50.23 Aligned_cols=20 Identities=30% Similarity=0.476 Sum_probs=17.5
Q ss_pred cchhHHHHHHHhcccccccC
Q psy1890 106 MGTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~r 125 (242)
.|.++|||||+||+.|.-+-
T Consensus 317 iGA~lHEiGH~fg~pH~~~G 336 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQEDG 336 (423)
T ss_pred hHHHHHHHHHhcCCCCCCCC
Confidence 47899999999999998753
No 42
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=89.62 E-value=0.14 Score=50.16 Aligned_cols=16 Identities=38% Similarity=0.459 Sum_probs=14.7
Q ss_pred ccchhHHHHHHHhccc
Q psy1890 105 VMGTVQHETLHALGFW 120 (242)
Q Consensus 105 ~~~~i~HEigHALGl~ 120 (242)
..|+++|||||.|||.
T Consensus 221 giGVfaHEfGH~LGLP 236 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLP 236 (645)
T ss_pred ceEEEEeeccccCCCC
Confidence 5799999999999996
No 43
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=89.60 E-value=0.16 Score=49.36 Aligned_cols=77 Identities=17% Similarity=0.199 Sum_probs=38.6
Q ss_pred HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHh
Q psy1890 41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHAL 117 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHAL 117 (242)
..++.|.+|++.|...-. .|....- +..+|-+...++ +.-|++..+... ..-.+|+......+.|++||+|||+
T Consensus 311 ~a~~~v~~~~~~l~~e~~-~~~~~~~-~~~~iD~~~r~gK~~Ga~~~~~~~~~~-p~i~~N~~~~~~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 311 EAKELVLEAFGKFSPEMA-DFAEKAF-EEGWIDAEPRPGKRGGAFCTGFPPSKE-PRVLMNYDGSRRDVSTLAHELGHAY 387 (587)
T ss_pred HHHHHHHHHHHhcCHHHH-HHHHHHH-HcCCcccCCCCCCCCCcccCCCCCCCC-CEEEEecCCCchhHHHHHHHhhHHH
Confidence 567888888877654311 1111111 223444433221 223444332211 1133455544555689999999999
Q ss_pred ccc
Q psy1890 118 GFW 120 (242)
Q Consensus 118 Gl~ 120 (242)
=..
T Consensus 388 H~~ 390 (587)
T TIGR02290 388 HSE 390 (587)
T ss_pred HHH
Confidence 443
No 44
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=89.38 E-value=0.17 Score=48.62 Aligned_cols=17 Identities=53% Similarity=0.688 Sum_probs=14.0
Q ss_pred cchhHHHHHHHhccccc
Q psy1890 106 MGTVQHETLHALGFWHE 122 (242)
Q Consensus 106 ~~~i~HEigHALGl~HE 122 (242)
..+++|||+|||||.-.
T Consensus 211 ~~~~~HEi~HaLGFs~~ 227 (521)
T PF01457_consen 211 FRTVIHEIAHALGFSSS 227 (521)
T ss_dssp HHHHHHHHHHHTT-SHH
T ss_pred cceeeeeeeeeeeeccc
Confidence 47999999999999763
No 45
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=89.09 E-value=0.73 Score=42.50 Aligned_cols=25 Identities=24% Similarity=0.175 Sum_probs=17.7
Q ss_pred eEEeCCCCcccchhHHHHHHHhccc
Q psy1890 96 YVNLGKNCFVMGTVQHETLHALGFW 120 (242)
Q Consensus 96 ~i~l~~~c~~~~~i~HEigHALGl~ 120 (242)
.+|+........|+.||+|||+=..
T Consensus 213 ~~n~~~~~~~v~tl~HE~GHa~h~~ 237 (427)
T cd06459 213 LMNFNGTLDDVFTLAHELGHAFHSY 237 (427)
T ss_pred EecCCCChhhHHHHHHHhhHHHHHH
Confidence 3444555556789999999997444
No 46
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=89.02 E-value=0.74 Score=44.77 Aligned_cols=75 Identities=19% Similarity=0.262 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHh
Q psy1890 41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHAL 117 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHAL 117 (242)
+.++.|.+|++.+.. ....++...- +..||-...+++ |.-|++..+ . ...--+|+...-..+.|++||+|||+
T Consensus 315 ~a~~~v~~~~~~~g~-~~~~~~~~~~-~~~wiD~~~r~gK~~Ga~~~~~~~-~-~p~il~N~~~~~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 315 EAKELILKSLEPLGE-EYIKILKRAF-NERWVDYAENKGKRSGAYSIGGYK-V-KPYILMNWDGTLNSVFTLAHELGHSM 390 (591)
T ss_pred HHHHHHHHHHhccCH-HHHHHHHHHh-hCCCeeecCCCCCCCCcccCCCCC-C-CCeEEEecCCCcchHHHHHHHhhhHH
Confidence 456667777754321 1111111111 124665543321 233555433 1 11123455444455789999999998
Q ss_pred cc
Q psy1890 118 GF 119 (242)
Q Consensus 118 Gl 119 (242)
=.
T Consensus 391 H~ 392 (591)
T TIGR00181 391 HS 392 (591)
T ss_pred HH
Confidence 44
No 47
>PTZ00337 surface protease GP63; Provisional
Probab=88.07 E-value=0.23 Score=48.17 Aligned_cols=17 Identities=41% Similarity=0.501 Sum_probs=14.7
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...+++|||.|||||..
T Consensus 229 ~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 229 DVRVAAHELGHALGFVR 245 (567)
T ss_pred HHHHHHHHHHHHHccCH
Confidence 46899999999999944
No 48
>KOG4525|consensus
Probab=87.16 E-value=0.24 Score=46.07 Aligned_cols=20 Identities=40% Similarity=0.552 Sum_probs=17.4
Q ss_pred cchhHHHHHHHhcccccccC
Q psy1890 106 MGTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~r 125 (242)
.|.++|||||.||+.|.-.-
T Consensus 302 lGA~~HElGH~lgcpHq~~G 321 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQSEG 321 (614)
T ss_pred HHHHHHHhhhccCCCCCCCc
Confidence 48899999999999997653
No 49
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.14 E-value=0.35 Score=47.08 Aligned_cols=17 Identities=53% Similarity=0.690 Sum_probs=14.6
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...+++|||.|||||.-
T Consensus 256 ~~rv~~HEi~HALGFS~ 272 (622)
T PTZ00257 256 TTRTVTHEVAHALGFSS 272 (622)
T ss_pred HHHHHHHHHHHHhcCCH
Confidence 46899999999999943
No 50
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=85.98 E-value=1 Score=30.37 Aligned_cols=40 Identities=13% Similarity=0.150 Sum_probs=31.6
Q ss_pred ceEEeCCCC-cccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 95 SYVNLGKNC-FVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 95 q~i~l~~~c-~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
+.+.|.+.. .....++|++.+.+||.|+-.-...+.||.|
T Consensus 18 ~~l~f~p~lt~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v 58 (60)
T cd02640 18 RDMVFSPEFSKEERALIHQIAQKYGLKSRSYGSGNDRYLVI 58 (60)
T ss_pred ceEEcCCCCCHHHHHHHHHHHHHcCCceeeEeCCCCeEEEE
Confidence 567776533 3358999999999999999888888888765
No 51
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=84.97 E-value=1.5 Score=29.39 Aligned_cols=40 Identities=13% Similarity=0.113 Sum_probs=30.7
Q ss_pred ceEEeCCC-CcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 95 SYVNLGKN-CFVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 95 q~i~l~~~-c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
+.+.+.++ ......++|++.+.+||.|+-.-...+.||.|
T Consensus 17 ~~l~Fpp~ls~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v 57 (59)
T cd06007 17 EEYEFPSSLTNHERAVIHRLCRKLGLKSKSKGKGSNRRLSV 57 (59)
T ss_pred cEEEcCCCCCHHHHHHHHHHHHHcCCCceeecCCCCeEEEE
Confidence 55666653 23458999999999999999877777777765
No 52
>KOG3607|consensus
Probab=82.67 E-value=0.6 Score=46.51 Aligned_cols=18 Identities=33% Similarity=0.519 Sum_probs=16.7
Q ss_pred cchhHHHHHHHhcccccc
Q psy1890 106 MGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh 123 (242)
..+++|||||.||+.|.-
T Consensus 324 a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 324 AVVLAHELGHNLGMIHDE 341 (716)
T ss_pred HHHHHHHHHhhcCccccc
Confidence 479999999999999987
No 53
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=81.75 E-value=2.4 Score=28.63 Aligned_cols=40 Identities=13% Similarity=0.114 Sum_probs=31.4
Q ss_pred ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890 95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH 135 (242)
Q Consensus 95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~ 135 (242)
+.+.|.+-......++|++....||.||.+... +.+|.|.
T Consensus 22 ~~~~f~pm~sy~RllvH~la~~~gL~s~s~~~~-~r~vvv~ 61 (63)
T cd02642 22 QSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSG-GKCVIVN 61 (63)
T ss_pred CeeEcCCCCcHHHHHHHHHHHHhCCeeEeecCC-ceEEEEE
Confidence 445555534456899999999999999999987 7788764
No 54
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=80.59 E-value=2.5 Score=28.04 Aligned_cols=40 Identities=10% Similarity=0.085 Sum_probs=31.1
Q ss_pred ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
+.+.+.+=......++|+|.+++||.++..-...+.||.|
T Consensus 17 ~~~~fppm~~~~R~~vH~lA~~~~L~S~S~G~g~~R~v~v 56 (58)
T cd02646 17 DSLSFPPMDKHGRKTIHKLANCYNLKSKSRGKGKKRFVTV 56 (58)
T ss_pred ceEecCCCCHHHHHHHHHHHHHcCCcccccccCCceEEEE
Confidence 4566655333347899999999999999988888888766
No 55
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=80.55 E-value=0.87 Score=41.02 Aligned_cols=23 Identities=35% Similarity=0.355 Sum_probs=19.2
Q ss_pred CCcccchhHHHHHHHhccccccc
Q psy1890 102 NCFVMGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 102 ~c~~~~~i~HEigHALGl~HEh~ 124 (242)
.....-|++||++|-+|+..|.-
T Consensus 193 ~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 193 PYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred cccccHHHHHHHHHHhCCCCHHH
Confidence 34456899999999999999864
No 56
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=80.32 E-value=1.3 Score=42.69 Aligned_cols=24 Identities=21% Similarity=0.279 Sum_probs=16.8
Q ss_pred EEeCCCCcccchhHHHHHHHhccc
Q psy1890 97 VNLGKNCFVMGTVQHETLHALGFW 120 (242)
Q Consensus 97 i~l~~~c~~~~~i~HEigHALGl~ 120 (242)
+|+...-..+.|++||+|||+=..
T Consensus 329 ~Nf~~t~~dv~TL~HElGHa~H~~ 352 (549)
T TIGR02289 329 SNFNGTSGDIDVLTHEAGHAFHVY 352 (549)
T ss_pred EeCCCChhHHHHHHHHhhHHHHHH
Confidence 455444445689999999998443
No 57
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=79.52 E-value=0.98 Score=39.63 Aligned_cols=15 Identities=33% Similarity=0.275 Sum_probs=11.2
Q ss_pred cchhHHHHHHHhccc
Q psy1890 106 MGTVQHETLHALGFW 120 (242)
Q Consensus 106 ~~~i~HEigHALGl~ 120 (242)
..+++|||||++|-.
T Consensus 217 ~~v~vHE~GHsf~~L 231 (264)
T PF09471_consen 217 KQVVVHEFGHSFGGL 231 (264)
T ss_dssp HHHHHHHHHHHTT--
T ss_pred cceeeeecccccccc
Confidence 478999999977643
No 58
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.58 E-value=2 Score=36.63 Aligned_cols=24 Identities=25% Similarity=0.461 Sum_probs=17.5
Q ss_pred ceEEeCCCCc------ccchhHHHHHHHhc
Q psy1890 95 SYVNLGKNCF------VMGTVQHETLHALG 118 (242)
Q Consensus 95 q~i~l~~~c~------~~~~i~HEigHALG 118 (242)
..++|++... ..++++||.|||+.
T Consensus 73 k~vrLS~~vy~~~SiaAvaVAAHEvGHAiQ 102 (222)
T PF04298_consen 73 KVVRLSEDVYNGRSIAAVAVAAHEVGHAIQ 102 (222)
T ss_pred CEEEeCCccCCCCCHHHHHHHHHHHhHHHh
Confidence 6777754322 35899999999985
No 59
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=77.47 E-value=1.7 Score=43.76 Aligned_cols=134 Identities=10% Similarity=0.022 Sum_probs=69.8
Q ss_pred CCCCcccccC--CCCCCCCCeEEEEECC-C-----CCHHHHHHHH---------------HHHHHHhccccEEEEecCCC
Q psy1890 11 HDGVKNLAKT--PAKLWKDGLIYYEIDK-N-----FSVDRKRFIV---------------KALNNIMTFSCVRFRRRTHN 67 (242)
Q Consensus 11 ~~~~R~~~~~--~~~~Wp~~~I~Y~~~~-~-----~~~~~~~~I~---------------~A~~~w~~~tci~F~~~~~~ 67 (242)
+|+-|.+... -...|.+..-+|.+-+ + .+...+..++ +.++.+..-+.|.|.+....
T Consensus 114 LP~y~~g~~~~~F~~~w~~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~~ 193 (775)
T PF03272_consen 114 LPVYRCGQNEEEFFSEWDDSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDPV 193 (775)
T ss_pred CCEEecCCCHHHHhhhhhhcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCccc
Confidence 4444554443 2347988777888743 2 1333444444 45566666788888876531
Q ss_pred C---CCeEEEEccCCCCCCCCccCCCC-CccceEEeCCC---CcccchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890 68 Q---TPYIVYKESPNGKSCSSNVGFTG-KKVSYVNLGKN---CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK 140 (242)
Q Consensus 68 ~---~~~I~i~~~~~~~gc~S~vG~~~-g~~q~i~l~~~---c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~ 140 (242)
. ...-.++.+.+|.| -+|.|..= +. ..-+++.. -..-..++|||||+--+.-.++.+ ++.=.|+||.
T Consensus 194 ~~n~~~kYF~KAD~~G~G-~AYY~~~w~a~-ss~s~~~~L~~~~~nW~~LHEIgHgYd~~F~~n~~----~~~EVWnNI~ 267 (775)
T PF03272_consen 194 DKNFNNKYFAKADKSGPG-AAYYGSNWTAQ-SSSSLSFYLNPSPTNWGALHEIGHGYDFGFTRNGT----YLNEVWNNIL 267 (775)
T ss_pred ccccccceEEEecCCCCC-Cccccccceec-CchhHHHHhCCCCCCchhhhhhhhhcceeEeeCCc----chhhhhhhhh
Confidence 1 11223333222322 34554421 11 11122100 112357899999998887776644 2336799998
Q ss_pred ccccccccccC
Q psy1890 141 KGSEINFKKRN 151 (242)
Q Consensus 141 ~~~~~nf~k~~ 151 (242)
+.. ++|..+.
T Consensus 268 ~d~-yQ~~~~~ 277 (775)
T PF03272_consen 268 ADR-YQYTYMT 277 (775)
T ss_pred hhh-hhhhhcC
Confidence 643 4444443
No 60
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=77.43 E-value=1.2 Score=37.88 Aligned_cols=16 Identities=44% Similarity=0.420 Sum_probs=12.4
Q ss_pred chhHHHHHHHhccccccc
Q psy1890 107 GTVQHETLHALGFWHEQA 124 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~ 124 (242)
=|++|||||+| .|...
T Consensus 74 FtlAHELGH~l--lH~~~ 89 (213)
T COG2856 74 FTLAHELGHAL--LHTDL 89 (213)
T ss_pred HHHHHHHhHHH--hcccc
Confidence 58999999997 44444
No 61
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=76.19 E-value=1.4 Score=36.96 Aligned_cols=22 Identities=32% Similarity=0.234 Sum_probs=16.7
Q ss_pred cchhHHHHHHHhcccccccCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpD 127 (242)
..++.||+|||+-=.-.-.|..
T Consensus 23 l~t~~HE~gHal~a~l~G~~v~ 44 (200)
T PF13398_consen 23 LVTFVHELGHALAALLTGGRVK 44 (200)
T ss_pred HHHHHHHHHHHHHHHHhCCCcc
Confidence 3799999999987666555544
No 62
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=74.95 E-value=1.6 Score=32.45 Aligned_cols=19 Identities=32% Similarity=0.426 Sum_probs=14.0
Q ss_pred cchhHHHHHHHhccccccc
Q psy1890 106 MGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~ 124 (242)
..|++||++|-+|+.-|.-
T Consensus 74 ~~tlvhEiah~fG~~~e~l 92 (97)
T PF06262_consen 74 RDTLVHEIAHHFGISDEDL 92 (97)
T ss_dssp HHHHHHHHHHHTT--HHHH
T ss_pred HHHHHHHHHHHcCCCHHHh
Confidence 3689999999999876643
No 63
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=73.86 E-value=1.8 Score=31.77 Aligned_cols=23 Identities=26% Similarity=0.160 Sum_probs=15.2
Q ss_pred ceEEeCCCC---cccchhHHHHHHHh
Q psy1890 95 SYVNLGKNC---FVMGTVQHETLHAL 117 (242)
Q Consensus 95 q~i~l~~~c---~~~~~i~HEigHAL 117 (242)
..|-+.... ...-+++|||||.+
T Consensus 29 ~~I~in~~~~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 29 PIIFINSNLSPERQRFTLAHELGHIL 54 (122)
T ss_dssp TEEEEESSS-HHHHHHHHHHHHHHHH
T ss_pred CEEEECCCCCHHHHHHHHHHHHHHHH
Confidence 455554432 22469999999987
No 64
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=73.71 E-value=9.4 Score=31.41 Aligned_cols=19 Identities=26% Similarity=0.113 Sum_probs=15.5
Q ss_pred ccchhHHHHHHHhcccccc
Q psy1890 105 VMGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh 123 (242)
...+|+|||.|..-..|.-
T Consensus 164 idYVvvHEL~Hl~~~nHs~ 182 (205)
T PF01863_consen 164 IDYVVVHELCHLRHPNHSK 182 (205)
T ss_pred HHHHHHHHHHHhccCCCCH
Confidence 4589999999999877654
No 65
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=71.58 E-value=2.1 Score=36.52 Aligned_cols=19 Identities=37% Similarity=0.375 Sum_probs=13.4
Q ss_pred cchhHHHHHHHhcccccccCC
Q psy1890 106 MGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rp 126 (242)
..|+.||+|||| .++.+=|
T Consensus 93 ~~~l~HE~GHAl--I~~~~lP 111 (220)
T PF14247_consen 93 LFTLYHELGHAL--IDDLDLP 111 (220)
T ss_pred HHHHHHHHHHHH--HHHhcCC
Confidence 478999999997 4444433
No 66
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=68.32 E-value=2.2 Score=32.91 Aligned_cols=17 Identities=24% Similarity=0.356 Sum_probs=14.3
Q ss_pred cchhHHHHHHHhccccc
Q psy1890 106 MGTVQHETLHALGFWHE 122 (242)
Q Consensus 106 ~~~i~HEigHALGl~HE 122 (242)
.-+++|||||-+|+.-+
T Consensus 110 thvliHEIgHhFGLsDd 126 (136)
T COG3824 110 THVLIHEIGHHFGLSDD 126 (136)
T ss_pred hhhhhhhhhhhcCCChh
Confidence 36899999999999754
No 67
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=68.30 E-value=2.5 Score=39.68 Aligned_cols=21 Identities=24% Similarity=0.192 Sum_probs=15.4
Q ss_pred ccchhHHHHHHHhcccccccC
Q psy1890 105 VMGTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~r 125 (242)
...|+.||||||+=..=-.++
T Consensus 242 ~v~tLfHE~GHa~H~~ls~~~ 262 (458)
T PF01432_consen 242 DVETLFHEFGHAMHSLLSRTK 262 (458)
T ss_dssp HHHHHHHHHHHHHHHHHCCCS
T ss_pred hHHHHHHHHhHHHHHHHhccc
Confidence 357999999999866544443
No 68
>PF11667 DUF3267: Protein of unknown function (DUF3267); InterPro: IPR021683 This family of proteins has no known function.
Probab=67.40 E-value=5 Score=30.06 Aligned_cols=20 Identities=30% Similarity=0.245 Sum_probs=18.1
Q ss_pred chhHHHHHHHhcccccccCC
Q psy1890 107 GTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~rp 126 (242)
..++||+.|+|.+...-.++
T Consensus 6 ~~~~HEliH~l~~~~~~~~~ 25 (111)
T PF11667_consen 6 LIPLHELIHGLFFKLFGKKP 25 (111)
T ss_pred eHHHHHHHHHHHHHHhCCCC
Confidence 46899999999999998877
No 69
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=65.98 E-value=3.1 Score=40.74 Aligned_cols=78 Identities=18% Similarity=0.117 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC-CCCCCCccCCCCCc-cceEEeCCCCcccchhHHHHHHHhc
Q psy1890 41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN-GKSCSSNVGFTGKK-VSYVNLGKNCFVMGTVQHETLHALG 118 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~-~~gc~S~vG~~~g~-~q~i~l~~~c~~~~~i~HEigHALG 118 (242)
..++.+.++++......+=.|++.- +..||-+..+++ .+|-+|.-...+.. ---||+...-..+.|++||+||++=
T Consensus 316 ea~~~v~~~l~~lg~ey~~~~~~a~--~~~WiD~~~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~dV~TLaHElGHs~H 393 (598)
T COG1164 316 EAKELVLKALAPLGPEYAKIARRAF--DERWIDVYPRKGKRSGAYSIGFYKGDHPFILMNYDGTLRDVFTLAHELGHSVH 393 (598)
T ss_pred HHHHHHHHHHHhhCHHHHHHHHHHH--hcCCeeccCCCCCCCCcccCCCCCCCCCeEEEeCCCchhHHHHHHHHccHHHH
Confidence 4566666666655543222122221 225555533321 13323211111122 1234544334457899999999986
Q ss_pred cc
Q psy1890 119 FW 120 (242)
Q Consensus 119 l~ 120 (242)
..
T Consensus 394 s~ 395 (598)
T COG1164 394 SY 395 (598)
T ss_pred HH
Confidence 53
No 70
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=65.90 E-value=3.2 Score=33.94 Aligned_cols=23 Identities=30% Similarity=0.234 Sum_probs=16.3
Q ss_pred cchhHHHHHHHhcccccccCCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPDR 128 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDR 128 (242)
..++.||+||++--.....+..+
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~~~ 30 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKVPR 30 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B--
T ss_pred ccccccccccccccccccccccc
Confidence 36899999999998887776543
No 71
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=64.49 E-value=3.3 Score=37.54 Aligned_cols=14 Identities=36% Similarity=0.323 Sum_probs=11.7
Q ss_pred ccchhHHHHHHHhc
Q psy1890 105 VMGTVQHETLHALG 118 (242)
Q Consensus 105 ~~~~i~HEigHALG 118 (242)
...|+.||+|||+=
T Consensus 154 ~v~tl~HE~GHa~h 167 (365)
T cd06258 154 DINTLFHEFGHAVH 167 (365)
T ss_pred HHHHHHHHHhHHHH
Confidence 35789999999983
No 72
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=62.86 E-value=4.5 Score=30.60 Aligned_cols=18 Identities=39% Similarity=0.530 Sum_probs=15.9
Q ss_pred chhHHHHHHHhccccccc
Q psy1890 107 GTVQHETLHALGFWHEQA 124 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~ 124 (242)
..++|=+.|.|||-|+..
T Consensus 75 ~l~iHG~LHLlGyDH~~~ 92 (110)
T TIGR00043 75 HLTVHGLLHLLGYDHETE 92 (110)
T ss_pred HHHHHHHHHHcCCCCCCh
Confidence 679999999999999853
No 73
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=62.44 E-value=5.6 Score=33.65 Aligned_cols=76 Identities=26% Similarity=0.209 Sum_probs=45.1
Q ss_pred chhHHHHHHHhcccccccCCCCCceEEEeeecccccccc-cccccCccc----cccCCCcCCCCcccccCCCccccCCCC
Q psy1890 107 GTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEI-NFKKRNFKE----ATTLDLPYDFGSIMHYSRFAFSRDSKS 181 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~-nf~k~~~~~----~~~~g~~yD~~SIMhY~~~~f~~~g~~ 181 (242)
-+.-|+++|++|+.|--.|-| |+ +.++... +....-.++ ....| --|..|.|.|+..
T Consensus 155 l~~~efl~~~~~~~~lnhr~~------i~---~~pg~~~e~L~~tarhElGhaLgi~g-hsd~ks~aly~~s-------- 216 (236)
T COG5549 155 LIAYEFLGHALGLGHLNHRGD------IM---YPPGELRENLNPTARHELGHALGIWG-HSDLKSDALYGIS-------- 216 (236)
T ss_pred HHHHHHhhhccCccccccccc------cc---CCcccchhhhhHHHHHhhcchheecc-cccchhhhhcchh--------
Confidence 356789999999999888866 22 2222211 111000000 11122 2478888888742
Q ss_pred CeeeeccCCCccccccCCcCcccHHHHHHHHhHCCCC
Q psy1890 182 PTITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCP 218 (242)
Q Consensus 182 ~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~ 218 (242)
..+.|..|++.|.++|.-|
T Consensus 217 ------------------~~~is~rdv~tL~r~Yeqp 235 (236)
T COG5549 217 ------------------WQGISRRDVKTLERKYEQP 235 (236)
T ss_pred ------------------hcccCHHHHHHHHHHhcCC
Confidence 1138899999999999654
No 74
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=61.78 E-value=4.1 Score=38.63 Aligned_cols=14 Identities=29% Similarity=0.226 Sum_probs=11.6
Q ss_pred ccchhHHHHHHHhc
Q psy1890 105 VMGTVQHETLHALG 118 (242)
Q Consensus 105 ~~~~i~HEigHALG 118 (242)
...|+.|||||||=
T Consensus 263 ~V~TLfHEfGHalH 276 (472)
T cd06455 263 EVETFFHEFGHVIH 276 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 35789999999993
No 75
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=61.23 E-value=19 Score=28.07 Aligned_cols=69 Identities=20% Similarity=0.265 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHhccccEEEEecCC-CCCCeEEEEccCCCCCCCCccCCCCC--ccceEEeC----------CC-------
Q psy1890 43 KRFIVKALNNIMTFSCVRFRRRTH-NQTPYIVYKESPNGKSCSSNVGFTGK--KVSYVNLG----------KN------- 102 (242)
Q Consensus 43 ~~~I~~A~~~w~~~tci~F~~~~~-~~~~~I~i~~~~~~~gc~S~vG~~~g--~~q~i~l~----------~~------- 102 (242)
...|-+|+..|-..+-....+.-. +....|.|... ..|| ..|.... ..|.|.+. ++
T Consensus 22 ~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~--~~~~--A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~ 97 (135)
T PF15639_consen 22 VPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPL--VGGT--AYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNV 97 (135)
T ss_pred ChHHHHHHHHHcccchhhhhhccccCCCCEEEEeec--ccce--eeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceE
Confidence 457778888877655444443322 34578999876 5677 3444321 13666652 11
Q ss_pred CcccchhHHHHHH
Q psy1890 103 CFVMGTVQHETLH 115 (242)
Q Consensus 103 c~~~~~i~HEigH 115 (242)
-....|++||+-|
T Consensus 98 ~~v~~TlLHEliH 110 (135)
T PF15639_consen 98 YLVGSTLLHELIH 110 (135)
T ss_pred EEeehHHHHHHHH
Confidence 0123799999988
No 76
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=60.97 E-value=4.5 Score=32.83 Aligned_cols=22 Identities=23% Similarity=0.123 Sum_probs=16.8
Q ss_pred cchhHHHHHHHhcccccccCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpD 127 (242)
..++.||+||++=-.+.-.+..
T Consensus 9 i~i~iHE~gH~~~A~~~G~~~~ 30 (180)
T cd05709 9 ISVTVHELGHALVARRLGVKVA 30 (180)
T ss_pred HHHHHHHHHHHHHHHHcCCCch
Confidence 3678999999997776666544
No 77
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=60.95 E-value=3.9 Score=31.11 Aligned_cols=37 Identities=30% Similarity=0.438 Sum_probs=22.2
Q ss_pred CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHhccc
Q psy1890 80 GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHALGFW 120 (242)
Q Consensus 80 ~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHALGl~ 120 (242)
++-|.|++|..--.+. +-..|| ..|+.||+.|+..|.
T Consensus 58 ~~LCG~~~~~i~IDP~--~~~KGC--~~TL~HEL~H~WQ~R 94 (141)
T PHA02456 58 QDLCGQFVGWIEIDPD--YANKGC--RDTLAHELNHAWQFR 94 (141)
T ss_pred cchhhcceeEEEECCc--ccccch--HHHHHHHHHHHHhhh
Confidence 4557676665321100 012467 469999999998764
No 78
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=59.92 E-value=4.5 Score=38.25 Aligned_cols=15 Identities=27% Similarity=0.355 Sum_probs=12.9
Q ss_pred ccchhHHHHHHHhcc
Q psy1890 105 VMGTVQHETLHALGF 119 (242)
Q Consensus 105 ~~~~i~HEigHALGl 119 (242)
..++++|||||..|=
T Consensus 130 lagViAHEigHv~qr 144 (484)
T COG4783 130 LAGVIAHEIGHVAQR 144 (484)
T ss_pred HHHHHHHHHHHHhhh
Confidence 459999999998874
No 79
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=59.60 E-value=5.2 Score=32.02 Aligned_cols=12 Identities=33% Similarity=0.595 Sum_probs=10.4
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.-+++|||.|++
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 469999999985
No 80
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=59.45 E-value=4.7 Score=38.08 Aligned_cols=18 Identities=28% Similarity=0.222 Sum_probs=13.8
Q ss_pred cchhHHHHHHHhcccccc
Q psy1890 106 MGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh 123 (242)
..|+.|||||||=..-.+
T Consensus 249 v~TLfHEfGHalH~~ls~ 266 (458)
T cd06457 249 VETLFHEMGHAMHSMLGR 266 (458)
T ss_pred HHHHHHHHhHHHHHHHcC
Confidence 478999999998654443
No 81
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=58.20 E-value=4.1 Score=39.13 Aligned_cols=24 Identities=13% Similarity=0.375 Sum_probs=18.3
Q ss_pred CHHHHHHHHHHHHHHhccccEEEE
Q psy1890 39 SVDRKRFIVKALNNIMTFSCVRFR 62 (242)
Q Consensus 39 ~~~~~~~I~~A~~~w~~~tci~F~ 62 (242)
+...|+.+++|+.+-.++..|...
T Consensus 234 ~~~ar~lvkeal~~~ak~~~I~l~ 257 (760)
T COG4412 234 PSGARKLVKEALNTAAKHQAIQLQ 257 (760)
T ss_pred ChhHHHHHHHHHHHHhhcccCChh
Confidence 456899999999998886655543
No 82
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=58.15 E-value=6 Score=31.37 Aligned_cols=16 Identities=44% Similarity=0.642 Sum_probs=14.1
Q ss_pred chhHHHHHHHhccccc
Q psy1890 107 GTVQHETLHALGFWHE 122 (242)
Q Consensus 107 ~~i~HEigHALGl~HE 122 (242)
..++|=+.|.||+-|+
T Consensus 110 ~l~vHG~LHLlGyDH~ 125 (145)
T PF02130_consen 110 RLLVHGLLHLLGYDHE 125 (145)
T ss_dssp HHHHHHHHHHTT-SST
T ss_pred HHHHHHHHHHcCCCCC
Confidence 6799999999999999
No 83
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=55.17 E-value=6.6 Score=32.42 Aligned_cols=22 Identities=18% Similarity=-0.065 Sum_probs=15.8
Q ss_pred cchhHHHHHHHhcccccccCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpD 127 (242)
..++.||+||.+=-.-...+.+
T Consensus 10 ~~v~iHElGH~~~Ar~~Gv~v~ 31 (182)
T cd06163 10 ILIFVHELGHFLVAKLFGVKVE 31 (182)
T ss_pred HHHHHHHHHHHHHHHHcCCeee
Confidence 3579999999876665555543
No 84
>COG0319 Predicted metal-dependent hydrolase [General function prediction only]
Probab=54.83 E-value=7.3 Score=31.35 Aligned_cols=18 Identities=39% Similarity=0.562 Sum_probs=16.5
Q ss_pred chhHHHHHHHhccccccc
Q psy1890 107 GTVQHETLHALGFWHEQA 124 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~ 124 (242)
..++|=+.|.||+-|+..
T Consensus 110 ~l~vHG~LHLlGYDH~~~ 127 (153)
T COG0319 110 HLTIHGILHLLGYDHEED 127 (153)
T ss_pred HHHHHHHHHHccCCCCCc
Confidence 679999999999999986
No 85
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=54.62 E-value=6.4 Score=36.82 Aligned_cols=14 Identities=43% Similarity=0.342 Sum_probs=11.7
Q ss_pred ccchhHHHHHHHhc
Q psy1890 105 VMGTVQHETLHALG 118 (242)
Q Consensus 105 ~~~~i~HEigHALG 118 (242)
...|+.|||||||=
T Consensus 208 ~v~tLfHEfGHalH 221 (422)
T cd06456 208 EVTTLFHEFGHALH 221 (422)
T ss_pred HHHHHHHHHHHHHH
Confidence 35789999999994
No 86
>KOG3658|consensus
Probab=54.48 E-value=4 Score=40.08 Aligned_cols=18 Identities=33% Similarity=0.303 Sum_probs=15.5
Q ss_pred cchhHHHHHHHhcccccc
Q psy1890 106 MGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh 123 (242)
.-|++|||||.+|-.|.-
T Consensus 393 ~lt~AHEiGHNfGSpHDp 410 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDP 410 (764)
T ss_pred heeehhhhccccCCCCCC
Confidence 368999999999999963
No 87
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=54.11 E-value=7.3 Score=28.72 Aligned_cols=12 Identities=25% Similarity=0.274 Sum_probs=10.7
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
..+++||+.|++
T Consensus 26 ~~~l~HE~~H~~ 37 (128)
T PF13485_consen 26 DRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHH
Confidence 389999999996
No 88
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=53.69 E-value=6.5 Score=32.94 Aligned_cols=24 Identities=25% Similarity=0.338 Sum_probs=16.5
Q ss_pred ceEEeCC------CCcccchhHHHHHHHhc
Q psy1890 95 SYVNLGK------NCFVMGTVQHETLHALG 118 (242)
Q Consensus 95 q~i~l~~------~c~~~~~i~HEigHALG 118 (242)
+.+.|.+ .....++++||.|||+.
T Consensus 76 kvvrLSe~~y~g~Sia~~aVAAHEVGHAiQ 105 (226)
T COG2738 76 KVVRLSEANYYGPSIAAIAVAAHEVGHAIQ 105 (226)
T ss_pred heeeccccccCCccHHHHHHHHHHhhHHHh
Confidence 5566643 22345899999999984
No 89
>PRK00016 metal-binding heat shock protein; Provisional
Probab=52.59 E-value=7.6 Score=31.35 Aligned_cols=17 Identities=35% Similarity=0.485 Sum_probs=15.4
Q ss_pred chhHHHHHHHhcccccc
Q psy1890 107 GTVQHETLHALGFWHEQ 123 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh 123 (242)
..++|=++|.||+-|+.
T Consensus 115 ~l~iHG~LHLlGYDH~~ 131 (159)
T PRK00016 115 HLTVHGILHLLGYDHIE 131 (159)
T ss_pred HHHHHhhHHhcCCCCCC
Confidence 57999999999999975
No 90
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=52.38 E-value=7.5 Score=34.38 Aligned_cols=22 Identities=32% Similarity=0.370 Sum_probs=17.4
Q ss_pred cchhHHHHHHHhcccccccCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpD 127 (242)
..++.||++||+=-..|..+-+
T Consensus 136 isvvvHElgHal~A~~~gi~V~ 157 (277)
T cd06162 136 ISGVVHEMGHGVAAVREQVRVN 157 (277)
T ss_pred HHHHHHHHHHHHHHHHcCCeec
Confidence 4789999999998887765544
No 91
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=52.26 E-value=7.9 Score=36.01 Aligned_cols=28 Identities=36% Similarity=0.310 Sum_probs=18.2
Q ss_pred eEEeCCC--CcccchhHHHHHHHhcccccccCC
Q psy1890 96 YVNLGKN--CFVMGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 96 ~i~l~~~--c~~~~~i~HEigHALGl~HEh~rp 126 (242)
+++.... ....-+++||+|||| |||+-|
T Consensus 148 tt~y~~~d~~~~l~t~iHE~GHal---ye~~l~ 177 (396)
T cd06460 148 TTRYDENDFRSALFSTIHETGHAL---YEQGLP 177 (396)
T ss_pred EeeeCCcchHHHHHHHHHHhhHHH---HHhcCC
Confidence 4444443 233468999999997 677544
No 92
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=51.96 E-value=7.6 Score=34.02 Aligned_cols=23 Identities=26% Similarity=0.267 Sum_probs=19.0
Q ss_pred cchhHHHHHHHhcccccccCCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPDR 128 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDR 128 (242)
..++.||+|||+=-.++..+.+.
T Consensus 119 isv~iHElgHa~~Ar~~G~~V~~ 141 (263)
T cd06159 119 VGVVVHELSHGILARVEGIKVKS 141 (263)
T ss_pred HHHHHHHHHHHHHHHHcCCEECc
Confidence 46899999999988888877653
No 93
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=51.57 E-value=19 Score=24.12 Aligned_cols=38 Identities=18% Similarity=0.236 Sum_probs=29.8
Q ss_pred EEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 97 VNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 97 i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
+.|.+=......++||+..+.||.=|..-+.-+.||-|
T Consensus 20 ~~f~pM~~~~R~~vHdla~~~gl~SeS~d~Ep~R~V~v 57 (59)
T cd06006 20 LRFPPMRSPQRAFIHELAKDYGLYSESQDPEPKRSVFV 57 (59)
T ss_pred eeCCCCCHHHHHHHHHHHHHcCCeeEecCCCCCcEEEE
Confidence 44444333458899999999999999999888888765
No 94
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=49.84 E-value=9 Score=31.75 Aligned_cols=12 Identities=25% Similarity=0.235 Sum_probs=10.9
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.++++||+||..
T Consensus 90 ~aVlaHElgH~~ 101 (226)
T PF01435_consen 90 AAVLAHELGHIK 101 (226)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 489999999987
No 95
>PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional
Probab=49.01 E-value=9.9 Score=33.15 Aligned_cols=16 Identities=44% Similarity=0.632 Sum_probs=15.1
Q ss_pred chhHHHHHHHhccccc
Q psy1890 107 GTVQHETLHALGFWHE 122 (242)
Q Consensus 107 ~~i~HEigHALGl~HE 122 (242)
..++|=++|.|||-|+
T Consensus 217 ~LlIHGlLHLLGYDHe 232 (258)
T PRK13963 217 HLLVHGALHAQGYDHE 232 (258)
T ss_pred HHHHHHHHHHcCCCCC
Confidence 6799999999999999
No 96
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=48.38 E-value=9.6 Score=31.33 Aligned_cols=13 Identities=31% Similarity=0.319 Sum_probs=11.0
Q ss_pred cchhHHHHHHHhc
Q psy1890 106 MGTVQHETLHALG 118 (242)
Q Consensus 106 ~~~i~HEigHALG 118 (242)
.+.++||++||+=
T Consensus 10 ~~i~~HE~aHa~~ 22 (181)
T cd06158 10 LAITLHEFAHAYV 22 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4789999999983
No 97
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=47.59 E-value=9.9 Score=32.46 Aligned_cols=23 Identities=17% Similarity=0.011 Sum_probs=19.0
Q ss_pred cchhHHHHHHHhcccccccCCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPDR 128 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDR 128 (242)
..++.||+||++=..+...+..+
T Consensus 54 ~~v~iHElgH~~~A~~~G~~v~~ 76 (227)
T cd06164 54 ASVLLHELGHSLVARRYGIPVRS 76 (227)
T ss_pred HHHHHHHHHHHHHHHHcCCeECe
Confidence 47899999999988888877754
No 98
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=46.80 E-value=11 Score=29.73 Aligned_cols=15 Identities=33% Similarity=0.317 Sum_probs=12.7
Q ss_pred cchhHHHHHHHhccc
Q psy1890 106 MGTVQHETLHALGFW 120 (242)
Q Consensus 106 ~~~i~HEigHALGl~ 120 (242)
..|++||+.|++=+.
T Consensus 60 ~~~l~HEm~H~~~~~ 74 (146)
T smart00731 60 RETLLHELCHAALYL 74 (146)
T ss_pred HhhHHHHHHHHHHHH
Confidence 479999999998664
No 99
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=46.73 E-value=12 Score=26.56 Aligned_cols=14 Identities=21% Similarity=0.235 Sum_probs=11.8
Q ss_pred ccchhHHHHHHHhc
Q psy1890 105 VMGTVQHETLHALG 118 (242)
Q Consensus 105 ~~~~i~HEigHALG 118 (242)
....+.|||.|++-
T Consensus 61 ~~~llaHEl~Hv~Q 74 (79)
T PF13699_consen 61 GRALLAHELAHVVQ 74 (79)
T ss_pred cchhHhHHHHHHHh
Confidence 45799999999974
No 100
>KOG2921|consensus
Probab=46.32 E-value=10 Score=35.23 Aligned_cols=34 Identities=29% Similarity=0.214 Sum_probs=24.0
Q ss_pred cchhHHHHHHHhcccccccCCCCCceEEEeeeccccc
Q psy1890 106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKG 142 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~ 142 (242)
...+.||+||||--.-|+-+- +++-|.---|.|+
T Consensus 132 i~~vvHElGHalAA~segV~v---ngfgIfi~aiyPg 165 (484)
T KOG2921|consen 132 ITVVVHELGHALAAASEGVQV---NGFGIFIAAIYPG 165 (484)
T ss_pred HHHHHHHhhHHHHHHhcCcee---eeeEEEEEEEcCc
Confidence 367899999999998888764 3445544445554
No 101
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=46.16 E-value=32 Score=21.61 Aligned_cols=40 Identities=18% Similarity=0.092 Sum_probs=30.5
Q ss_pred ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890 95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI 134 (242)
Q Consensus 95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i 134 (242)
..+.|.+-.....-++|++.-..||.++....+-+.+|.|
T Consensus 18 ~~~~~~p~~~~~R~~vH~la~~~~L~s~s~g~~~~r~v~i 57 (59)
T cd02325 18 KSLELPPMNSYERKLIHDLAEYYGLKSESEGEGPNRRVVI 57 (59)
T ss_pred CeEEcCCCCHHHHHHHHHHHHHCCCEEEEecCCCCcEEEE
Confidence 4455554333458899999999999999998887777765
No 102
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=45.96 E-value=11 Score=29.62 Aligned_cols=17 Identities=29% Similarity=0.491 Sum_probs=14.3
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...|++||+.||+=+..
T Consensus 60 ~~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 60 LIDTLLHEMAHAAAYVF 76 (157)
T ss_pred HHHHHHHHHHHHHhhhc
Confidence 35899999999988765
No 103
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=45.79 E-value=21 Score=24.00 Aligned_cols=29 Identities=14% Similarity=0.095 Sum_probs=20.8
Q ss_pred ceEEeCCCCc-ccchhHHHHHHHhcccccc
Q psy1890 95 SYVNLGKNCF-VMGTVQHETLHALGFWHEQ 123 (242)
Q Consensus 95 q~i~l~~~c~-~~~~i~HEigHALGl~HEh 123 (242)
..+.++++-. ....++|.+.|-|||.|.-
T Consensus 18 ~eL~Fp~~ls~~eRriih~la~~lGL~~~s 47 (60)
T cd02639 18 DELAFPSSLSPAERRIVHLLASRLGLNHVS 47 (60)
T ss_pred eEEEcCCCCCHHHHHHHHHHHHHcCCceEE
Confidence 4455544322 3478999999999999983
No 104
>PRK10911 oligopeptidase A; Provisional
Probab=45.42 E-value=10 Score=37.75 Aligned_cols=12 Identities=42% Similarity=0.343 Sum_probs=10.8
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
+.|+.|||||||
T Consensus 464 v~tlfHEfGHal 475 (680)
T PRK10911 464 VITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHhHHH
Confidence 478999999998
No 105
>PRK02870 heat shock protein HtpX; Provisional
Probab=45.22 E-value=8.8 Score=34.88 Aligned_cols=12 Identities=25% Similarity=0.393 Sum_probs=10.5
Q ss_pred ccchhHHHHHHH
Q psy1890 105 VMGTVQHETLHA 116 (242)
Q Consensus 105 ~~~~i~HEigHA 116 (242)
..++++||+||.
T Consensus 173 L~aVlAHELgHi 184 (336)
T PRK02870 173 LQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHH
Confidence 359999999997
No 106
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=43.09 E-value=13 Score=31.39 Aligned_cols=17 Identities=24% Similarity=0.190 Sum_probs=13.3
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...+++||+|||.=...
T Consensus 93 ~gsiLAHE~mHa~Lrl~ 109 (212)
T PF12315_consen 93 TGSILAHELMHAWLRLN 109 (212)
T ss_pred HhhHHHHHHHHHHhccc
Confidence 34789999999986553
No 107
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=41.70 E-value=12 Score=29.74 Aligned_cols=14 Identities=50% Similarity=0.622 Sum_probs=11.4
Q ss_pred ccchhHHHHHH--Hhc
Q psy1890 105 VMGTVQHETLH--ALG 118 (242)
Q Consensus 105 ~~~~i~HEigH--ALG 118 (242)
..+|++||+-| +.|
T Consensus 96 k~~TLiHE~SHf~~~~ 111 (148)
T PF14521_consen 96 KEGTLIHEWSHFTAVG 111 (148)
T ss_dssp HHHHHHHHHHHSCCCT
T ss_pred hHHHHHHhhhhhhhhc
Confidence 45899999999 644
No 108
>KOG3538|consensus
Probab=40.90 E-value=8.4 Score=39.34 Aligned_cols=23 Identities=26% Similarity=0.287 Sum_probs=18.6
Q ss_pred ccchhHHHHHHHhcccccccCCC
Q psy1890 105 VMGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~rpD 127 (242)
..-+++||+||.+|+.|--...+
T Consensus 317 ~aftiahE~GH~~gm~hd~~~~~ 339 (845)
T KOG3538|consen 317 SAFTIAHELGHNFGMEHDGRGNS 339 (845)
T ss_pred cceeeeeccccccCcccCCcCCC
Confidence 34689999999999999855543
No 109
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=40.88 E-value=15 Score=30.31 Aligned_cols=24 Identities=17% Similarity=-0.025 Sum_probs=19.9
Q ss_pred cchhHHHHHHHhcccccccCCCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPDRD 129 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDRD 129 (242)
...+.||+||++--.....+..+-
T Consensus 42 ~~l~iHElgH~~~A~~~G~~~~~~ 65 (183)
T cd06160 42 AILGIHEMGHYLAARRHGVKASLP 65 (183)
T ss_pred HHHHHHHHHHHHHHHHCCCCccce
Confidence 467899999999999888887653
No 110
>PRK03982 heat shock protein HtpX; Provisional
Probab=40.76 E-value=14 Score=32.58 Aligned_cols=13 Identities=31% Similarity=0.498 Sum_probs=10.8
Q ss_pred ccchhHHHHHHHh
Q psy1890 105 VMGTVQHETLHAL 117 (242)
Q Consensus 105 ~~~~i~HEigHAL 117 (242)
-.++++||+||.-
T Consensus 125 l~AVlAHElgHi~ 137 (288)
T PRK03982 125 LEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHH
Confidence 3589999999964
No 111
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=40.03 E-value=15 Score=36.74 Aligned_cols=13 Identities=38% Similarity=0.284 Sum_probs=11.2
Q ss_pred cchhHHHHHHHhc
Q psy1890 106 MGTVQHETLHALG 118 (242)
Q Consensus 106 ~~~i~HEigHALG 118 (242)
+.|+-|||||||=
T Consensus 465 V~TlFHEfGHalH 477 (681)
T PRK10280 465 VITLFHEFGHTLH 477 (681)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999985
No 112
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=40.01 E-value=15 Score=32.08 Aligned_cols=12 Identities=25% Similarity=0.199 Sum_probs=10.7
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.+++.||+||..
T Consensus 158 ~aVlaHElgHi~ 169 (302)
T COG0501 158 EAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHh
Confidence 589999999975
No 113
>PRK03001 M48 family peptidase; Provisional
Probab=38.95 E-value=16 Score=32.21 Aligned_cols=13 Identities=31% Similarity=0.526 Sum_probs=10.8
Q ss_pred ccchhHHHHHHHh
Q psy1890 105 VMGTVQHETLHAL 117 (242)
Q Consensus 105 ~~~~i~HEigHAL 117 (242)
-.++++||+||.-
T Consensus 124 l~aVlAHElgHi~ 136 (283)
T PRK03001 124 IRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHh
Confidence 3589999999964
No 114
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.69 E-value=16 Score=30.58 Aligned_cols=24 Identities=25% Similarity=0.156 Sum_probs=19.6
Q ss_pred ccchhHHHHHHHhcccccccCCCC
Q psy1890 105 VMGTVQHETLHALGFWHEQARPDR 128 (242)
Q Consensus 105 ~~~~i~HEigHALGl~HEh~rpDR 128 (242)
...++.||+||++=..+...+..+
T Consensus 38 ~~~v~iHElgH~~~A~~~G~~v~~ 61 (208)
T cd06161 38 FLSVLLHELGHALVARRYGIRVRS 61 (208)
T ss_pred HHHHHHHHHHHHHHHHHcCCCccc
Confidence 347899999999988888887654
No 115
>PRK03072 heat shock protein HtpX; Provisional
Probab=38.20 E-value=16 Score=32.29 Aligned_cols=13 Identities=31% Similarity=0.509 Sum_probs=10.6
Q ss_pred ccchhHHHHHHHh
Q psy1890 105 VMGTVQHETLHAL 117 (242)
Q Consensus 105 ~~~~i~HEigHAL 117 (242)
-.++++||+||.-
T Consensus 127 l~aVlAHElgHi~ 139 (288)
T PRK03072 127 LRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHh
Confidence 3599999999943
No 116
>smart00393 R3H Putative single-stranded nucleic acids-binding domain.
Probab=38.09 E-value=44 Score=23.25 Aligned_cols=40 Identities=23% Similarity=0.100 Sum_probs=28.8
Q ss_pred eEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890 96 YVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH 135 (242)
Q Consensus 96 ~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~ 135 (242)
.+.|.+=.....-++|++...+||.|+.....-+.+|.|.
T Consensus 38 ~~~~~pm~~~~R~~iH~~a~~~~l~s~S~g~g~~R~vvv~ 77 (79)
T smart00393 38 SVELPPMNSYERKIVHELAEKYGLESESFGEGPKRRVVIS 77 (79)
T ss_pred eEEcCCCCHHHHHHHHHHHHHcCCEEEEEcCCCCcEEEEE
Confidence 4455442234578999999999999998776666676653
No 117
>PRK01345 heat shock protein HtpX; Provisional
Probab=37.52 E-value=17 Score=32.70 Aligned_cols=13 Identities=38% Similarity=0.560 Sum_probs=10.6
Q ss_pred ccchhHHHHHHHh
Q psy1890 105 VMGTVQHETLHAL 117 (242)
Q Consensus 105 ~~~~i~HEigHAL 117 (242)
..++++|||||..
T Consensus 124 L~aVlAHElgHi~ 136 (317)
T PRK01345 124 VAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHH
Confidence 3589999999943
No 118
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=37.13 E-value=18 Score=32.24 Aligned_cols=10 Identities=30% Similarity=0.205 Sum_probs=8.7
Q ss_pred chhHHHHHHH
Q psy1890 107 GTVQHETLHA 116 (242)
Q Consensus 107 ~~i~HEigHA 116 (242)
.+++||+||.
T Consensus 172 yVlAHEyGHH 181 (292)
T PF04228_consen 172 YVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHH
Confidence 5899999995
No 119
>PRK05457 heat shock protein HtpX; Provisional
Probab=36.91 E-value=17 Score=32.17 Aligned_cols=12 Identities=33% Similarity=0.434 Sum_probs=10.2
Q ss_pred ccchhHHHHHHH
Q psy1890 105 VMGTVQHETLHA 116 (242)
Q Consensus 105 ~~~~i~HEigHA 116 (242)
-.++++||+||.
T Consensus 134 l~aVlAHElgHi 145 (284)
T PRK05457 134 VEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHH
Confidence 359999999995
No 120
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=35.08 E-value=22 Score=31.02 Aligned_cols=17 Identities=29% Similarity=0.216 Sum_probs=12.6
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...+++|||.|+|-+..
T Consensus 154 g~NVviHEfAH~LD~~~ 170 (253)
T PF06167_consen 154 GHNVVIHEFAHKLDMED 170 (253)
T ss_dssp S--HHHHHHHHHHHCTT
T ss_pred CcchHHHHHHHHHHhhc
Confidence 45899999999997653
No 121
>PRK04897 heat shock protein HtpX; Provisional
Probab=34.71 E-value=19 Score=32.01 Aligned_cols=12 Identities=33% Similarity=0.462 Sum_probs=10.2
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.++++||+||.-
T Consensus 138 ~aVlAHElgHi~ 149 (298)
T PRK04897 138 EGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHh
Confidence 589999999953
No 122
>PF14891 Peptidase_M91: Effector protein
Probab=33.33 E-value=21 Score=29.01 Aligned_cols=19 Identities=21% Similarity=0.384 Sum_probs=15.7
Q ss_pred cchhHHHHHHHhccccccc
Q psy1890 106 MGTVQHETLHALGFWHEQA 124 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~ 124 (242)
.-++.|||.||.-+.+=-.
T Consensus 104 ~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 104 FVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHCCCC
Confidence 3689999999998877666
No 123
>PRK02391 heat shock protein HtpX; Provisional
Probab=32.95 E-value=22 Score=31.61 Aligned_cols=12 Identities=25% Similarity=0.396 Sum_probs=10.1
Q ss_pred ccchhHHHHHHH
Q psy1890 105 VMGTVQHETLHA 116 (242)
Q Consensus 105 ~~~~i~HEigHA 116 (242)
-.++++||+||.
T Consensus 133 l~aVlaHElgHi 144 (296)
T PRK02391 133 LEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHH
Confidence 358999999993
No 124
>PRK01265 heat shock protein HtpX; Provisional
Probab=32.37 E-value=22 Score=32.17 Aligned_cols=11 Identities=27% Similarity=0.341 Sum_probs=9.7
Q ss_pred ccchhHHHHHH
Q psy1890 105 VMGTVQHETLH 115 (242)
Q Consensus 105 ~~~~i~HEigH 115 (242)
-.++++||+||
T Consensus 140 l~aVlAHElgH 150 (324)
T PRK01265 140 IKAVAGHELGH 150 (324)
T ss_pred HHHHHHHHHHH
Confidence 35899999999
No 125
>KOG2556|consensus
Probab=31.63 E-value=23 Score=33.71 Aligned_cols=21 Identities=43% Similarity=0.441 Sum_probs=16.8
Q ss_pred cchhHHHHHHHhcccccccCC
Q psy1890 106 MGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rp 126 (242)
..++.||+.|||||.--+-..
T Consensus 256 ~stVkhE~~halgfsaGl~af 276 (666)
T KOG2556|consen 256 ASTVKHERDHALGFSAGLVAF 276 (666)
T ss_pred HHHHHHHHHhhhcccceeeEE
Confidence 368999999999998665543
No 126
>PLN02887 hydrolase family protein
Probab=30.52 E-value=27 Score=34.15 Aligned_cols=19 Identities=47% Similarity=0.532 Sum_probs=16.7
Q ss_pred chhHHHHHHHhcccccccC
Q psy1890 107 GTVQHETLHALGFWHEQAR 125 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~r 125 (242)
..++|=++|.|||.||...
T Consensus 232 ~l~vHG~LHLlGyDH~~~~ 250 (580)
T PLN02887 232 ILVVHGLLHLLGFDHEISN 250 (580)
T ss_pred HHHHHHHHHhcCCCCCCCH
Confidence 5799999999999999753
No 127
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=30.52 E-value=25 Score=32.79 Aligned_cols=12 Identities=33% Similarity=0.191 Sum_probs=9.4
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.-++.||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 357899999943
No 128
>PLN02791 Nudix hydrolase homolog
Probab=30.38 E-value=20 Score=36.21 Aligned_cols=21 Identities=24% Similarity=0.229 Sum_probs=17.3
Q ss_pred cchhHHHHHHHhcccccccCC
Q psy1890 106 MGTVQHETLHALGFWHEQARP 126 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rp 126 (242)
..|++||+||.+|=.+....+
T Consensus 586 ~~v~lHElgHGsG~~~~~~~~ 606 (770)
T PLN02791 586 THTICHECCHGIGPHTITLPD 606 (770)
T ss_pred HHHHHHHhhccccccceecCC
Confidence 478999999999998866543
No 129
>PF01424 R3H: R3H domain; InterPro: IPR001374 The R3H motif: a domain that binds single-stranded nucleic acids. The most prominent feature of the R3H motif is the presence of an invariant arginine residue and a highly conserved histidine residue that are separated by three residues. The motif also displays a conserved pattern of hydrophobic residues, prolines and glycines. The R3H motif is present in proteins from a diverse range of organisms that includes Eubacteria, green plants, fungi and various groups of metazoans. Intriguingly, it has not yet been identified in Archaea and Escherichia coli. The sequences that contain the R3H domain, many of which are hypothetical proteins predicted from genome sequencing projects, can be grouped into eight families on the basis of similarities outside the R3H region. Three of the families contain ATPase domains either upstream (families II and VII) or downstream of the R3H domain (family VIII). The N-terminal part of members of family VII contains an SF1 helicase domain5. The C-terminal part of family VIII contains an SF2 DEAH helicase domain5. The ATPase domain in the members of family II is similar to the stage-III sporulation protein AA (S3AA_BACSU), the proteasome ATPase, bacterial transcription-termination factor r and the mitochondrial F1-ATPase b subunit (the F5 helicase family5). Family VI contains Cys-rich repeats6, as well as a ring-type zinc finger upstream of the R3H domain. JAG bacterial proteins (family I) contain a KH domain N-terminal to the R3H domain. The functions of other domains in R3H proteins support the notion that the R3H domain might be involved in interactions with single-stranded nucleic acids [].; GO: 0003676 nucleic acid binding; PDB: 1WHR_A 1MSZ_A 1UG8_A 3GKU_B 2CPM_A.
Probab=30.21 E-value=70 Score=20.97 Aligned_cols=40 Identities=13% Similarity=0.063 Sum_probs=28.6
Q ss_pred eEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890 96 YVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH 135 (242)
Q Consensus 96 ~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~ 135 (242)
.+.|.+-.....-++|++..-.||.|+-...+-+.+|.|.
T Consensus 22 ~~~f~pm~~~~R~~iH~~a~~~gL~s~S~g~~~~R~vvv~ 61 (63)
T PF01424_consen 22 SLEFPPMNSFERKLIHELAEYYGLKSKSEGEGPNRRVVVS 61 (63)
T ss_dssp EEEEEC--SHHHHHHHHHHHHCTEEEEEESSSSSSEEEEE
T ss_pred EEEECCCCHHHHHHHHHHHHHCCCEEEEecCCCCeEEEEE
Confidence 3444432223578999999999999999877767777664
No 130
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=29.95 E-value=25 Score=29.01 Aligned_cols=14 Identities=29% Similarity=0.223 Sum_probs=10.7
Q ss_pred chhHHHHHHHhccc
Q psy1890 107 GTVQHETLHALGFW 120 (242)
Q Consensus 107 ~~i~HEigHALGl~ 120 (242)
.+++|||+||+.-.
T Consensus 38 ~ilahel~hafd~~ 51 (206)
T PF01431_consen 38 FILAHELMHAFDPE 51 (206)
T ss_dssp HHHHHHHHHCTSTT
T ss_pred HHHHHHHHHHHHHh
Confidence 57999999966443
No 131
>PRK04351 hypothetical protein; Provisional
Probab=29.47 E-value=30 Score=27.63 Aligned_cols=16 Identities=25% Similarity=0.384 Sum_probs=12.3
Q ss_pred ccchhHHHHHHHhccc
Q psy1890 105 VMGTVQHETLHALGFW 120 (242)
Q Consensus 105 ~~~~i~HEigHALGl~ 120 (242)
..+||.||+.|.+=+.
T Consensus 61 l~~vv~HElcH~~~~~ 76 (149)
T PRK04351 61 LIGIIKHELCHYHLHL 76 (149)
T ss_pred HHhhHHHHHHHHHHHH
Confidence 3589999999986443
No 132
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=29.36 E-value=29 Score=31.91 Aligned_cols=20 Identities=30% Similarity=0.504 Sum_probs=13.8
Q ss_pred CCCcc---cchhHHHHHHHhccc
Q psy1890 101 KNCFV---MGTVQHETLHALGFW 120 (242)
Q Consensus 101 ~~c~~---~~~i~HEigHALGl~ 120 (242)
..|.. .++++||+-|+.+..
T Consensus 290 ~~C~~qDqatt~LHE~TH~~~V~ 312 (359)
T PF02102_consen 290 NRCHAQDQATTTLHEMTHAPAVY 312 (359)
T ss_dssp -STT---HHHHHHHHHHT-TTTS
T ss_pred ccccCCCccchhhhhhhcccccc
Confidence 45754 379999999999864
No 133
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=29.06 E-value=31 Score=26.25 Aligned_cols=31 Identities=23% Similarity=0.208 Sum_probs=20.0
Q ss_pred CCCCCHHHHHHHHHHHHHHhc-cccEEEEecC
Q psy1890 35 DKNFSVDRKRFIVKALNNIMT-FSCVRFRRRT 65 (242)
Q Consensus 35 ~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~ 65 (242)
++.|+-..+.+-.+-++.++. .+.|.|-..-
T Consensus 3 d~E~~wd~~~l~~DvFs~i~k~~~pviFcdTc 34 (126)
T PF13058_consen 3 DSEFPWDVQRLRNDVFSLIEKCHIPVIFCDTC 34 (126)
T ss_pred cccccchHHHHHHHHHHHHHhhcCCeEEeccc
Confidence 344555566666777777776 5778786554
No 134
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=28.04 E-value=28 Score=34.50 Aligned_cols=24 Identities=8% Similarity=0.158 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHhccccEEEEecCC
Q psy1890 43 KRFIVKALNNIMTFSCVRFRRRTH 66 (242)
Q Consensus 43 ~~~I~~A~~~w~~~tci~F~~~~~ 66 (242)
-++|.--++..++-=.|+|+|+..
T Consensus 364 ~~Vl~GlF~~~~rLfGI~~~e~~~ 387 (683)
T COG0339 364 NKVLEGLFEVAKRLFGITFVERKD 387 (683)
T ss_pred hHHHHHHHHHHHHHcCeEEEECCC
Confidence 346655556666777999999853
No 135
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=27.84 E-value=34 Score=29.98 Aligned_cols=21 Identities=38% Similarity=0.588 Sum_probs=16.5
Q ss_pred cchhHHHHHHHhcccccccCCC
Q psy1890 106 MGTVQHETLHALGFWHEQARPD 127 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpD 127 (242)
.++++||++|.+ |.|..-+..
T Consensus 61 ~~~l~HevlH~~-~~H~~r~~~ 81 (292)
T PF13203_consen 61 VGLLLHEVLHCL-LRHPWRRGG 81 (292)
T ss_pred HHHHHHHHHHHH-ccchhhhcc
Confidence 478999999987 666666666
No 136
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=27.14 E-value=32 Score=32.36 Aligned_cols=10 Identities=30% Similarity=0.139 Sum_probs=8.3
Q ss_pred cchhHHHHHH
Q psy1890 106 MGTVQHETLH 115 (242)
Q Consensus 106 ~~~i~HEigH 115 (242)
.-++.||+||
T Consensus 16 ~li~vHElGH 25 (449)
T PRK10779 16 VLITVHEFGH 25 (449)
T ss_pred HHHHHHHHHH
Confidence 3578999999
No 137
>PRK04860 hypothetical protein; Provisional
Probab=26.60 E-value=36 Score=27.51 Aligned_cols=17 Identities=29% Similarity=0.307 Sum_probs=13.2
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
..+||.||+.|.+=+.+
T Consensus 63 l~~~v~HEl~H~~~~~~ 79 (160)
T PRK04860 63 IDEVVPHELAHLLVYQL 79 (160)
T ss_pred HHhHHHHHHHHHHHHHH
Confidence 35899999999875543
No 138
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=25.17 E-value=35 Score=29.87 Aligned_cols=17 Identities=24% Similarity=0.353 Sum_probs=15.0
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...+++|||.|.|-+.+
T Consensus 142 g~NVvIHEFAH~LDm~~ 158 (260)
T PRK15410 142 GFNLIIHEVAHKLDMRN 158 (260)
T ss_pred CcchhHhHHHhHhhhhc
Confidence 46899999999999876
No 139
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=24.77 E-value=34 Score=32.66 Aligned_cols=17 Identities=29% Similarity=0.243 Sum_probs=13.7
Q ss_pred ccchhHHHHHHHhcccc
Q psy1890 105 VMGTVQHETLHALGFWH 121 (242)
Q Consensus 105 ~~~~i~HEigHALGl~H 121 (242)
...|+.||+|||..+..
T Consensus 247 D~~t~~HE~GH~~yy~~ 263 (477)
T cd06461 247 DFVTVHHEMGHIQYYLQ 263 (477)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45799999999996544
No 140
>KOG2090|consensus
Probab=24.55 E-value=39 Score=33.18 Aligned_cols=12 Identities=42% Similarity=0.509 Sum_probs=10.4
Q ss_pred cchhHHHHHHHh
Q psy1890 106 MGTVQHETLHAL 117 (242)
Q Consensus 106 ~~~i~HEigHAL 117 (242)
.-++-||+|||+
T Consensus 480 vetLFHEmGHAM 491 (704)
T KOG2090|consen 480 VETLFHEMGHAM 491 (704)
T ss_pred HHHHHHHHHHHH
Confidence 468999999996
No 141
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=24.05 E-value=35 Score=32.40 Aligned_cols=10 Identities=60% Similarity=0.571 Sum_probs=8.7
Q ss_pred hhHHHHHHHh
Q psy1890 108 TVQHETLHAL 117 (242)
Q Consensus 108 ~i~HEigHAL 117 (242)
-++||.||||
T Consensus 262 g~iHE~GHAl 271 (497)
T COG2317 262 GTIHETGHAL 271 (497)
T ss_pred HHHHhhhhHH
Confidence 4789999986
No 142
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=23.63 E-value=44 Score=27.44 Aligned_cols=25 Identities=24% Similarity=0.454 Sum_probs=17.7
Q ss_pred cchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890 106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK 140 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~ 140 (242)
..+++|||-||.-. .|- .|+|.|..
T Consensus 72 ~~~l~HELIHayD~----cr~------kvD~~n~~ 96 (173)
T PF09768_consen 72 EDTLTHELIHAYDH----CRA------KVDWTNCR 96 (173)
T ss_pred HHHHHHHHHHHHHH----Hhc------cCCcccch
Confidence 47999999999753 332 36777654
No 143
>KOG3624|consensus
Probab=23.29 E-value=43 Score=33.26 Aligned_cols=29 Identities=21% Similarity=0.259 Sum_probs=21.1
Q ss_pred chhHHHHHHHhcccccccCCCCCceEEEeee
Q psy1890 107 GTVQHETLHALGFWHEQARPDRDRFVDIHYE 137 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ 137 (242)
.+|.|||+| ||.......|..-.+.--|+
T Consensus 520 ~vigHEl~H--~FD~~G~~~D~~Gn~~~ww~ 548 (687)
T KOG3624|consen 520 FVIGHELTH--GFDDQGRQYDESGNLRDWWD 548 (687)
T ss_pred HHHHHHHhh--cccccccccCCCcccccCCC
Confidence 578999999 77777777776655555444
No 144
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=23.23 E-value=3e+02 Score=22.57 Aligned_cols=75 Identities=11% Similarity=0.077 Sum_probs=38.1
Q ss_pred HHHHHHHHHHHHHhcc-ccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEE-eCCCCc---ccchhHHHHHH
Q psy1890 41 DRKRFIVKALNNIMTF-SCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVN-LGKNCF---VMGTVQHETLH 115 (242)
Q Consensus 41 ~~~~~I~~A~~~w~~~-tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~-l~~~c~---~~~~i~HEigH 115 (242)
...+.+.++.+.|... ..|...+... ....+... .|..+..+..+. -.+. +.+... ...+++||+-|
T Consensus 4 ~i~~~~~~~~~~~~~~~i~v~i~p~~~-~~~~~~~~-----~g~~g~~~~~~~--i~l~~~~~~~~~~~l~~~iaHE~hH 75 (195)
T PF10026_consen 4 IIEEALEKSIELLPGPDIPVFIFPADP-ENPFLIPE-----LGGKGGGAIPGY--IFLFLLPNDYSLEELPALIAHEYHH 75 (195)
T ss_pred HHHHHHHHHHHHcCCCCCCEEEEeccC-CCcccccc-----cCcccccCCCCE--EEEEecCCcccHHHHHHHHHHHHHH
Confidence 3456777777888753 3444444442 22222211 222233333222 2333 333222 24899999999
Q ss_pred Hhcccccc
Q psy1890 116 ALGFWHEQ 123 (242)
Q Consensus 116 ALGl~HEh 123 (242)
++=+..-.
T Consensus 76 ~~r~~~~~ 83 (195)
T PF10026_consen 76 NCRYEQIG 83 (195)
T ss_pred HHHHhccC
Confidence 98776443
No 145
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=23.10 E-value=37 Score=30.88 Aligned_cols=15 Identities=20% Similarity=0.162 Sum_probs=11.5
Q ss_pred cchhHHHHHHHhccc
Q psy1890 106 MGTVQHETLHALGFW 120 (242)
Q Consensus 106 ~~~i~HEigHALGl~ 120 (242)
...|.||+.|..=+.
T Consensus 166 A~LIfHELaHq~~Yv 180 (337)
T PF10023_consen 166 ARLIFHELAHQTLYV 180 (337)
T ss_pred HHHHHHHHhhceeec
Confidence 478999999975443
No 146
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=23.01 E-value=46 Score=27.90 Aligned_cols=13 Identities=31% Similarity=0.202 Sum_probs=11.2
Q ss_pred cchhHHHHHHHhc
Q psy1890 106 MGTVQHETLHALG 118 (242)
Q Consensus 106 ~~~i~HEigHALG 118 (242)
..++.||-||||=
T Consensus 29 ~~~A~HEAGhAvv 41 (213)
T PF01434_consen 29 RRIAYHEAGHAVV 41 (213)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999974
No 147
>KOG3314|consensus
Probab=22.71 E-value=49 Score=27.06 Aligned_cols=25 Identities=24% Similarity=0.489 Sum_probs=18.3
Q ss_pred cchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890 106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK 140 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~ 140 (242)
..++.|||.||+- +.| ..|+|.|+.
T Consensus 92 n~vv~HElIH~fD----d~r------~~vDW~Nlr 116 (194)
T KOG3314|consen 92 NQVVIHELIHAFD----DCR------AKVDWTNLR 116 (194)
T ss_pred HHHHHHHHHHHHH----hhh------hhcccchhh
Confidence 3799999999973 233 248898876
No 148
>KOG0201|consensus
Probab=22.20 E-value=39 Score=31.86 Aligned_cols=30 Identities=27% Similarity=0.480 Sum_probs=25.0
Q ss_pred cchhHHHHHHHhcccccccCCCCCceEEEeeecc
Q psy1890 106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELI 139 (242)
Q Consensus 106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni 139 (242)
.++|++|+++||+|.|.+.---|| |.-.||
T Consensus 113 i~~ilre~l~~l~ylH~~~kiHrD----IKaani 142 (467)
T KOG0201|consen 113 IAVILREVLKGLDYLHSEKKIHRD----IKAANI 142 (467)
T ss_pred eeeehHHHHHHhhhhhhcceeccc----ccccce
Confidence 478999999999999999988777 555554
No 149
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=21.66 E-value=49 Score=27.85 Aligned_cols=16 Identities=25% Similarity=0.455 Sum_probs=13.5
Q ss_pred cchhHHHHHHHhcccc
Q psy1890 106 MGTVQHETLHALGFWH 121 (242)
Q Consensus 106 ~~~i~HEigHALGl~H 121 (242)
.|++.||+.|+.-..-
T Consensus 97 ~Gvl~HE~~H~~Q~~~ 112 (205)
T PF04450_consen 97 IGVLYHEMVHCWQWDG 112 (205)
T ss_pred HHHHHHHHHHHhhcCC
Confidence 6999999999976554
No 150
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=20.73 E-value=29 Score=33.84 Aligned_cols=21 Identities=33% Similarity=0.419 Sum_probs=14.4
Q ss_pred chhHHHHHHHhcccccccCCCCC
Q psy1890 107 GTVQHETLHALGFWHEQARPDRD 129 (242)
Q Consensus 107 ~~i~HEigHALGl~HEh~rpDRD 129 (242)
++|.|||+|+ |--.-.+.|.|
T Consensus 489 aVIgHEI~Hg--FDdqGakfD~~ 509 (654)
T COG3590 489 AVIGHEIGHG--FDDQGAKFDGD 509 (654)
T ss_pred ceehhhhccc--ccCCccccCCC
Confidence 6899999996 44444555544
No 151
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=20.56 E-value=53 Score=30.30 Aligned_cols=16 Identities=31% Similarity=0.644 Sum_probs=13.7
Q ss_pred cchhHHHHHHHhcccc
Q psy1890 106 MGTVQHETLHALGFWH 121 (242)
Q Consensus 106 ~~~i~HEigHALGl~H 121 (242)
.+|++|||-|.+-|..
T Consensus 140 ~sTlAHEfQHmInfy~ 155 (366)
T PF10460_consen 140 YSTLAHEFQHMINFYQ 155 (366)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5899999999997764
Done!