Query psy2949
Match_columns 367
No_of_seqs 291 out of 1661
Neff 7.4
Searched_HMMs 46136
Date Fri Aug 16 21:41:07 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2949.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2949hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd04282 ZnMc_meprin Zinc-depen 100.0 1.6E-64 3.5E-69 463.7 17.9 206 1-214 24-230 (230)
2 cd04281 ZnMc_BMP1_TLD Zinc-dep 100.0 5.4E-62 1.2E-66 439.8 18.5 196 16-215 2-200 (200)
3 cd04283 ZnMc_hatching_enzyme Z 100.0 3.9E-56 8.4E-61 395.8 15.7 177 28-214 5-182 (182)
4 PF01400 Astacin: Astacin (Pep 100.0 1.8E-56 4E-61 403.7 13.8 189 22-215 1-190 (191)
5 KOG3714|consensus 100.0 2.6E-55 5.5E-60 437.6 18.4 202 11-217 69-272 (411)
6 cd04280 ZnMc_astacin_like Zinc 100.0 3.6E-54 7.8E-59 385.3 15.4 180 26-212 1-180 (180)
7 cd04327 ZnMc_MMP_like_3 Zinc-d 100.0 9.2E-41 2E-45 303.2 11.1 168 22-212 1-197 (198)
8 smart00235 ZnMc Zinc-dependent 100.0 5.8E-30 1.3E-34 219.7 12.3 135 21-165 2-140 (140)
9 cd04268 ZnMc_MMP_like Zinc-dep 100.0 5.8E-29 1.3E-33 218.7 10.9 145 27-212 2-165 (165)
10 KOG3714|consensus 99.9 1.4E-23 3.1E-28 209.7 3.7 120 191-313 168-287 (411)
11 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.9 1.3E-23 2.8E-28 190.0 3.0 102 192-295 97-200 (200)
12 cd04282 ZnMc_meprin Zinc-depen 99.9 2.6E-23 5.7E-28 191.3 2.8 101 192-294 130-230 (230)
13 cd04283 ZnMc_hatching_enzyme Z 99.8 1.1E-21 2.4E-26 174.9 1.8 96 192-294 87-182 (182)
14 cd00203 ZnMc Zinc-dependent me 99.8 1.3E-19 2.7E-24 159.4 11.6 145 28-212 2-167 (167)
15 PF01400 Astacin: Astacin (Pep 99.8 3.5E-20 7.6E-25 167.3 2.0 103 192-296 89-191 (191)
16 cd04280 ZnMc_astacin_like Zinc 99.8 1.5E-19 3.3E-24 161.7 1.4 97 192-292 84-180 (180)
17 cd04278 ZnMc_MMP Zinc-dependen 99.7 9.6E-17 2.1E-21 140.4 11.7 99 23-122 1-125 (157)
18 cd04277 ZnMc_serralysin_like Z 99.5 4.4E-14 9.5E-19 126.9 12.0 141 37-213 30-186 (186)
19 cd04279 ZnMc_MMP_like_1 Zinc-d 99.5 1.7E-13 3.6E-18 119.7 11.3 98 28-125 3-125 (156)
20 PF00413 Peptidase_M10: Matrix 99.5 7.9E-13 1.7E-17 114.6 13.1 100 23-122 1-123 (154)
21 cd04327 ZnMc_MMP_like_3 Zinc-d 99.2 2.1E-12 4.5E-17 117.3 1.9 87 192-293 102-198 (198)
22 PF12388 Peptidase_M57: Dual-a 99.0 7.1E-09 1.5E-13 93.9 12.2 178 1-212 3-211 (211)
23 cd04276 ZnMc_MMP_like_2 Zinc-d 98.5 1.2E-06 2.7E-11 79.4 12.7 31 25-56 6-36 (197)
24 KOG1565|consensus 98.5 7.6E-07 1.6E-11 90.6 11.1 104 17-121 99-228 (469)
25 cd04268 ZnMc_MMP_like Zinc-dep 97.7 1.4E-05 3.1E-10 69.8 2.0 44 235-292 122-165 (165)
26 PF02031 Peptidase_M7: Strepto 97.6 3E-05 6.5E-10 64.5 2.6 86 28-118 3-91 (132)
27 COG5549 Predicted Zn-dependent 97.2 0.00076 1.6E-08 60.7 6.1 80 40-119 103-202 (236)
28 PF13583 Reprolysin_4: Metallo 95.7 0.0045 9.7E-08 56.6 1.5 19 105-123 138-156 (206)
29 PF13582 Reprolysin_3: Metallo 95.6 0.0053 1.1E-07 50.9 1.3 19 103-121 106-124 (124)
30 PF13574 Reprolysin_2: Metallo 95.4 0.01 2.2E-07 52.7 2.7 20 104-123 111-130 (173)
31 PF05548 Peptidase_M11: Gameto 95.3 0.031 6.8E-07 54.4 5.8 81 39-123 86-169 (314)
32 PF05572 Peptidase_M43: Pregna 95.2 0.0082 1.8E-07 52.4 1.3 20 106-125 71-90 (154)
33 cd04267 ZnMc_ADAM_like Zinc-de 94.9 0.015 3.2E-07 52.2 2.2 18 106-123 135-152 (192)
34 cd00203 ZnMc Zinc-dependent me 94.8 0.023 5E-07 49.4 3.0 40 235-292 128-167 (167)
35 smart00042 CUB Domain first fo 94.8 0.024 5.3E-07 44.7 2.8 29 308-340 4-32 (102)
36 PF13688 Reprolysin_5: Metallo 94.5 0.018 3.9E-07 51.7 1.5 23 104-126 142-164 (196)
37 cd04275 ZnMc_pappalysin_like Z 93.9 0.0096 2.1E-07 55.2 -1.5 20 106-125 139-158 (225)
38 cd00041 CUB CUB domain; extrac 93.6 0.074 1.6E-06 42.4 3.4 33 305-341 10-43 (113)
39 COG1913 Predicted Zn-dependent 93.3 0.035 7.5E-07 49.0 1.0 18 106-123 126-143 (181)
40 cd04269 ZnMc_adamalysin_II_lik 93.2 0.039 8.5E-07 49.6 1.3 19 105-123 132-150 (194)
41 cd04272 ZnMc_salivary_gland_MP 93.0 0.064 1.4E-06 49.3 2.5 20 105-124 146-165 (220)
42 TIGR03296 M6dom_TIGR03296 M6 f 93.0 0.19 4.1E-06 48.3 5.6 19 104-122 165-183 (286)
43 PF00431 CUB: CUB domain CUB d 92.4 0.058 1.3E-06 42.8 1.1 34 304-341 8-42 (110)
44 cd04271 ZnMc_ADAM_fungal Zinc- 92.2 0.053 1.2E-06 50.4 0.7 17 106-122 147-163 (228)
45 PRK13267 archaemetzincin-like 92.0 0.073 1.6E-06 47.6 1.3 15 106-120 127-141 (179)
46 cd04270 ZnMc_TACE_like Zinc-de 91.9 0.072 1.6E-06 50.0 1.3 17 106-122 169-185 (244)
47 PF07998 Peptidase_M54: Peptid 91.8 0.077 1.7E-06 48.0 1.3 17 106-122 147-163 (194)
48 cd04273 ZnMc_ADAMTS_like Zinc- 91.7 0.041 8.8E-07 50.2 -0.7 19 105-123 141-159 (207)
49 PF01421 Reprolysin: Reprolysi 90.2 0.14 2.9E-06 46.3 1.3 18 106-123 133-150 (199)
50 PF11350 DUF3152: Protein of u 89.2 0.15 3.3E-06 46.0 0.9 98 27-124 29-159 (203)
51 PF10462 Peptidase_M66: Peptid 89.0 0.17 3.8E-06 48.9 1.1 16 106-121 195-210 (305)
52 cd02641 R3H_Smubp-2_like R3H d 86.8 0.58 1.3E-05 34.1 2.5 29 105-133 30-58 (60)
53 PTZ00337 surface protease GP63 86.1 0.25 5.5E-06 51.8 0.3 28 93-120 218-245 (567)
54 PF12044 Metallopep: Putative 85.6 0.34 7.4E-06 48.9 1.0 18 106-123 318-335 (423)
55 PF01457 Peptidase_M8: Leishma 84.2 0.46 1E-05 49.6 1.3 17 105-121 211-227 (521)
56 cd06459 M3B_Oligoendopeptidase 82.5 1.8 4E-05 43.3 4.8 20 98-117 216-235 (427)
57 KOG4525|consensus 81.9 0.57 1.2E-05 46.9 0.8 18 106-123 303-320 (614)
58 TIGR00181 pepF oligoendopeptid 81.8 1.9 4E-05 45.7 4.7 74 40-118 314-392 (591)
59 PF12725 DUF3810: Protein of u 81.0 1.1 2.5E-05 43.7 2.5 75 40-122 133-214 (318)
60 PF05547 Peptidase_M6: Immune 80.1 0.37 8E-06 51.3 -1.3 16 104-119 221-236 (645)
61 TIGR02290 M3_fam_3 oligoendope 79.8 2.6 5.7E-05 44.6 5.0 75 40-118 310-389 (587)
62 PTZ00257 Glycoprotein GP63 (le 79.1 0.91 2E-05 47.9 1.2 17 104-120 256-272 (622)
63 PF02408 CUB_2: CUB-like domai 72.8 6.6 0.00014 32.4 4.6 29 313-341 31-61 (120)
64 cd02640 R3H_NRF R3H domain of 72.4 3.3 7.2E-05 30.2 2.3 28 106-133 31-58 (60)
65 PHA02456 zinc metallopeptidase 72.3 2.3 4.9E-05 34.9 1.6 37 77-119 58-94 (141)
66 cd06007 R3H_DEXH_helicase R3H 71.8 3.7 8E-05 29.8 2.4 29 105-133 29-57 (59)
67 KOG3607|consensus 71.4 1.9 4.1E-05 46.6 1.2 17 106-122 325-341 (716)
68 PF09471 Peptidase_M64: IgA Pe 70.5 2.1 4.6E-05 40.6 1.3 26 26-51 80-105 (264)
69 PF13398 Peptidase_M50B: Pepti 66.4 2.8 6.1E-05 38.0 1.1 14 106-119 24-37 (200)
70 PF04298 Zn_peptidase_2: Putat 65.2 3.7 8E-05 37.9 1.6 13 105-117 90-102 (222)
71 COG2856 Predicted Zn peptidase 65.2 3 6.6E-05 38.3 1.1 16 106-123 74-89 (213)
72 TIGR02289 M3_not_pepF oligoend 62.9 4.7 0.0001 42.3 2.1 21 99-119 332-352 (549)
73 PF06114 DUF955: Domain of unk 61.6 4.2 9.1E-05 32.4 1.2 12 105-116 43-54 (122)
74 cd02646 R3H_G-patch R3H domain 61.0 8.2 0.00018 27.7 2.5 28 106-133 29-56 (58)
75 PF06262 DUF1025: Possibl zinc 60.9 4.1 8.8E-05 32.7 1.0 17 106-122 75-91 (97)
76 PF03272 Enhancin: Viral enhan 59.3 7.5 0.00016 42.6 2.9 124 22-153 128-280 (775)
77 cd02642 R3H_encore_like R3H do 59.1 9.5 0.0002 27.9 2.6 29 105-134 33-61 (63)
78 PF01863 DUF45: Protein of unk 58.9 27 0.00058 31.2 6.1 17 105-121 165-181 (205)
79 PF11667 DUF3267: Protein of u 57.7 6.6 0.00014 31.9 1.7 20 106-125 6-25 (111)
80 PF12388 Peptidase_M57: Dual-a 57.4 3.4 7.3E-05 37.9 -0.1 40 232-292 170-211 (211)
81 PF14247 DUF4344: Domain of un 56.5 5.5 0.00012 36.8 1.2 11 106-116 94-104 (220)
82 COG3824 Predicted Zn-dependent 55.9 4.9 0.00011 33.4 0.7 16 106-121 111-126 (136)
83 PF02163 Peptidase_M50: Peptid 55.5 5.8 0.00012 35.2 1.1 21 105-125 8-28 (192)
84 smart00235 ZnMc Zinc-dependent 55.2 2.3 4.9E-05 35.8 -1.5 45 192-245 96-140 (140)
85 COG4783 Putative Zn-dependent 51.3 7 0.00015 39.9 1.1 15 104-118 130-144 (484)
86 PF01432 Peptidase_M3: Peptida 51.0 7.4 0.00016 39.7 1.2 15 105-119 243-257 (458)
87 cd06258 Peptidase_M3_like The 50.9 7.1 0.00015 38.4 1.0 12 105-116 155-166 (365)
88 cd05709 S2P-M50 Site-2 proteas 49.3 8.3 0.00018 33.9 1.1 19 106-124 10-28 (180)
89 TIGR00043 metalloprotein, YbeY 49.3 9.7 0.00021 31.2 1.4 17 106-122 75-91 (110)
90 PF15639 Tox-MPTase3: Metallop 48.0 25 0.00055 29.5 3.6 70 43-114 22-110 (135)
91 PF01447 Peptidase_M4: Thermol 47.7 9.6 0.00021 33.1 1.2 11 106-116 137-147 (150)
92 COG1164 Oligoendopeptidase F [ 47.0 8.2 0.00018 41.1 0.8 75 40-118 315-394 (598)
93 cd06455 M3A_TOP Peptidase M3 T 46.5 9.4 0.0002 39.3 1.1 12 106-117 265-276 (472)
94 cd06457 M3A_MIP Peptidase M3 m 44.4 11 0.00023 38.8 1.1 15 106-120 250-264 (458)
95 cd06163 S2P-M50_PDZ_RseP-like 42.9 12 0.00027 33.4 1.2 19 106-124 11-29 (182)
96 PF02130 UPF0054: Uncharacteri 42.8 13 0.00029 31.9 1.4 16 106-121 110-125 (145)
97 COG2738 Predicted Zn-dependent 42.2 12 0.00026 33.9 0.9 17 101-117 89-105 (226)
98 cd06162 S2P-M50_PDZ_SREBP Ster 41.0 13 0.00028 35.6 1.1 17 106-122 137-153 (277)
99 PF13485 Peptidase_MA_2: Pepti 40.6 14 0.00031 29.5 1.2 11 106-116 27-37 (128)
100 cd06456 M3A_DCP_Oligopeptidase 39.8 14 0.0003 37.5 1.2 13 105-117 209-221 (422)
101 COG0319 Predicted metal-depend 39.4 16 0.00035 31.8 1.3 18 106-123 110-127 (153)
102 cd06159 S2P-M50_PDZ_Arch Uncha 39.4 14 0.00031 35.0 1.1 21 106-126 120-140 (263)
103 KOG3658|consensus 39.3 10 0.00022 40.3 0.1 16 106-121 394-409 (764)
104 PF01435 Peptidase_M48: Peptid 39.1 15 0.00033 33.0 1.2 13 105-117 90-102 (226)
105 cd06158 S2P-M50_like_1 Unchara 38.9 15 0.00033 32.7 1.1 12 105-116 10-21 (181)
106 PRK02870 heat shock protein Ht 38.8 12 0.00026 36.9 0.5 15 105-122 174-188 (336)
107 cd06161 S2P-M50_SpoIVFB SpoIVF 38.1 16 0.00035 33.2 1.2 21 106-126 40-60 (208)
108 PRK00016 metal-binding heat sh 37.6 17 0.00037 31.8 1.2 17 106-122 115-131 (159)
109 PRK13963 unkown domain/putativ 35.6 20 0.00044 33.8 1.4 16 106-121 217-232 (258)
110 cd06460 M32_Taq Peptidase fami 35.4 18 0.00038 36.6 1.0 17 106-125 161-177 (396)
111 smart00731 SprT SprT homologue 35.0 20 0.00042 30.7 1.2 15 105-119 60-74 (146)
112 KOG2921|consensus 34.9 18 0.00039 36.2 1.0 33 106-141 133-165 (484)
113 cd06006 R3H_unknown_2 R3H doma 34.7 41 0.0009 24.4 2.6 29 105-133 29-57 (59)
114 PF10263 SprT-like: SprT-like 34.7 20 0.00043 30.7 1.2 16 105-120 61-76 (157)
115 PF12315 DUF3633: Protein of u 34.4 20 0.00043 32.8 1.2 14 106-119 95-108 (212)
116 cd06164 S2P-M50_SpoIVFB_CBS Sp 33.8 21 0.00044 33.1 1.2 21 106-126 55-75 (227)
117 PF13699 DUF4157: Domain of un 32.3 24 0.00052 27.0 1.2 14 104-117 61-74 (79)
118 PRK10911 oligopeptidase A; Pro 32.3 21 0.00045 38.7 1.1 11 106-116 465-475 (680)
119 PF14521 Aspzincin_M35: Lysine 32.0 20 0.00044 30.8 0.8 14 104-117 96-111 (148)
120 PRK03982 heat shock protein Ht 31.7 22 0.00048 34.0 1.1 12 105-116 126-137 (288)
121 PF02102 Peptidase_M35: Deuter 30.3 29 0.00062 34.6 1.6 21 98-118 288-311 (359)
122 COG0501 HtpX Zn-dependent prot 30.1 24 0.00053 33.4 1.1 12 105-116 158-169 (302)
123 KOG3538|consensus 30.0 13 0.00029 41.1 -0.9 21 105-125 318-338 (845)
124 PRK03001 M48 family peptidase; 29.8 25 0.00054 33.6 1.0 12 105-116 125-136 (283)
125 PRK03072 heat shock protein Ht 29.4 25 0.00055 33.7 1.0 21 39-59 65-85 (288)
126 PRK01345 heat shock protein Ht 29.4 26 0.00055 34.2 1.1 12 105-116 125-136 (317)
127 PRK05457 heat shock protein Ht 28.3 28 0.0006 33.4 1.1 11 105-115 135-145 (284)
128 PRK10280 dipeptidyl carboxypep 28.0 28 0.00061 37.7 1.1 18 48-65 369-386 (681)
129 cd06160 S2P-M50_like_2 Unchara 27.9 29 0.00063 31.1 1.1 22 106-127 43-64 (183)
130 PF06167 Peptidase_M90: Glucos 27.8 30 0.00065 32.7 1.2 21 36-56 39-59 (253)
131 PF04228 Zn_peptidase: Putativ 26.9 31 0.00067 33.3 1.1 11 106-116 172-182 (292)
132 cd02639 R3H_RRM R3H domain of 26.6 37 0.00081 24.7 1.3 17 105-121 30-46 (60)
133 PRK04897 heat shock protein Ht 26.0 31 0.00067 33.3 0.9 11 105-115 138-148 (298)
134 KOG2556|consensus 25.2 34 0.00074 35.1 1.1 19 106-124 257-275 (666)
135 PRK02391 heat shock protein Ht 25.1 34 0.00074 33.0 1.1 11 105-115 134-144 (296)
136 PRK01265 heat shock protein Ht 24.4 34 0.00075 33.5 1.0 11 105-115 141-151 (324)
137 PF05506 DUF756: Domain of unk 23.8 77 0.0017 24.4 2.7 41 326-366 47-87 (89)
138 PF14891 Peptidase_M91: Effect 23.5 36 0.00078 30.0 0.8 18 106-123 105-122 (174)
139 cd02325 R3H R3H domain. The na 22.8 82 0.0018 21.3 2.5 30 105-134 29-58 (59)
140 PF08810 KapB: Kinase associat 22.4 21 0.00046 29.3 -0.8 53 2-54 31-90 (112)
141 PRK04351 hypothetical protein; 21.7 47 0.001 28.8 1.1 12 105-116 62-73 (149)
142 PF01431 Peptidase_M13: Peptid 21.2 44 0.00095 29.9 0.9 13 106-118 38-50 (206)
143 TIGR00054 RIP metalloprotease 21.1 45 0.00097 33.8 1.1 11 106-116 16-26 (420)
144 PF07108 PipA: PipA protein; 21.0 94 0.002 27.2 2.8 52 68-128 122-177 (200)
145 PF13058 DUF3920: Protein of u 21.0 49 0.0011 27.2 1.0 15 106-122 78-92 (126)
146 PRK15410 DgsA anti-repressor M 20.7 44 0.00095 31.7 0.8 17 104-120 142-158 (260)
147 PF13203 DUF2201_N: Putative m 20.5 53 0.0011 31.3 1.4 20 106-126 62-81 (292)
No 1
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=100.00 E-value=1.6e-64 Score=463.73 Aligned_cols=206 Identities=36% Similarity=0.658 Sum_probs=194.6
Q ss_pred CCCccC-CccccccccccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCC
Q psy2949 1 MEGDIL-IPRERDSRNLVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFG 79 (367)
Q Consensus 1 ~~GDi~-~~~q~~~r~a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~g 79 (367)
|||||+ +++ ++|+|+.....+||+ +|||.| +.+|+..+++.|++||++|++.|||||+|++++.+||.|.. +.|
T Consensus 24 ~eGDI~l~~~--~~R~a~~~~~~~Wp~-~VPY~i-~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e~~yi~i~~-~~G 98 (230)
T cd04282 24 FEGDILLDEG--QSRNGLIGDTYRWPF-PIPYIL-DDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFK-GSG 98 (230)
T ss_pred ccccccCCcc--cccccccCcccCCCc-ceeEEE-CCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCCCcEEEEEc-CCC
Confidence 799999 775 468999999999999 999999 88899999999999999999999999999999999999974 689
Q ss_pred ccCCCCCCCCCcccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCC
Q psy2949 80 CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGM 159 (367)
Q Consensus 80 c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~ 159 (367)
|||+||+.+|.|.|+ |+.+|...|+|+|||||||||+|||+|||||+||+|+|+||.++..+||.|++...+.++|+
T Consensus 99 C~S~vG~~gg~q~is---l~~~C~~~Gti~HEl~HalGf~HEqsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~ 175 (230)
T cd04282 99 CWSMVGDQQGGQNLS---IGAGCDYKATVEHEFLHALGFYHEQSRSDRDDYVKIWWDQILSGREHNFNKYDDSFSTDLNT 175 (230)
T ss_pred eeeccCccCCeEEEE---ECCCcCCCchHHHHHHHHhCCcccccccccccceEEeecccCchHHHHhhhcCccccccCCC
Confidence 999999999999998 78899999999999999999999999999999999999999999999999999888888999
Q ss_pred cCCCCCCcCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHhcC
Q psy2949 160 PYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKC 214 (367)
Q Consensus 160 ~YD~~SIMhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C 214 (367)
||||+|||||++.+||+++..+||+|+++.+...|||+.+||..|+.+||+||+|
T Consensus 176 pYDy~SIMHY~~~aFs~~~~~pTi~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C 230 (230)
T cd04282 176 PYDYESVMHYSPFSFNKGASEPTITTKIPEFNDIIGQRLDFSDIDLERLNRMYNC 230 (230)
T ss_pred CCCcccccccCCCccccCCCCceeeecCCcccccccccCCCCHHHHHHHHHhcCC
Confidence 9999999999999999994459999999877788999999999999999999998
No 2
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=100.00 E-value=5.4e-62 Score=439.83 Aligned_cols=196 Identities=38% Similarity=0.702 Sum_probs=183.5
Q ss_pred ccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCC-CCcccc
Q psy2949 16 LVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP-IGTGID 94 (367)
Q Consensus 16 a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~-g~q~i~ 94 (367)
|.....++||+++|||.| +.+|+..+++.|++||++|++.|||||+|++++.+||.|...+.||||+||+.+ |.|.|+
T Consensus 2 ~~~~~~~~Wp~~~VpY~i-~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~~~yi~f~~~~~GC~S~vG~~~~g~q~is 80 (200)
T cd04281 2 ATARKERIWPGGVIPYVI-DGNFTGSQRAMFKQAMRHWENFTCVTFVERTPEENYIVFTYRPCGCCSYVGRRGNGPQAIS 80 (200)
T ss_pred CccCccCcCCCCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCceEEEECCCCCCEEEEEECCCCeeEcCCCcCCCceeee
Confidence 556678999999999999 888999999999999999999999999999999999999866689999999987 789887
Q ss_pred cccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCccccee
Q psy2949 95 IFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAF 174 (367)
Q Consensus 95 ~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~f 174 (367)
|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+..+|+||||+|||||++.+|
T Consensus 81 ---l~~~C~~~Gti~HEl~HaLGf~HEhsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aF 157 (200)
T cd04281 81 ---IGKNCDKFGIVVHELGHVIGFWHEHTRPDRDDHVTIIRENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTF 157 (200)
T ss_pred ---cCCCcCcCchHHHHHHHHhcCcchhccccccceEEEeecccCcchhhhhhhcCccccccCCCccCCCccccCCCCcc
Confidence 78999999999999999999999999999999999999999999999999999888888999999999999999999
Q ss_pred ccCCCcceeeeCCCC--CCCccCccccccHHHHHHHHHHhcCC
Q psy2949 175 SKDGVSKTIVPLYPG--AEDTMGQRDAMSRVDLAKLNRLYKCP 215 (367)
Q Consensus 175 s~~~~~~ti~~~~~~--~~~~IG~r~~lS~~D~~~in~lY~C~ 215 (367)
++++..+||+|+++. +...|||+.+||+.|+.+||+||+|+
T Consensus 158 s~~~~~~Ti~p~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C~ 200 (200)
T cd04281 158 SRGMFLDTILPKRDPNGVRPEIGQRTRLSEGDIIQANKLYKCP 200 (200)
T ss_pred ccCCCCCceEECCCcccccccccccCCCCHHHHHHHHHhcCCC
Confidence 999877999998774 45779999999999999999999995
No 3
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=100.00 E-value=3.9e-56 Score=395.81 Aligned_cols=177 Identities=42% Similarity=0.773 Sum_probs=165.3
Q ss_pred eeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCCCCcccccccCC-Ccccccc
Q psy2949 28 TVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGG-RVCFLKG 106 (367)
Q Consensus 28 ~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~-~~c~~~g 106 (367)
+|||.| +.+|++.+++.|++||++|++.|||||++++++.+||.|.. +.||||+||+.++.|.|+ ++ .+|...|
T Consensus 5 ~IpY~i-~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~~~yi~~~~-~~gC~S~vG~~gg~q~i~---l~~~~C~~~G 79 (182)
T cd04283 5 YVPYVI-SPQYSENERAVIEKAMQEFETLTCVRFVPRTTERDYLNIES-RSGCWSYIGRQGGRQTVS---LQKQGCMYKG 79 (182)
T ss_pred EEEEEE-CCCCCHHHHHHHHHHHHHHHhCCceeeEECCCCCcEEEEEc-CCCceEecCccCCceeEe---cCCCCcCccc
Confidence 799999 88899999999999999999999999999998899999974 689999999999999998 55 6899999
Q ss_pred hhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCcceeeeC
Q psy2949 107 KIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVSKTIVPL 186 (367)
Q Consensus 107 ~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~~ti~~~ 186 (367)
+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++. .++++||||+|||||++.+||+++. +||+|+
T Consensus 80 ~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~ 155 (182)
T cd04283 80 IIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPI 155 (182)
T ss_pred hHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEEC
Confidence 999999999999999999999999999999999999999999864 3578999999999999999999887 999999
Q ss_pred CCCCCCccCccccccHHHHHHHHHHhcC
Q psy2949 187 YPGAEDTMGQRDAMSRVDLAKLNRLYKC 214 (367)
Q Consensus 187 ~~~~~~~IG~r~~lS~~D~~~in~lY~C 214 (367)
++.. ..||||.+||+.|+.+||+||.|
T Consensus 156 ~~~~-~~iGqr~~lS~~Di~~iN~~Y~C 182 (182)
T cd04283 156 PDPN-VPIGQRQGMSNLDILRINKLYNC 182 (182)
T ss_pred Cccc-ccccCCCCCCHHHHHHHHHhcCC
Confidence 7643 57999999999999999999988
No 4
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=100.00 E-value=1.8e-56 Score=403.66 Aligned_cols=189 Identities=44% Similarity=0.855 Sum_probs=151.0
Q ss_pred CCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCCCccCCCCCCCCCcccccccCCC
Q psy2949 22 QLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGGR 100 (367)
Q Consensus 22 ~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~~ 100 (367)
++||+++|||.| ++++++.+++.|++||++|++.|||+|++++ ...++|.|. .+.||||+||+.++.|.|+ |+.
T Consensus 1 ~~Wp~~~IpY~~-~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~-~~~gC~S~vG~~~g~q~i~---l~~ 75 (191)
T PF01400_consen 1 KKWPNGTIPYYI-DPSFSSSQRQRIRKAMDEWEKNTCIRFVERTENEDDYISFS-NGSGCWSYVGRQGGEQTIN---LGD 75 (191)
T ss_dssp -S-GGGEEEEEE-ETTS-HHHHHHHHHHHHHHHHHSSEEEEE-SSSSSSEEEEE-SSSSEEEESS--SSEEEEE---E-T
T ss_pred CcCCCCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCCEEEEECCCCCceEEEee-cCccccchhhhcCcceeEE---ecc
Confidence 589999999999 7889999999999999999999999999999 888899886 5699999999999999998 788
Q ss_pred cccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCc
Q psy2949 101 VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVS 180 (367)
Q Consensus 101 ~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~ 180 (367)
+|...|+|+|||||||||.|||+|||||+||.|+|+||.++...||.+.+...+..+++|||++|||||++.+|++++..
T Consensus 76 ~c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~ 155 (191)
T PF01400_consen 76 GCFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQ 155 (191)
T ss_dssp TC-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS
T ss_pred eeCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCC
Confidence 99999999999999999999999999999999999999999999999998888888999999999999999999999977
Q ss_pred ceeeeCCCCCCCccCccccccHHHHHHHHHHhcCC
Q psy2949 181 KTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKCP 215 (367)
Q Consensus 181 ~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C~ 215 (367)
+||+|++..+...||++.+||+.|+++||.+|+|+
T Consensus 156 ~ti~~~~~~~~~~iG~~~~lS~~D~~~in~~Y~C~ 190 (191)
T PF01400_consen 156 PTITPKDPKYQETIGQRNRLSFTDIKQINKMYNCP 190 (191)
T ss_dssp -SEEESSTTCTS-GGG-SS--HHHHHHHHHHTT--
T ss_pred CeEEecCCCCcccccccCCCCHHHHHHHHHhcCCC
Confidence 99999998777899999999999999999999996
No 5
>KOG3714|consensus
Probab=100.00 E-value=2.6e-55 Score=437.59 Aligned_cols=202 Identities=45% Similarity=0.813 Sum_probs=188.5
Q ss_pred cccccccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCCCccCCCCCCCC
Q psy2949 11 RDSRNLVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGFGCASPVGYFPI 89 (367)
Q Consensus 11 ~~~r~a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~gc~S~vG~~~g 89 (367)
+..|+++.+..++||+++|||.| +++|+..+|+.|+.||++|+++|||+|+|++ ...+++.|.... ||||+||+.++
T Consensus 69 ~~~r~~~~~~~~~Wp~~~ipY~i-~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~~-gC~S~VGr~gg 146 (411)
T KOG3714|consen 69 RSRRNGTSNPERRWPNGVIPYYI-DGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTGG-GCYSYVGRRGG 146 (411)
T ss_pred hhhhhcccChhhcCCCCeeeeEE-CCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCCCcceEEEeCCC-cceeeeCccCC
Confidence 46788888999999999999999 7779999999999999999999999999998 567888886554 99999999998
Q ss_pred C-cccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcC
Q psy2949 90 G-TGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMH 168 (367)
Q Consensus 90 ~-q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMh 168 (367)
. |.++ ++.+|...|+|+|||||||||+|||+|||||+||+|+|+||.++..+||.|++......+++||||+||||
T Consensus 147 ~~q~~s---l~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMH 223 (411)
T KOG3714|consen 147 GQQLLS---LGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMH 223 (411)
T ss_pred Ccccee---cCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCCcccc
Confidence 6 7666 89999999999999999999999999999999999999999999999999999988999999999999999
Q ss_pred cccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHhcCCCC
Q psy2949 169 YSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKCPKN 217 (367)
Q Consensus 169 Y~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C~~n 217 (367)
|++.+|++++..+||+|++...+.+||||..+|+.|+.+||+||.|+..
T Consensus 224 Y~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~ 272 (411)
T KOG3714|consen 224 YAPYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPEC 272 (411)
T ss_pred cCCcccCcCCCCCceecccccccccccccCcCCHHHHHHHHhhcCCCcc
Confidence 9999999999988999999987778999999999999999999999643
No 6
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=100.00 E-value=3.6e-54 Score=385.26 Aligned_cols=180 Identities=44% Similarity=0.815 Sum_probs=170.0
Q ss_pred CCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCCCCcccccccCCCccccc
Q psy2949 26 DKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGGRVCFLK 105 (367)
Q Consensus 26 ~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~~~c~~~ 105 (367)
+++|||.| +.++++.+++.|++||++|++.|||+|+|++.+.++|.|... .||||.||+.++.|.|+ |+.+|...
T Consensus 1 ~~~VpY~i-~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~~~~~I~f~~~-~Gc~S~vG~~~~~q~i~---l~~~c~~~ 75 (180)
T cd04280 1 NGTVPYVI-DGSFDESDRSLILRAMREIESNTCIRFVPRTTEKDYIRIVKG-SGCWSYVGRVGGRQVVS---LGSGCFSL 75 (180)
T ss_pred CCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCcceEEECCCCCcEEEEEcC-CCcceecCccCCceeEE---eCCCcCcC
Confidence 57999999 777999999999999999999999999999988999999765 89999999999999998 78899999
Q ss_pred chhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCcceeee
Q psy2949 106 GKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVSKTIVP 185 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~~ti~~ 185 (367)
|+|+|||||||||+|||+|||||+||.|+|+||.++...||.|++...+..+++||||+|||||++.+|++++. +||+|
T Consensus 76 g~v~HE~~HalG~~HEh~R~DRD~yv~i~~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~ 154 (180)
T cd04280 76 GTIVHELMHALGFYHEQSRPDRDDYVTINWENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVP 154 (180)
T ss_pred chhHHHHHHHhcCcchhcccccCCeEEEeecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEE
Confidence 99999999999999999999999999999999999999999999988888889999999999999999999977 99999
Q ss_pred CCCCCCCccCccccccHHHHHHHHHHh
Q psy2949 186 LYPGAEDTMGQRDAMSRVDLAKLNRLY 212 (367)
Q Consensus 186 ~~~~~~~~IG~r~~lS~~D~~~in~lY 212 (367)
+++.+.. +|++.+||+.|+++||+||
T Consensus 155 ~~~~~~~-~g~~~~~S~~D~~~in~~Y 180 (180)
T cd04280 155 KDPGYQI-IGQREGLSFLDIKKINKMY 180 (180)
T ss_pred CCchhhc-ccCCCCCCHHHHHHHHHhC
Confidence 9987544 8999999999999999998
No 7
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=100.00 E-value=9.2e-41 Score=303.23 Aligned_cols=168 Identities=26% Similarity=0.292 Sum_probs=132.9
Q ss_pred CCcCCCeeE-EEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEe-cCCCCccCCCCCCC-----CCcccc
Q psy2949 22 QLWPDKTVY-YNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFR-NTGFGCASPVGYFP-----IGTGID 94 (367)
Q Consensus 22 ~~Wp~~~Vp-Y~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~-~~~~gc~S~vG~~~-----g~q~i~ 94 (367)
++||+++|+ |.| ..++++.+|+.|++||++|++.|||+|++++.....|+|. ..+.||||+||+.+ +.+.|+
T Consensus 1 ~~W~~~~~~~~~f-~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~~~Gc~S~vG~~~~~~~~~~~t~~ 79 (198)
T cd04327 1 KLWRNGTVLRIAF-LGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGDGYWSYVGTDALLIGADAPTMN 79 (198)
T ss_pred CCCCCCCeEEEEe-CCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEecCCCCCCCcCCcccccCCCCceee
Confidence 589999655 777 6667899999999999999999999999998434444443 34589999999953 345565
Q ss_pred cccCCC------cccccchhHHHHHhhhcccccccCCCCCcceEEeeCC--cCC-----------CCC--cccccCCCCc
Q psy2949 95 IFLGGR------VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLREN--IGP-----------GHE--FNLERRPTGS 153 (367)
Q Consensus 95 ~~~l~~------~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~n--i~~-----------~~~--~nf~k~~~~~ 153 (367)
++. +|...++|+|||||||||.|||+||||| |.|++ |.+ +.. .+|.|+..
T Consensus 80 ---l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh~rpdrd----i~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~-- 150 (198)
T cd04327 80 ---LGWFTDDTPDPEFSRVVLHEFGHALGFIHEHQSPAAN----IPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD-- 150 (198)
T ss_pred ---eeeecCCCchhhHHHHHHHHHHHHhcCcccccCCCCC----CCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--
Confidence 333 4556699999999999999999999999 67773 321 112 56888764
Q ss_pred ccccCCcCCCCCCcCcc-cceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949 154 VRTFGMPYDYGSIMHYS-GIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 212 (367)
Q Consensus 154 ~~~~g~~YD~~SIMhY~-~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY 212 (367)
...+++||||+|||||+ +.+|++++. +| |++.+||..|+++|+.||
T Consensus 151 ~~~~~~~yD~~SIMHY~~~~~~t~~g~-~t------------~~~~~lS~~D~~~i~~~Y 197 (198)
T cd04327 151 GDVAYSPYDPDSIMHYPFPGSLTLDGE-EV------------PPNRTLSDKDKAFMRLLY 197 (198)
T ss_pred ccccCCCCCcHHHHcCCCcHHhhcCCc-cc------------CCCCCCCHHHHHHHHHhC
Confidence 33578999999999999 789998877 33 677889999999999999
No 8
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.96 E-value=5.8e-30 Score=219.67 Aligned_cols=135 Identities=34% Similarity=0.610 Sum_probs=115.9
Q ss_pred CCCcCCCeeEEEEcCCCCCHHH-HHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCC-Ccc-CCCCCCCCCcccccc
Q psy2949 21 AQLWPDKTVYYNFEDSEFTIYE-KTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGF-GCA-SPVGYFPIGTGIDIF 96 (367)
Q Consensus 21 ~~~Wp~~~VpY~~~~~~~~~~~-~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~-gc~-S~vG~~~g~q~i~~~ 96 (367)
.++||+++|+|+|+...+++.+ ++.|++||+.|++.+||+|++++ +...+|.|..... ||+ |++|+.++.+.++
T Consensus 2 ~~~W~~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~~~g~~~a~~g~~~g~~~~~-- 79 (140)
T smart00235 2 SKKWPKGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDGSGCTLSHAGRPGGDQHFS-- 79 (140)
T ss_pred CCcCCCCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECCCCCcceeeeecCCCceEEE--
Confidence 4789999999999326778887 99999999999999999999987 5677888876543 998 9999988888887
Q ss_pred cCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCC
Q psy2949 97 LGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGS 165 (367)
Q Consensus 97 ~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~S 165 (367)
++.+|...++++|||||||||.|||+|||||+|+.|+|+++ ..+|.+.. .+.++.+|||+|
T Consensus 80 -~~~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~----~~~~~~~~---~~~~~~~~~yg~ 140 (140)
T smart00235 80 -LGNGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNI----TRNFDLSN---DDSLGIPYDYGS 140 (140)
T ss_pred -ccCCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhh----hhcccccc---ccCCCchhccCc
Confidence 57789999999999999999999999999999999999998 35666543 245778999986
No 9
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.96 E-value=5.8e-29 Score=218.70 Aligned_cols=145 Identities=25% Similarity=0.322 Sum_probs=119.6
Q ss_pred CeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC-CceEEEecC-----CCCccCCCCCC--CCCcccccccC
Q psy2949 27 KTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTYLRFRNT-----GFGCASPVGYF--PIGTGIDIFLG 98 (367)
Q Consensus 27 ~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~I~i~~~-----~~gc~S~vG~~--~g~q~i~~~~l 98 (367)
.+|+|.| +..++...|+.|++||+.|++.+||+|++++.. ...|+|... +.||+|++|+. ++.+.|. +
T Consensus 2 ~~I~y~i-~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i~---~ 77 (165)
T cd04268 2 KPITYYI-DDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEIL---L 77 (165)
T ss_pred CCEEEEE-cCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCCccEE---e
Confidence 4799999 778899999999999999999999999999853 566666432 58999999985 4455555 3
Q ss_pred CCccc-----------ccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCc
Q psy2949 99 GRVCF-----------LKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIM 167 (367)
Q Consensus 99 ~~~c~-----------~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIM 167 (367)
+..|. ..++++|||||||||.|||+|||||.++ ..++.+||+.|||
T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~-----------------------~~~~~~~~~~SvM 134 (165)
T cd04268 78 ARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNV-----------------------DLLAEKGDTSSVM 134 (165)
T ss_pred eEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcc-----------------------hhhccCCCCcccC
Confidence 44443 4599999999999999999999999876 3467899999999
Q ss_pred CcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949 168 HYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 212 (367)
Q Consensus 168 hY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY 212 (367)
||....|+.+ ..+|++..++..|+..|+.||
T Consensus 135 ~y~~~~~~~~--------------~~~~~~~~~~~~Di~ai~~lY 165 (165)
T cd04268 135 DYAPSNFSIQ--------------LGDGQKYTIGPYDIAAIKKLY 165 (165)
T ss_pred CCCccccccc--------------cccccCCCCCHHHHHHHHhcC
Confidence 9999887655 125778889999999999998
No 10
>KOG3714|consensus
Probab=99.88 E-value=1.4e-23 Score=209.66 Aligned_cols=120 Identities=38% Similarity=0.560 Sum_probs=105.1
Q ss_pred CCccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCC
Q psy2949 191 EDTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGA 270 (367)
Q Consensus 191 ~~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~ 270 (367)
.|++|..|+.|+.|++....+- .+||.++...||++++...++++++||||+|||||++++||+++..+||+|+++..
T Consensus 168 ~HaLGf~HehsR~DRD~yV~I~--~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMHY~~~afs~~~~~~ti~~~~~~~ 245 (411)
T KOG3714|consen 168 MHALGFWHEHSRPDRDNYVSIN--WDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMHYAPYAFSKNGSLPTIVPKDNGF 245 (411)
T ss_pred HHHhhhhhccCcccccCceEEe--eccCChhhhhhhhhcChhhhhccCCcccCCcccccCCcccCcCCCCCceecccccc
Confidence 4889999999999999855544 48999999999999999999999999999999999999999999999999999888
Q ss_pred ccccccccCCCHHHHHHHhhhccCCCCcccCccccccccCCCC
Q psy2949 271 EDTMGQRDAMSRVDLAKLNRLYKCPKNYYQGFDIQGFYSTSGP 313 (367)
Q Consensus 271 ~~~~gqr~~~s~~D~~~in~~Y~C~~~~~~~~~~~g~~~~~~~ 313 (367)
+.+||||.++|+.|+.+||+||+|+.++...+ .+|..+++++
T Consensus 246 ~~~mGqr~~~S~~Di~~iN~~Y~C~~~~~~~~-~~~g~~~~~~ 287 (411)
T KOG3714|consen 246 QNTMGQRERLSFYDIRKINKLYCCPECCELLC-NNGGTNPSNC 287 (411)
T ss_pred cccccccCcCCHHHHHHHHhhcCCCccCcccc-cCCCCCCccc
Confidence 77899999999999999999999999887444 3434444333
No 11
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.88 E-value=1.3e-23 Score=190.02 Aligned_cols=102 Identities=31% Similarity=0.540 Sum_probs=92.5
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCC--
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPG-- 269 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~-- 269 (367)
|++|..|+.|+.|++....+- .+||.++...||.+.+...++++++||||+|||||++.+||+++..+||+|+.+.
T Consensus 97 HaLGf~HEhsRpDRD~yV~I~--~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aFs~~~~~~Ti~p~~~~~~ 174 (200)
T cd04281 97 HVIGFWHEHTRPDRDDHVTII--RENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTFSRGMFLDTILPKRDPNG 174 (200)
T ss_pred HHhcCcchhccccccceEEEe--ecccCcchhhhhhhcCccccccCCCccCCCccccCCCCccccCCCCCceEECCCccc
Confidence 789999999999999855543 5789999999999999888899999999999999999999999878999998763
Q ss_pred CccccccccCCCHHHHHHHhhhccCC
Q psy2949 270 AEDTMGQRDAMSRVDLAKLNRLYKCP 295 (367)
Q Consensus 270 ~~~~~gqr~~~s~~D~~~in~~Y~C~ 295 (367)
+..+||||.+||+.|+++||+||+||
T Consensus 175 ~~~~iGqr~~lS~~Di~~iN~~Y~C~ 200 (200)
T cd04281 175 VRPEIGQRTRLSEGDIIQANKLYKCP 200 (200)
T ss_pred ccccccccCCCCHHHHHHHHHhcCCC
Confidence 45689999999999999999999997
No 12
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.87 E-value=2.6e-23 Score=191.26 Aligned_cols=101 Identities=33% Similarity=0.580 Sum_probs=93.7
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE 271 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~ 271 (367)
|++|..|+.|+.|++....+. .+||.++...||+|++...++++++||||+|||||++++||+++..+||+|+.+.+.
T Consensus 130 HalGf~HEqsRpDRD~yV~I~--~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SIMHY~~~aFs~~~~~pTi~~~~~~~~ 207 (230)
T cd04282 130 HALGFYHEQSRSDRDDYVKIW--WDQILSGREHNFNKYDDSFSTDLNTPYDYESVMHYSPFSFNKGASEPTITTKIPEFN 207 (230)
T ss_pred HHhCCcccccccccccceEEe--ecccCchHHHHhhhcCccccccCCCCCCcccccccCCCccccCCCCceeeecCCccc
Confidence 789999999999999966665 578999999999999998889999999999999999999999866799999988877
Q ss_pred cccccccCCCHHHHHHHhhhccC
Q psy2949 272 DTMGQRDAMSRVDLAKLNRLYKC 294 (367)
Q Consensus 272 ~~~gqr~~~s~~D~~~in~~Y~C 294 (367)
.+||||.+||+.|+++||+||+|
T Consensus 208 ~~iGqr~~lS~~Di~~iN~~Y~C 230 (230)
T cd04282 208 DIIGQRLDFSDIDLERLNRMYNC 230 (230)
T ss_pred ccccccCCCCHHHHHHHHHhcCC
Confidence 89999999999999999999999
No 13
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.83 E-value=1.1e-21 Score=174.86 Aligned_cols=96 Identities=34% Similarity=0.644 Sum_probs=85.7
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE 271 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~ 271 (367)
|++|..++.|+.|++....+. .+||.++...||++.+. .+++.||||+|||||++.+||+++. +||+|+.+. .
T Consensus 87 HaLG~~HEhsRpDRD~yV~I~--~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~~~~-~ 159 (182)
T cd04283 87 HALGFYHEQTRSDRDKYVRIN--WENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPIPDP-N 159 (182)
T ss_pred HHhCCcccccccccCceEEEe--hhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEECCcc-c
Confidence 789999999999999966655 58899999999999865 4678999999999999999999875 899998654 3
Q ss_pred cccccccCCCHHHHHHHhhhccC
Q psy2949 272 DTMGQRDAMSRVDLAKLNRLYKC 294 (367)
Q Consensus 272 ~~~gqr~~~s~~D~~~in~~Y~C 294 (367)
.+||||.+||+.|+++||+||+|
T Consensus 160 ~~iGqr~~lS~~Di~~iN~~Y~C 182 (182)
T cd04283 160 VPIGQRQGMSNLDILRINKLYNC 182 (182)
T ss_pred ccccCCCCCCHHHHHHHHHhcCC
Confidence 58999999999999999999999
No 14
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.81 E-value=1.3e-19 Score=159.36 Aligned_cols=145 Identities=28% Similarity=0.327 Sum_probs=107.6
Q ss_pred eeEEEEcCC-------CCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC--CceEEEecC------CCCccCCCCCC--CCC
Q psy2949 28 TVYYNFEDS-------EFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ--DTYLRFRNT------GFGCASPVGYF--PIG 90 (367)
Q Consensus 28 ~VpY~~~~~-------~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~--~~~I~i~~~------~~gc~S~vG~~--~g~ 90 (367)
.|+|.|... .++...++.|+.|++.|+..+||+|+++..+ ...|.|... +.+|+|++|.. +..
T Consensus 2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~ 81 (167)
T cd00203 2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR 81 (167)
T ss_pred EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence 466766322 2677899999999999999999999999854 555555321 46789999885 322
Q ss_pred cccccccCCCc----ccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCC
Q psy2949 91 TGIDIFLGGRV----CFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSI 166 (367)
Q Consensus 91 q~i~~~~l~~~----c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SI 166 (367)
..+. +... .....+++|||||||||.|++.+++||.+|++. ......++|..||
T Consensus 82 ~~~~---~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~-------------------~~~~~~~~~~~si 139 (167)
T cd00203 82 GVGV---LQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTID-------------------DTLNAEDDDYYSV 139 (167)
T ss_pred CcEE---EecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcccc-------------------ccccCCCCCCCeE
Confidence 2222 1112 245589999999999999999999999988772 1124578999999
Q ss_pred cCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949 167 MHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 212 (367)
Q Consensus 167 MhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY 212 (367)
|||....++ ++++..+|..|+.+|+.+|
T Consensus 140 M~y~~~~~~------------------~~~~~~fS~~d~~~i~~~Y 167 (167)
T cd00203 140 MSYTKGSFS------------------DGQRKDFSQCDIDQINKLY 167 (167)
T ss_pred eccCccccC------------------cccCCCcCHHHHHHHHhhC
Confidence 999875432 4566779999999999987
No 15
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.78 E-value=3.5e-20 Score=167.33 Aligned_cols=103 Identities=37% Similarity=0.669 Sum_probs=73.0
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE 271 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~ 271 (367)
|++|..++.++.|++....+. .+||.++...||.+.+......++.||||+|||||++.+|++++..+||+|+++...
T Consensus 89 HaLG~~HEh~RpDRd~yi~i~--~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~~ti~~~~~~~~ 166 (191)
T PF01400_consen 89 HALGFWHEHQRPDRDNYITIN--WDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQPTITPKDPKYQ 166 (191)
T ss_dssp HHHTB--GGGSTTGGGTEEE---GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS-SEEESSTTCT
T ss_pred HHHhhhhhhhccccccEEEEe--hhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCCCeEEecCCCCc
Confidence 789999999999999866665 478999999999999877788899999999999999999999998899999988778
Q ss_pred cccccccCCCHHHHHHHhhhccCCC
Q psy2949 272 DTMGQRDAMSRVDLAKLNRLYKCPK 296 (367)
Q Consensus 272 ~~~gqr~~~s~~D~~~in~~Y~C~~ 296 (367)
.+|||+.+||+.|+++||.||+|+.
T Consensus 167 ~~iG~~~~lS~~D~~~in~~Y~C~~ 191 (191)
T PF01400_consen 167 ETIGQRNRLSFTDIKQINKMYNCPE 191 (191)
T ss_dssp S-GGG-SS--HHHHHHHHHHTT---
T ss_pred ccccccCCCCHHHHHHHHHhcCCCC
Confidence 8999999999999999999999973
No 16
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.75 E-value=1.5e-19 Score=161.75 Aligned_cols=97 Identities=39% Similarity=0.672 Sum_probs=87.2
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE 271 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~ 271 (367)
|++|..++.++.|++....+. .+||.++...||.+.+.....+++.||||+|||||++.+|++++. +||+|+++...
T Consensus 84 HalG~~HEh~R~DRD~yv~i~--~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~~~~~~~ 160 (180)
T cd04280 84 HALGFYHEQSRPDRDDYVTIN--WENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVPKDPGYQ 160 (180)
T ss_pred HHhcCcchhcccccCCeEEEe--ecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEECCchhh
Confidence 789999999999999855554 478999999999999998888899999999999999999999864 89999987764
Q ss_pred cccccccCCCHHHHHHHhhhc
Q psy2949 272 DTMGQRDAMSRVDLAKLNRLY 292 (367)
Q Consensus 272 ~~~gqr~~~s~~D~~~in~~Y 292 (367)
. ||||.+||+.|+++||+||
T Consensus 161 ~-~g~~~~~S~~D~~~in~~Y 180 (180)
T cd04280 161 I-IGQREGLSFLDIKKINKMY 180 (180)
T ss_pred c-ccCCCCCCHHHHHHHHHhC
Confidence 3 9999999999999999998
No 17
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.70 E-value=9.6e-17 Score=140.42 Aligned_cols=99 Identities=18% Similarity=0.094 Sum_probs=66.8
Q ss_pred CcCCCeeEEEEcCCCC----CHHHHHHHHHHHHHHHHcCCcEEEEcCCC-Cce--EEEecCCCCc-cCCCCCC-------
Q psy2949 23 LWPDKTVYYNFEDSEF----TIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTY--LRFRNTGFGC-ASPVGYF------- 87 (367)
Q Consensus 23 ~Wp~~~VpY~~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~--I~i~~~~~gc-~S~vG~~------- 87 (367)
+||+++|+|+| +... ....++.|++||+.|+..+||+|++++.. ..- |.+.....+| +++-|..
T Consensus 1 kW~~~~itY~i-~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a~ 79 (157)
T cd04278 1 KWSKTNLTYRI-LNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHAF 79 (157)
T ss_pred CCCCCceeEEE-ECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccccccc
Confidence 69999999999 5443 37889999999999999999999998743 333 4444444454 3333321
Q ss_pred -----CCCcccc---cccC---CCcccccchhHHHHHhhhcccccc
Q psy2949 88 -----PIGTGID---IFLG---GRVCFLKGKIQHEILHSLGFWHEH 122 (367)
Q Consensus 88 -----~g~q~i~---~~~l---~~~c~~~g~i~HEigHALG~~HEh 122 (367)
.+...++ ..++ ..+.....+++|||||||||.|++
T Consensus 80 ~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~ 125 (157)
T cd04278 80 FPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS 125 (157)
T ss_pred CCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence 1111222 1111 124445689999999999999974
No 18
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.55 E-value=4.4e-14 Score=126.95 Aligned_cols=141 Identities=17% Similarity=0.141 Sum_probs=91.8
Q ss_pred CCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC-CceEEEecCC-----CCccCCCCCC------CCCcccccccC----CC
Q psy2949 37 EFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTYLRFRNTG-----FGCASPVGYF------PIGTGIDIFLG----GR 100 (367)
Q Consensus 37 ~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~I~i~~~~-----~gc~S~vG~~------~g~q~i~~~~l----~~ 100 (367)
.+++.+++.|++||+.|++.+||+|++++.. ...|+|.... .+++++.+.. .+.-.++.... ..
T Consensus 30 ~~~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~ 109 (186)
T cd04277 30 ALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSP 109 (186)
T ss_pred cCCHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCC
Confidence 4578999999999999999999999999843 5677775432 3445544432 22333442111 23
Q ss_pred cccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCc
Q psy2949 101 VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVS 180 (367)
Q Consensus 101 ~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~ 180 (367)
++....+++|||||||||.|+|..++.+.+-. .....-.-.|||-|....... .
T Consensus 110 g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~-----------------------~~~~~~~~~SVMSY~~~~~~~--~- 163 (186)
T cd04277 110 GSYGYQTIIHEIGHALGLEHPGDYNGGDPVPP-----------------------TYALDSREYTVMSYNSGYGNG--A- 163 (186)
T ss_pred ChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCc-----------------------cccccCcceEEEeecCCCCCC--c-
Confidence 66677999999999999999999887653210 001112345899996542110 0
Q ss_pred ceeeeCCCCCCCccCccccccHHHHHHHHHHhc
Q psy2949 181 KTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYK 213 (367)
Q Consensus 181 ~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~ 213 (367)
. ..-+....+...|++.|..||+
T Consensus 164 -~---------~~~~~~~~~~~~DI~AlQ~lYG 186 (186)
T cd04277 164 -S---------AGGGYPQTPMLLDIAALQYLYG 186 (186)
T ss_pred -c---------ccCcccCCccHHHHHHHHHhhC
Confidence 0 0113335678899999999984
No 19
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.50 E-value=1.7e-13 Score=119.74 Aligned_cols=98 Identities=21% Similarity=0.244 Sum_probs=60.9
Q ss_pred eeEEEEcCCC-----CCHHHHHHHHHHHHHHHHcCCcEEEEcCC---CCceEEEecC-------CCCccCCCCCCC----
Q psy2949 28 TVYYNFEDSE-----FTIYEKTLVENAIQDLRMHTCVRFVPRTN---QDTYLRFRNT-------GFGCASPVGYFP---- 88 (367)
Q Consensus 28 ~VpY~~~~~~-----~~~~~~~~I~~A~~~w~~~tci~F~~~~~---~~~~I~i~~~-------~~gc~S~vG~~~---- 88 (367)
.|+|+++... .....++.|++||+.|++.+||+|++... ..+.+.++.. +..|.++.+..-
T Consensus 3 ~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~~ 82 (156)
T cd04279 3 PIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGN 82 (156)
T ss_pred CeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCCc
Confidence 5788884432 25678999999999999999999999874 3333222221 233433333210
Q ss_pred --C--CcccccccCCC--cccccchhHHHHHhhhcccccccCC
Q psy2949 89 --I--GTGIDIFLGGR--VCFLKGKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 89 --g--~q~i~~~~l~~--~c~~~g~i~HEigHALG~~HEh~rp 125 (367)
+ ...++...... .....++++|||||||||.||++++
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~ 125 (156)
T cd04279 83 RKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRP 125 (156)
T ss_pred ccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCc
Confidence 1 11122100000 1223589999999999999999865
No 20
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.47 E-value=7.9e-13 Score=114.59 Aligned_cols=100 Identities=20% Similarity=0.246 Sum_probs=61.8
Q ss_pred CcCCCeeEEEEcCCC--C-CHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEe--cC-CCCccCCCCCC--------C
Q psy2949 23 LWPDKTVYYNFEDSE--F-TIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFR--NT-GFGCASPVGYF--------P 88 (367)
Q Consensus 23 ~Wp~~~VpY~~~~~~--~-~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~--~~-~~gc~S~vG~~--------~ 88 (367)
+||..+|.|+|.... + .+..+++|++||+.|+..+.|+|++......-|.|. .. ...+..+-|.. +
T Consensus 1 ~W~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~adi~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (154)
T PF00413_consen 1 KWPKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGNADIRISFGSNNHGDGYSFDGSGGTLAHAYFP 80 (154)
T ss_dssp SSSSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSSCSEEEEEESSSSSSSS-CSSSSSESEEEEES
T ss_pred CCCCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCCcceeeeeeccccCcccccccceeeeeccccc
Confidence 699999999993322 1 256889999999999999999999998433444442 22 12222221211 1
Q ss_pred -----CCcccc---cccCCCccc-ccchhHHHHHhhhcccccc
Q psy2949 89 -----IGTGID---IFLGGRVCF-LKGKIQHEILHSLGFWHEH 122 (367)
Q Consensus 89 -----g~q~i~---~~~l~~~c~-~~g~i~HEigHALG~~HEh 122 (367)
+...++ ......... ...+++|||||||||.|++
T Consensus 81 ~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~ 123 (154)
T PF00413_consen 81 NNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN 123 (154)
T ss_dssp SSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred cccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence 111111 000111222 2379999999999999975
No 21
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.24 E-value=2.1e-12 Score=117.35 Aligned_cols=87 Identities=21% Similarity=0.220 Sum_probs=64.2
Q ss_pred CccCccccccHHHHH------HHHHHhcCCCC-cccCcc--cccccCCCCcccccccCCCCCCccccc-ccccccCCCce
Q psy2949 192 DTMGQRDAMSRVDLA------KLNRLYKCPKN-YYQGHE--FNLERRPAGSVRTFSMPYDYGSIMHYS-GIAFSKDGVSK 261 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~------~in~lY~C~~n-I~~g~~--~Nf~~~~~~~~~~~~~pYDy~SiMhY~-~~~fs~~~~~~ 261 (367)
|++|..|+.|+.|++ .|...|.-..+ +..+.. .+|.+.+. ...++.||||+|||||+ +.+|++++. +
T Consensus 102 HaLG~~HEh~rpdrdi~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~--~~~~~~~yD~~SIMHY~~~~~~t~~g~-~ 178 (198)
T cd04327 102 HALGFIHEHQSPAANIPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD--GDVAYSPYDPDSIMHYPFPGSLTLDGE-E 178 (198)
T ss_pred HHhcCcccccCCCCCCCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--ccccCCCCCcHHHHcCCCcHHhhcCCc-c
Confidence 788888999999886 22233322212 222222 67888764 34689999999999999 899998875 2
Q ss_pred eeeeCCCCCccccccccCCCHHHHHHHhhhcc
Q psy2949 262 TIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYK 293 (367)
Q Consensus 262 ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y~ 293 (367)
| |++.+||..|+++||+||.
T Consensus 179 t------------~~~~~lS~~D~~~i~~~Yp 198 (198)
T cd04327 179 V------------PPNRTLSDKDKAFMRLLYP 198 (198)
T ss_pred c------------CCCCCCCHHHHHHHHHhCc
Confidence 2 6778999999999999994
No 22
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=98.97 E-value=7.1e-09 Score=93.93 Aligned_cols=178 Identities=22% Similarity=0.264 Sum_probs=95.2
Q ss_pred CCCccC-Ccccccccc----cc---ccCCCCcC--CCeeEEEEcCC----CCCHHHHHHHHHHHHHHHH-cCCcEEEEc-
Q psy2949 1 MEGDIL-IPRERDSRN----LV---LYQAQLWP--DKTVYYNFEDS----EFTIYEKTLVENAIQDLRM-HTCVRFVPR- 64 (367)
Q Consensus 1 ~~GDi~-~~~q~~~r~----a~---~~~~~~Wp--~~~VpY~~~~~----~~~~~~~~~I~~A~~~w~~-~tci~F~~~- 64 (367)
.||||+ +++|+++.. .. -++..+|. ..+|...+ .. +++...+..+.+||+.|+. ...|+|.-.
T Consensus 3 vegDi~~s~~~l~~~~~~~~~~~eqYrTtnlV~~~~~~I~i~~-~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~ 81 (211)
T PF12388_consen 3 VEGDIFVSKKQLREMLQSDDGTSEQYRTTNLVGSSPRTITIIG-YTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTF 81 (211)
T ss_pred eccceEEcHHHHhhhhccCCcchhheeeeeeecCCCCEEEEEe-CCCccccccHHHHHHHHHHHHHHHhhCCceEEEEec
Confidence 489999 887743322 11 12345673 34788888 55 5678899999999999999 557888411
Q ss_pred -C--CCCceEEEecCC---CCccCCCCCC--CCC--cccccccCCCccc--ccchhHHHHHhhhcccccccCCCCCcceE
Q psy2949 65 -T--NQDTYLRFRNTG---FGCASPVGYF--PIG--TGIDIFLGGRVCF--LKGKIQHEILHSLGFWHEHTRPDRDQFVR 132 (367)
Q Consensus 65 -~--~~~~~I~i~~~~---~gc~S~vG~~--~g~--q~i~~~~l~~~c~--~~g~i~HEigHALG~~HEh~rpDRD~yv~ 132 (367)
. ...+.+.+.... .|--..-|.. +|. ..+....+...-. ...+|+|||||+|||.|+.
T Consensus 82 g~~~~~~di~v~~~~~~~~~G~ggsAGFP~s~G~P~~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHTD---------- 151 (211)
T PF12388_consen 82 GTNYQNADIIVYSNSSNNPSGAGGSAGFPTSNGNPYKFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHTD---------- 151 (211)
T ss_pred CCCcCCCceEEEeccCCCCCCcceeccCCCCCCCCCceEEEEecCCCchhHHHHHHHHHhhhhccccccC----------
Confidence 1 233444442211 1111111222 121 1111111111111 2369999999999999975
Q ss_pred EeeCCcCC-CCCcccccCCCCcccccCCc--CCCCCCcCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHH
Q psy2949 133 VLRENIGP-GHEFNLERRPTGSVRTFGMP--YDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLN 209 (367)
Q Consensus 133 I~~~ni~~-~~~~nf~k~~~~~~~~~g~~--YD~~SIMhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in 209 (367)
|.+-.. +...|=.--....+..-|+| +|..|||.-.-. -+...+++..|+..|+
T Consensus 152 --~~~R~SCG~~~nEg~~~vGAi~IpGTPt~~d~~SiM~ac~~---------------------~~~~~~f~~~Di~Al~ 208 (211)
T PF12388_consen 152 --YFNRSSCGSGGNEGSAGVGAIHIPGTPTGADPNSIMNACFS---------------------SGEDGEFTSNDITALN 208 (211)
T ss_pred --cCCcccccccCCcCccccceEECCCCCCCCCCchhhhcccc---------------------CCCCCCcChhhHHHHH
Confidence 222110 00000000000011222443 599999987421 1234568999999999
Q ss_pred HHh
Q psy2949 210 RLY 212 (367)
Q Consensus 210 ~lY 212 (367)
.||
T Consensus 209 ~lY 211 (211)
T PF12388_consen 209 YLY 211 (211)
T ss_pred hhC
Confidence 998
No 23
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.55 E-value=1.2e-06 Score=79.36 Aligned_cols=31 Identities=13% Similarity=0.296 Sum_probs=27.1
Q ss_pred CCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHc
Q psy2949 25 PDKTVYYNFEDSEFTIYEKTLVENAIQDLRMH 56 (367)
Q Consensus 25 p~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~ 56 (367)
|+..|.|.| +..++...++.|++|++.|++.
T Consensus 6 pk~pItyyI-~~~~p~~~r~aI~~A~~~Wn~~ 36 (197)
T cd04276 6 PKEPIVYYL-DNTFPEKYRDAIREGVLYWNKA 36 (197)
T ss_pred CCCCEEEEe-cCCCcHHHHHHHHHHHHHHHHH
Confidence 567899999 6667789999999999999885
No 24
>KOG1565|consensus
Probab=98.50 E-value=7.6e-07 Score=90.57 Aligned_cols=104 Identities=16% Similarity=0.111 Sum_probs=66.4
Q ss_pred cccCCCCcCCCeeEEEEcCCCC----CHHHHHHHHHHHHHHHHcCCcEEEEcCC-CCceEEEe--cCCCCccC-------
Q psy2949 17 VLYQAQLWPDKTVYYNFEDSEF----TIYEKTLVENAIQDLRMHTCVRFVPRTN-QDTYLRFR--NTGFGCAS------- 82 (367)
Q Consensus 17 ~~~~~~~Wp~~~VpY~~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~-~~~~I~i~--~~~~gc~S------- 82 (367)
......+|+...|+|+| .... ....+.++++|+..|++.|-++|.|+.. ...-|.|. ....|+..
T Consensus 99 ~~~~~~kW~k~~lT~ri-~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g 177 (469)
T KOG1565|consen 99 YFPGKPKWNKEHLTYRI-KNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGG 177 (469)
T ss_pred cCcccCcccccccceec-cccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCC
Confidence 33456799999999999 3322 4678999999999999999999999984 44444443 22223211
Q ss_pred CCCC-------CCCCcccc---cccCC--CcccccchhHHHHHhhhccccc
Q psy2949 83 PVGY-------FPIGTGID---IFLGG--RVCFLKGKIQHEILHSLGFWHE 121 (367)
Q Consensus 83 ~vG~-------~~g~q~i~---~~~l~--~~c~~~g~i~HEigHALG~~HE 121 (367)
.+.+ .+|.-... .+.++ .+-....|++|||||||||.|.
T Consensus 178 ~laHAf~Pg~~~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS 228 (469)
T KOG1565|consen 178 VLAHAFFPGPGIGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHS 228 (469)
T ss_pred ceecccCCCCCCCCccccCcccceeccCCccchhHHHhhhhcccccccCCC
Confidence 1111 11111111 22221 1334458999999999999998
No 25
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=97.72 E-value=1.4e-05 Score=69.80 Aligned_cols=44 Identities=27% Similarity=0.391 Sum_probs=38.6
Q ss_pred cccccCCCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949 235 RTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 292 (367)
Q Consensus 235 ~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y 292 (367)
..++.+||+.|||||....|+.+ ..+|++..++..||+.|+.||
T Consensus 122 ~~~~~~~~~~SvM~y~~~~~~~~--------------~~~~~~~~~~~~Di~ai~~lY 165 (165)
T cd04268 122 DLLAEKGDTSSVMDYAPSNFSIQ--------------LGDGQKYTIGPYDIAAIKKLY 165 (165)
T ss_pred hhhccCCCCcccCCCCccccccc--------------cccccCCCCCHHHHHHHHhcC
Confidence 57888999999999999998765 245788999999999999998
No 26
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=97.65 E-value=3e-05 Score=64.48 Aligned_cols=86 Identities=20% Similarity=0.091 Sum_probs=45.5
Q ss_pred eeEEEEcCCCCCHHHHHHHHHHHHHHHH-cCCcEEEEcCCCCceEEEecCCCCccCCCCCCC-CCcccccc-cCCCcccc
Q psy2949 28 TVYYNFEDSEFTIYEKTLVENAIQDLRM-HTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP-IGTGIDIF-LGGRVCFL 104 (367)
Q Consensus 28 ~VpY~~~~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~-g~q~i~~~-~l~~~c~~ 104 (367)
+|.|.- +. .++.+..|.+|.+.|+. .++|+|++... ..+++......-+|+-...+ |.-.|-.. ....+-..
T Consensus 3 tv~Yda-s~--A~~f~~~i~~aa~iWN~sV~NV~L~~~s~--a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy~~ 77 (132)
T PF02031_consen 3 TVYYDA-SR--APEFRSAIAQAAQIWNSSVSNVRLVEGSS--ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGYNS 77 (132)
T ss_dssp EEEEEE-EE---GGGHHHHHHHHHHHHHH-SSEEEEE-SS---SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS-H
T ss_pred EEEEeC-CC--CchhHHHHHHHHHHHhcccCceEEeecCC--CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCCcc
Confidence 466665 32 45788999999999987 78999999875 33334322222233322221 11112100 00123333
Q ss_pred cchhHHHHHhhhcc
Q psy2949 105 KGKIQHEILHSLGF 118 (367)
Q Consensus 105 ~g~i~HEigHALG~ 118 (367)
..+++|||||.|||
T Consensus 78 ~RIaaHE~GHiLGL 91 (132)
T PF02031_consen 78 TRIAAHELGHILGL 91 (132)
T ss_dssp HHHHHHHHHHHHT-
T ss_pred ceeeeehhccccCC
Confidence 47999999999997
No 27
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.20 E-value=0.00076 Score=60.71 Aligned_cols=80 Identities=15% Similarity=0.023 Sum_probs=44.2
Q ss_pred HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCC-ccCCCCC-------------CCCCcccc---cccCCCcc
Q psy2949 40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFG-CASPVGY-------------FPIGTGID---IFLGGRVC 102 (367)
Q Consensus 40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~g-c~S~vG~-------------~~g~q~i~---~~~l~~~c 102 (367)
+...+.+..|++.|.+.-.+..+++.++.|......+..| -|.-.|+ .-+.-.+| .+....+-
T Consensus 103 p~wq~a~~tava~wa~~fpl~ive~~eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 103 PRWQGAYLTAVAGWAKTFPLIIVERFEEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred hhHHHHHHHHHHHHHHhCCceeeecceeeeEEEEecCCCCCcccccchHHHHHHHHHHhhhccCcccccccccccCCccc
Confidence 3568999999999999866666776554444333333222 1222222 11111122 00011111
Q ss_pred c---ccchhHHHHHhhhccc
Q psy2949 103 F---LKGKIQHEILHSLGFW 119 (367)
Q Consensus 103 ~---~~g~i~HEigHALG~~ 119 (367)
. ..+++.||+|||||++
T Consensus 183 ~~e~L~~tarhElGhaLgi~ 202 (236)
T COG5549 183 LRENLNPTARHELGHALGIW 202 (236)
T ss_pred chhhhhHHHHHhhcchheec
Confidence 1 2389999999999976
No 28
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=95.73 E-value=0.0045 Score=56.62 Aligned_cols=19 Identities=37% Similarity=0.588 Sum_probs=17.4
Q ss_pred cchhHHHHHhhhccccccc
Q psy2949 105 KGKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~ 123 (367)
..+++|||||+||+.|+..
T Consensus 138 ~~~~aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 138 YQTFAHEIGHNLGLRHDFD 156 (206)
T ss_pred chHHHHHHHHHhcCCCCcc
Confidence 3679999999999999987
No 29
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=95.58 E-value=0.0053 Score=50.92 Aligned_cols=19 Identities=37% Similarity=0.455 Sum_probs=13.2
Q ss_pred cccchhHHHHHhhhccccc
Q psy2949 103 FLKGKIQHEILHSLGFWHE 121 (367)
Q Consensus 103 ~~~g~i~HEigHALG~~HE 121 (367)
....+++|||||.||+.|-
T Consensus 106 ~~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 106 SGVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp TSTTHHHHHHHHHTT----
T ss_pred ccceEeeehhhHhcCCCCC
Confidence 3458999999999999993
No 30
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=95.45 E-value=0.01 Score=52.72 Aligned_cols=20 Identities=30% Similarity=0.345 Sum_probs=14.9
Q ss_pred ccchhHHHHHhhhccccccc
Q psy2949 104 LKGKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 104 ~~g~i~HEigHALG~~HEh~ 123 (367)
...+++|||||.||..|...
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~ 130 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFD 130 (173)
T ss_dssp HHHHHHHHHHHHHT---SSS
T ss_pred eeeeehhhhHhhcCCCCCCC
Confidence 34689999999999999987
No 31
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.34 E-value=0.031 Score=54.40 Aligned_cols=81 Identities=20% Similarity=0.249 Sum_probs=48.1
Q ss_pred CHHHHHHHHHHHHHHHHcC-Cc-EEEEcCCCCceEEEecCCCCc-cCCCCCCCCCcccccccCCCcccccchhHHHHHhh
Q psy2949 39 TIYEKTLVENAIQDLRMHT-CV-RFVPRTNQDTYLRFRNTGFGC-ASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHS 115 (367)
Q Consensus 39 ~~~~~~~I~~A~~~w~~~t-ci-~F~~~~~~~~~I~i~~~~~gc-~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHA 115 (367)
+..+.-.+..+.+.+-+.. .+ .| ..-..+|.|...+..| |+-+|..+|.+..- ..-+.++...++++||+||-
T Consensus 86 ~~~d~~~~~~~Ad~~a~~~lG~~~~---s~y~h~vyvlP~~~~C~w~Gla~v~G~~~~~-~~~~~~~~~~~~~~HElgHN 161 (314)
T PF05548_consen 86 STDDWYGWADAADAAARAQLGVNAF---SSYTHRVYVLPPGFACGWAGLATVPGSQSWL-WISGYGVQDWATIMHELGHN 161 (314)
T ss_pred chHHHHHHHHHHHHHHHhhcCcccc---cccceEEEEcCCCCCCCceEEeecCCcceee-eecCcccccHHHHHHHhhhh
Confidence 3344556777777776654 33 23 2234456665555456 44444454544221 11234566778999999999
Q ss_pred hccccccc
Q psy2949 116 LGFWHEHT 123 (367)
Q Consensus 116 LG~~HEh~ 123 (367)
|||.|...
T Consensus 162 ~GL~Ha~~ 169 (314)
T PF05548_consen 162 LGLWHAGR 169 (314)
T ss_pred ccccccCC
Confidence 99999963
No 32
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=95.24 E-value=0.0082 Score=52.39 Aligned_cols=20 Identities=30% Similarity=0.527 Sum_probs=13.5
Q ss_pred chhHHHHHhhhcccccccCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rp 125 (367)
.|++||+||-|||.|-.+..
T Consensus 71 ~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 71 KTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp HHHHHHHHHHTT---TT---
T ss_pred cchhhhhhhhhcccccccCC
Confidence 79999999999999998754
No 33
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=94.94 E-value=0.015 Score=52.19 Aligned_cols=18 Identities=33% Similarity=0.429 Sum_probs=16.5
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
.+++||+||.||+.|...
T Consensus 135 ~~~aHElGH~lG~~HD~~ 152 (192)
T cd04267 135 LTMAHELGHNLGAEHDGG 152 (192)
T ss_pred hhhhhhHHhhcCCcCCCC
Confidence 689999999999999864
No 34
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=94.80 E-value=0.023 Score=49.38 Aligned_cols=40 Identities=38% Similarity=0.577 Sum_probs=33.3
Q ss_pred cccccCCCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949 235 RTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 292 (367)
Q Consensus 235 ~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y 292 (367)
.....++++.|||||....|+ ++++..||.-|+.+|+++|
T Consensus 128 ~~~~~~~~~~siM~y~~~~~~------------------~~~~~~fS~~d~~~i~~~Y 167 (167)
T cd00203 128 TLNAEDDDYYSVMSYTKGSFS------------------DGQRKDFSQCDIDQINKLY 167 (167)
T ss_pred cccCCCCCCCeEeccCccccC------------------cccCCCcCHHHHHHHHhhC
Confidence 356778999999999866553 4677889999999999998
No 35
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=94.76 E-value=0.024 Score=44.69 Aligned_cols=29 Identities=21% Similarity=0.285 Sum_probs=24.8
Q ss_pred ccCCCCCCCCCCCCCCCCcEEEEcccCCCcccc
Q psy2949 308 YSTSGPIPDLGYLPTGSGWFYKIGGPSDDRKIM 340 (367)
Q Consensus 308 ~~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~ 340 (367)
.+|++|.+ ++.+.+|.|.|.+|++.++.+
T Consensus 4 ~Sp~yP~~----y~~~~~C~w~i~~~~g~~i~l 32 (102)
T smart00042 4 TSPNYPQS----YPNNLDCVWTIRAPPGYRIEL 32 (102)
T ss_pred eCCCCCcC----CCCCCcEEEEEECCCCeEEEE
Confidence 37888876 789999999999999987765
No 36
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=94.49 E-value=0.018 Score=51.74 Aligned_cols=23 Identities=30% Similarity=0.254 Sum_probs=14.4
Q ss_pred ccchhHHHHHhhhcccccccCCC
Q psy2949 104 LKGKIQHEILHSLGFWHEHTRPD 126 (367)
Q Consensus 104 ~~g~i~HEigHALG~~HEh~rpD 126 (367)
..-+++|||||.||..|.+..+.
T Consensus 142 ~~~~~AHEiGH~lGa~HD~~~~~ 164 (196)
T PF13688_consen 142 GAITFAHEIGHNLGAPHDGDYSS 164 (196)
T ss_dssp HHHHHHHHHHHHTT-----SSS-
T ss_pred eehhhHHhHHHhcCCCCCCCCCC
Confidence 34799999999999999887654
No 37
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=93.92 E-value=0.0096 Score=55.22 Aligned_cols=20 Identities=30% Similarity=0.506 Sum_probs=18.0
Q ss_pred chhHHHHHhhhcccccccCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rp 125 (367)
.|++||+||.|||+|.++--
T Consensus 139 ~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 139 DTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ceeEEeccceeeeeeeecCC
Confidence 69999999999999999753
No 38
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=93.61 E-value=0.074 Score=42.37 Aligned_cols=33 Identities=21% Similarity=0.295 Sum_probs=27.0
Q ss_pred cccc-cCCCCCCCCCCCCCCCCcEEEEcccCCCccccc
Q psy2949 305 QGFY-STSGPIPDLGYLPTGSGWFYKIGGPSDDRKIMD 341 (367)
Q Consensus 305 ~g~~-~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~~ 341 (367)
.|.+ +|++|.+ ++.+.+|.|.|.+|.++|+.+-
T Consensus 10 ~g~i~Sp~~p~~----~~~~~~C~w~i~~~~g~~i~l~ 43 (113)
T cd00041 10 SGTISSPNYPNN----YPNNLNCVWTIEAPPGYRIRLT 43 (113)
T ss_pred CeEEECCCCCCC----CCCCCcEEEEEEcCCCCEEEEE
Confidence 4555 8888876 7788999999999999988764
No 39
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=93.31 E-value=0.035 Score=48.97 Aligned_cols=18 Identities=33% Similarity=0.468 Sum_probs=15.2
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
--++||+||++||.|=-+
T Consensus 126 KEv~HElGH~~GL~HC~N 143 (181)
T COG1913 126 KEVLHELGHLLGLSHCPN 143 (181)
T ss_pred HHHHHHhhhhcCcccCCC
Confidence 358999999999999654
No 40
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=93.20 E-value=0.039 Score=49.56 Aligned_cols=19 Identities=32% Similarity=0.519 Sum_probs=16.7
Q ss_pred cchhHHHHHhhhccccccc
Q psy2949 105 KGKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~ 123 (367)
..+++|||||.||+.|...
T Consensus 132 a~~~AHElGH~lG~~HD~~ 150 (194)
T cd04269 132 AVTMAHELGHNLGMEHDDG 150 (194)
T ss_pred HHHHHHHHHhhcCCCcCCC
Confidence 3799999999999999864
No 41
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=93.04 E-value=0.064 Score=49.34 Aligned_cols=20 Identities=30% Similarity=0.388 Sum_probs=17.4
Q ss_pred cchhHHHHHhhhcccccccC
Q psy2949 105 KGKIQHEILHSLGFWHEHTR 124 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~r 124 (367)
..+++||+||.||+.|....
T Consensus 146 ~~~~AHElGH~lG~~HD~~~ 165 (220)
T cd04272 146 VYTMTHELAHLLGAPHDGSP 165 (220)
T ss_pred HHHHHHHHHHHhCCCCCCCC
Confidence 47999999999999998654
No 42
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.95 E-value=0.19 Score=48.30 Aligned_cols=19 Identities=37% Similarity=0.354 Sum_probs=16.2
Q ss_pred ccchhHHHHHhhhcccccc
Q psy2949 104 LKGKIQHEILHSLGFWHEH 122 (367)
Q Consensus 104 ~~g~i~HEigHALG~~HEh 122 (367)
..|+++||+||+|||.+.-
T Consensus 165 ~igv~~HE~gH~lGLPDlY 183 (286)
T TIGR03296 165 GVGVIAHELGHDLGLPDLY 183 (286)
T ss_pred ceeeeehhhhcccCCCccc
Confidence 3599999999999987655
No 43
>PF00431 CUB: CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ; InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include: Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system. Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN. Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins. Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development. Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=92.40 E-value=0.058 Score=42.83 Aligned_cols=34 Identities=24% Similarity=0.355 Sum_probs=26.0
Q ss_pred ccccc-cCCCCCCCCCCCCCCCCcEEEEcccCCCccccc
Q psy2949 304 IQGFY-STSGPIPDLGYLPTGSGWFYKIGGPSDDRKIMD 341 (367)
Q Consensus 304 ~~g~~-~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~~ 341 (367)
..|.+ +|++|.+ ++...+|.|.|.+|.+.+|.+-
T Consensus 8 ~~g~i~Sp~yp~~----y~~~~~C~w~i~~~~~~~I~l~ 42 (110)
T PF00431_consen 8 NSGIISSPNYPSN----YPSNSDCTWTITAPPGHRIRLT 42 (110)
T ss_dssp SEEEEESTTTTS-----SSSSEEEEEEEE-STTEEEEEE
T ss_pred CeEEEECCCCCCC----CCCCCcEeEEEEecccceeeec
Confidence 45555 8888876 7888999999999999987654
No 44
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=92.16 E-value=0.053 Score=50.37 Aligned_cols=17 Identities=35% Similarity=0.509 Sum_probs=15.3
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
.+++|||||.||+.|--
T Consensus 147 ~t~AHElGHnLGm~HD~ 163 (228)
T cd04271 147 QVFAHEIGHTFGAVHDC 163 (228)
T ss_pred eehhhhhhhhcCCCCCC
Confidence 48999999999999864
No 45
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=91.96 E-value=0.073 Score=47.61 Aligned_cols=15 Identities=40% Similarity=0.734 Sum_probs=13.5
Q ss_pred chhHHHHHhhhcccc
Q psy2949 106 GKIQHEILHSLGFWH 120 (367)
Q Consensus 106 g~i~HEigHALG~~H 120 (367)
-.++||+||.||+.|
T Consensus 127 k~~~HElGH~lGL~H 141 (179)
T PRK13267 127 KEVTHELGHTLGLEH 141 (179)
T ss_pred HHHHHHHHHHcCCcc
Confidence 358999999999999
No 46
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=91.95 E-value=0.072 Score=49.99 Aligned_cols=17 Identities=29% Similarity=0.323 Sum_probs=15.0
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
.+++|||||.||..|--
T Consensus 169 ~t~AHElGHnlGm~HD~ 185 (244)
T cd04270 169 LVTAHELGHNFGSPHDP 185 (244)
T ss_pred HHHHHHHHHhcCCCCCC
Confidence 68999999999998854
No 47
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=91.79 E-value=0.077 Score=48.01 Aligned_cols=17 Identities=29% Similarity=0.364 Sum_probs=11.6
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
-.++||+||++||.|=.
T Consensus 147 Kea~HElGH~~GL~HC~ 163 (194)
T PF07998_consen 147 KEAVHELGHLFGLDHCE 163 (194)
T ss_dssp HHHHHHHHHHTT----S
T ss_pred HHHHHHHHHHcCCcCCC
Confidence 57999999999999854
No 48
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=91.68 E-value=0.041 Score=50.16 Aligned_cols=19 Identities=37% Similarity=0.454 Sum_probs=17.0
Q ss_pred cchhHHHHHhhhccccccc
Q psy2949 105 KGKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~ 123 (367)
..+++|||||.||+.|...
T Consensus 141 a~~~aHElGH~LG~~HD~~ 159 (207)
T cd04273 141 AFTIAHELGHVLGMPHDGD 159 (207)
T ss_pred EEeeeeechhhcCCCCCCC
Confidence 4789999999999999865
No 49
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=90.18 E-value=0.14 Score=46.26 Aligned_cols=18 Identities=39% Similarity=0.516 Sum_probs=13.5
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
.+++|||||.||+.|...
T Consensus 133 ~~~AHelGH~lGm~HD~~ 150 (199)
T PF01421_consen 133 VIIAHELGHNLGMPHDGD 150 (199)
T ss_dssp HHHHHHHHHHTT---TTT
T ss_pred HHHHHHHHHhcCCCCCCC
Confidence 688999999999999876
No 50
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=89.25 E-value=0.15 Score=46.05 Aligned_cols=98 Identities=20% Similarity=0.255 Sum_probs=53.3
Q ss_pred CeeEEEEcCC-CC--C-----HHHHHHHHHHHHH---HHHcCCcEEEEcCC-CCce-EEEecC---CCCcc-CC-CCC--
Q psy2949 27 KTVYYNFEDS-EF--T-----IYEKTLVENAIQD---LRMHTCVRFVPRTN-QDTY-LRFRNT---GFGCA-SP-VGY-- 86 (367)
Q Consensus 27 ~~VpY~~~~~-~~--~-----~~~~~~I~~A~~~---w~~~tci~F~~~~~-~~~~-I~i~~~---~~gc~-S~-vG~-- 86 (367)
.++.|.++-+ ++ + ...-..|...+.. |...--++|+.+.. ..++ |.+... ..-|. .. -|.
T Consensus 29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~~~Df~I~Lasp~T~~~lC~g~~~~~e~S 108 (203)
T PF11350_consen 29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSGAPDFRISLASPGTTDRLCAGLDTSGETS 108 (203)
T ss_pred ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCCCCCEEEEECCcchhhhhccCcCcCceeE
Confidence 4688888522 22 1 3455556555544 88888899999983 3344 333322 34461 11 111
Q ss_pred --C--CCCccccc--ccCCCcc-------cccchhHHHHHhhhcccccccC
Q psy2949 87 --F--PIGTGIDI--FLGGRVC-------FLKGKIQHEILHSLGFWHEHTR 124 (367)
Q Consensus 87 --~--~g~q~i~~--~~l~~~c-------~~~g~i~HEigHALG~~HEh~r 124 (367)
. ++.-.||. +..+..- ...-+|-||.||+||..||-.-
T Consensus 109 C~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cp 159 (203)
T PF11350_consen 109 CRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCP 159 (203)
T ss_pred eecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCC
Confidence 1 23444551 1111111 1125899999999999887653
No 51
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=89.03 E-value=0.17 Score=48.87 Aligned_cols=16 Identities=31% Similarity=0.575 Sum_probs=12.8
Q ss_pred chhHHHHHhhhccccc
Q psy2949 106 GKIQHEILHSLGFWHE 121 (367)
Q Consensus 106 g~i~HEigHALG~~HE 121 (367)
++.+||+||++||.|-
T Consensus 195 ~~f~HE~GH~~GL~H~ 210 (305)
T PF10462_consen 195 NEFSHELGHNFGLGHY 210 (305)
T ss_dssp HHHHHHHHHTTT--SS
T ss_pred ceeehhhhhhcCCCCC
Confidence 7899999999999993
No 52
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=86.81 E-value=0.58 Score=34.09 Aligned_cols=29 Identities=31% Similarity=0.511 Sum_probs=25.7
Q ss_pred cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949 105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV 133 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I 133 (367)
...++|++.+.+||.|+..-...+.+|.|
T Consensus 30 eR~~vH~lA~~~gL~s~S~G~g~~R~v~v 58 (60)
T cd02641 30 DRLLVHELAEELGLRHESTGEGSDRVITV 58 (60)
T ss_pred HHHHHHHHHHHcCCceEeeCCCCceEEEe
Confidence 37899999999999999988888888776
No 53
>PTZ00337 surface protease GP63; Provisional
Probab=86.06 E-value=0.25 Score=51.78 Aligned_cols=28 Identities=25% Similarity=0.081 Sum_probs=19.1
Q ss_pred cccccCCCcccccchhHHHHHhhhcccc
Q psy2949 93 IDIFLGGRVCFLKGKIQHEILHSLGFWH 120 (367)
Q Consensus 93 i~~~~l~~~c~~~g~i~HEigHALG~~H 120 (367)
++|..+...-....+++|||.|||||-.
T Consensus 218 ~np~~i~~~~~~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 218 FDPRQIAVTNGDVRVAAHELGHALGFVR 245 (567)
T ss_pred ECHHHccchhHHHHHHHHHHHHHHccCH
Confidence 3343343333345899999999999944
No 54
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=85.62 E-value=0.34 Score=48.95 Aligned_cols=18 Identities=39% Similarity=0.549 Sum_probs=16.3
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
|.++||+||+||+.|.-+
T Consensus 318 GA~lHEiGH~fg~pH~~~ 335 (423)
T PF12044_consen 318 GAFLHEIGHLFGCPHQED 335 (423)
T ss_pred HHHHHHHHHhcCCCCCCC
Confidence 789999999999999764
No 55
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=84.22 E-value=0.46 Score=49.55 Aligned_cols=17 Identities=41% Similarity=0.604 Sum_probs=14.0
Q ss_pred cchhHHHHHhhhccccc
Q psy2949 105 KGKIQHEILHSLGFWHE 121 (367)
Q Consensus 105 ~g~i~HEigHALG~~HE 121 (367)
..+++||++|||||--.
T Consensus 211 ~~~~~HEi~HaLGFs~~ 227 (521)
T PF01457_consen 211 FRTVIHEIAHALGFSSS 227 (521)
T ss_dssp HHHHHHHHHHHTT-SHH
T ss_pred cceeeeeeeeeeeeccc
Confidence 37999999999999764
No 56
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=82.47 E-value=1.8 Score=43.35 Aligned_cols=20 Identities=15% Similarity=-0.069 Sum_probs=14.6
Q ss_pred CCCcccccchhHHHHHhhhc
Q psy2949 98 GGRVCFLKGKIQHEILHSLG 117 (367)
Q Consensus 98 l~~~c~~~g~i~HEigHALG 117 (367)
+.......-|++||+|||+=
T Consensus 216 ~~~~~~~v~tl~HE~GHa~h 235 (427)
T cd06459 216 FNGTLDDVFTLAHELGHAFH 235 (427)
T ss_pred CCCChhhHHHHHHHhhHHHH
Confidence 34445556899999999974
No 57
>KOG4525|consensus
Probab=81.89 E-value=0.57 Score=46.93 Aligned_cols=18 Identities=39% Similarity=0.615 Sum_probs=16.2
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
|.++|||||.||+.|.-.
T Consensus 303 GA~~HElGH~lgcpHq~~ 320 (614)
T KOG4525|consen 303 GAVCHELGHCLGCPHQSE 320 (614)
T ss_pred HHHHHHhhhccCCCCCCC
Confidence 789999999999999753
No 58
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=81.84 E-value=1.9 Score=45.66 Aligned_cols=74 Identities=11% Similarity=0.077 Sum_probs=35.7
Q ss_pred HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecC----CCC-ccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949 40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNT----GFG-CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH 114 (367)
Q Consensus 40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~----~~g-c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH 114 (367)
++.++.|.+|++.+... ...++...-...||-+..+ +++ |++..+ ..+--.+| +...-...-|++|||||
T Consensus 314 ~~a~~~v~~~~~~~g~~-~~~~~~~~~~~~wiD~~~r~gK~~Ga~~~~~~~-~~p~il~N---~~~~~~dv~TLaHElGH 388 (591)
T TIGR00181 314 EEAKELILKSLEPLGEE-YIKILKRAFNERWVDYAENKGKRSGAYSIGGYK-VKPYILMN---WDGTLNSVFTLAHELGH 388 (591)
T ss_pred HHHHHHHHHHHhccCHH-HHHHHHHHhhCCCeeecCCCCCCCCcccCCCCC-CCCeEEEe---cCCCcchHHHHHHHhhh
Confidence 34566677787543221 1111111112246665432 222 444433 22222233 44444456899999999
Q ss_pred hhcc
Q psy2949 115 SLGF 118 (367)
Q Consensus 115 ALG~ 118 (367)
|+=.
T Consensus 389 a~H~ 392 (591)
T TIGR00181 389 SMHS 392 (591)
T ss_pred HHHH
Confidence 9743
No 59
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=80.96 E-value=1.1 Score=43.65 Aligned_cols=75 Identities=17% Similarity=0.020 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHHHHHHc-CCcEEEEcCCCCceEEEecCCCCccCCCCCCC------CCcccccccCCCcccccchhHHHH
Q psy2949 40 IYEKTLVENAIQDLRMH-TCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP------IGTGIDIFLGGRVCFLKGKIQHEI 112 (367)
Q Consensus 40 ~~~~~~I~~A~~~w~~~-tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~------g~q~i~~~~l~~~c~~~g~i~HEi 112 (367)
.+..+.+++|++.+.+. ..+..... ...-+.+ +.-.|++|..| ++-.+|. .-.......|++||+
T Consensus 133 ~~i~~~~~~~y~~l~~~~p~l~~~~~--~~K~~l~----S~~~s~~g~~Gyy~PFT~EA~vN~--~~p~~~~P~T~~HEl 204 (318)
T PF12725_consen 133 EEIFEEAREGYENLAERYPFLSGYYP--SPKPSLF----SRLMSYMGISGYYNPFTGEANVNT--DLPPYSLPFTICHEL 204 (318)
T ss_pred HHHHHHHHHHHHHHHHhCCccCCCCC--CCcHhhc----cHHHHHccCceEEcCCcceeecCC--CCCcccccHHHHHHH
Confidence 56677788888888763 33322211 1111111 22234444432 3333441 112334458999999
Q ss_pred Hhhhcccccc
Q psy2949 113 LHSLGFWHEH 122 (367)
Q Consensus 113 gHALG~~HEh 122 (367)
+|-+||..|.
T Consensus 205 AHq~G~a~E~ 214 (318)
T PF12725_consen 205 AHQLGFASED 214 (318)
T ss_pred HHHhCCCCHH
Confidence 9999999996
No 60
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=80.07 E-value=0.37 Score=51.29 Aligned_cols=16 Identities=38% Similarity=0.407 Sum_probs=14.2
Q ss_pred ccchhHHHHHhhhccc
Q psy2949 104 LKGKIQHEILHSLGFW 119 (367)
Q Consensus 104 ~~g~i~HEigHALG~~ 119 (367)
..|+++|||||.|||.
T Consensus 221 giGVfaHEfGH~LGLP 236 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLP 236 (645)
T ss_pred ceEEEEeeccccCCCC
Confidence 4699999999999984
No 61
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=79.85 E-value=2.6 Score=44.56 Aligned_cols=75 Identities=13% Similarity=0.037 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEec----C-CCCccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949 40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRN----T-GFGCASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH 114 (367)
Q Consensus 40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~----~-~~gc~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH 114 (367)
++.++.|.+|++.|...-.- |..+.-...+|-+.. . +.-|++..+.....-.+| +.......-|++||+||
T Consensus 310 e~a~~~v~~~~~~l~~e~~~-~~~~~~~~~~iD~~~r~gK~~Ga~~~~~~~~~~p~i~~N---~~~~~~~v~TL~HE~GH 385 (587)
T TIGR02290 310 DEAKELVLEAFGKFSPEMAD-FAEKAFEEGWIDAEPRPGKRGGAFCTGFPPSKEPRVLMN---YDGSRRDVSTLAHELGH 385 (587)
T ss_pred HHHHHHHHHHHHhcCHHHHH-HHHHHHHcCCcccCCCCCCCCCcccCCCCCCCCCEEEEe---cCCCchhHHHHHHHhhH
Confidence 35678888888776543111 111111233444422 1 222444343222222233 34444456899999999
Q ss_pred hhcc
Q psy2949 115 SLGF 118 (367)
Q Consensus 115 ALG~ 118 (367)
|+=.
T Consensus 386 a~H~ 389 (587)
T TIGR02290 386 AYHS 389 (587)
T ss_pred HHHH
Confidence 9943
No 62
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=79.07 E-value=0.91 Score=47.91 Aligned_cols=17 Identities=35% Similarity=0.661 Sum_probs=14.5
Q ss_pred ccchhHHHHHhhhcccc
Q psy2949 104 LKGKIQHEILHSLGFWH 120 (367)
Q Consensus 104 ~~g~i~HEigHALG~~H 120 (367)
...+++|||.|||||--
T Consensus 256 ~~rv~~HEi~HALGFS~ 272 (622)
T PTZ00257 256 TTRTVTHEVAHALGFSS 272 (622)
T ss_pred HHHHHHHHHHHHhcCCH
Confidence 35899999999999944
No 63
>PF02408 CUB_2: CUB-like domain; InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=72.82 E-value=6.6 Score=32.43 Aligned_cols=29 Identities=14% Similarity=0.232 Sum_probs=22.3
Q ss_pred CCCCCCCCCCCCCcEEEEcccCC--Cccccc
Q psy2949 313 PIPDLGYLPTGSGWFYKIGGPSD--DRKIMD 341 (367)
Q Consensus 313 ~~~~~~~~~~~~~c~~~~~~~~~--~r~~~~ 341 (367)
...++-.+|++.+|.|.|.+|.| -++.|-
T Consensus 31 ~s~~~~~~p~n~~C~y~i~iP~G~~a~v~~~ 61 (120)
T PF02408_consen 31 GSTSPPQFPANQNCTYQINIPKGYYAKVTLS 61 (120)
T ss_pred CCCCccccCCCCceEEEEEcCCceEEEEEEE
Confidence 34466789999999999999998 444443
No 64
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=72.43 E-value=3.3 Score=30.16 Aligned_cols=28 Identities=14% Similarity=0.341 Sum_probs=25.0
Q ss_pred chhHHHHHhhhcccccccCCCCCcceEE
Q psy2949 106 GKIQHEILHSLGFWHEHTRPDRDQFVRV 133 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpDRD~yv~I 133 (367)
..++|++.+.+||.|+-.-...+.||.|
T Consensus 31 R~~vH~~a~~~gL~s~S~G~g~~R~v~v 58 (60)
T cd02640 31 RALIHQIAQKYGLKSRSYGSGNDRYLVI 58 (60)
T ss_pred HHHHHHHHHHcCCceeeEeCCCCeEEEE
Confidence 7899999999999999888888888776
No 65
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=72.31 E-value=2.3 Score=34.85 Aligned_cols=37 Identities=27% Similarity=0.471 Sum_probs=26.3
Q ss_pred CCCccCCCCCCCCCcccccccCCCcccccchhHHHHHhhhccc
Q psy2949 77 GFGCASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFW 119 (367)
Q Consensus 77 ~~gc~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~ 119 (367)
++=|.|++|..- |.|..-.+|| ..|+.||+.|++.|.
T Consensus 58 ~~LCG~~~~~i~----IDP~~~~KGC--~~TL~HEL~H~WQ~R 94 (141)
T PHA02456 58 QDLCGQFVGWIE----IDPDYANKGC--RDTLAHELNHAWQFR 94 (141)
T ss_pred cchhhcceeEEE----ECCcccccch--HHHHHHHHHHHHhhh
Confidence 566888888752 3344446788 569999999998653
No 66
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=71.82 E-value=3.7 Score=29.83 Aligned_cols=29 Identities=14% Similarity=0.268 Sum_probs=24.8
Q ss_pred cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949 105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV 133 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I 133 (367)
...++|++.+.+||.|+-.-...+.||.|
T Consensus 29 eR~~vH~~a~~~gL~s~S~G~g~~R~v~v 57 (59)
T cd06007 29 ERAVIHRLCRKLGLKSKSKGKGSNRRLSV 57 (59)
T ss_pred HHHHHHHHHHHcCCCceeecCCCCeEEEE
Confidence 37899999999999999877777777766
No 67
>KOG3607|consensus
Probab=71.37 E-value=1.9 Score=46.64 Aligned_cols=17 Identities=35% Similarity=0.628 Sum_probs=16.2
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
.+++|||||.||+.|+-
T Consensus 325 ~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 325 VVLAHELGHNLGMIHDE 341 (716)
T ss_pred HHHHHHHHhhcCccccc
Confidence 79999999999999987
No 68
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=70.51 E-value=2.1 Score=40.65 Aligned_cols=26 Identities=12% Similarity=0.077 Sum_probs=14.1
Q ss_pred CCeeEEEEcCCCCCHHHHHHHHHHHH
Q psy2949 26 DKTVYYNFEDSEFTIYEKTLVENAIQ 51 (367)
Q Consensus 26 ~~~VpY~~~~~~~~~~~~~~I~~A~~ 51 (367)
...|...|-..+|++.++..+++.++
T Consensus 80 ~~~i~ivilgDGYT~~E~~~F~~da~ 105 (264)
T PF09471_consen 80 GNRIDIVILGDGYTAAEMDKFEEDAR 105 (264)
T ss_dssp TTSEEEEEEEES--GGGHHHHHHHHH
T ss_pred cCCeEEEEEcCCCCHHHHHHHHHHHH
Confidence 34577777566787766655544333
No 69
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=66.42 E-value=2.8 Score=38.04 Aligned_cols=14 Identities=29% Similarity=0.309 Sum_probs=11.7
Q ss_pred chhHHHHHhhhccc
Q psy2949 106 GKIQHEILHSLGFW 119 (367)
Q Consensus 106 g~i~HEigHALG~~ 119 (367)
+++.||+|||+--.
T Consensus 24 ~t~~HE~gHal~a~ 37 (200)
T PF13398_consen 24 VTFVHELGHALAAL 37 (200)
T ss_pred HHHHHHHHHHHHHH
Confidence 79999999998533
No 70
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=65.22 E-value=3.7 Score=37.91 Aligned_cols=13 Identities=23% Similarity=0.342 Sum_probs=11.6
Q ss_pred cchhHHHHHhhhc
Q psy2949 105 KGKIQHEILHSLG 117 (367)
Q Consensus 105 ~g~i~HEigHALG 117 (367)
.++++||.|||+-
T Consensus 90 vaVAAHEvGHAiQ 102 (222)
T PF04298_consen 90 VAVAAHEVGHAIQ 102 (222)
T ss_pred HHHHHHHHhHHHh
Confidence 4899999999985
No 71
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=65.21 E-value=3 Score=38.33 Aligned_cols=16 Identities=31% Similarity=0.436 Sum_probs=12.5
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
-|++|||||+| .|...
T Consensus 74 FtlAHELGH~l--lH~~~ 89 (213)
T COG2856 74 FTLAHELGHAL--LHTDL 89 (213)
T ss_pred HHHHHHHhHHH--hcccc
Confidence 69999999997 45443
No 72
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=62.92 E-value=4.7 Score=42.35 Aligned_cols=21 Identities=14% Similarity=0.093 Sum_probs=14.7
Q ss_pred CCcccccchhHHHHHhhhccc
Q psy2949 99 GRVCFLKGKIQHEILHSLGFW 119 (367)
Q Consensus 99 ~~~c~~~g~i~HEigHALG~~ 119 (367)
...-...-|++||+|||+=.+
T Consensus 332 ~~t~~dv~TL~HElGHa~H~~ 352 (549)
T TIGR02289 332 NGTSGDIDVLTHEAGHAFHVY 352 (549)
T ss_pred CCChhHHHHHHHHhhHHHHHH
Confidence 333344589999999998443
No 73
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=61.61 E-value=4.2 Score=32.39 Aligned_cols=12 Identities=33% Similarity=0.456 Sum_probs=10.3
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.-+++|||||.+
T Consensus 43 ~f~laHELgH~~ 54 (122)
T PF06114_consen 43 RFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 369999999976
No 74
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=60.99 E-value=8.2 Score=27.67 Aligned_cols=28 Identities=18% Similarity=0.312 Sum_probs=24.9
Q ss_pred chhHHHHHhhhcccccccCCCCCcceEE
Q psy2949 106 GKIQHEILHSLGFWHEHTRPDRDQFVRV 133 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpDRD~yv~I 133 (367)
..++|+|.+++||.++.+-...+.||.|
T Consensus 29 R~~vH~lA~~~~L~S~S~G~g~~R~v~v 56 (58)
T cd02646 29 RKTIHKLANCYNLKSKSRGKGKKRFVTV 56 (58)
T ss_pred HHHHHHHHHHcCCcccccccCCceEEEE
Confidence 6899999999999999888888888776
No 75
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=60.88 E-value=4.1 Score=32.71 Aligned_cols=17 Identities=35% Similarity=0.597 Sum_probs=13.0
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
-|++||++|-+|+--|.
T Consensus 75 ~tlvhEiah~fG~~~e~ 91 (97)
T PF06262_consen 75 DTLVHEIAHHFGISDED 91 (97)
T ss_dssp HHHHHHHHHHTT--HHH
T ss_pred HHHHHHHHHHcCCCHHH
Confidence 58999999999987654
No 76
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=59.30 E-value=7.5 Score=42.60 Aligned_cols=124 Identities=14% Similarity=0.145 Sum_probs=65.3
Q ss_pred CCcCCCeeEEEEcCCC-----CCHHHHHHHH---------------HHHHHHHHcCCcEEEEcC------CCCceEEEec
Q psy2949 22 QLWPDKTVYYNFEDSE-----FTIYEKTLVE---------------NAIQDLRMHTCVRFVPRT------NQDTYLRFRN 75 (367)
Q Consensus 22 ~~Wp~~~VpY~~~~~~-----~~~~~~~~I~---------------~A~~~w~~~tci~F~~~~------~~~~~I~i~~ 75 (367)
..|.+..-+|++-+.+ .+...|..++ +.++.+...+.|.|.+.. ...-+++--.
T Consensus 128 ~~w~~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~~~~n~~~kYF~KAD~ 207 (775)
T PF03272_consen 128 SEWDDSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDPVDKNFNNKYFAKADK 207 (775)
T ss_pred hhhhhcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCcccccccccceEEEecC
Confidence 4688777888874333 1445555555 556667788898888754 1222232211
Q ss_pred CCCCccCCCCCCCC---CcccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCC
Q psy2949 76 TGFGCASPVGYFPI---GTGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTG 152 (367)
Q Consensus 76 ~~~gc~S~vG~~~g---~q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~ 152 (367)
.|.| .+|.|..-- ...|.. -|... ..-..++|||||+--+.-.++.+ ++.=.|+||.+. ..++..++..
T Consensus 208 ~G~G-~AYY~~~w~a~ss~s~~~-~L~~~-~~nW~~LHEIgHgYd~~F~~n~~----~~~EVWnNI~~d-~yQ~~~~~~~ 279 (775)
T PF03272_consen 208 SGPG-AAYYGSNWTAQSSSSLSF-YLNPS-PTNWGALHEIGHGYDFGFTRNGT----YLNEVWNNILAD-RYQYTYMTQD 279 (775)
T ss_pred CCCC-CccccccceecCchhHHH-HhCCC-CCCchhhhhhhhhcceeEeeCCc----chhhhhhhhhhh-hhhhhhcChh
Confidence 2222 334444211 112220 01000 12256899999987666665544 244579999864 3455555433
Q ss_pred c
Q psy2949 153 S 153 (367)
Q Consensus 153 ~ 153 (367)
+
T Consensus 280 e 280 (775)
T PF03272_consen 280 E 280 (775)
T ss_pred h
Confidence 3
No 77
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=59.06 E-value=9.5 Score=27.85 Aligned_cols=29 Identities=17% Similarity=0.189 Sum_probs=25.4
Q ss_pred cchhHHHHHhhhcccccccCCCCCcceEEe
Q psy2949 105 KGKIQHEILHSLGFWHEHTRPDRDQFVRVL 134 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I~ 134 (367)
..-++|++....||.||.+... +.+|.|.
T Consensus 33 ~RllvH~la~~~gL~s~s~~~~-~r~vvv~ 61 (63)
T cd02642 33 YRLLAHRVAQYYGLDHNVDNSG-GKCVIVN 61 (63)
T ss_pred HHHHHHHHHHHhCCeeEeecCC-ceEEEEE
Confidence 3789999999999999999987 7777764
No 78
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=58.87 E-value=27 Score=31.18 Aligned_cols=17 Identities=24% Similarity=0.128 Sum_probs=14.3
Q ss_pred cchhHHHHHhhhccccc
Q psy2949 105 KGKIQHEILHSLGFWHE 121 (367)
Q Consensus 105 ~g~i~HEigHALG~~HE 121 (367)
.-+|+||+.|.+-..|.
T Consensus 165 dYVvvHEL~Hl~~~nHs 181 (205)
T PF01863_consen 165 DYVVVHELCHLRHPNHS 181 (205)
T ss_pred HHHHHHHHHHhccCCCC
Confidence 37999999999977765
No 79
>PF11667 DUF3267: Protein of unknown function (DUF3267); InterPro: IPR021683 This family of proteins has no known function.
Probab=57.67 E-value=6.6 Score=31.90 Aligned_cols=20 Identities=30% Similarity=0.275 Sum_probs=18.1
Q ss_pred chhHHHHHhhhcccccccCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rp 125 (367)
..++||+.|+|.+...-.++
T Consensus 6 ~~~~HEliH~l~~~~~~~~~ 25 (111)
T PF11667_consen 6 LIPLHELIHGLFFKLFGKKP 25 (111)
T ss_pred eHHHHHHHHHHHHHHhCCCC
Confidence 57899999999999998877
No 80
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=57.40 E-value=3.4 Score=37.90 Aligned_cols=40 Identities=35% Similarity=0.566 Sum_probs=26.9
Q ss_pred CcccccccC--CCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949 232 GSVRTFSMP--YDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY 292 (367)
Q Consensus 232 ~~~~~~~~p--YDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y 292 (367)
+.+.-.|+| +|-+|||.- .|+. +....|+..|+..++.||
T Consensus 170 GAi~IpGTPt~~d~~SiM~a---c~~~------------------~~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 170 GAIHIPGTPTGADPNSIMNA---CFSS------------------GEDGEFTSNDITALNYLY 211 (211)
T ss_pred ceEECCCCCCCCCCchhhhc---cccC------------------CCCCCcChhhHHHHHhhC
Confidence 334455555 599999953 2221 223469999999999998
No 81
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=56.45 E-value=5.5 Score=36.81 Aligned_cols=11 Identities=36% Similarity=0.458 Sum_probs=10.0
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
.|+.||+||||
T Consensus 94 ~~l~HE~GHAl 104 (220)
T PF14247_consen 94 FTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHH
Confidence 68999999997
No 82
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=55.89 E-value=4.9 Score=33.41 Aligned_cols=16 Identities=31% Similarity=0.484 Sum_probs=13.6
Q ss_pred chhHHHHHhhhccccc
Q psy2949 106 GKIQHEILHSLGFWHE 121 (367)
Q Consensus 106 g~i~HEigHALG~~HE 121 (367)
-+++|||||-+|+--+
T Consensus 111 hvliHEIgHhFGLsDd 126 (136)
T COG3824 111 HVLIHEIGHHFGLSDD 126 (136)
T ss_pred hhhhhhhhhhcCCChh
Confidence 5899999999998654
No 83
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=55.55 E-value=5.8 Score=35.24 Aligned_cols=21 Identities=19% Similarity=0.134 Sum_probs=15.3
Q ss_pred cchhHHHHHhhhcccccccCC
Q psy2949 105 KGKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rp 125 (367)
..+++||+||++-....-.+.
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~ 28 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKV 28 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B
T ss_pred ccccccccccccccccccccc
Confidence 368999999999877765544
No 84
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=55.15 E-value=2.3 Score=35.83 Aligned_cols=45 Identities=27% Similarity=0.425 Sum_probs=29.9
Q ss_pred CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCC
Q psy2949 192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGS 245 (367)
Q Consensus 192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~S 245 (367)
|++|..+..++.|++.+..+. ..++ ..+|...+. +.+++||||+|
T Consensus 96 HaLGl~H~~~~~drd~~~~~~--~~~~----~~~~~~~~~---~~~~~~~~yg~ 140 (140)
T smart00235 96 HALGLYHEQSRSDRDNYMYIN--YTNI----TRNFDLSND---DSLGIPYDYGS 140 (140)
T ss_pred HHhcCCcCCCCCcccCeEEEe--hhhh----hhccccccc---cCCCchhccCc
Confidence 678888888999987544332 2333 345555432 57889999987
No 85
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=51.32 E-value=7 Score=39.95 Aligned_cols=15 Identities=47% Similarity=0.470 Sum_probs=12.7
Q ss_pred ccchhHHHHHhhhcc
Q psy2949 104 LKGKIQHEILHSLGF 118 (367)
Q Consensus 104 ~~g~i~HEigHALG~ 118 (367)
..||++|||||+.|-
T Consensus 130 lagViAHEigHv~qr 144 (484)
T COG4783 130 LAGVIAHEIGHVAQR 144 (484)
T ss_pred HHHHHHHHHHHHhhh
Confidence 349999999999873
No 86
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=51.04 E-value=7.4 Score=39.67 Aligned_cols=15 Identities=20% Similarity=0.202 Sum_probs=12.2
Q ss_pred cchhHHHHHhhhccc
Q psy2949 105 KGKIQHEILHSLGFW 119 (367)
Q Consensus 105 ~g~i~HEigHALG~~ 119 (367)
.-|+.|||||||=..
T Consensus 243 v~tLfHE~GHa~H~~ 257 (458)
T PF01432_consen 243 VETLFHEFGHAMHSL 257 (458)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhHHHHHH
Confidence 479999999998544
No 87
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=50.92 E-value=7.1 Score=38.43 Aligned_cols=12 Identities=25% Similarity=0.382 Sum_probs=10.7
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
..|+.||+|||+
T Consensus 155 v~tl~HE~GHa~ 166 (365)
T cd06258 155 INTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHhHHH
Confidence 478999999998
No 88
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=49.31 E-value=8.3 Score=33.95 Aligned_cols=19 Identities=21% Similarity=0.136 Sum_probs=13.9
Q ss_pred chhHHHHHhhhcccccccC
Q psy2949 106 GKIQHEILHSLGFWHEHTR 124 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~r 124 (367)
.+++||+||++=-.+.-.+
T Consensus 10 ~i~iHE~gH~~~A~~~G~~ 28 (180)
T cd05709 10 SVTVHELGHALVARRLGVK 28 (180)
T ss_pred HHHHHHHHHHHHHHHcCCC
Confidence 6899999999955554433
No 89
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=49.29 E-value=9.7 Score=31.20 Aligned_cols=17 Identities=41% Similarity=0.591 Sum_probs=15.4
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
-.++|=++|.||+-|+.
T Consensus 75 ~l~iHG~LHLlGyDH~~ 91 (110)
T TIGR00043 75 HLTVHGLLHLLGYDHET 91 (110)
T ss_pred HHHHHHHHHHcCCCCCC
Confidence 57999999999999985
No 90
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=47.98 E-value=25 Score=29.53 Aligned_cols=70 Identities=16% Similarity=0.170 Sum_probs=40.3
Q ss_pred HHHHHHHHHHHHHcCCcEEEEcC--CCCceEEEecCCCCccCCCCCCC---CCcccccc-----cC--CCc-----cc--
Q psy2949 43 KTLVENAIQDLRMHTCVRFVPRT--NQDTYLRFRNTGFGCASPVGYFP---IGTGIDIF-----LG--GRV-----CF-- 103 (367)
Q Consensus 43 ~~~I~~A~~~w~~~tci~F~~~~--~~~~~I~i~~~~~gc~S~vG~~~---g~q~i~~~-----~l--~~~-----c~-- 103 (367)
...|-+|+..|-..+-+...+.= .....|.|.....|| ..|+.. +.|.|.+. .+ +.+ -.
T Consensus 22 ~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~--A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~~~ 99 (135)
T PF15639_consen 22 VPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGT--AYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNVYL 99 (135)
T ss_pred ChHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccce--eeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceEEE
Confidence 46778888888665444333322 567789998776777 556532 24444310 01 111 11
Q ss_pred ccchhHHHHHh
Q psy2949 104 LKGKIQHEILH 114 (367)
Q Consensus 104 ~~g~i~HEigH 114 (367)
-..|++||+-|
T Consensus 100 v~~TlLHEliH 110 (135)
T PF15639_consen 100 VGSTLLHELIH 110 (135)
T ss_pred eehHHHHHHHH
Confidence 12699999998
No 91
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=47.68 E-value=9.6 Score=33.05 Aligned_cols=11 Identities=27% Similarity=0.625 Sum_probs=9.8
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
-+|+|||.|++
T Consensus 137 DVvaHEltHGV 147 (150)
T PF01447_consen 137 DVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHH
T ss_pred ceeeecccccc
Confidence 59999999985
No 92
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=47.03 E-value=8.2 Score=41.05 Aligned_cols=75 Identities=13% Similarity=-0.104 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecC---CCC--ccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949 40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNT---GFG--CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH 114 (367)
Q Consensus 40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~---~~g--c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH 114 (367)
++.++.|.++++.......=.|++. -...+|-+..+ ..| |.+.......---|| +...-..+-|++||+||
T Consensus 315 ~ea~~~v~~~l~~lg~ey~~~~~~a-~~~~WiD~~~~~gKrsGaYs~~~~~~~~p~IlmN---~~gt~~dV~TLaHElGH 390 (598)
T COG1164 315 EEAKELVLKALAPLGPEYAKIARRA-FDERWIDVYPRKGKRSGAYSIGFYKGDHPFILMN---YDGTLRDVFTLAHELGH 390 (598)
T ss_pred HHHHHHHHHHHHhhCHHHHHHHHHH-HhcCCeeccCCCCCCCCcccCCCCCCCCCeEEEe---CCCchhHHHHHHHHccH
Confidence 3566677777766644322212221 22356655322 233 333322222222333 33344456899999999
Q ss_pred hhcc
Q psy2949 115 SLGF 118 (367)
Q Consensus 115 ALG~ 118 (367)
++=.
T Consensus 391 s~Hs 394 (598)
T COG1164 391 SVHS 394 (598)
T ss_pred HHHH
Confidence 9853
No 93
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=46.47 E-value=9.4 Score=39.28 Aligned_cols=12 Identities=25% Similarity=0.288 Sum_probs=10.6
Q ss_pred chhHHHHHhhhc
Q psy2949 106 GKIQHEILHSLG 117 (367)
Q Consensus 106 g~i~HEigHALG 117 (367)
-|+.|||||||=
T Consensus 265 ~TLfHEfGHalH 276 (472)
T cd06455 265 ETFFHEFGHVIH 276 (472)
T ss_pred HHHHHHHHHHHH
Confidence 689999999983
No 94
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=44.45 E-value=11 Score=38.79 Aligned_cols=15 Identities=20% Similarity=0.224 Sum_probs=12.0
Q ss_pred chhHHHHHhhhcccc
Q psy2949 106 GKIQHEILHSLGFWH 120 (367)
Q Consensus 106 g~i~HEigHALG~~H 120 (367)
-|+.|||||||=..-
T Consensus 250 ~TLfHEfGHalH~~l 264 (458)
T cd06457 250 ETLFHEMGHAMHSML 264 (458)
T ss_pred HHHHHHHhHHHHHHH
Confidence 689999999985333
No 95
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=42.95 E-value=12 Score=33.44 Aligned_cols=19 Identities=21% Similarity=-0.092 Sum_probs=13.1
Q ss_pred chhHHHHHhhhcccccccC
Q psy2949 106 GKIQHEILHSLGFWHEHTR 124 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~r 124 (367)
.+++||+||.+=-.-..-+
T Consensus 11 ~v~iHElGH~~~Ar~~Gv~ 29 (182)
T cd06163 11 LIFVHELGHFLVAKLFGVK 29 (182)
T ss_pred HHHHHHHHHHHHHHHcCCe
Confidence 5799999998744444333
No 96
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=42.80 E-value=13 Score=31.85 Aligned_cols=16 Identities=44% Similarity=0.713 Sum_probs=14.0
Q ss_pred chhHHHHHhhhccccc
Q psy2949 106 GKIQHEILHSLGFWHE 121 (367)
Q Consensus 106 g~i~HEigHALG~~HE 121 (367)
-.++|=++|.||+-|+
T Consensus 110 ~l~vHG~LHLlGyDH~ 125 (145)
T PF02130_consen 110 RLLVHGLLHLLGYDHE 125 (145)
T ss_dssp HHHHHHHHHHTT-SST
T ss_pred HHHHHHHHHHcCCCCC
Confidence 5799999999999999
No 97
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=42.19 E-value=12 Score=33.90 Aligned_cols=17 Identities=18% Similarity=0.145 Sum_probs=13.1
Q ss_pred cccccchhHHHHHhhhc
Q psy2949 101 VCFLKGKIQHEILHSLG 117 (367)
Q Consensus 101 ~c~~~g~i~HEigHALG 117 (367)
.....++++||.|||+.
T Consensus 89 Sia~~aVAAHEVGHAiQ 105 (226)
T COG2738 89 SIAAIAVAAHEVGHAIQ 105 (226)
T ss_pred cHHHHHHHHHHhhHHHh
Confidence 33345899999999985
No 98
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=41.02 E-value=13 Score=35.58 Aligned_cols=17 Identities=24% Similarity=0.350 Sum_probs=14.1
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
.+++||++||+=-..|.
T Consensus 137 svvvHElgHal~A~~~g 153 (277)
T cd06162 137 SGVVHEMGHGVAAVREQ 153 (277)
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 78999999998766664
No 99
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=40.64 E-value=14 Score=29.47 Aligned_cols=11 Identities=27% Similarity=0.504 Sum_probs=10.2
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
.+++||+.|++
T Consensus 27 ~~l~HE~~H~~ 37 (128)
T PF13485_consen 27 RVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHH
Confidence 89999999986
No 100
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=39.75 E-value=14 Score=37.54 Aligned_cols=13 Identities=31% Similarity=0.345 Sum_probs=11.0
Q ss_pred cchhHHHHHhhhc
Q psy2949 105 KGKIQHEILHSLG 117 (367)
Q Consensus 105 ~g~i~HEigHALG 117 (367)
.-|+.|||||||=
T Consensus 209 v~tLfHEfGHalH 221 (422)
T cd06456 209 VTTLFHEFGHALH 221 (422)
T ss_pred HHHHHHHHHHHHH
Confidence 3689999999984
No 101
>COG0319 Predicted metal-dependent hydrolase [General function prediction only]
Probab=39.39 E-value=16 Score=31.80 Aligned_cols=18 Identities=44% Similarity=0.586 Sum_probs=16.2
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
-.++|=++|.||+-|+..
T Consensus 110 ~l~vHG~LHLlGYDH~~~ 127 (153)
T COG0319 110 HLTIHGILHLLGYDHEED 127 (153)
T ss_pred HHHHHHHHHHccCCCCCc
Confidence 579999999999999975
No 102
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=39.36 E-value=14 Score=35.02 Aligned_cols=21 Identities=24% Similarity=0.298 Sum_probs=16.8
Q ss_pred chhHHHHHhhhcccccccCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPD 126 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpD 126 (367)
.+++||+||++=-.++..+.+
T Consensus 120 sv~iHElgHa~~Ar~~G~~V~ 140 (263)
T cd06159 120 GVVVHELSHGILARVEGIKVK 140 (263)
T ss_pred HHHHHHHHHHHHHHHcCCEEC
Confidence 689999999998777765543
No 103
>KOG3658|consensus
Probab=39.27 E-value=10 Score=40.29 Aligned_cols=16 Identities=38% Similarity=0.499 Sum_probs=14.5
Q ss_pred chhHHHHHhhhccccc
Q psy2949 106 GKIQHEILHSLGFWHE 121 (367)
Q Consensus 106 g~i~HEigHALG~~HE 121 (367)
-|++|||||.+|=.|.
T Consensus 394 lt~AHEiGHNfGSpHD 409 (764)
T KOG3658|consen 394 LTLAHEIGHNFGSPHD 409 (764)
T ss_pred eeehhhhccccCCCCC
Confidence 4899999999999886
No 104
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=39.08 E-value=15 Score=33.03 Aligned_cols=13 Identities=23% Similarity=0.250 Sum_probs=11.0
Q ss_pred cchhHHHHHhhhc
Q psy2949 105 KGKIQHEILHSLG 117 (367)
Q Consensus 105 ~g~i~HEigHALG 117 (367)
.++++||+||...
T Consensus 90 ~aVlaHElgH~~~ 102 (226)
T PF01435_consen 90 AAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHc
Confidence 3899999999763
No 105
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=38.88 E-value=15 Score=32.74 Aligned_cols=12 Identities=25% Similarity=0.296 Sum_probs=10.6
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.+.++||++||+
T Consensus 10 ~~i~~HE~aHa~ 21 (181)
T cd06158 10 LAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHH
Confidence 378999999998
No 106
>PRK02870 heat shock protein HtpX; Provisional
Probab=38.76 E-value=12 Score=36.89 Aligned_cols=15 Identities=27% Similarity=0.470 Sum_probs=12.1
Q ss_pred cchhHHHHHhhhcccccc
Q psy2949 105 KGKIQHEILHSLGFWHEH 122 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh 122 (367)
.|+++||+||. .|.|
T Consensus 174 ~aVlAHELgHi---k~~d 188 (336)
T PRK02870 174 QAVMAHELSHI---RHGD 188 (336)
T ss_pred HHHHHHHHHHH---Hccc
Confidence 49999999997 5544
No 107
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.05 E-value=16 Score=33.25 Aligned_cols=21 Identities=24% Similarity=0.166 Sum_probs=16.8
Q ss_pred chhHHHHHhhhcccccccCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPD 126 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpD 126 (367)
.+++||+||++=..+...+..
T Consensus 40 ~v~iHElgH~~~A~~~G~~v~ 60 (208)
T cd06161 40 SVLLHELGHALVARRYGIRVR 60 (208)
T ss_pred HHHHHHHHHHHHHHHcCCCcc
Confidence 689999999997777766654
No 108
>PRK00016 metal-binding heat shock protein; Provisional
Probab=37.64 E-value=17 Score=31.75 Aligned_cols=17 Identities=41% Similarity=0.505 Sum_probs=15.3
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
-.++|=++|.||+-|+.
T Consensus 115 ~l~iHG~LHLlGYDH~~ 131 (159)
T PRK00016 115 HLTVHGILHLLGYDHIE 131 (159)
T ss_pred HHHHHhhHHhcCCCCCC
Confidence 57999999999999974
No 109
>PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional
Probab=35.64 E-value=20 Score=33.81 Aligned_cols=16 Identities=38% Similarity=0.580 Sum_probs=15.0
Q ss_pred chhHHHHHhhhccccc
Q psy2949 106 GKIQHEILHSLGFWHE 121 (367)
Q Consensus 106 g~i~HEigHALG~~HE 121 (367)
-.++|=++|.|||-||
T Consensus 217 ~LlIHGlLHLLGYDHe 232 (258)
T PRK13963 217 HLLVHGALHAQGYDHE 232 (258)
T ss_pred HHHHHHHHHHcCCCCC
Confidence 5799999999999999
No 110
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=35.35 E-value=18 Score=36.56 Aligned_cols=17 Identities=35% Similarity=0.296 Sum_probs=13.8
Q ss_pred chhHHHHHhhhcccccccCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rp 125 (367)
-+++||+|||| |||+-|
T Consensus 161 ~t~iHE~GHal---ye~~l~ 177 (396)
T cd06460 161 FSTIHETGHAL---YEQGLP 177 (396)
T ss_pred HHHHHHhhHHH---HHhcCC
Confidence 58999999997 777554
No 111
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=35.01 E-value=20 Score=30.70 Aligned_cols=15 Identities=20% Similarity=0.445 Sum_probs=12.2
Q ss_pred cchhHHHHHhhhccc
Q psy2949 105 KGKIQHEILHSLGFW 119 (367)
Q Consensus 105 ~g~i~HEigHALG~~ 119 (367)
..||+||+.|++=+.
T Consensus 60 ~~~l~HEm~H~~~~~ 74 (146)
T smart00731 60 RETLLHELCHAALYL 74 (146)
T ss_pred HhhHHHHHHHHHHHH
Confidence 379999999988553
No 112
>KOG2921|consensus
Probab=34.92 E-value=18 Score=36.22 Aligned_cols=33 Identities=24% Similarity=0.258 Sum_probs=21.4
Q ss_pred chhHHHHHhhhcccccccCCCCCcceEEeeCCcCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPG 141 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~ 141 (367)
..++||+||||--.-|+-+- +++-|..-.|.|+
T Consensus 133 ~~vvHElGHalAA~segV~v---ngfgIfi~aiyPg 165 (484)
T KOG2921|consen 133 TVVVHELGHALAAASEGVQV---NGFGIFIAAIYPG 165 (484)
T ss_pred HHHHHHhhHHHHHHhcCcee---eeeEEEEEEEcCc
Confidence 57899999999877777543 3344444444443
No 113
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=34.72 E-value=41 Score=24.37 Aligned_cols=29 Identities=24% Similarity=0.453 Sum_probs=25.7
Q ss_pred cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949 105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV 133 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I 133 (367)
...++||+..+.||.=|..-+.-+.+|-|
T Consensus 29 ~R~~vHdla~~~gl~SeS~d~Ep~R~V~v 57 (59)
T cd06006 29 QRAFIHELAKDYGLYSESQDPEPKRSVFV 57 (59)
T ss_pred HHHHHHHHHHHcCCeeEecCCCCCcEEEE
Confidence 37899999999999999999888888766
No 114
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=34.71 E-value=20 Score=30.66 Aligned_cols=16 Identities=19% Similarity=0.397 Sum_probs=13.3
Q ss_pred cchhHHHHHhhhcccc
Q psy2949 105 KGKIQHEILHSLGFWH 120 (367)
Q Consensus 105 ~g~i~HEigHALG~~H 120 (367)
..|++||+.||+=+..
T Consensus 61 ~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 61 IDTLLHEMAHAAAYVF 76 (157)
T ss_pred HHHHHHHHHHHHhhhc
Confidence 4899999999986555
No 115
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=34.45 E-value=20 Score=32.80 Aligned_cols=14 Identities=21% Similarity=0.340 Sum_probs=11.4
Q ss_pred chhHHHHHhhhccc
Q psy2949 106 GKIQHEILHSLGFW 119 (367)
Q Consensus 106 g~i~HEigHALG~~ 119 (367)
.+++||+|||+=..
T Consensus 95 siLAHE~mHa~Lrl 108 (212)
T PF12315_consen 95 SILAHELMHAWLRL 108 (212)
T ss_pred hHHHHHHHHHHhcc
Confidence 58899999998544
No 116
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=33.81 E-value=21 Score=33.12 Aligned_cols=21 Identities=24% Similarity=0.094 Sum_probs=16.3
Q ss_pred chhHHHHHhhhcccccccCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPD 126 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpD 126 (367)
.+++||++|++=..+...+.+
T Consensus 55 ~v~iHElgH~~~A~~~G~~v~ 75 (227)
T cd06164 55 SVLLHELGHSLVARRYGIPVR 75 (227)
T ss_pred HHHHHHHHHHHHHHHcCCeEC
Confidence 688999999987666666554
No 117
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=32.30 E-value=24 Score=27.03 Aligned_cols=14 Identities=21% Similarity=0.389 Sum_probs=11.6
Q ss_pred ccchhHHHHHhhhc
Q psy2949 104 LKGKIQHEILHSLG 117 (367)
Q Consensus 104 ~~g~i~HEigHALG 117 (367)
....++||+.|++-
T Consensus 61 ~~~llaHEl~Hv~Q 74 (79)
T PF13699_consen 61 GRALLAHELAHVVQ 74 (79)
T ss_pred cchhHhHHHHHHHh
Confidence 35789999999973
No 118
>PRK10911 oligopeptidase A; Provisional
Probab=32.26 E-value=21 Score=38.67 Aligned_cols=11 Identities=36% Similarity=0.377 Sum_probs=10.2
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
-|+.|||||||
T Consensus 465 ~tlfHEfGHal 475 (680)
T PRK10911 465 ITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHhHHH
Confidence 68999999997
No 119
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=31.97 E-value=20 Score=30.77 Aligned_cols=14 Identities=36% Similarity=0.665 Sum_probs=11.3
Q ss_pred ccchhHHHHHh--hhc
Q psy2949 104 LKGKIQHEILH--SLG 117 (367)
Q Consensus 104 ~~g~i~HEigH--ALG 117 (367)
..+|++||+-| ++|
T Consensus 96 k~~TLiHE~SHf~~~~ 111 (148)
T PF14521_consen 96 KEGTLIHEWSHFTAVG 111 (148)
T ss_dssp HHHHHHHHHHHSCCCT
T ss_pred hHHHHHHhhhhhhhhc
Confidence 35999999999 644
No 120
>PRK03982 heat shock protein HtpX; Provisional
Probab=31.74 E-value=22 Score=34.03 Aligned_cols=12 Identities=42% Similarity=0.642 Sum_probs=10.2
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.++++||+||.-
T Consensus 126 ~AVlAHElgHi~ 137 (288)
T PRK03982 126 EGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHH
Confidence 499999999953
No 121
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=30.28 E-value=29 Score=34.56 Aligned_cols=21 Identities=19% Similarity=0.303 Sum_probs=13.6
Q ss_pred CCCccccc---chhHHHHHhhhcc
Q psy2949 98 GGRVCFLK---GKIQHEILHSLGF 118 (367)
Q Consensus 98 l~~~c~~~---g~i~HEigHALG~ 118 (367)
+...|... ++++|||-|+.+.
T Consensus 288 ~~~~C~~qDqatt~LHE~TH~~~V 311 (359)
T PF02102_consen 288 LSNRCHAQDQATTTLHEMTHAPAV 311 (359)
T ss_dssp S--STT---HHHHHHHHHHT-TTT
T ss_pred ccccccCCCccchhhhhhhccccc
Confidence 45677654 8999999999875
No 122
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=30.10 E-value=24 Score=33.37 Aligned_cols=12 Identities=25% Similarity=0.354 Sum_probs=10.4
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.+++.||+||..
T Consensus 158 ~aVlaHElgHi~ 169 (302)
T COG0501 158 EAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHh
Confidence 399999999964
No 123
>KOG3538|consensus
Probab=29.98 E-value=13 Score=41.15 Aligned_cols=21 Identities=29% Similarity=0.387 Sum_probs=17.2
Q ss_pred cchhHHHHHhhhcccccccCC
Q psy2949 105 KGKIQHEILHSLGFWHEHTRP 125 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rp 125 (367)
.-+|+||+||.+|+.|--.-.
T Consensus 318 aftiahE~GH~~gm~hd~~~~ 338 (845)
T KOG3538|consen 318 AFTIAHELGHNFGMEHDGRGN 338 (845)
T ss_pred ceeeeeccccccCcccCCcCC
Confidence 368999999999999984433
No 124
>PRK03001 M48 family peptidase; Provisional
Probab=29.76 E-value=25 Score=33.57 Aligned_cols=12 Identities=33% Similarity=0.667 Sum_probs=10.3
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.++++||+||.-
T Consensus 125 ~aVlAHElgHi~ 136 (283)
T PRK03001 125 RGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHh
Confidence 499999999964
No 125
>PRK03072 heat shock protein HtpX; Provisional
Probab=29.44 E-value=25 Score=33.70 Aligned_cols=21 Identities=5% Similarity=0.048 Sum_probs=13.4
Q ss_pred CHHHHHHHHHHHHHHHHcCCc
Q psy2949 39 TIYEKTLVENAIQDLRMHTCV 59 (367)
Q Consensus 39 ~~~~~~~I~~A~~~w~~~tci 59 (367)
++.+...+.+.+++..+...+
T Consensus 65 ~~~~~p~L~~~v~~la~~~g~ 85 (288)
T PRK03072 65 SEVQAPAMYRIVRELSTAARQ 85 (288)
T ss_pred ChhhhHHHHHHHHHHHHHcCC
Confidence 344555677777777776665
No 126
>PRK01345 heat shock protein HtpX; Provisional
Probab=29.38 E-value=26 Score=34.23 Aligned_cols=12 Identities=33% Similarity=0.581 Sum_probs=10.5
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.++++|||||..
T Consensus 125 ~aVlAHElgHi~ 136 (317)
T PRK01345 125 AGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHH
Confidence 499999999965
No 127
>PRK05457 heat shock protein HtpX; Provisional
Probab=28.29 E-value=28 Score=33.43 Aligned_cols=11 Identities=36% Similarity=0.555 Sum_probs=9.7
Q ss_pred cchhHHHHHhh
Q psy2949 105 KGKIQHEILHS 115 (367)
Q Consensus 105 ~g~i~HEigHA 115 (367)
.|+++||+||.
T Consensus 135 ~aVlAHElgHi 145 (284)
T PRK05457 135 EAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHH
Confidence 49999999994
No 128
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=27.96 E-value=28 Score=37.70 Aligned_cols=18 Identities=17% Similarity=0.169 Sum_probs=13.0
Q ss_pred HHHHHHHHcCCcEEEEcC
Q psy2949 48 NAIQDLRMHTCVRFVPRT 65 (367)
Q Consensus 48 ~A~~~w~~~tci~F~~~~ 65 (367)
-.+...++-=.|+|++++
T Consensus 369 Glf~l~~~LfGi~f~~~~ 386 (681)
T PRK10280 369 GVFWTANQLFGIKFVERF 386 (681)
T ss_pred hHHHHHHHHcCeEEEECC
Confidence 344556777899999865
No 129
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=27.94 E-value=29 Score=31.05 Aligned_cols=22 Identities=18% Similarity=0.007 Sum_probs=18.0
Q ss_pred chhHHHHHhhhcccccccCCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPDR 127 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpDR 127 (367)
..+.||+||++-......+..+
T Consensus 43 ~l~iHElgH~~~A~~~G~~~~~ 64 (183)
T cd06160 43 ILGIHEMGHYLAARRHGVKASL 64 (183)
T ss_pred HHHHHHHHHHHHHHHCCCCccc
Confidence 5789999999988887777664
No 130
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=27.78 E-value=30 Score=32.67 Aligned_cols=21 Identities=10% Similarity=0.077 Sum_probs=14.1
Q ss_pred CCCCHHHHHHHHHHHHHHHHc
Q psy2949 36 SEFTIYEKTLVENAIQDLRMH 56 (367)
Q Consensus 36 ~~~~~~~~~~I~~A~~~w~~~ 56 (367)
..+++++++.++.-+..+-..
T Consensus 39 ~~L~~~~~~~l~~~~~~Fl~~ 59 (253)
T PF06167_consen 39 RRLSPEEQRRLRQRVQLFLAE 59 (253)
T ss_dssp TT--HHHHHHHHHHHHHHHHH
T ss_pred hcCCHHHHHHHHHHHHHHhcc
Confidence 466888888888877776554
No 131
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=26.92 E-value=31 Score=33.32 Aligned_cols=11 Identities=27% Similarity=0.274 Sum_probs=9.3
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
.+|+||+||.+
T Consensus 172 yVlAHEyGHHV 182 (292)
T PF04228_consen 172 YVLAHEYGHHV 182 (292)
T ss_pred HHHHHHHHHHH
Confidence 48999999954
No 132
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=26.63 E-value=37 Score=24.68 Aligned_cols=17 Identities=29% Similarity=0.450 Sum_probs=15.6
Q ss_pred cchhHHHHHhhhccccc
Q psy2949 105 KGKIQHEILHSLGFWHE 121 (367)
Q Consensus 105 ~g~i~HEigHALG~~HE 121 (367)
...++|.+.|-||+.|.
T Consensus 30 eRriih~la~~lGL~~~ 46 (60)
T cd02639 30 ERRIVHLLASRLGLNHV 46 (60)
T ss_pred HHHHHHHHHHHcCCceE
Confidence 37899999999999998
No 133
>PRK04897 heat shock protein HtpX; Provisional
Probab=26.00 E-value=31 Score=33.29 Aligned_cols=11 Identities=55% Similarity=0.752 Sum_probs=9.7
Q ss_pred cchhHHHHHhh
Q psy2949 105 KGKIQHEILHS 115 (367)
Q Consensus 105 ~g~i~HEigHA 115 (367)
.++++||+||.
T Consensus 138 ~aVlAHElgHi 148 (298)
T PRK04897 138 EGVIGHEISHI 148 (298)
T ss_pred HHHHHHHHHHH
Confidence 49999999994
No 134
>KOG2556|consensus
Probab=25.17 E-value=34 Score=35.06 Aligned_cols=19 Identities=32% Similarity=0.450 Sum_probs=15.6
Q ss_pred chhHHHHHhhhcccccccC
Q psy2949 106 GKIQHEILHSLGFWHEHTR 124 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~r 124 (367)
.+|.||+.|||||.--+-.
T Consensus 257 stVkhE~~halgfsaGl~a 275 (666)
T KOG2556|consen 257 STVKHERDHALGFSAGLVA 275 (666)
T ss_pred HHHHHHHHhhhcccceeeE
Confidence 5899999999999766543
No 135
>PRK02391 heat shock protein HtpX; Provisional
Probab=25.05 E-value=34 Score=33.01 Aligned_cols=11 Identities=27% Similarity=0.543 Sum_probs=9.7
Q ss_pred cchhHHHHHhh
Q psy2949 105 KGKIQHEILHS 115 (367)
Q Consensus 105 ~g~i~HEigHA 115 (367)
.++++||+||.
T Consensus 134 ~aVlaHElgHi 144 (296)
T PRK02391 134 EAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHH
Confidence 49999999994
No 136
>PRK01265 heat shock protein HtpX; Provisional
Probab=24.40 E-value=34 Score=33.51 Aligned_cols=11 Identities=36% Similarity=0.368 Sum_probs=9.7
Q ss_pred cchhHHHHHhh
Q psy2949 105 KGKIQHEILHS 115 (367)
Q Consensus 105 ~g~i~HEigHA 115 (367)
.++++||+||.
T Consensus 141 ~aVlAHElgHi 151 (324)
T PRK01265 141 KAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHH
Confidence 49999999994
No 137
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=23.77 E-value=77 Score=24.42 Aligned_cols=41 Identities=10% Similarity=0.056 Sum_probs=35.7
Q ss_pred cEEEEcccCCCccccchhhccccccccceeeecCCceeeec
Q psy2949 326 WFYKIGGPSDDRKIMDKFFNNTIHFSKMKVGRDNSYTYRYA 366 (367)
Q Consensus 326 c~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (367)
=.|.+.++.+++...-=-+...-+|=..+|..+++|..|||
T Consensus 47 ~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~~~~~F~rr~a 87 (89)
T PF05506_consen 47 GPWTYTVAAGQTVSLTWPLAASGGWYDLTVTGPNGFLRRFA 87 (89)
T ss_pred CCEEEEECCCCEEEEEEeecCCCCcEEEEEEcCCCEEEEec
Confidence 34899999998877776678899999999999999999987
No 138
>PF14891 Peptidase_M91: Effector protein
Probab=23.45 E-value=36 Score=30.00 Aligned_cols=18 Identities=17% Similarity=0.410 Sum_probs=14.8
Q ss_pred chhHHHHHhhhccccccc
Q psy2949 106 GKIQHEILHSLGFWHEHT 123 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~ 123 (367)
-++.|||.||.-+.+=-.
T Consensus 105 v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 105 VVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHCCCC
Confidence 579999999997777655
No 139
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=22.75 E-value=82 Score=21.31 Aligned_cols=30 Identities=17% Similarity=0.254 Sum_probs=25.3
Q ss_pred cchhHHHHHhhhcccccccCCCCCcceEEe
Q psy2949 105 KGKIQHEILHSLGFWHEHTRPDRDQFVRVL 134 (367)
Q Consensus 105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I~ 134 (367)
..-++|++....|+.|+..-.+-+.+|.|.
T Consensus 29 ~R~~vH~la~~~~L~s~s~g~~~~r~v~i~ 58 (59)
T cd02325 29 ERKLIHDLAEYYGLKSESEGEGPNRRVVIT 58 (59)
T ss_pred HHHHHHHHHHHCCCEEEEecCCCCcEEEEe
Confidence 378999999999999999888877777663
No 140
>PF08810 KapB: Kinase associated protein B; InterPro: IPR014916 This bacterial protein forms an anti-parallel beta sheet with an extending alpha helical region. ; PDB: 1Y71_B.
Probab=22.38 E-value=21 Score=29.27 Aligned_cols=53 Identities=19% Similarity=0.334 Sum_probs=18.6
Q ss_pred CCccCCcccc-----ccccccccCCCCc-CCC-eeEEEEcCCCCCHHHHHHHHHHHHHHH
Q psy2949 2 EGDILIPRER-----DSRNLVLYQAQLW-PDK-TVYYNFEDSEFTIYEKTLVENAIQDLR 54 (367)
Q Consensus 2 ~GDi~~~~q~-----~~r~a~~~~~~~W-p~~-~VpY~~~~~~~~~~~~~~I~~A~~~w~ 54 (367)
|||...|.|. ..|+|++..++.| |.. +-||.-+-+++.++.+++|.+.++..+
T Consensus 31 QGDLHnP~q~dvpfFHERkALs~~Ek~~ip~~~vk~y~geip~Y~~SLq~A~~~~~~~L~ 90 (112)
T PF08810_consen 31 QGDLHNPKQTDVPFFHERKALSYREKTNIPKSMVKPYEGEIPDYEESLQQALEKLKEKLK 90 (112)
T ss_dssp ----------------------TT-EEEEEGGGEEE--S----HHHHHHHHHHHHHHTTT
T ss_pred ccCCCCCCcCCCcceehhhhcchHHHhcCCHHhcccCCCCCCChHHHHHHHHHHHHHHHh
Confidence 7998865553 3688999888888 444 557766545566666766666655543
No 141
>PRK04351 hypothetical protein; Provisional
Probab=21.70 E-value=47 Score=28.77 Aligned_cols=12 Identities=42% Similarity=0.584 Sum_probs=10.5
Q ss_pred cchhHHHHHhhh
Q psy2949 105 KGKIQHEILHSL 116 (367)
Q Consensus 105 ~g~i~HEigHAL 116 (367)
.+||.||+.|++
T Consensus 62 ~~vv~HElcH~~ 73 (149)
T PRK04351 62 IGIIKHELCHYH 73 (149)
T ss_pred HhhHHHHHHHHH
Confidence 489999999975
No 142
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=21.22 E-value=44 Score=29.91 Aligned_cols=13 Identities=23% Similarity=0.409 Sum_probs=10.4
Q ss_pred chhHHHHHhhhcc
Q psy2949 106 GKIQHEILHSLGF 118 (367)
Q Consensus 106 g~i~HEigHALG~ 118 (367)
.+++|||+||+.-
T Consensus 38 ~ilahel~hafd~ 50 (206)
T PF01431_consen 38 FILAHELMHAFDP 50 (206)
T ss_dssp HHHHHHHHHCTST
T ss_pred HHHHHHHHHHHHH
Confidence 4799999997754
No 143
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=21.08 E-value=45 Score=33.80 Aligned_cols=11 Identities=36% Similarity=0.250 Sum_probs=9.4
Q ss_pred chhHHHHHhhh
Q psy2949 106 GKIQHEILHSL 116 (367)
Q Consensus 106 g~i~HEigHAL 116 (367)
-+++||+||.|
T Consensus 16 ~v~~HE~gH~~ 26 (420)
T TIGR00054 16 LIFVHELGHFL 26 (420)
T ss_pred HHHHHhHHHHH
Confidence 57899999965
No 144
>PF07108 PipA: PipA protein; InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=21.02 E-value=94 Score=27.24 Aligned_cols=52 Identities=17% Similarity=0.056 Sum_probs=27.6
Q ss_pred CceEEEecCCCCccCCCCC---CCCCcccccccCCCcccccchhHHHHHhhhccc-ccccCCCCC
Q psy2949 68 DTYLRFRNTGFGCASPVGY---FPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFW-HEHTRPDRD 128 (367)
Q Consensus 68 ~~~I~i~~~~~gc~S~vG~---~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~-HEh~rpDRD 128 (367)
...|-..-++..|.++--. ..|+|.+. .....+||+.|||-.. -+-...-|+
T Consensus 122 rkvIcLnlDd~dd~~~~E~Yes~~G~qlFd---------~~RsFiHEiVHALt~L~Dke~nhprg 177 (200)
T PF07108_consen 122 RKVICLNLDDSDDDSYPEHYESNEGPQLFD---------TKRSFIHEIVHALTHLQDKEDNHPRG 177 (200)
T ss_pred ceEEEeecCCcccccchhhhhccCCccchh---------HHHHHHHHHHHHHhhcccccccCCCC
Confidence 3444444455566555433 22344332 2467899999999633 333333344
No 145
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=21.01 E-value=49 Score=27.20 Aligned_cols=15 Identities=33% Similarity=0.616 Sum_probs=12.8
Q ss_pred chhHHHHHhhhcccccc
Q psy2949 106 GKIQHEILHSLGFWHEH 122 (367)
Q Consensus 106 g~i~HEigHALG~~HEh 122 (367)
-|++|||-|++. ||+
T Consensus 78 ktllhefrh~mQ--h~~ 92 (126)
T PF13058_consen 78 KTLLHEFRHAMQ--HEK 92 (126)
T ss_pred HHHHHHHHHHHH--hhh
Confidence 699999999987 655
No 146
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=20.70 E-value=44 Score=31.74 Aligned_cols=17 Identities=29% Similarity=0.307 Sum_probs=14.3
Q ss_pred ccchhHHHHHhhhcccc
Q psy2949 104 LKGKIQHEILHSLGFWH 120 (367)
Q Consensus 104 ~~g~i~HEigHALG~~H 120 (367)
..-+++|||.|.|-+.+
T Consensus 142 g~NVvIHEFAH~LDm~~ 158 (260)
T PRK15410 142 GFNLIIHEVAHKLDMRN 158 (260)
T ss_pred CcchhHhHHHhHhhhhc
Confidence 34799999999998866
No 147
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=20.47 E-value=53 Score=31.27 Aligned_cols=20 Identities=40% Similarity=0.677 Sum_probs=15.8
Q ss_pred chhHHHHHhhhcccccccCCC
Q psy2949 106 GKIQHEILHSLGFWHEHTRPD 126 (367)
Q Consensus 106 g~i~HEigHALG~~HEh~rpD 126 (367)
++++||++|.+ |.|-.-+..
T Consensus 62 ~~l~HevlH~~-~~H~~r~~~ 81 (292)
T PF13203_consen 62 GLLLHEVLHCL-LRHPWRRGG 81 (292)
T ss_pred HHHHHHHHHHH-ccchhhhcc
Confidence 78999999976 666666665
Done!