Query psy2976
Match_columns 184
No_of_seqs 146 out of 1128
Neff 6.8
Searched_HMMs 46136
Date Fri Aug 16 22:22:03 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2976.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2976hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd04281 ZnMc_BMP1_TLD Zinc-dep 100.0 3E-45 6.5E-50 298.0 12.7 121 62-184 2-122 (200)
2 cd04282 ZnMc_meprin Zinc-depen 100.0 1.1E-44 2.4E-49 299.7 13.3 134 35-184 22-155 (230)
3 PF01400 Astacin: Astacin (Pep 100.0 1.9E-41 4.1E-46 273.9 10.2 114 68-184 1-114 (191)
4 cd04283 ZnMc_hatching_enzyme Z 100.0 1.9E-40 4E-45 266.2 11.5 107 74-184 5-112 (182)
5 KOG3714|consensus 100.0 1.2E-40 2.5E-45 296.6 11.2 127 56-184 68-194 (411)
6 cd04280 ZnMc_astacin_like Zinc 100.0 6.1E-40 1.3E-44 262.9 11.4 109 72-184 1-109 (180)
7 smart00235 ZnMc Zinc-dependent 100.0 1.3E-32 2.9E-37 210.7 12.2 115 67-183 2-120 (140)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 100.0 2.6E-31 5.6E-36 215.5 11.7 110 68-182 1-121 (198)
9 cd04268 ZnMc_MMP_like Zinc-dep 99.9 7E-25 1.5E-29 171.3 8.4 103 73-176 2-121 (165)
10 cd04278 ZnMc_MMP Zinc-dependen 99.8 6.9E-19 1.5E-23 137.4 10.2 103 69-171 1-129 (157)
11 cd00203 ZnMc Zinc-dependent me 99.7 9E-17 2E-21 125.3 9.9 106 74-179 2-126 (167)
12 cd04279 ZnMc_MMP_like_1 Zinc-d 99.6 5.4E-15 1.2E-19 115.2 10.3 97 74-170 3-125 (156)
13 PF00413 Peptidase_M10: Matrix 99.6 3.4E-14 7.5E-19 109.4 11.5 102 69-170 1-126 (154)
14 cd04277 ZnMc_serralysin_like Z 99.6 3.1E-14 6.8E-19 113.9 10.8 93 82-175 30-139 (186)
15 KOG1565|consensus 98.9 2.9E-09 6.2E-14 96.8 7.5 102 67-168 103-230 (469)
16 PF12388 Peptidase_M57: Dual-a 98.8 3.8E-08 8.3E-13 80.7 9.7 127 39-167 5-151 (211)
17 cd04276 ZnMc_MMP_like_2 Zinc-d 98.6 6.5E-07 1.4E-11 73.0 10.6 99 71-170 6-137 (197)
18 PF02031 Peptidase_M7: Strepto 98.3 1.1E-06 2.3E-11 66.7 4.7 86 74-165 3-93 (132)
19 COG5549 Predicted Zn-dependent 97.3 0.00038 8.3E-09 57.1 4.9 82 86-170 104-209 (236)
20 PF13583 Reprolysin_4: Metallo 96.7 0.0008 1.7E-08 55.0 2.0 20 149-168 137-156 (206)
21 PF13582 Reprolysin_3: Metallo 96.7 0.00073 1.6E-08 50.0 1.6 18 149-166 107-124 (124)
22 cd04267 ZnMc_ADAM_like Zinc-de 96.6 0.0016 3.5E-08 52.0 2.6 21 149-169 133-153 (192)
23 PF05572 Peptidase_M43: Pregna 96.5 0.0016 3.5E-08 51.0 2.0 21 150-170 70-90 (154)
24 PF13688 Reprolysin_5: Metallo 96.2 0.0028 6E-08 50.6 2.0 23 148-170 141-163 (196)
25 PF05548 Peptidase_M11: Gameto 96.1 0.02 4.3E-07 49.9 6.9 81 82-168 84-169 (314)
26 PF13574 Reprolysin_2: Metallo 96.0 0.0042 9.1E-08 49.3 2.3 21 149-169 111-131 (173)
27 cd04272 ZnMc_salivary_gland_MP 95.9 0.0072 1.6E-07 49.5 3.0 21 149-169 145-165 (220)
28 cd04269 ZnMc_adamalysin_II_lik 95.7 0.006 1.3E-07 48.8 2.0 19 150-168 132-150 (194)
29 cd04270 ZnMc_TACE_like Zinc-de 95.3 0.0094 2E-07 49.9 1.8 19 150-168 168-186 (244)
30 cd04275 ZnMc_pappalysin_like Z 95.3 0.0018 3.9E-08 53.8 -2.6 22 149-170 137-158 (225)
31 cd04271 ZnMc_ADAM_fungal Zinc- 95.2 0.0073 1.6E-07 50.2 0.8 18 151-168 147-164 (228)
32 cd04273 ZnMc_ADAMTS_like Zinc- 95.1 0.006 1.3E-07 49.5 0.1 20 149-168 140-159 (207)
33 COG1913 Predicted Zn-dependent 95.1 0.01 2.2E-07 47.5 1.3 20 150-169 125-144 (181)
34 PF01421 Reprolysin: Reprolysi 95.1 0.021 4.5E-07 45.9 3.1 19 150-168 132-150 (199)
35 PRK13267 archaemetzincin-like 94.7 0.017 3.8E-07 46.4 1.8 16 151-166 127-142 (179)
36 PF07998 Peptidase_M54: Peptid 94.4 0.021 4.5E-07 46.6 1.6 19 150-168 146-164 (194)
37 PF11350 DUF3152: Protein of u 94.3 0.02 4.3E-07 46.9 1.2 95 73-168 29-158 (203)
38 TIGR03296 M6dom_TIGR03296 M6 f 93.5 0.017 3.6E-07 49.5 -0.7 21 149-169 165-185 (286)
39 PF12044 Metallopep: Putative 93.1 0.063 1.4E-06 48.6 2.4 30 149-178 316-345 (423)
40 PF10462 Peptidase_M66: Peptid 92.7 0.063 1.4E-06 46.7 1.7 17 149-165 193-209 (305)
41 PF01457 Peptidase_M8: Leishma 92.4 0.074 1.6E-06 49.3 2.0 17 149-165 210-226 (521)
42 cd06459 M3B_Oligoendopeptidase 91.7 0.44 9.6E-06 42.1 6.0 21 142-162 215-235 (427)
43 PTZ00337 surface protease GP63 91.5 0.11 2.3E-06 48.8 1.9 18 148-165 228-245 (567)
44 TIGR00181 pepF oligoendopeptid 91.4 0.45 9.7E-06 44.5 6.0 22 142-163 371-392 (591)
45 cd02641 R3H_Smubp-2_like R3H d 91.1 0.35 7.7E-06 31.9 3.6 41 139-179 18-59 (60)
46 TIGR02290 M3_fam_3 oligoendope 90.7 0.57 1.2E-05 43.9 5.9 22 142-163 368-389 (587)
47 PTZ00257 Glycoprotein GP63 (le 90.3 0.16 3.5E-06 48.0 2.0 18 148-165 255-272 (622)
48 TIGR02289 M3_not_pepF oligoend 88.9 0.84 1.8E-05 42.4 5.5 17 147-163 335-351 (549)
49 KOG4525|consensus 88.6 0.27 5.8E-06 44.8 1.9 20 149-168 301-320 (614)
50 PF12725 DUF3810: Protein of u 87.9 0.35 7.6E-06 42.2 2.2 21 147-167 194-214 (318)
51 PF05547 Peptidase_M6: Immune 87.1 0.074 1.6E-06 50.6 -2.6 19 149-167 221-239 (645)
52 cd06007 R3H_DEXH_helicase R3H 86.3 1.4 3E-05 29.1 3.9 42 138-179 16-58 (59)
53 KOG3607|consensus 85.9 0.45 9.8E-06 45.9 1.9 19 149-167 323-341 (716)
54 cd02640 R3H_NRF R3H domain of 85.4 1.4 3E-05 29.1 3.6 42 138-179 17-59 (60)
55 PF09471 Peptidase_M64: IgA Pe 82.7 0.86 1.9E-05 38.8 2.0 14 149-162 216-229 (264)
56 PF04298 Zn_peptidase_2: Putat 80.3 1.8 3.8E-05 36.1 3.1 26 137-162 71-102 (222)
57 cd02642 R3H_encore_like R3H do 80.0 3.4 7.4E-05 27.2 3.9 41 139-180 22-62 (63)
58 TIGR00043 metalloprotein, YbeY 79.2 1.4 3.1E-05 32.5 2.0 19 150-168 74-92 (110)
59 PF01863 DUF45: Protein of unk 79.0 5.7 0.00012 31.5 5.6 18 149-166 164-181 (205)
60 COG2856 Predicted Zn peptidase 78.7 1.1 2.3E-05 37.1 1.3 12 150-161 73-84 (213)
61 cd02646 R3H_G-patch R3H domain 78.5 4.8 0.0001 26.1 4.1 41 139-179 17-57 (58)
62 PF02130 UPF0054: Uncharacteri 76.2 1.9 4.1E-05 33.3 1.9 17 150-166 109-125 (145)
63 PF06114 DUF955: Domain of unk 75.8 1.7 3.6E-05 30.8 1.5 24 138-161 28-54 (122)
64 COG0319 Predicted metal-depend 73.6 2.4 5.2E-05 33.3 2.0 19 150-168 109-127 (153)
65 PF06262 DUF1025: Possibl zinc 73.3 2.3 4.9E-05 30.9 1.6 17 150-166 74-90 (97)
66 COG1164 Oligoendopeptidase F [ 73.2 7.8 0.00017 36.7 5.6 17 147-163 378-394 (598)
67 PRK00016 metal-binding heat sh 72.4 2.4 5.3E-05 33.3 1.8 18 150-167 114-131 (159)
68 PF11667 DUF3267: Protein of u 71.7 1.8 3.9E-05 31.5 0.8 20 151-170 6-25 (111)
69 cd06258 Peptidase_M3_like The 70.7 2.5 5.5E-05 36.8 1.7 14 148-161 153-166 (365)
70 PF01432 Peptidase_M3: Peptida 69.3 2.9 6.3E-05 37.7 1.8 15 149-163 242-256 (458)
71 PF14247 DUF4344: Domain of un 69.0 3.1 6.7E-05 34.6 1.8 13 149-161 92-104 (220)
72 PRK13963 unkown domain/putativ 68.9 3.4 7.3E-05 35.1 2.0 17 150-166 216-232 (258)
73 cd06455 M3A_TOP Peptidase M3 T 65.7 3.7 7.9E-05 37.5 1.7 13 149-161 263-275 (472)
74 COG2738 Predicted Zn-dependent 65.0 3.7 8.1E-05 33.7 1.4 24 139-162 76-105 (226)
75 PF01447 Peptidase_M4: Thermol 64.8 3.9 8.5E-05 31.8 1.5 12 150-161 136-147 (150)
76 COG4783 Putative Zn-dependent 63.6 4.1 8.9E-05 37.6 1.6 16 147-162 128-143 (484)
77 COG3824 Predicted Zn-dependent 63.4 3.3 7.1E-05 31.5 0.8 30 137-166 83-126 (136)
78 cd06457 M3A_MIP Peptidase M3 m 63.0 4.4 9.6E-05 36.9 1.7 17 149-165 248-264 (458)
79 cd06162 S2P-M50_PDZ_SREBP Ster 61.2 5.3 0.00011 34.4 1.8 16 149-164 135-150 (277)
80 cd06159 S2P-M50_PDZ_Arch Uncha 60.5 5.5 0.00012 33.9 1.8 23 149-171 118-140 (263)
81 cd06161 S2P-M50_SpoIVFB SpoIVF 58.9 6.2 0.00013 32.1 1.8 24 148-171 37-60 (208)
82 PF01435 Peptidase_M48: Peptid 58.6 6.8 0.00015 31.2 1.9 14 149-162 89-102 (226)
83 PF13485 Peptidase_MA_2: Pepti 58.2 6.5 0.00014 27.9 1.6 12 150-161 26-37 (128)
84 cd06460 M32_Taq Peptidase fami 58.0 7.2 0.00016 35.2 2.1 20 148-170 158-177 (396)
85 cd06456 M3A_DCP_Oligopeptidase 57.6 6.3 0.00014 35.6 1.7 13 149-161 208-220 (422)
86 cd06164 S2P-M50_SpoIVFB_CBS Sp 54.2 8.2 0.00018 31.9 1.8 22 149-170 53-74 (227)
87 PF14521 Aspzincin_M35: Lysine 54.1 7.9 0.00017 29.9 1.5 14 149-162 96-111 (148)
88 smart00731 SprT SprT homologue 53.0 8.5 0.00018 29.4 1.5 15 149-163 59-73 (146)
89 PF10263 SprT-like: SprT-like 52.8 9.3 0.0002 29.1 1.7 18 148-165 59-76 (157)
90 KOG2921|consensus 51.5 8.4 0.00018 34.9 1.5 18 150-167 132-149 (484)
91 PLN02887 hydrolase family prot 51.0 10 0.00022 35.9 2.0 19 150-168 231-249 (580)
92 PF12315 DUF3633: Protein of u 49.2 10 0.00022 31.3 1.5 13 149-161 93-105 (212)
93 KOG3658|consensus 48.1 6.1 0.00013 38.0 0.0 18 151-168 394-411 (764)
94 PRK10911 oligopeptidase A; Pro 48.1 10 0.00022 36.4 1.6 13 149-161 463-475 (680)
95 PRK03982 heat shock protein Ht 48.0 11 0.00024 32.1 1.6 14 148-161 124-137 (288)
96 PRK03072 heat shock protein Ht 48.0 11 0.00024 32.3 1.6 14 148-161 126-139 (288)
97 PF13699 DUF4157: Domain of un 47.4 12 0.00025 26.0 1.4 14 149-162 61-74 (79)
98 COG0501 HtpX Zn-dependent prot 47.3 12 0.00026 31.3 1.7 14 148-161 156-169 (302)
99 KOG3538|consensus 47.3 4 8.8E-05 40.1 -1.3 21 150-170 318-338 (845)
100 PRK05457 heat shock protein Ht 46.1 12 0.00026 32.0 1.6 13 148-160 133-145 (284)
101 PRK03001 M48 family peptidase; 46.1 12 0.00026 31.8 1.6 14 148-161 123-136 (283)
102 PF06167 Peptidase_M90: Glucos 45.2 12 0.00027 31.6 1.5 17 148-164 153-169 (253)
103 PHA02456 zinc metallopeptidase 44.9 13 0.00027 28.0 1.3 36 124-163 58-93 (141)
104 PRK01345 heat shock protein Ht 44.8 13 0.00028 32.3 1.6 14 148-161 123-136 (317)
105 cd06160 S2P-M50_like_2 Unchara 44.2 15 0.00032 29.5 1.8 25 149-173 41-65 (183)
106 PRK04897 heat shock protein Ht 44.0 14 0.0003 31.8 1.6 13 149-161 137-149 (298)
107 cd02639 R3H_RRM R3H domain of 44.0 16 0.00034 24.1 1.5 28 139-166 18-46 (60)
108 PF15639 Tox-MPTase3: Metallop 43.7 46 0.001 25.5 4.2 70 89-160 23-111 (135)
109 KOG2556|consensus 43.5 13 0.00029 34.5 1.5 18 150-167 256-273 (666)
110 PF07108 PipA: PipA protein; 42.8 9.5 0.00021 30.4 0.4 44 115-164 122-167 (200)
111 PRK02870 heat shock protein Ht 42.8 15 0.00033 32.3 1.7 13 148-160 172-184 (336)
112 PF13203 DUF2201_N: Putative m 42.3 18 0.00038 30.6 2.0 22 149-171 60-81 (292)
113 PF04228 Zn_peptidase: Putativ 41.7 15 0.00032 31.9 1.4 12 150-161 171-182 (292)
114 PRK02391 heat shock protein Ht 40.9 16 0.00036 31.4 1.6 13 148-160 132-144 (296)
115 PRK01265 heat shock protein Ht 40.6 17 0.00036 32.0 1.6 13 148-160 139-151 (324)
116 KOG3624|consensus 39.1 21 0.00045 34.1 2.2 30 151-182 520-549 (687)
117 PF02102 Peptidase_M35: Deuter 38.8 22 0.00047 31.8 2.0 20 145-164 290-312 (359)
118 cd02325 R3H R3H domain. The na 38.4 50 0.0011 20.0 3.2 41 139-179 18-58 (59)
119 cd06006 R3H_unknown_2 R3H doma 38.1 33 0.00071 22.5 2.3 39 141-179 20-58 (59)
120 PRK04351 hypothetical protein; 36.2 22 0.00048 27.6 1.5 13 149-161 61-73 (149)
121 PF01431 Peptidase_M13: Peptid 36.2 19 0.00041 28.6 1.2 20 151-172 38-57 (206)
122 PF14891 Peptidase_M91: Effect 35.4 25 0.00054 27.7 1.7 19 150-168 104-122 (174)
123 PRK10280 dipeptidyl carboxypep 35.1 23 0.00049 34.2 1.7 13 150-162 465-477 (681)
124 PRK04860 hypothetical protein; 34.3 26 0.00057 27.6 1.7 13 149-161 63-75 (160)
125 PLN02791 Nudix hydrolase homol 33.8 20 0.00044 35.2 1.1 20 150-169 586-605 (770)
126 PRK15410 DgsA anti-repressor M 32.8 21 0.00045 30.4 1.0 18 148-165 141-158 (260)
127 PF10460 Peptidase_M30: Peptid 32.6 28 0.00061 31.2 1.8 16 149-164 139-154 (366)
128 COG2317 Zn-dependent carboxype 30.2 24 0.00052 32.7 0.9 11 151-161 261-271 (497)
129 KOG0201|consensus 27.4 20 0.00044 32.9 -0.0 30 150-183 113-142 (467)
130 smart00393 R3H Putative single 27.2 73 0.0016 21.5 2.8 39 141-179 39-77 (79)
131 PF04450 BSP: Peptidase of pla 26.0 44 0.00095 27.3 1.7 16 149-164 96-111 (205)
132 PF01434 Peptidase_M41: Peptid 24.2 45 0.00098 27.0 1.5 13 149-161 28-40 (213)
133 PF02074 Peptidase_M32: Carbox 24.2 43 0.00094 31.1 1.5 13 149-161 259-271 (494)
134 PF09768 Peptidase_M76: Peptid 23.2 48 0.001 26.5 1.4 14 149-162 71-84 (173)
135 COG0339 Dcp Zn-dependent oligo 22.6 49 0.0011 32.0 1.5 24 89-112 365-388 (683)
136 KOG2719|consensus 22.4 46 0.001 30.4 1.3 13 148-160 279-291 (428)
137 KOG3314|consensus 21.7 56 0.0012 26.3 1.5 24 150-183 92-115 (194)
138 PF07023 DUF1315: Protein of u 21.6 1.3E+02 0.0028 21.7 3.2 47 67-121 22-68 (93)
139 cd06461 M2_ACE Peptidase famil 21.3 58 0.0013 30.1 1.7 16 148-163 246-261 (477)
140 KOG3854|consensus 21.0 84 0.0018 29.2 2.6 40 124-163 321-365 (505)
141 PF05452 Clavanin: Clavanin; 20.8 28 0.00061 23.7 -0.3 13 150-162 41-53 (80)
142 KOG2661|consensus 20.8 56 0.0012 29.2 1.4 12 150-161 276-287 (424)
143 COG4900 Predicted metallopepti 20.5 55 0.0012 24.6 1.1 15 146-160 76-91 (133)
144 PF10023 DUF2265: Predicted am 20.1 79 0.0017 28.0 2.2 13 149-161 165-177 (337)
No 1
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=100.00 E-value=3e-45 Score=297.95 Aligned_cols=121 Identities=80% Similarity=1.446 Sum_probs=112.4
Q ss_pred cccCCCCCcCCCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCCCCccee
Q psy2976 62 ATARKERIWDYGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRGNGQQAI 141 (184)
Q Consensus 62 a~~~~~~~Wp~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~~g~q~i 141 (184)
|......+||+++|||.|+++|+...+..|++||++|++.|||+|+|++. +.+||.|...+.||||+||+.+.|.|.|
T Consensus 2 ~~~~~~~~Wp~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~--~~~yi~f~~~~~GC~S~vG~~~~g~q~i 79 (200)
T cd04281 2 ATARKERIWPGGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTP--EENYIVFTYRPCGCCSYVGRRGNGPQAI 79 (200)
T ss_pred CccCccCcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCC--CCCEEEEEECCCCeeEcCCCcCCCceee
Confidence 34456789999999999999999999999999999999999999999987 5799999865679999999988789999
Q ss_pred ecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 142 SIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 142 sl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
+|+.+|...|+|+|||||||||+|||+|||||+||+|+|+||+
T Consensus 80 sl~~~C~~~Gti~HEl~HaLGf~HEhsRpDRD~yV~I~~~nI~ 122 (200)
T cd04281 80 SIGKNCDKFGIVVHELGHVIGFWHEHTRPDRDDHVTIIRENIQ 122 (200)
T ss_pred ecCCCcCcCchHHHHHHHHhcCcchhccccccceEEEeecccC
Confidence 9999999999999999999999999999999999999999995
No 2
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=100.00 E-value=1.1e-44 Score=299.74 Aligned_cols=134 Identities=32% Similarity=0.601 Sum_probs=121.0
Q ss_pred CCCCCceEEccCCcccccccCCCCCcccccCCCCCcCCCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCC
Q psy2976 35 PPSPSTIVVSQTSLPEAYTVHHRDTRAATARKERIWDYGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEH 114 (184)
Q Consensus 35 ~~~~~~i~~~~~~~~~~~~~~~R~kR~a~~~~~~~Wp~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~ 114 (184)
..-.+||.+.. .++|.|+.....+||+ +|||.|+++|+...++.|++||++|++.|||+|++++. +
T Consensus 22 ~~~eGDI~l~~-----------~~~R~a~~~~~~~Wp~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~--e 87 (230)
T cd04282 22 DLFEGDILLDE-----------GQSRNGLIGDTYRWPF-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEG--E 87 (230)
T ss_pred ccccccccCCc-----------ccccccccCcccCCCc-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCC--C
Confidence 35566666652 2367888888899999 89999999999999999999999999999999999987 4
Q ss_pred CcEEEEecCCCCcccccCCCCCCcceeecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 115 PNYILFTEKPCGCCSFVGKRGNGQQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 115 ~~~I~f~~~~~gC~S~vG~~~~g~q~isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
.+||.|.. +.||||+||+.+ |.|.|+|+.+|...|+|+|||||||||+|||+|||||+||+|+|+||.
T Consensus 88 ~~yi~i~~-~~GC~S~vG~~g-g~q~isl~~~C~~~Gti~HEl~HalGf~HEqsRpDRD~yV~I~~~nI~ 155 (230)
T cd04282 88 SNYIFFFK-GSGCWSMVGDQQ-GGQNLSIGAGCDYKATVEHEFLHALGFYHEQSRSDRDDYVKIWWDQIL 155 (230)
T ss_pred CcEEEEEc-CCCeeeccCccC-CeEEEEECCCcCCCchHHHHHHHHhCCcccccccccccceEEeecccC
Confidence 79999984 789999999987 899999999999999999999999999999999999999999999995
No 3
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=100.00 E-value=1.9e-41 Score=273.85 Aligned_cols=114 Identities=46% Similarity=1.008 Sum_probs=96.4
Q ss_pred CCcCCCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCCCCcceeecCCCC
Q psy2976 68 RIWDYGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRGNGQQAISIGKNC 147 (184)
Q Consensus 68 ~~Wp~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~~g~q~isl~~~c 147 (184)
++||+++|||.|+++|++..++.|++||+.|++.|||+|++++ .....+|.|. .+.||||+||+.+ |.|.|+|+.+|
T Consensus 1 ~~Wp~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~~~~~~i~~~-~~~gC~S~vG~~~-g~q~i~l~~~c 77 (191)
T PF01400_consen 1 KKWPNGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-ENEDDYISFS-NGSGCWSYVGRQG-GEQTINLGDGC 77 (191)
T ss_dssp -S-GGGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-SSSSSEEEEE-SSSSEEEESS--S-SEEEEEE-TTC
T ss_pred CcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-CCCceEEEee-cCccccchhhhcC-cceeEEeccee
Confidence 5899999999999999999999999999999999999999999 3367899997 4789999999995 99999999999
Q ss_pred chHHHHHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 148 DKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
...|+|+|||||||||.|||+|||||+||+|+|+||+
T Consensus 78 ~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~ 114 (191)
T PF01400_consen 78 FSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQ 114 (191)
T ss_dssp -SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-
T ss_pred CCccchHHHHHHHHhhhhhhhccccccEEEEehhcch
Confidence 9999999999999999999999999999999999984
No 4
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=100.00 E-value=1.9e-40 Score=266.24 Aligned_cols=107 Identities=40% Similarity=0.835 Sum_probs=100.8
Q ss_pred ceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCCCCcceeecC-CCCchHHH
Q psy2976 74 VIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRGNGQQAISIG-KNCDKFGI 152 (184)
Q Consensus 74 ~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~~g~q~isl~-~~c~~~g~ 152 (184)
+|||.|+++|++..++.|++||++|++.|||+|++++. +.+||.|.. +.||||+||+.+ |.|.|+|+ .+|...|+
T Consensus 5 ~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~--~~~yi~~~~-~~gC~S~vG~~g-g~q~i~l~~~~C~~~G~ 80 (182)
T cd04283 5 YVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTT--ERDYLNIES-RSGCWSYIGRQG-GRQTVSLQKQGCMYKGI 80 (182)
T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCC--CCcEEEEEc-CCCceEecCccC-CceeEecCCCCcCccch
Confidence 79999999999999999999999999999999999876 578999984 789999999997 89999997 58999999
Q ss_pred HHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 153 VVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 153 i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
|+|||||||||+|||+|||||+||+|+|+||.
T Consensus 81 i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~ 112 (182)
T cd04283 81 IQHELLHALGFYHEQTRSDRDKYVRINWENII 112 (182)
T ss_pred HHHHHHHHhCCcccccccccCceEEEehhhcC
Confidence 99999999999999999999999999999984
No 5
>KOG3714|consensus
Probab=100.00 E-value=1.2e-40 Score=296.55 Aligned_cols=127 Identities=52% Similarity=0.987 Sum_probs=116.3
Q ss_pred CCCCcccccCCCCCcCCCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCC
Q psy2976 56 HRDTRAATARKERIWDYGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRG 135 (184)
Q Consensus 56 ~R~kR~a~~~~~~~Wp~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~ 135 (184)
.|.+|.++.....+||+++|||.|+++|+...|++|+.||++|+++|||+|+|++.. ..+|+.+.... ||||+||+.|
T Consensus 68 ~~~~r~~~~~~~~~Wp~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~-gC~S~VGr~g 145 (411)
T KOG3714|consen 68 KRSRRNGTSNPERRWPNGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG-GCYSYVGRRG 145 (411)
T ss_pred chhhhhcccChhhcCCCCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC-cceeeeCccC
Confidence 467788888899999999999999999999999999999999999999999999854 45677666444 9999999999
Q ss_pred CCcceeecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 136 NGQQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 136 ~g~q~isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
+++|.++|+.+|..+|+|+|||||||||+|||+|||||+||+|+|+||.
T Consensus 146 g~~q~~sl~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~ 194 (411)
T KOG3714|consen 146 GGQQLLSLGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNID 194 (411)
T ss_pred CCccceecCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCC
Confidence 6679999999999999999999999999999999999999999999995
No 6
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=100.00 E-value=6.1e-40 Score=262.86 Aligned_cols=109 Identities=45% Similarity=0.872 Sum_probs=102.9
Q ss_pred CCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCCCCcceeecCCCCchHH
Q psy2976 72 YGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRGNGQQAISIGKNCDKFG 151 (184)
Q Consensus 72 ~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~~g~q~isl~~~c~~~g 151 (184)
+++|||.|+++|++.+++.|++||+.|++.|||+|+|++. +.++|.|.. ..||||+||+.+ +.|.|+|+.+|...|
T Consensus 1 ~~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~--~~~~I~f~~-~~Gc~S~vG~~~-~~q~i~l~~~c~~~g 76 (180)
T cd04280 1 NGTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT--EKDYIRIVK-GSGCWSYVGRVG-GRQVVSLGSGCFSLG 76 (180)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC--CCcEEEEEc-CCCcceecCccC-CceeEEeCCCcCcCc
Confidence 4689999999999999999999999999999999999987 578999986 489999999996 899999999999999
Q ss_pred HHHHHHHHhhCCCCCCCCCCCCCceEeecccCC
Q psy2976 152 IVVHELGHVVGFWHEHTRPDRDNHVQIIRENIM 184 (184)
Q Consensus 152 ~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI~ 184 (184)
+++|||||||||.|||+|||||+||+|+|+||.
T Consensus 77 ~v~HE~~HalG~~HEh~R~DRD~yv~i~~~ni~ 109 (180)
T cd04280 77 TIVHELMHALGFYHEQSRPDRDDYVTINWENIQ 109 (180)
T ss_pred hhHHHHHHHhcCcchhcccccCCeEEEeecccC
Confidence 999999999999999999999999999999984
No 7
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=100.00 E-value=1.3e-32 Score=210.65 Aligned_cols=115 Identities=41% Similarity=0.815 Sum_probs=103.9
Q ss_pred CCCcCCCceEEEEc-CCCCHHH-HHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCC-Ccc-cccCCCCCCcceee
Q psy2976 67 ERIWDYGVIPYEID-SNFGGQH-KALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPC-GCC-SFVGKRGNGQQAIS 142 (184)
Q Consensus 67 ~~~Wp~g~IpY~~~-~~~~~~~-r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~-gC~-S~vG~~~~g~q~is 142 (184)
..+|++++|+|+|+ +.+++.+ ++.|++||+.|+..+||+|++++.. ...+|.|..... ||+ +++|+.+ |.+.++
T Consensus 2 ~~~W~~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~-ad~~I~f~~~~~~g~~~a~~g~~~-g~~~~~ 79 (140)
T smart00235 2 SKKWPKGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTST-ADIYISFGKGDGSGCTLSHAGRPG-GDQHFS 79 (140)
T ss_pred CCcCCCCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCC-CCeEEEEEECCCCCcceeeeecCC-CceEEE
Confidence 46999999999997 8888876 9999999999999999999998752 567899986443 998 9999985 889999
Q ss_pred cCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEeecccC
Q psy2976 143 IGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENI 183 (184)
Q Consensus 143 l~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI 183 (184)
++.+|...++++|||||||||.|||+|||||+||.|+|+||
T Consensus 80 ~~~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~ 120 (140)
T smart00235 80 LGNGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNI 120 (140)
T ss_pred ccCCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhh
Confidence 98899999999999999999999999999999999999987
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.97 E-value=2.6e-31 Score=215.52 Aligned_cols=110 Identities=25% Similarity=0.442 Sum_probs=92.1
Q ss_pred CCcCCCc-eEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcccccCCCC----CCcceee
Q psy2976 68 RIWDYGV-IPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRG----NGQQAIS 142 (184)
Q Consensus 68 ~~Wp~g~-IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~----~g~q~is 142 (184)
++||+++ ++|.|.+++++..|+.|++||++|+.+|||+|++++......+|.|.. +.||||+||+.+ .+.++|+
T Consensus 1 ~~W~~~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~-~~Gc~S~vG~~~~~~~~~~~t~~ 79 (198)
T cd04327 1 KLWRNGTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTP-GDGYWSYVGTDALLIGADAPTMN 79 (198)
T ss_pred CCCCCCCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEec-CCCCCCCcCCcccccCCCCceee
Confidence 5899985 557888888888999999999999999999999987643455777764 579999999964 2467888
Q ss_pred cCCC------CchHHHHHHHHHHhhCCCCCCCCCCCCCceEeeccc
Q psy2976 143 IGKN------CDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIREN 182 (184)
Q Consensus 143 l~~~------c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~N 182 (184)
|+.. |...++|+|||||||||.|||+||||| |.|+|
T Consensus 80 l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh~rpdrd----i~w~~ 121 (198)
T cd04327 80 LGWFTDDTPDPEFSRVVLHEFGHALGFIHEHQSPAAN----IPWDK 121 (198)
T ss_pred eeeecCCCchhhHHHHHHHHHHHHhcCcccccCCCCC----CCcCH
Confidence 8753 566799999999999999999999999 77774
No 9
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.91 E-value=7e-25 Score=171.25 Aligned_cols=103 Identities=23% Similarity=0.343 Sum_probs=85.9
Q ss_pred CceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEec-----CCCCcccccCCCC-CCcceeecCCC
Q psy2976 73 GVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTE-----KPCGCCSFVGKRG-NGQQAISIGKN 146 (184)
Q Consensus 73 g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~-----~~~gC~S~vG~~~-~g~q~isl~~~ 146 (184)
..|+|.|+++++...|+.|++||+.|++.+||+|++.+.. ....|.|.. .+.||++++|... .+.+.|.++..
T Consensus 2 ~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~-~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i~~~~~ 80 (165)
T cd04268 2 KPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDV-DPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEILLARV 80 (165)
T ss_pred CCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceecccc-CccCceEEEEEeecCCCCccccCCccCCCCCccEEeeEE
Confidence 4799999999999999999999999999999999998764 233444432 2578999999862 35678888776
Q ss_pred Cc-----------hHHHHHHHHHHhhCCCCCCCCCCCCCce
Q psy2976 147 CD-----------KFGIVVHELGHVVGFWHEHTRPDRDNHV 176 (184)
Q Consensus 147 c~-----------~~g~i~HElgHALG~~HEh~RpDRD~yV 176 (184)
|. ..++++|||||||||.|+++|||||.|+
T Consensus 81 ~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~ 121 (165)
T cd04268 81 YLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNV 121 (165)
T ss_pred EEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcc
Confidence 64 5799999999999999999999999876
No 10
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.79 E-value=6.9e-19 Score=137.44 Aligned_cols=103 Identities=26% Similarity=0.451 Sum_probs=73.1
Q ss_pred CcCCCceEEEEcCCC----CHHHHHHHHHHHHHhhcccceEEEEcCCC-CCCcEEEEecCCCCc-ccccCCCCC------
Q psy2976 69 IWDYGVIPYEIDSNF----GGQHKALFKQAMRHWENFTCVKFVERSST-EHPNYILFTEKPCGC-CSFVGKRGN------ 136 (184)
Q Consensus 69 ~Wp~g~IpY~~~~~~----~~~~r~~I~~A~~~we~~TcV~F~e~~~~-~~~~~I~f~~~~~gC-~S~vG~~~~------ 136 (184)
|||+++|+|+|++.. ....++.|++||+.|+..+||+|++++.. .....|.|.....+| +++.|..+.
T Consensus 1 kW~~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a~~ 80 (157)
T cd04278 1 KWSKTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHAFF 80 (157)
T ss_pred CCCCCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccccC
Confidence 799999999998654 36788999999999999999999998764 224456666544555 344443220
Q ss_pred -----C------cceeecC---CCCchHHHHHHHHHHhhCCCCCCCCCC
Q psy2976 137 -----G------QQAISIG---KNCDKFGIVVHELGHVVGFWHEHTRPD 171 (184)
Q Consensus 137 -----g------~q~isl~---~~c~~~g~i~HElgHALG~~HEh~RpD 171 (184)
+ ....++. .++....+++|||||||||.|++.+.+
T Consensus 81 p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~ 129 (157)
T cd04278 81 PGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSDPDS 129 (157)
T ss_pred CCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCCCcC
Confidence 1 0111221 235678899999999999999987644
No 11
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.70 E-value=9e-17 Score=125.28 Aligned_cols=106 Identities=29% Similarity=0.511 Sum_probs=79.8
Q ss_pred ceEEEEcCC--------CCHHHHHHHHHHHHHhhcccceEEEEcCCC--CCCcEEEEecC----CCCcccccCCCC-CCc
Q psy2976 74 VIPYEIDSN--------FGGQHKALFKQAMRHWENFTCVKFVERSST--EHPNYILFTEK----PCGCCSFVGKRG-NGQ 138 (184)
Q Consensus 74 ~IpY~~~~~--------~~~~~r~~I~~A~~~we~~TcV~F~e~~~~--~~~~~I~f~~~----~~gC~S~vG~~~-~g~ 138 (184)
.|+|.|... ++...++.|+.|++.|+..+||+|++.... .....+.+... +.+|++++|... .+.
T Consensus 2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~ 81 (167)
T cd00203 2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR 81 (167)
T ss_pred EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence 577777654 667788999999999999999999998763 22333444321 357889998863 233
Q ss_pred ceeecCCCC----chHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 139 QAISIGKNC----DKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 139 q~isl~~~c----~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
+.+.+...+ ....+++|||||||||.|++.++|||+||++.
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~ 126 (167)
T cd00203 82 GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTID 126 (167)
T ss_pred CcEEEecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcccc
Confidence 455555443 56789999999999999999999999998873
No 12
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.61 E-value=5.4e-15 Score=115.23 Aligned_cols=97 Identities=29% Similarity=0.391 Sum_probs=64.4
Q ss_pred ceEEEEcCC------CCHHHHHHHHHHHHHhhcccceEEEEcCCC--CCCcEEEEec----C--CCCcccccCCC-----
Q psy2976 74 VIPYEIDSN------FGGQHKALFKQAMRHWENFTCVKFVERSST--EHPNYILFTE----K--PCGCCSFVGKR----- 134 (184)
Q Consensus 74 ~IpY~~~~~------~~~~~r~~I~~A~~~we~~TcV~F~e~~~~--~~~~~I~f~~----~--~~gC~S~vG~~----- 134 (184)
.|+|++++. ...+.++.|++||+.|++.+||+|++.... +....|.|.. . ++.|.+..+..
T Consensus 3 ~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~~ 82 (156)
T cd04279 3 PIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGN 82 (156)
T ss_pred CeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCCc
Confidence 578888754 345788999999999999999999998652 2334455642 1 22343322211
Q ss_pred --CCCcceeecCC-----CCchHHHHHHHHHHhhCCCCCCCCC
Q psy2976 135 --GNGQQAISIGK-----NCDKFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 135 --~~g~q~isl~~-----~c~~~g~i~HElgHALG~~HEh~Rp 170 (184)
+...-.++++. .....++++|||||||||.|+++++
T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~ 125 (156)
T cd04279 83 RKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRP 125 (156)
T ss_pred ccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCc
Confidence 11122333322 1245789999999999999999998
No 13
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.57 E-value=3.4e-14 Score=109.43 Aligned_cols=102 Identities=24% Similarity=0.431 Sum_probs=67.2
Q ss_pred CcCCCceEEEEcCCC---C-HHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCC-CCcccccCCCCC-------
Q psy2976 69 IWDYGVIPYEIDSNF---G-GQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKP-CGCCSFVGKRGN------- 136 (184)
Q Consensus 69 ~Wp~g~IpY~~~~~~---~-~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~-~gC~S~vG~~~~------- 136 (184)
||+..+|.|+|.... + .+.+++|++||+.|+..++|+|++.......-.|.|.... ..+..+-|..+.
T Consensus 1 ~W~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~adi~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (154)
T PF00413_consen 1 KWPKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGNADIRISFGSNNHGDGYSFDGSGGTLAHAYFP 80 (154)
T ss_dssp SSSSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSSCSEEEEEESSSSSSSS-CSSSSSESEEEEES
T ss_pred CCCCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCCcceeeeeeccccCcccccccceeeeeccccc
Confidence 799999999997532 3 4588899999999999999999999863234456665322 223222222210
Q ss_pred C---cceeecCC--------CC-chHHHHHHHHHHhhCCCCCCCCC
Q psy2976 137 G---QQAISIGK--------NC-DKFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 137 g---~q~isl~~--------~c-~~~g~i~HElgHALG~~HEh~Rp 170 (184)
+ .-.+.+.. .. ....+++|||||||||.|++.+.
T Consensus 81 ~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~~~~ 126 (154)
T PF00413_consen 81 NNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSNDPN 126 (154)
T ss_dssp SSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSSSTT
T ss_pred cccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCCCcc
Confidence 0 11222221 11 35689999999999999997653
No 14
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.56 E-value=3.1e-14 Score=113.90 Aligned_cols=93 Identities=20% Similarity=0.364 Sum_probs=66.0
Q ss_pred CCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCC-----CCcccccCCCC-----CCcceeecC-------
Q psy2976 82 NFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKP-----CGCCSFVGKRG-----NGQQAISIG------- 144 (184)
Q Consensus 82 ~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~-----~gC~S~vG~~~-----~g~q~isl~------- 144 (184)
.+++.+|+.|++||+.|+..+||+|++++.. ....|.|.... .+++++.+... .|.-.++..
T Consensus 30 ~~~~~~~~~i~~A~~~w~~~~~l~F~e~~~~-~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~ 108 (186)
T cd04277 30 ALSAAQQAAARDALEAWEDVADIDFVEVSDN-SGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDS 108 (186)
T ss_pred cCCHHHHHHHHHHHHHHHhhcCceeEECCCC-CcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCC
Confidence 5677899999999999999999999999875 33456665322 23444444321 122122221
Q ss_pred CCCchHHHHHHHHHHhhCCCCCCCCCCCCCc
Q psy2976 145 KNCDKFGIVVHELGHVVGFWHEHTRPDRDNH 175 (184)
Q Consensus 145 ~~c~~~g~i~HElgHALG~~HEh~RpDRD~y 175 (184)
.++....+++|||||||||.|++..++.+.+
T Consensus 109 ~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~ 139 (186)
T cd04277 109 PGSYGYQTIIHEIGHALGLEHPGDYNGGDPV 139 (186)
T ss_pred CChhhHHHHHHHHHHHhcCCCCCcCCCCCCC
Confidence 2567789999999999999999999887654
No 15
>KOG1565|consensus
Probab=98.91 E-value=2.9e-09 Score=96.77 Aligned_cols=102 Identities=24% Similarity=0.418 Sum_probs=71.3
Q ss_pred CCCcCCCceEEEEcCC---C-CHHHHHHHHHHHHHhhcccceEEEEcCC-CCCCcEEEEecCCCCc-ccccCCCCC----
Q psy2976 67 ERIWDYGVIPYEIDSN---F-GGQHKALFKQAMRHWENFTCVKFVERSS-TEHPNYILFTEKPCGC-CSFVGKRGN---- 136 (184)
Q Consensus 67 ~~~Wp~g~IpY~~~~~---~-~~~~r~~I~~A~~~we~~TcV~F~e~~~-~~~~~~I~f~~~~~gC-~S~vG~~~~---- 136 (184)
..+|++..++|+|.+. + ..+.++++.+|+..|+++|.++|.|... ...+..|.|.....|+ +.+-|..+.
T Consensus 103 ~~kW~k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHA 182 (469)
T KOG1565|consen 103 KPKWNKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHA 182 (469)
T ss_pred cCcccccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecc
Confidence 5699999999999853 2 2468899999999999999999999987 3244467776433333 123333220
Q ss_pred --------------CcceeecC--CCCchHHHHHHHHHHhhCCCCCCC
Q psy2976 137 --------------GQQAISIG--KNCDKFGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 137 --------------g~q~isl~--~~c~~~g~i~HElgHALG~~HEh~ 168 (184)
......++ .+-+.+.|++|||||||||.|...
T Consensus 183 f~Pg~~~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS~~ 230 (469)
T KOG1565|consen 183 FFPGPGIGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHSSD 230 (469)
T ss_pred cCCCCCCCCccccCcccceeccCCccchhHHHhhhhcccccccCCCCC
Confidence 11233332 244678999999999999999943
No 16
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=98.80 E-value=3.8e-08 Score=80.72 Aligned_cols=127 Identities=18% Similarity=0.303 Sum_probs=72.5
Q ss_pred CceEEccCCcccccccCCCCCcccccCCCCCcC--CCceEEEEcC----CCCHHHHHHHHHHHHHhhcc-cceEEEEc--
Q psy2976 39 STIVVSQTSLPEAYTVHHRDTRAATARKERIWD--YGVIPYEIDS----NFGGQHKALFKQAMRHWENF-TCVKFVER-- 109 (184)
Q Consensus 39 ~~i~~~~~~~~~~~~~~~R~kR~a~~~~~~~Wp--~g~IpY~~~~----~~~~~~r~~I~~A~~~we~~-TcV~F~e~-- 109 (184)
+||.++.+.+.++.....-..++- .+..+|. .++|...+.. .++..++..+.+||+.|+.. ..|+|.-.
T Consensus 5 gDi~~s~~~l~~~~~~~~~~~eqY--rTtnlV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g 82 (211)
T PF12388_consen 5 GDIFVSKKQLREMLQSDDGTSEQY--RTTNLVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFG 82 (211)
T ss_pred cceEEcHHHHhhhhccCCcchhhe--eeeeeecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecC
Confidence 688898887776652221112221 3355673 3578777776 67778899999999999995 56899521
Q ss_pred CCCCCCcEEEEecCC---CCcccccCCC---CCCcceeecCC---C-C-chHHHHHHHHHHhhCCCCCC
Q psy2976 110 SSTEHPNYILFTEKP---CGCCSFVGKR---GNGQQAISIGK---N-C-DKFGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 110 ~~~~~~~~I~f~~~~---~gC~S~vG~~---~~g~q~isl~~---~-c-~~~g~i~HElgHALG~~HEh 167 (184)
......+.+++.... .|-...-|.. |...-.|.+.. . - ....+++|||||+|||.|+-
T Consensus 83 ~~~~~~di~v~~~~~~~~~G~ggsAGFP~s~G~P~~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHTD 151 (211)
T PF12388_consen 83 TNYQNADIIVYSNSSNNPSGAGGSAGFPTSNGNPYKFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHTD 151 (211)
T ss_pred CCcCCCceEEEeccCCCCCCcceeccCCCCCCCCCceEEEEecCCCchhHHHHHHHHHhhhhccccccC
Confidence 222123444543211 1111112222 11112333311 1 1 13569999999999999974
No 17
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.58 E-value=6.5e-07 Score=72.98 Aligned_cols=99 Identities=18% Similarity=0.346 Sum_probs=59.4
Q ss_pred CCCceEEEEcCCCCHHHHHHHHHHHHHhhcc------cceEEEEcCCCC---CCc-E--EEEecCCCCcccccCC-----
Q psy2976 71 DYGVIPYEIDSNFGGQHKALFKQAMRHWENF------TCVKFVERSSTE---HPN-Y--ILFTEKPCGCCSFVGK----- 133 (184)
Q Consensus 71 p~g~IpY~~~~~~~~~~r~~I~~A~~~we~~------TcV~F~e~~~~~---~~~-~--I~f~~~~~gC~S~vG~----- 133 (184)
|+..|.|+|++.++.+.++.|++|++.|++. ..+.+++..+.+ ... | |.+.....+-+++ |.
T Consensus 6 pk~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~~~~~~~-gps~~dP 84 (197)
T cd04276 6 PKEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAY-GPSVVDP 84 (197)
T ss_pred CCCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecCCCccee-cccccCC
Confidence 4568999999888888999999999999986 455455443321 111 2 4443322122211 21
Q ss_pred -CCC-CcceeecCCCC--------------chHHHHHHHHHHhhCCCCCCCCC
Q psy2976 134 -RGN-GQQAISIGKNC--------------DKFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 134 -~~~-g~q~isl~~~c--------------~~~g~i~HElgHALG~~HEh~Rp 170 (184)
.|. ..-.|.+..+- ....+++||+||||||.|.....
T Consensus 85 rTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s 137 (197)
T cd04276 85 RTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKAS 137 (197)
T ss_pred CCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccccccc
Confidence 110 01244444321 12358999999999999976543
No 18
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.29 E-value=1.1e-06 Score=66.71 Aligned_cols=86 Identities=26% Similarity=0.417 Sum_probs=49.8
Q ss_pred ceEEEEcCCCCHHHHHHHHHHHHHhhc-ccceEEEEcCCCCCCcEEEEecCCCCcccccCCCCCCcceeecCC----CCc
Q psy2976 74 VIPYEIDSNFGGQHKALFKQAMRHWEN-FTCVKFVERSSTEHPNYILFTEKPCGCCSFVGKRGNGQQAISIGK----NCD 148 (184)
Q Consensus 74 ~IpY~~~~~~~~~~r~~I~~A~~~we~-~TcV~F~e~~~~~~~~~I~f~~~~~gC~S~vG~~~~g~q~isl~~----~c~ 148 (184)
+|+|.-.. -++.+..|.+|.+.|+. +++|+|++.... + +.+......-+|+-...+.|...|-|+. +-.
T Consensus 3 tv~Ydas~--A~~f~~~i~~aa~iWN~sV~NV~L~~~s~a---~-~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy~ 76 (132)
T PF02031_consen 3 TVYYDASR--APEFRSAIAQAAQIWNSSVSNVRLVEGSSA---D-IRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGYN 76 (132)
T ss_dssp EEEEEEEE---GGGHHHHHHHHHHHHHH-SSEEEEE-SS----S-EEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS-
T ss_pred EEEEeCCC--CchhHHHHHHHHHHHhcccCceEEeecCCC---c-EEEEEecCCCCcccccCCCCcEEEEechHHhhCCc
Confidence 45565442 34567789999999997 689999998773 3 2232112334555555554556777764 234
Q ss_pred hHHHHHHHHHHhhCCCC
Q psy2976 149 KFGIVVHELGHVVGFWH 165 (184)
Q Consensus 149 ~~g~i~HElgHALG~~H 165 (184)
...+++|||||.|||.-
T Consensus 77 ~~RIaaHE~GHiLGLPD 93 (132)
T PF02031_consen 77 STRIAAHELGHILGLPD 93 (132)
T ss_dssp HHHHHHHHHHHHHT---
T ss_pred cceeeeehhccccCCCC
Confidence 56799999999999963
No 19
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.30 E-value=0.00038 Score=57.09 Aligned_cols=82 Identities=23% Similarity=0.389 Sum_probs=48.1
Q ss_pred HHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEec-CCCC-cccccCCCCC------------C----c--ceeecCC
Q psy2976 86 QHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFTE-KPCG-CCSFVGKRGN------------G----Q--QAISIGK 145 (184)
Q Consensus 86 ~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~-~~~g-C~S~vG~~~~------------g----~--q~isl~~ 145 (184)
..++.+.+|++.|++.-++..++...+ .+ |++.. ...| .|...|+... | . -.|.+.+
T Consensus 104 ~wq~a~~tava~wa~~fpl~ive~~ee--aD-Itie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~p 180 (236)
T COG5549 104 RWQGAYLTAVAGWAKTFPLIIVERFEE--AD-ITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPP 180 (236)
T ss_pred hHHHHHHHHHHHHHHhCCceeeeccee--ee-EEEEecCCCCCcccccchHHHHHHHHHHhhhccCcccccccccccCCc
Confidence 457889999999999877766776663 34 44432 1222 2222222110 0 0 1223333
Q ss_pred CC---chHHHHHHHHHHhhCCC-CCCCCC
Q psy2976 146 NC---DKFGIVVHELGHVVGFW-HEHTRP 170 (184)
Q Consensus 146 ~c---~~~g~i~HElgHALG~~-HEh~Rp 170 (184)
+- ...+++-||+|||||++ |.---.
T Consensus 181 g~~~e~L~~tarhElGhaLgi~ghsd~ks 209 (236)
T COG5549 181 GELRENLNPTARHELGHALGIWGHSDLKS 209 (236)
T ss_pred ccchhhhhHHHHHhhcchheecccccchh
Confidence 31 24689999999999999 865433
No 20
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=96.74 E-value=0.0008 Score=54.98 Aligned_cols=20 Identities=30% Similarity=0.713 Sum_probs=17.7
Q ss_pred hHHHHHHHHHHhhCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~ 168 (184)
...+++|||||+||+.|+..
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFD 156 (206)
T ss_pred cchHHHHHHHHHhcCCCCcc
Confidence 34679999999999999986
No 21
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.74 E-value=0.00073 Score=49.99 Aligned_cols=18 Identities=33% Similarity=0.685 Sum_probs=12.7
Q ss_pred hHHHHHHHHHHhhCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHE 166 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HE 166 (184)
...+++|||||.||+.|.
T Consensus 107 ~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 107 GVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp STTHHHHHHHHHTT----
T ss_pred cceEeeehhhHhcCCCCC
Confidence 458999999999999994
No 22
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.56 E-value=0.0016 Score=51.96 Aligned_cols=21 Identities=33% Similarity=0.417 Sum_probs=18.2
Q ss_pred hHHHHHHHHHHhhCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~R 169 (184)
...+++||+||.||+.|....
T Consensus 133 ~~~~~aHElGH~lG~~HD~~~ 153 (192)
T cd04267 133 TALTMAHELGHNLGAEHDGGD 153 (192)
T ss_pred ehhhhhhhHHhhcCCcCCCCC
Confidence 356999999999999999863
No 23
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.47 E-value=0.0016 Score=51.02 Aligned_cols=21 Identities=29% Similarity=0.523 Sum_probs=14.0
Q ss_pred HHHHHHHHHHhhCCCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~Rp 170 (184)
..+++||+||-|||.|..+-.
T Consensus 70 g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 70 GKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp SHHHHHHHHHHTT---TT---
T ss_pred ccchhhhhhhhhcccccccCC
Confidence 579999999999999998654
No 24
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=96.19 E-value=0.0028 Score=50.58 Aligned_cols=23 Identities=26% Similarity=0.345 Sum_probs=15.1
Q ss_pred chHHHHHHHHHHhhCCCCCCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~HEh~Rp 170 (184)
....+++|||||.||..|.+.-+
T Consensus 141 ~~~~~~AHEiGH~lGa~HD~~~~ 163 (196)
T PF13688_consen 141 NGAITFAHEIGHNLGAPHDGDYS 163 (196)
T ss_dssp HHHHHHHHHHHHHTT-----SSS
T ss_pred ceehhhHHhHHHhcCCCCCCCCC
Confidence 34679999999999999988864
No 25
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=96.07 E-value=0.02 Score=49.92 Aligned_cols=81 Identities=22% Similarity=0.352 Sum_probs=44.7
Q ss_pred CCCHHHHHHHHHHHHHhhccc-ce-EEEEcCCCCCCcEEEEecCCCCc-ccccCCCCCCcce-eec-CCCCchHHHHHHH
Q psy2976 82 NFGGQHKALFKQAMRHWENFT-CV-KFVERSSTEHPNYILFTEKPCGC-CSFVGKRGNGQQA-ISI-GKNCDKFGIVVHE 156 (184)
Q Consensus 82 ~~~~~~r~~I~~A~~~we~~T-cV-~F~e~~~~~~~~~I~f~~~~~gC-~S~vG~~~~g~q~-isl-~~~c~~~g~i~HE 156 (184)
..+....-.+..+.+.+.+.. -+ .| .. ....|.+...+..| |.-+|..+ |.+. +.. +.++...++++||
T Consensus 84 ~C~~~d~~~~~~~Ad~~a~~~lG~~~~---s~--y~h~vyvlP~~~~C~w~Gla~v~-G~~~~~~~~~~~~~~~~~~~HE 157 (314)
T PF05548_consen 84 SCSTDDWYGWADAADAAARAQLGVNAF---SS--YTHRVYVLPPGFACGWAGLATVP-GSQSWLWISGYGVQDWATIMHE 157 (314)
T ss_pred ccchHHHHHHHHHHHHHHHhhcCcccc---cc--cceEEEEcCCCCCCCceEEeecC-CcceeeeecCcccccHHHHHHH
Confidence 333233334566666666554 33 12 11 23345554443346 33333333 4442 222 3356778999999
Q ss_pred HHHhhCCCCCCC
Q psy2976 157 LGHVVGFWHEHT 168 (184)
Q Consensus 157 lgHALG~~HEh~ 168 (184)
+||-|||.|...
T Consensus 158 lgHN~GL~Ha~~ 169 (314)
T PF05548_consen 158 LGHNLGLWHAGR 169 (314)
T ss_pred hhhhccccccCC
Confidence 999999999974
No 26
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=96.02 E-value=0.0042 Score=49.32 Aligned_cols=21 Identities=33% Similarity=0.545 Sum_probs=15.7
Q ss_pred hHHHHHHHHHHhhCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~R 169 (184)
...+++|||||.||..|...-
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~ 131 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDN 131 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS
T ss_pred eeeeehhhhHhhcCCCCCCCC
Confidence 467899999999999999874
No 27
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.85 E-value=0.0072 Score=49.52 Aligned_cols=21 Identities=29% Similarity=0.428 Sum_probs=18.1
Q ss_pred hHHHHHHHHHHhhCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~R 169 (184)
...+++|||||.||+.|....
T Consensus 145 ~~~~~AHElGH~lG~~HD~~~ 165 (220)
T cd04272 145 GVYTMTHELAHLLGAPHDGSP 165 (220)
T ss_pred cHHHHHHHHHHHhCCCCCCCC
Confidence 367999999999999998654
No 28
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.73 E-value=0.006 Score=48.75 Aligned_cols=19 Identities=37% Similarity=0.583 Sum_probs=17.0
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
..+++|||||.||+.|...
T Consensus 132 a~~~AHElGH~lG~~HD~~ 150 (194)
T cd04269 132 AVTMAHELGHNLGMEHDDG 150 (194)
T ss_pred HHHHHHHHHhhcCCCcCCC
Confidence 5799999999999999764
No 29
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=95.29 E-value=0.0094 Score=49.90 Aligned_cols=19 Identities=37% Similarity=0.436 Sum_probs=16.4
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
..+++|||||.||+.|--.
T Consensus 168 a~t~AHElGHnlGm~HD~~ 186 (244)
T cd04270 168 DLVTAHELGHNFGSPHDPD 186 (244)
T ss_pred HHHHHHHHHHhcCCCCCCC
Confidence 4789999999999999554
No 30
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=95.26 E-value=0.0018 Score=53.84 Aligned_cols=22 Identities=27% Similarity=0.477 Sum_probs=19.0
Q ss_pred hHHHHHHHHHHhhCCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~Rp 170 (184)
...+++||+||.|||+|..+--
T Consensus 137 ~g~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 137 LGDTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ccceeEEeccceeeeeeeecCC
Confidence 3469999999999999999764
No 31
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=95.19 E-value=0.0073 Score=50.17 Aligned_cols=18 Identities=39% Similarity=0.722 Sum_probs=15.9
Q ss_pred HHHHHHHHHhhCCCCCCC
Q psy2976 151 GIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 151 g~i~HElgHALG~~HEh~ 168 (184)
.+++|||||.||+.|--.
T Consensus 147 ~t~AHElGHnLGm~HD~~ 164 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCT 164 (228)
T ss_pred eehhhhhhhhcCCCCCCC
Confidence 589999999999999754
No 32
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=95.13 E-value=0.006 Score=49.54 Aligned_cols=20 Identities=40% Similarity=0.619 Sum_probs=17.6
Q ss_pred hHHHHHHHHHHhhCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~ 168 (184)
...+++|||||.||+.|...
T Consensus 140 ~a~~~aHElGH~LG~~HD~~ 159 (207)
T cd04273 140 SAFTIAHELGHVLGMPHDGD 159 (207)
T ss_pred eEEeeeeechhhcCCCCCCC
Confidence 34689999999999999986
No 33
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=95.09 E-value=0.01 Score=47.54 Aligned_cols=20 Identities=40% Similarity=0.635 Sum_probs=16.8
Q ss_pred HHHHHHHHHHhhCCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~R 169 (184)
+--++||+||.|||.|=-++
T Consensus 125 ~KEv~HElGH~~GL~HC~N~ 144 (181)
T COG1913 125 VKEVLHELGHLLGLSHCPNP 144 (181)
T ss_pred HHHHHHHhhhhcCcccCCCC
Confidence 45699999999999997654
No 34
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=95.06 E-value=0.021 Score=45.88 Aligned_cols=19 Identities=42% Similarity=0.665 Sum_probs=14.4
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
..+++|||||.||+.|...
T Consensus 132 a~~~AHelGH~lGm~HD~~ 150 (199)
T PF01421_consen 132 AVIIAHELGHNLGMPHDGD 150 (199)
T ss_dssp HHHHHHHHHHHTT---TTT
T ss_pred HHHHHHHHHHhcCCCCCCC
Confidence 4688999999999999988
No 35
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=94.71 E-value=0.017 Score=46.40 Aligned_cols=16 Identities=50% Similarity=0.727 Sum_probs=14.1
Q ss_pred HHHHHHHHHhhCCCCC
Q psy2976 151 GIVVHELGHVVGFWHE 166 (184)
Q Consensus 151 g~i~HElgHALG~~HE 166 (184)
..++||+||.|||.|=
T Consensus 127 k~~~HElGH~lGL~HC 142 (179)
T PRK13267 127 KEVTHELGHTLGLEHC 142 (179)
T ss_pred HHHHHHHHHHcCCccC
Confidence 4589999999999993
No 36
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=94.44 E-value=0.021 Score=46.58 Aligned_cols=19 Identities=42% Similarity=0.602 Sum_probs=12.7
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
.-.++||+||++||.|=..
T Consensus 146 ~Kea~HElGH~~GL~HC~~ 164 (194)
T PF07998_consen 146 CKEAVHELGHLFGLDHCEN 164 (194)
T ss_dssp HHHHHHHHHHHTT----SS
T ss_pred HHHHHHHHHHHcCCcCCCC
Confidence 4689999999999999543
No 37
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=94.29 E-value=0.02 Score=46.87 Aligned_cols=95 Identities=21% Similarity=0.385 Sum_probs=54.6
Q ss_pred CceEEEEc--CCCC-------HHHHHHHHHHH---HHhhcccceEEEEcCCCCCCc-EEEEecC---CCCcc-ccc-CC-
Q psy2976 73 GVIPYEID--SNFG-------GQHKALFKQAM---RHWENFTCVKFVERSSTEHPN-YILFTEK---PCGCC-SFV-GK- 133 (184)
Q Consensus 73 g~IpY~~~--~~~~-------~~~r~~I~~A~---~~we~~TcV~F~e~~~~~~~~-~I~f~~~---~~gC~-S~v-G~- 133 (184)
..+.|++. .++. .+.-..|.+.+ +-|....-++|+.+.... .+ .|.+... ..-|. ..+ |.
T Consensus 29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~~-~Df~I~Lasp~T~~~lC~g~~~~~e~ 107 (203)
T PF11350_consen 29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSGA-PDFRISLASPGTTDRLCAGLDTSGET 107 (203)
T ss_pred ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCCC-CCEEEEECCcchhhhhccCcCcCcee
Confidence 37888875 3332 23344555555 378888889999998863 45 3666532 23461 111 11
Q ss_pred ---CC-CCcceeecCCC------------CchHHHHHHHHHHhhCCCCCCC
Q psy2976 134 ---RG-NGQQAISIGKN------------CDKFGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 134 ---~~-~g~q~isl~~~------------c~~~g~i~HElgHALG~~HEh~ 168 (184)
.. .+.=.||+.++ -...-+|.||.||+||..||-.
T Consensus 108 SC~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~C 158 (203)
T PF11350_consen 108 SCRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPC 158 (203)
T ss_pred EeecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcC
Confidence 00 12223455432 1134689999999999988853
No 38
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.48 E-value=0.017 Score=49.53 Aligned_cols=21 Identities=33% Similarity=0.569 Sum_probs=18.1
Q ss_pred hHHHHHHHHHHhhCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~R 169 (184)
..|+++||+||+|||.+.-..
T Consensus 165 ~igv~~HE~gH~lGLPDlY~~ 185 (286)
T TIGR03296 165 GVGVIAHELGHDLGLPDLYDT 185 (286)
T ss_pred ceeeeehhhhcccCCCccccC
Confidence 479999999999999887654
No 39
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.08 E-value=0.063 Score=48.65 Aligned_cols=30 Identities=23% Similarity=0.398 Sum_probs=21.6
Q ss_pred hHHHHHHHHHHhhCCCCCCCCCCCCCceEe
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRPDRDNHVQI 178 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~RpDRD~yV~I 178 (184)
..|.++||+||+||+.|.-+=-=-.+|+.+
T Consensus 316 ~iGA~lHEiGH~fg~pH~~~GIM~Rdy~~~ 345 (423)
T PF12044_consen 316 GIGAFLHEIGHLFGCPHQEDGIMLRDYDRL 345 (423)
T ss_pred hhHHHHHHHHHhcCCCCCCCCeeeCCchhh
Confidence 468899999999999998653333335444
No 40
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=92.68 E-value=0.063 Score=46.65 Aligned_cols=17 Identities=41% Similarity=0.475 Sum_probs=13.6
Q ss_pred hHHHHHHHHHHhhCCCC
Q psy2976 149 KFGIVVHELGHVVGFWH 165 (184)
Q Consensus 149 ~~g~i~HElgHALG~~H 165 (184)
..++.+||+||++||.|
T Consensus 193 ~~~~f~HE~GH~~GL~H 209 (305)
T PF10462_consen 193 YGNEFSHELGHNFGLGH 209 (305)
T ss_dssp SHHHHHHHHHHTTT--S
T ss_pred ccceeehhhhhhcCCCC
Confidence 46799999999999999
No 41
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=92.43 E-value=0.074 Score=49.29 Aligned_cols=17 Identities=41% Similarity=0.774 Sum_probs=14.0
Q ss_pred hHHHHHHHHHHhhCCCC
Q psy2976 149 KFGIVVHELGHVVGFWH 165 (184)
Q Consensus 149 ~~g~i~HElgHALG~~H 165 (184)
.+.+++|||+|||||.-
T Consensus 210 ~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 210 FFRTVIHEIAHALGFSS 226 (521)
T ss_dssp HHHHHHHHHHHHTT-SH
T ss_pred ccceeeeeeeeeeeecc
Confidence 46799999999999965
No 42
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=91.72 E-value=0.44 Score=42.14 Aligned_cols=21 Identities=29% Similarity=0.423 Sum_probs=15.6
Q ss_pred ecCCCCchHHHHHHHHHHhhC
Q psy2976 142 SIGKNCDKFGIVVHELGHVVG 162 (184)
Q Consensus 142 sl~~~c~~~g~i~HElgHALG 162 (184)
++......+-+++||+|||+=
T Consensus 215 n~~~~~~~v~tl~HE~GHa~h 235 (427)
T cd06459 215 NFNGTLDDVFTLAHELGHAFH 235 (427)
T ss_pred cCCCChhhHHHHHHHhhHHHH
Confidence 334445668899999999973
No 43
>PTZ00337 surface protease GP63; Provisional
Probab=91.50 E-value=0.11 Score=48.84 Aligned_cols=18 Identities=39% Similarity=0.791 Sum_probs=15.3
Q ss_pred chHHHHHHHHHHhhCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWH 165 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~H 165 (184)
..+.+++|||+|||||.-
T Consensus 228 ~~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 228 GDVRVAAHELGHALGFVR 245 (567)
T ss_pred HHHHHHHHHHHHHHccCH
Confidence 357899999999999943
No 44
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=91.45 E-value=0.45 Score=44.48 Aligned_cols=22 Identities=23% Similarity=0.277 Sum_probs=15.8
Q ss_pred ecCCCCchHHHHHHHHHHhhCC
Q psy2976 142 SIGKNCDKFGIVVHELGHVVGF 163 (184)
Q Consensus 142 sl~~~c~~~g~i~HElgHALG~ 163 (184)
|....-..+.|++||||||+=.
T Consensus 371 N~~~~~~dv~TLaHElGHa~H~ 392 (591)
T TIGR00181 371 NWDGTLNSVFTLAHELGHSMHS 392 (591)
T ss_pred ecCCCcchHHHHHHHhhhHHHH
Confidence 3333345688999999999843
No 45
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=91.11 E-value=0.35 Score=31.91 Aligned_cols=41 Identities=22% Similarity=0.340 Sum_probs=33.2
Q ss_pred ceeecCC-CCchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 139 QAISIGK-NCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 139 q~isl~~-~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
..+.+.+ =+.....++|++.+.+||.|+..-..-+.||.|.
T Consensus 18 ~~l~F~p~ls~~eR~~vH~lA~~~gL~s~S~G~g~~R~v~v~ 59 (60)
T cd02641 18 TELEFPPTLSSHDRLLVHELAEELGLRHESTGEGSDRVITVS 59 (60)
T ss_pred CcEECCCCCCHHHHHHHHHHHHHcCCceEeeCCCCceEEEee
Confidence 4556665 2455689999999999999999988888898874
No 46
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=90.72 E-value=0.57 Score=43.87 Aligned_cols=22 Identities=23% Similarity=0.318 Sum_probs=15.9
Q ss_pred ecCCCCchHHHHHHHHHHhhCC
Q psy2976 142 SIGKNCDKFGIVVHELGHVVGF 163 (184)
Q Consensus 142 sl~~~c~~~g~i~HElgHALG~ 163 (184)
++......+.|++||+|||+=.
T Consensus 368 N~~~~~~~v~TL~HE~GHa~H~ 389 (587)
T TIGR02290 368 NYDGSRRDVSTLAHELGHAYHS 389 (587)
T ss_pred ecCCCchhHHHHHHHhhHHHHH
Confidence 3333345678999999999943
No 47
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=90.33 E-value=0.16 Score=47.99 Aligned_cols=18 Identities=33% Similarity=0.551 Sum_probs=15.3
Q ss_pred chHHHHHHHHHHhhCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWH 165 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~H 165 (184)
..+.+++|||.|||||.-
T Consensus 255 ~~~rv~~HEi~HALGFS~ 272 (622)
T PTZ00257 255 GTTRTVTHEVAHALGFSS 272 (622)
T ss_pred HHHHHHHHHHHHHhcCCH
Confidence 357899999999999943
No 48
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=88.86 E-value=0.84 Score=42.44 Aligned_cols=17 Identities=24% Similarity=0.522 Sum_probs=13.8
Q ss_pred CchHHHHHHHHHHhhCC
Q psy2976 147 CDKFGIVVHELGHVVGF 163 (184)
Q Consensus 147 c~~~g~i~HElgHALG~ 163 (184)
-..+.|++||+|||+=.
T Consensus 335 ~~dv~TL~HElGHa~H~ 351 (549)
T TIGR02289 335 SGDIDVLTHEAGHAFHV 351 (549)
T ss_pred hhHHHHHHHHhhHHHHH
Confidence 45678999999999843
No 49
>KOG4525|consensus
Probab=88.58 E-value=0.27 Score=44.80 Aligned_cols=20 Identities=45% Similarity=0.792 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHhhCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~ 168 (184)
..|.+.|||||+||..|.-.
T Consensus 301 tlGA~~HElGH~lgcpHq~~ 320 (614)
T KOG4525|consen 301 TLGAVCHELGHCLGCPHQSE 320 (614)
T ss_pred HHHHHHHHhhhccCCCCCCC
Confidence 46899999999999999753
No 50
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=87.87 E-value=0.35 Score=42.17 Aligned_cols=21 Identities=33% Similarity=0.407 Sum_probs=18.4
Q ss_pred CchHHHHHHHHHHhhCCCCCC
Q psy2976 147 CDKFGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 147 c~~~g~i~HElgHALG~~HEh 167 (184)
.....|++||++|-+||..|.
T Consensus 194 ~~~P~T~~HElAHq~G~a~E~ 214 (318)
T PF12725_consen 194 YSLPFTICHELAHQLGFASED 214 (318)
T ss_pred ccccHHHHHHHHHHhCCCCHH
Confidence 456789999999999999884
No 51
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=87.12 E-value=0.074 Score=50.58 Aligned_cols=19 Identities=32% Similarity=0.560 Sum_probs=16.2
Q ss_pred hHHHHHHHHHHhhCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh 167 (184)
..|+++|||||.|||.-+=
T Consensus 221 giGVfaHEfGH~LGLPDlY 239 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLPDLY 239 (645)
T ss_pred ceEEEEeeccccCCCCCcc
Confidence 4799999999999997543
No 52
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=86.33 E-value=1.4 Score=29.05 Aligned_cols=42 Identities=10% Similarity=0.211 Sum_probs=33.4
Q ss_pred cceeecCCC-CchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 138 QQAISIGKN-CDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 138 ~q~isl~~~-c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
.+.+.++++ ......++|++.+.+||.|+..=...+.||+|.
T Consensus 16 ~~~l~Fpp~ls~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v~ 58 (59)
T cd06007 16 NEEYEFPSSLTNHERAVIHRLCRKLGLKSKSKGKGSNRRLSVY 58 (59)
T ss_pred ccEEEcCCCCCHHHHHHHHHHHHHcCCCceeecCCCCeEEEEe
Confidence 456666664 445689999999999999998888888888774
No 53
>KOG3607|consensus
Probab=85.89 E-value=0.45 Score=45.92 Aligned_cols=19 Identities=37% Similarity=0.656 Sum_probs=16.9
Q ss_pred hHHHHHHHHHHhhCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh 167 (184)
...+++|||||.||+.|.-
T Consensus 323 ~a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 323 FAVVLAHELGHNLGMIHDE 341 (716)
T ss_pred HHHHHHHHHHhhcCccccc
Confidence 3579999999999999986
No 54
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=85.42 E-value=1.4 Score=29.14 Aligned_cols=42 Identities=10% Similarity=0.170 Sum_probs=34.3
Q ss_pred cceeecCCC-CchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 138 QQAISIGKN-CDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 138 ~q~isl~~~-c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
.+.+.+.+. ......++|++.+.+||.|...=...+.||.|.
T Consensus 17 ~~~l~f~p~lt~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v~ 59 (60)
T cd02640 17 IRDMVFSPEFSKEERALIHQIAQKYGLKSRSYGSGNDRYLVIS 59 (60)
T ss_pred cceEEcCCCCCHHHHHHHHHHHHHcCCceeeEeCCCCeEEEEe
Confidence 466777764 345689999999999999998888888888874
No 55
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=82.66 E-value=0.86 Score=38.83 Aligned_cols=14 Identities=50% Similarity=0.767 Sum_probs=11.9
Q ss_pred hHHHHHHHHHHhhC
Q psy2976 149 KFGIVVHELGHVVG 162 (184)
Q Consensus 149 ~~g~i~HElgHALG 162 (184)
...+++|||||++|
T Consensus 216 ~~~v~vHE~GHsf~ 229 (264)
T PF09471_consen 216 FKQVVVHEFGHSFG 229 (264)
T ss_dssp HHHHHHHHHHHHTT
T ss_pred ccceeeeecccccc
Confidence 45799999999876
No 56
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=80.33 E-value=1.8 Score=36.09 Aligned_cols=26 Identities=15% Similarity=0.502 Sum_probs=18.6
Q ss_pred CcceeecCCC------CchHHHHHHHHHHhhC
Q psy2976 137 GQQAISIGKN------CDKFGIVVHELGHVVG 162 (184)
Q Consensus 137 g~q~isl~~~------c~~~g~i~HElgHALG 162 (184)
....+.|+.. ....++++||.|||+-
T Consensus 71 ~~k~vrLS~~vy~~~SiaAvaVAAHEvGHAiQ 102 (222)
T PF04298_consen 71 RNKVVRLSEDVYNGRSIAAVAVAAHEVGHAIQ 102 (222)
T ss_pred CCCEEEeCCccCCCCCHHHHHHHHHHHhHHHh
Confidence 3456777653 2347899999999984
No 57
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=80.05 E-value=3.4 Score=27.25 Aligned_cols=41 Identities=12% Similarity=0.258 Sum_probs=32.3
Q ss_pred ceeecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEeec
Q psy2976 139 QAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQIIR 180 (184)
Q Consensus 139 q~isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~~ 180 (184)
+.+.+.+-......++|++....||.|+.+... +.+|.|..
T Consensus 22 ~~~~f~pm~sy~RllvH~la~~~gL~s~s~~~~-~r~vvv~k 62 (63)
T cd02642 22 QSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSG-GKCVIVNK 62 (63)
T ss_pred CeeEcCCCCcHHHHHHHHHHHHhCCeeEeecCC-ceEEEEEe
Confidence 344554434456899999999999999999998 88888753
No 58
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=79.16 E-value=1.4 Score=32.50 Aligned_cols=19 Identities=37% Similarity=0.573 Sum_probs=16.4
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
.-.++|=++|.|||.|+..
T Consensus 74 ~~l~iHG~LHLlGyDH~~~ 92 (110)
T TIGR00043 74 AHLTVHGLLHLLGYDHETE 92 (110)
T ss_pred HHHHHHHHHHHcCCCCCCh
Confidence 4579999999999999853
No 59
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=78.98 E-value=5.7 Score=31.52 Aligned_cols=18 Identities=39% Similarity=0.263 Sum_probs=15.7
Q ss_pred hHHHHHHHHHHhhCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHE 166 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HE 166 (184)
...+|+|||.|..-..|.
T Consensus 164 idYVvvHEL~Hl~~~nHs 181 (205)
T PF01863_consen 164 IDYVVVHELCHLRHPNHS 181 (205)
T ss_pred HHHHHHHHHHHhccCCCC
Confidence 357999999999998886
No 60
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=78.73 E-value=1.1 Score=37.08 Aligned_cols=12 Identities=42% Similarity=0.584 Sum_probs=10.6
Q ss_pred HHHHHHHHHHhh
Q psy2976 150 FGIVVHELGHVV 161 (184)
Q Consensus 150 ~g~i~HElgHAL 161 (184)
.=+++|||||+|
T Consensus 73 rFtlAHELGH~l 84 (213)
T COG2856 73 RFTLAHELGHAL 84 (213)
T ss_pred HHHHHHHHhHHH
Confidence 459999999997
No 61
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=78.52 E-value=4.8 Score=26.08 Aligned_cols=41 Identities=10% Similarity=0.127 Sum_probs=32.1
Q ss_pred ceeecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 139 QAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 139 q~isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
+.+.+.+=....-.++|+|..++||.++..-...+.||.|.
T Consensus 17 ~~~~fppm~~~~R~~vH~lA~~~~L~S~S~G~g~~R~v~v~ 57 (58)
T cd02646 17 DSLSFPPMDKHGRKTIHKLANCYNLKSKSRGKGKKRFVTVT 57 (58)
T ss_pred ceEecCCCCHHHHHHHHHHHHHcCCcccccccCCceEEEEE
Confidence 45555553334568999999999999999988888888874
No 62
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=76.17 E-value=1.9 Score=33.31 Aligned_cols=17 Identities=41% Similarity=0.690 Sum_probs=14.7
Q ss_pred HHHHHHHHHHhhCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHE 166 (184)
Q Consensus 150 ~g~i~HElgHALG~~HE 166 (184)
.-.++|=++|.|||.|+
T Consensus 109 ~~l~vHG~LHLlGyDH~ 125 (145)
T PF02130_consen 109 ARLLVHGLLHLLGYDHE 125 (145)
T ss_dssp HHHHHHHHHHHTT-SST
T ss_pred hHHHHHHHHHHcCCCCC
Confidence 46899999999999999
No 63
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=75.84 E-value=1.7 Score=30.76 Aligned_cols=24 Identities=33% Similarity=0.458 Sum_probs=15.6
Q ss_pred cceeecCCCC---chHHHHHHHHHHhh
Q psy2976 138 QQAISIGKNC---DKFGIVVHELGHVV 161 (184)
Q Consensus 138 ~q~isl~~~c---~~~g~i~HElgHAL 161 (184)
...|-+..+. ...-+++|||||.+
T Consensus 28 ~~~I~in~~~~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 28 NPIIFINSNLSPERQRFTLAHELGHIL 54 (122)
T ss_dssp TTEEEEESSS-HHHHHHHHHHHHHHHH
T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHH
Confidence 3455554442 23569999999986
No 64
>COG0319 Predicted metal-dependent hydrolase [General function prediction only]
Probab=73.57 E-value=2.4 Score=33.29 Aligned_cols=19 Identities=26% Similarity=0.560 Sum_probs=16.7
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
.-.++|=++|.|||.|+..
T Consensus 109 ~~l~vHG~LHLlGYDH~~~ 127 (153)
T COG0319 109 AHLTIHGILHLLGYDHEED 127 (153)
T ss_pred HHHHHHHHHHHccCCCCCc
Confidence 3589999999999999875
No 65
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=73.28 E-value=2.3 Score=30.89 Aligned_cols=17 Identities=35% Similarity=0.526 Sum_probs=12.6
Q ss_pred HHHHHHHHHHhhCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHE 166 (184)
Q Consensus 150 ~g~i~HElgHALG~~HE 166 (184)
..+++||++|-+|+.-|
T Consensus 74 ~~tlvhEiah~fG~~~e 90 (97)
T PF06262_consen 74 RDTLVHEIAHHFGISDE 90 (97)
T ss_dssp HHHHHHHHHHHTT--HH
T ss_pred HHHHHHHHHHHcCCCHH
Confidence 36899999999998643
No 66
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=73.24 E-value=7.8 Score=36.73 Aligned_cols=17 Identities=35% Similarity=0.380 Sum_probs=13.9
Q ss_pred CchHHHHHHHHHHhhCC
Q psy2976 147 CDKFGIVVHELGHVVGF 163 (184)
Q Consensus 147 c~~~g~i~HElgHALG~ 163 (184)
-..+-|++||+||++=.
T Consensus 378 ~~dV~TLaHElGHs~Hs 394 (598)
T COG1164 378 LRDVFTLAHELGHSVHS 394 (598)
T ss_pred hhHHHHHHHHccHHHHH
Confidence 45688999999999743
No 67
>PRK00016 metal-binding heat shock protein; Provisional
Probab=72.42 E-value=2.4 Score=33.28 Aligned_cols=18 Identities=28% Similarity=0.481 Sum_probs=15.8
Q ss_pred HHHHHHHHHHhhCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh 167 (184)
.-.++|=++|.|||.|+.
T Consensus 114 ~~l~iHG~LHLlGYDH~~ 131 (159)
T PRK00016 114 AHLTVHGILHLLGYDHIE 131 (159)
T ss_pred HHHHHHhhHHhcCCCCCC
Confidence 357999999999999974
No 68
>PF11667 DUF3267: Protein of unknown function (DUF3267); InterPro: IPR021683 This family of proteins has no known function.
Probab=71.69 E-value=1.8 Score=31.54 Aligned_cols=20 Identities=35% Similarity=0.247 Sum_probs=17.7
Q ss_pred HHHHHHHHHhhCCCCCCCCC
Q psy2976 151 GIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 151 g~i~HElgHALG~~HEh~Rp 170 (184)
..++||+.|+|.+..--.++
T Consensus 6 ~~~~HEliH~l~~~~~~~~~ 25 (111)
T PF11667_consen 6 LIPLHELIHGLFFKLFGKKP 25 (111)
T ss_pred eHHHHHHHHHHHHHHhCCCC
Confidence 46899999999999888877
No 69
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=70.65 E-value=2.5 Score=36.78 Aligned_cols=14 Identities=43% Similarity=0.847 Sum_probs=12.0
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
..+.++.||||||+
T Consensus 153 ~~v~tl~HE~GHa~ 166 (365)
T cd06258 153 DDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHhHHH
Confidence 34679999999998
No 70
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=69.35 E-value=2.9 Score=37.69 Aligned_cols=15 Identities=27% Similarity=0.507 Sum_probs=12.4
Q ss_pred hHHHHHHHHHHhhCC
Q psy2976 149 KFGIVVHELGHVVGF 163 (184)
Q Consensus 149 ~~g~i~HElgHALG~ 163 (184)
.+.|+.||||||+=.
T Consensus 242 ~v~tLfHE~GHa~H~ 256 (458)
T PF01432_consen 242 DVETLFHEFGHAMHS 256 (458)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHhHHHHH
Confidence 467999999999843
No 71
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=68.99 E-value=3.1 Score=34.56 Aligned_cols=13 Identities=38% Similarity=0.598 Sum_probs=11.0
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
...++.||+||||
T Consensus 92 ~~~~l~HE~GHAl 104 (220)
T PF14247_consen 92 VLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHH
Confidence 3578999999997
No 72
>PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional
Probab=68.89 E-value=3.4 Score=35.15 Aligned_cols=17 Identities=35% Similarity=0.483 Sum_probs=15.6
Q ss_pred HHHHHHHHHHhhCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHE 166 (184)
Q Consensus 150 ~g~i~HElgHALG~~HE 166 (184)
.-.++|=++|.|||.|+
T Consensus 216 ~~LlIHGlLHLLGYDHe 232 (258)
T PRK13963 216 AHLLVHGALHAQGYDHE 232 (258)
T ss_pred HHHHHHHHHHHcCCCCC
Confidence 46899999999999999
No 73
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=65.71 E-value=3.7 Score=37.52 Aligned_cols=13 Identities=38% Similarity=0.790 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
.+-|+.|||||||
T Consensus 263 ~V~TLfHEfGHal 275 (472)
T cd06455 263 EVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHH
Confidence 4678999999999
No 74
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=64.95 E-value=3.7 Score=33.73 Aligned_cols=24 Identities=17% Similarity=0.542 Sum_probs=16.9
Q ss_pred ceeecCC------CCchHHHHHHHHHHhhC
Q psy2976 139 QAISIGK------NCDKFGIVVHELGHVVG 162 (184)
Q Consensus 139 q~isl~~------~c~~~g~i~HElgHALG 162 (184)
+.+.|+. .....++++||.|||+-
T Consensus 76 kvvrLSe~~y~g~Sia~~aVAAHEVGHAiQ 105 (226)
T COG2738 76 KVVRLSEANYYGPSIAAIAVAAHEVGHAIQ 105 (226)
T ss_pred heeeccccccCCccHHHHHHHHHHhhHHHh
Confidence 4566643 23457899999999984
No 75
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=64.76 E-value=3.9 Score=31.84 Aligned_cols=12 Identities=50% Similarity=0.758 Sum_probs=10.5
Q ss_pred HHHHHHHHHHhh
Q psy2976 150 FGIVVHELGHVV 161 (184)
Q Consensus 150 ~g~i~HElgHAL 161 (184)
.-+++|||.|++
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 569999999985
No 76
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=63.58 E-value=4.1 Score=37.56 Aligned_cols=16 Identities=38% Similarity=0.808 Sum_probs=13.6
Q ss_pred CchHHHHHHHHHHhhC
Q psy2976 147 CDKFGIVVHELGHVVG 162 (184)
Q Consensus 147 c~~~g~i~HElgHALG 162 (184)
-...|+++||+||..|
T Consensus 128 sElagViAHEigHv~q 143 (484)
T COG4783 128 SELAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 3467999999999987
No 77
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=63.42 E-value=3.3 Score=31.45 Aligned_cols=30 Identities=20% Similarity=0.400 Sum_probs=20.9
Q ss_pred CcceeecCCC-----Cc---------hHHHHHHHHHHhhCCCCC
Q psy2976 137 GQQAISIGKN-----CD---------KFGIVVHELGHVVGFWHE 166 (184)
Q Consensus 137 g~q~isl~~~-----c~---------~~g~i~HElgHALG~~HE 166 (184)
++..|.|.+. |. ...+++||+||-.|+.-+
T Consensus 83 ~P~rItlYRrailDywae~eetlgd~vthvliHEIgHhFGLsDd 126 (136)
T COG3824 83 LPDRITLYRRALLDYWAENEETLGDQVTHVLIHEIGHHFGLSDD 126 (136)
T ss_pred CCceeeeeHHHHHHHHhhhhhhHhhHhhhhhhhhhhhhcCCChh
Confidence 5677777642 32 135899999999998643
No 78
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=62.96 E-value=4.4 Score=36.90 Aligned_cols=17 Identities=24% Similarity=0.421 Sum_probs=13.2
Q ss_pred hHHHHHHHHHHhhCCCC
Q psy2976 149 KFGIVVHELGHVVGFWH 165 (184)
Q Consensus 149 ~~g~i~HElgHALG~~H 165 (184)
.+.|+.|||||||=..-
T Consensus 248 ~v~TLfHEfGHalH~~l 264 (458)
T cd06457 248 EVETLFHEMGHAMHSML 264 (458)
T ss_pred HHHHHHHHHhHHHHHHH
Confidence 36799999999985433
No 79
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=61.19 E-value=5.3 Score=34.35 Aligned_cols=16 Identities=44% Similarity=0.466 Sum_probs=12.4
Q ss_pred hHHHHHHHHHHhhCCC
Q psy2976 149 KFGIVVHELGHVVGFW 164 (184)
Q Consensus 149 ~~g~i~HElgHALG~~ 164 (184)
..++++||++||+=-.
T Consensus 135 ~isvvvHElgHal~A~ 150 (277)
T cd06162 135 LISGVVHEMGHGVAAV 150 (277)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4578999999997433
No 80
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=60.49 E-value=5.5 Score=33.87 Aligned_cols=23 Identities=35% Similarity=0.398 Sum_probs=15.8
Q ss_pred hHHHHHHHHHHhhCCCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRPD 171 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~RpD 171 (184)
...+++||+||++=-..+..+.+
T Consensus 118 ~isv~iHElgHa~~Ar~~G~~V~ 140 (263)
T cd06159 118 VVGVVVHELSHGILARVEGIKVK 140 (263)
T ss_pred HHHHHHHHHHHHHHHHHcCCEEC
Confidence 45789999999975554444433
No 81
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=58.94 E-value=6.2 Score=32.06 Aligned_cols=24 Identities=25% Similarity=0.296 Sum_probs=16.3
Q ss_pred chHHHHHHHHHHhhCCCCCCCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWHEHTRPD 171 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~HEh~RpD 171 (184)
....++.||+||++=..+-..|..
T Consensus 37 l~~~v~iHElgH~~~A~~~G~~v~ 60 (208)
T cd06161 37 LFLSVLLHELGHALVARRYGIRVR 60 (208)
T ss_pred HHHHHHHHHHHHHHHHHHcCCCcc
Confidence 346789999999975544444433
No 82
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=58.55 E-value=6.8 Score=31.17 Aligned_cols=14 Identities=36% Similarity=0.650 Sum_probs=12.0
Q ss_pred hHHHHHHHHHHhhC
Q psy2976 149 KFGIVVHELGHVVG 162 (184)
Q Consensus 149 ~~g~i~HElgHALG 162 (184)
..++++||+||..+
T Consensus 89 l~aVlaHElgH~~~ 102 (226)
T PF01435_consen 89 LAAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHc
Confidence 46899999999874
No 83
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=58.23 E-value=6.5 Score=27.86 Aligned_cols=12 Identities=33% Similarity=0.432 Sum_probs=10.8
Q ss_pred HHHHHHHHHHhh
Q psy2976 150 FGIVVHELGHVV 161 (184)
Q Consensus 150 ~g~i~HElgHAL 161 (184)
..+++||+.|++
T Consensus 26 ~~~l~HE~~H~~ 37 (128)
T PF13485_consen 26 DRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHH
Confidence 489999999996
No 84
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=58.00 E-value=7.2 Score=35.17 Aligned_cols=20 Identities=30% Similarity=0.466 Sum_probs=15.0
Q ss_pred chHHHHHHHHHHhhCCCCCCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~HEh~Rp 170 (184)
...-+++||+|||| |||+-|
T Consensus 158 ~~l~t~iHE~GHal---ye~~l~ 177 (396)
T cd06460 158 SALFSTIHETGHAL---YEQGLP 177 (396)
T ss_pred HHHHHHHHHhhHHH---HHhcCC
Confidence 34568999999997 666544
No 85
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=57.59 E-value=6.3 Score=35.61 Aligned_cols=13 Identities=31% Similarity=0.708 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
.+-|+.|||||||
T Consensus 208 ~v~tLfHEfGHal 220 (422)
T cd06456 208 EVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHH
Confidence 4678999999999
No 86
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=54.23 E-value=8.2 Score=31.93 Aligned_cols=22 Identities=23% Similarity=0.222 Sum_probs=14.8
Q ss_pred hHHHHHHHHHHhhCCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~Rp 170 (184)
...+++||+||++=......++
T Consensus 53 ~~~v~iHElgH~~~A~~~G~~v 74 (227)
T cd06164 53 FASVLLHELGHSLVARRYGIPV 74 (227)
T ss_pred HHHHHHHHHHHHHHHHHcCCeE
Confidence 3578999999997444443333
No 87
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=54.05 E-value=7.9 Score=29.86 Aligned_cols=14 Identities=50% Similarity=0.847 Sum_probs=11.9
Q ss_pred hHHHHHHHHHH--hhC
Q psy2976 149 KFGIVVHELGH--VVG 162 (184)
Q Consensus 149 ~~g~i~HElgH--ALG 162 (184)
..++++||+-| +.|
T Consensus 96 k~~TLiHE~SHf~~~~ 111 (148)
T PF14521_consen 96 KEGTLIHEWSHFTAVG 111 (148)
T ss_dssp HHHHHHHHHHHSCCCT
T ss_pred hHHHHHHhhhhhhhhc
Confidence 57999999999 655
No 88
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=52.97 E-value=8.5 Score=29.42 Aligned_cols=15 Identities=27% Similarity=0.344 Sum_probs=12.4
Q ss_pred hHHHHHHHHHHhhCC
Q psy2976 149 KFGIVVHELGHVVGF 163 (184)
Q Consensus 149 ~~g~i~HElgHALG~ 163 (184)
...|++|||.|++=+
T Consensus 59 l~~~l~HEm~H~~~~ 73 (146)
T smart00731 59 LRETLLHELCHAALY 73 (146)
T ss_pred HHhhHHHHHHHHHHH
Confidence 457999999999854
No 89
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=52.78 E-value=9.3 Score=29.07 Aligned_cols=18 Identities=17% Similarity=0.495 Sum_probs=14.4
Q ss_pred chHHHHHHHHHHhhCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWH 165 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~H 165 (184)
....|++|||.|+.=+..
T Consensus 59 ~~~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 59 ELIDTLLHEMAHAAAYVF 76 (157)
T ss_pred HHHHHHHHHHHHHHhhhc
Confidence 346899999999987654
No 90
>KOG2921|consensus
Probab=51.54 E-value=8.4 Score=34.95 Aligned_cols=18 Identities=44% Similarity=0.639 Sum_probs=14.0
Q ss_pred HHHHHHHHHHhhCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh 167 (184)
...++||+||||--.-|+
T Consensus 132 i~~vvHElGHalAA~seg 149 (484)
T KOG2921|consen 132 ITVVVHELGHALAAASEG 149 (484)
T ss_pred HHHHHHHhhHHHHHHhcC
Confidence 568999999999655554
No 91
>PLN02887 hydrolase family protein
Probab=51.01 E-value=10 Score=35.91 Aligned_cols=19 Identities=47% Similarity=0.590 Sum_probs=16.7
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
.-.++|=++|.|||.|+..
T Consensus 231 ~~l~vHG~LHLlGyDH~~~ 249 (580)
T PLN02887 231 RILVVHGLLHLLGFDHEIS 249 (580)
T ss_pred HHHHHHHHHHhcCCCCCCC
Confidence 3579999999999999974
No 92
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=49.22 E-value=10 Score=31.34 Aligned_cols=13 Identities=38% Similarity=0.381 Sum_probs=10.8
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
...+++||+|||-
T Consensus 93 ~gsiLAHE~mHa~ 105 (212)
T PF12315_consen 93 TGSILAHELMHAW 105 (212)
T ss_pred HhhHHHHHHHHHH
Confidence 3468999999995
No 93
>KOG3658|consensus
Probab=48.12 E-value=6.1 Score=37.98 Aligned_cols=18 Identities=33% Similarity=0.486 Sum_probs=15.4
Q ss_pred HHHHHHHHHhhCCCCCCC
Q psy2976 151 GIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 151 g~i~HElgHALG~~HEh~ 168 (184)
-|++|||||..|-.|.-.
T Consensus 394 lt~AHEiGHNfGSpHDp~ 411 (764)
T KOG3658|consen 394 LTLAHEIGHNFGSPHDPD 411 (764)
T ss_pred eeehhhhccccCCCCCCC
Confidence 489999999999999643
No 94
>PRK10911 oligopeptidase A; Provisional
Probab=48.12 E-value=10 Score=36.45 Aligned_cols=13 Identities=31% Similarity=0.534 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
.+.|+.|||||||
T Consensus 463 ~v~tlfHEfGHal 475 (680)
T PRK10911 463 EVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHhHHH
Confidence 4679999999998
No 95
>PRK03982 heat shock protein HtpX; Provisional
Probab=48.04 E-value=11 Score=32.14 Aligned_cols=14 Identities=36% Similarity=0.703 Sum_probs=11.6
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
+..++++||+||.-
T Consensus 124 El~AVlAHElgHi~ 137 (288)
T PRK03982 124 ELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHH
Confidence 35799999999964
No 96
>PRK03072 heat shock protein HtpX; Provisional
Probab=47.96 E-value=11 Score=32.28 Aligned_cols=14 Identities=43% Similarity=0.665 Sum_probs=11.6
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
+..++++||+||.-
T Consensus 126 El~aVlAHElgHi~ 139 (288)
T PRK03072 126 ELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHh
Confidence 35799999999964
No 97
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=47.42 E-value=12 Score=25.97 Aligned_cols=14 Identities=43% Similarity=0.638 Sum_probs=11.9
Q ss_pred hHHHHHHHHHHhhC
Q psy2976 149 KFGIVVHELGHVVG 162 (184)
Q Consensus 149 ~~g~i~HElgHALG 162 (184)
....++|||.|++-
T Consensus 61 ~~~llaHEl~Hv~Q 74 (79)
T PF13699_consen 61 GRALLAHELAHVVQ 74 (79)
T ss_pred cchhHhHHHHHHHh
Confidence 46799999999973
No 98
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=47.34 E-value=12 Score=31.34 Aligned_cols=14 Identities=36% Similarity=0.710 Sum_probs=11.7
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
+..++++||+||..
T Consensus 156 El~aVlaHElgHi~ 169 (302)
T COG0501 156 ELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHh
Confidence 35799999999975
No 99
>KOG3538|consensus
Probab=47.31 E-value=4 Score=40.11 Aligned_cols=21 Identities=33% Similarity=0.450 Sum_probs=17.3
Q ss_pred HHHHHHHHHHhhCCCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHTRP 170 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~Rp 170 (184)
.-+++||+||.+|+.|--.-.
T Consensus 318 aftiahE~GH~~gm~hd~~~~ 338 (845)
T KOG3538|consen 318 AFTIAHELGHNFGMEHDGRGN 338 (845)
T ss_pred ceeeeeccccccCcccCCcCC
Confidence 458999999999999975444
No 100
>PRK05457 heat shock protein HtpX; Provisional
Probab=46.13 E-value=12 Score=31.99 Aligned_cols=13 Identities=23% Similarity=0.629 Sum_probs=11.1
Q ss_pred chHHHHHHHHHHh
Q psy2976 148 DKFGIVVHELGHV 160 (184)
Q Consensus 148 ~~~g~i~HElgHA 160 (184)
+..++++||+||.
T Consensus 133 El~aVlAHElgHi 145 (284)
T PRK05457 133 EVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHH
Confidence 3579999999995
No 101
>PRK03001 M48 family peptidase; Provisional
Probab=46.11 E-value=12 Score=31.75 Aligned_cols=14 Identities=43% Similarity=0.729 Sum_probs=11.6
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
+..++++||+||.-
T Consensus 123 El~aVlAHElgHi~ 136 (283)
T PRK03001 123 EIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHh
Confidence 35799999999974
No 102
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=45.17 E-value=12 Score=31.56 Aligned_cols=17 Identities=29% Similarity=0.571 Sum_probs=12.0
Q ss_pred chHHHHHHHHHHhhCCC
Q psy2976 148 DKFGIVVHELGHVVGFW 164 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~ 164 (184)
+..-+++|||.|+|=+.
T Consensus 153 dg~NVviHEfAH~LD~~ 169 (253)
T PF06167_consen 153 DGHNVVIHEFAHKLDME 169 (253)
T ss_dssp SS--HHHHHHHHHHHCT
T ss_pred CCcchHHHHHHHHHHhh
Confidence 34679999999999443
No 103
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=44.87 E-value=13 Score=28.02 Aligned_cols=36 Identities=31% Similarity=0.398 Sum_probs=22.2
Q ss_pred CCCcccccCCCCCCcceeecCCCCchHHHHHHHHHHhhCC
Q psy2976 124 PCGCCSFVGKRGNGQQAISIGKNCDKFGIVVHELGHVVGF 163 (184)
Q Consensus 124 ~~gC~S~vG~~~~g~q~isl~~~c~~~g~i~HElgHALG~ 163 (184)
++-|.+++|..--.+ -+-.+||. -++.||+.|+--|
T Consensus 58 ~~LCG~~~~~i~IDP--~~~~KGC~--~TL~HEL~H~WQ~ 93 (141)
T PHA02456 58 QDLCGQFVGWIEIDP--DYANKGCR--DTLAHELNHAWQF 93 (141)
T ss_pred cchhhcceeEEEECC--cccccchH--HHHHHHHHHHHhh
Confidence 455888877642111 11134674 6899999998654
No 104
>PRK01345 heat shock protein HtpX; Provisional
Probab=44.79 E-value=13 Score=32.34 Aligned_cols=14 Identities=43% Similarity=0.759 Sum_probs=11.8
Q ss_pred chHHHHHHHHHHhh
Q psy2976 148 DKFGIVVHELGHVV 161 (184)
Q Consensus 148 ~~~g~i~HElgHAL 161 (184)
+..++++|||||..
T Consensus 123 EL~aVlAHElgHi~ 136 (317)
T PRK01345 123 EVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHH
Confidence 35799999999975
No 105
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=44.24 E-value=15 Score=29.46 Aligned_cols=25 Identities=16% Similarity=0.218 Sum_probs=18.5
Q ss_pred hHHHHHHHHHHhhCCCCCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRPDRD 173 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~RpDRD 173 (184)
....+.||+||++-......+..+-
T Consensus 41 ~~~l~iHElgH~~~A~~~G~~~~~~ 65 (183)
T cd06160 41 LAILGIHEMGHYLAARRHGVKASLP 65 (183)
T ss_pred HHHHHHHHHHHHHHHHHCCCCccce
Confidence 3568899999998776666666553
No 106
>PRK04897 heat shock protein HtpX; Provisional
Probab=44.01 E-value=14 Score=31.83 Aligned_cols=13 Identities=31% Similarity=0.601 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
..++++||+||.-
T Consensus 137 l~aVlAHElgHi~ 149 (298)
T PRK04897 137 LEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHh
Confidence 5799999999964
No 107
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=43.99 E-value=16 Score=24.14 Aligned_cols=28 Identities=18% Similarity=0.218 Sum_probs=20.6
Q ss_pred ceeecCCCC-chHHHHHHHHHHhhCCCCC
Q psy2976 139 QAISIGKNC-DKFGIVVHELGHVVGFWHE 166 (184)
Q Consensus 139 q~isl~~~c-~~~g~i~HElgHALG~~HE 166 (184)
..+.++++- .....++|.+.|-||+.|.
T Consensus 18 ~eL~Fp~~ls~~eRriih~la~~lGL~~~ 46 (60)
T cd02639 18 DELAFPSSLSPAERRIVHLLASRLGLNHV 46 (60)
T ss_pred eEEEcCCCCCHHHHHHHHHHHHHcCCceE
Confidence 345554442 3467899999999999997
No 108
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=43.75 E-value=46 Score=25.49 Aligned_cols=70 Identities=23% Similarity=0.218 Sum_probs=40.0
Q ss_pred HHHHHHHHHhhcccceEEEEcCCCCCCcEEEEecCCCCcc--cccCCCCCCcceeecCC--------CC---------ch
Q psy2976 89 ALFKQAMRHWENFTCVKFVERSSTEHPNYILFTEKPCGCC--SFVGKRGNGQQAISIGK--------NC---------DK 149 (184)
Q Consensus 89 ~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~~~~~gC~--S~vG~~~~g~q~isl~~--------~c---------~~ 149 (184)
..|-.||+.|-..+-+...+.-.=....+|.|.....||+ .+.+.. +.|.|.+.. +- ..
T Consensus 23 p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~A~G~t~p~~--~~~~i~I~~~lV~~~Ek~~~~~r~~~~~~v 100 (135)
T PF15639_consen 23 PAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGTAYGYTTPDD--GSQVIRIDGDLVNMFEKGPGERRAGNVYLV 100 (135)
T ss_pred hHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccceeeeeecCCC--CCcEEEeeHHHhhhhhcCCcccccceEEEe
Confidence 3467888888776555444332111345788886667784 222222 345665521 11 12
Q ss_pred HHHHHHHHHHh
Q psy2976 150 FGIVVHELGHV 160 (184)
Q Consensus 150 ~g~i~HElgHA 160 (184)
..|++||+-|=
T Consensus 101 ~~TlLHEliHw 111 (135)
T PF15639_consen 101 GSTLLHELIHW 111 (135)
T ss_pred ehHHHHHHHHh
Confidence 47999999984
No 109
>KOG2556|consensus
Probab=43.50 E-value=13 Score=34.48 Aligned_cols=18 Identities=33% Similarity=0.397 Sum_probs=14.8
Q ss_pred HHHHHHHHHHhhCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEH 167 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh 167 (184)
..+|.||+.|||||.--+
T Consensus 256 ~stVkhE~~halgfsaGl 273 (666)
T KOG2556|consen 256 ASTVKHERDHALGFSAGL 273 (666)
T ss_pred HHHHHHHHHhhhccccee
Confidence 578999999999996433
No 110
>PF07108 PipA: PipA protein; InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=42.84 E-value=9.5 Score=30.39 Aligned_cols=44 Identities=16% Similarity=0.153 Sum_probs=26.5
Q ss_pred CcEEEEecCCCCcccccCCCC--CCcceeecCCCCchHHHHHHHHHHhhCCC
Q psy2976 115 PNYILFTEKPCGCCSFVGKRG--NGQQAISIGKNCDKFGIVVHELGHVVGFW 164 (184)
Q Consensus 115 ~~~I~f~~~~~gC~S~vG~~~--~g~q~isl~~~c~~~g~i~HElgHALG~~ 164 (184)
...|.++.....|.++--.-. .|+|.+. .....+||+.|||-..
T Consensus 122 rkvIcLnlDd~dd~~~~E~Yes~~G~qlFd------~~RsFiHEiVHALt~L 167 (200)
T PF07108_consen 122 RKVICLNLDDSDDDSYPEHYESNEGPQLFD------TKRSFIHEIVHALTHL 167 (200)
T ss_pred ceEEEeecCCcccccchhhhhccCCccchh------HHHHHHHHHHHHHhhc
Confidence 445666555566766543321 2444443 3568999999999643
No 111
>PRK02870 heat shock protein HtpX; Provisional
Probab=42.75 E-value=15 Score=32.33 Aligned_cols=13 Identities=31% Similarity=0.677 Sum_probs=11.3
Q ss_pred chHHHHHHHHHHh
Q psy2976 148 DKFGIVVHELGHV 160 (184)
Q Consensus 148 ~~~g~i~HElgHA 160 (184)
+..++++||+||.
T Consensus 172 EL~aVlAHELgHi 184 (336)
T PRK02870 172 ELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHH
Confidence 3579999999997
No 112
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=42.25 E-value=18 Score=30.64 Aligned_cols=22 Identities=27% Similarity=0.587 Sum_probs=17.7
Q ss_pred hHHHHHHHHHHhhCCCCCCCCCC
Q psy2976 149 KFGIVVHELGHVVGFWHEHTRPD 171 (184)
Q Consensus 149 ~~g~i~HElgHALG~~HEh~RpD 171 (184)
..++++||++|++ |.|-.-+..
T Consensus 60 ~~~~l~HevlH~~-~~H~~r~~~ 81 (292)
T PF13203_consen 60 RVGLLLHEVLHCL-LRHPWRRGG 81 (292)
T ss_pred HHHHHHHHHHHHH-ccchhhhcc
Confidence 4578999999987 777777776
No 113
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=41.74 E-value=15 Score=31.86 Aligned_cols=12 Identities=42% Similarity=0.620 Sum_probs=9.8
Q ss_pred HHHHHHHHHHhh
Q psy2976 150 FGIVVHELGHVV 161 (184)
Q Consensus 150 ~g~i~HElgHAL 161 (184)
..+++||+||.+
T Consensus 171 ayVlAHEyGHHV 182 (292)
T PF04228_consen 171 AYVLAHEYGHHV 182 (292)
T ss_pred HHHHHHHHHHHH
Confidence 358999999974
No 114
>PRK02391 heat shock protein HtpX; Provisional
Probab=40.91 E-value=16 Score=31.42 Aligned_cols=13 Identities=38% Similarity=0.657 Sum_probs=11.0
Q ss_pred chHHHHHHHHHHh
Q psy2976 148 DKFGIVVHELGHV 160 (184)
Q Consensus 148 ~~~g~i~HElgHA 160 (184)
+..++++||+||.
T Consensus 132 El~aVlaHElgHi 144 (296)
T PRK02391 132 ELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHH
Confidence 3579999999995
No 115
>PRK01265 heat shock protein HtpX; Provisional
Probab=40.64 E-value=17 Score=31.96 Aligned_cols=13 Identities=38% Similarity=0.659 Sum_probs=11.0
Q ss_pred chHHHHHHHHHHh
Q psy2976 148 DKFGIVVHELGHV 160 (184)
Q Consensus 148 ~~~g~i~HElgHA 160 (184)
+..++++||+||.
T Consensus 139 El~aVlAHElgHi 151 (324)
T PRK01265 139 EIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHH
Confidence 3579999999995
No 116
>KOG3624|consensus
Probab=39.07 E-value=21 Score=34.10 Aligned_cols=30 Identities=23% Similarity=0.335 Sum_probs=23.1
Q ss_pred HHHHHHHHHhhCCCCCCCCCCCCCceEeeccc
Q psy2976 151 GIVVHELGHVVGFWHEHTRPDRDNHVQIIREN 182 (184)
Q Consensus 151 g~i~HElgHALG~~HEh~RpDRD~yV~I~~~N 182 (184)
.+++|||+| ||....-..|..-.+.-.|++
T Consensus 520 ~vigHEl~H--~FD~~G~~~D~~Gn~~~ww~~ 549 (687)
T KOG3624|consen 520 FVIGHELTH--GFDDQGRQYDESGNLRDWWDT 549 (687)
T ss_pred HHHHHHHhh--cccccccccCCCcccccCCCH
Confidence 378999999 888888888877776665553
No 117
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=38.76 E-value=22 Score=31.78 Aligned_cols=20 Identities=20% Similarity=0.544 Sum_probs=14.2
Q ss_pred CCCc---hHHHHHHHHHHhhCCC
Q psy2976 145 KNCD---KFGIVVHELGHVVGFW 164 (184)
Q Consensus 145 ~~c~---~~g~i~HElgHALG~~ 164 (184)
..|. ..++++|||-|+.+..
T Consensus 290 ~~C~~qDqatt~LHE~TH~~~V~ 312 (359)
T PF02102_consen 290 NRCHAQDQATTTLHEMTHAPAVY 312 (359)
T ss_dssp -STT---HHHHHHHHHHT-TTTS
T ss_pred ccccCCCccchhhhhhhcccccc
Confidence 4574 3579999999999863
No 118
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=38.44 E-value=50 Score=19.97 Aligned_cols=41 Identities=15% Similarity=0.199 Sum_probs=31.5
Q ss_pred ceeecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 139 QAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 139 q~isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
..+.+.+-....--++|++.-..|+.++..-.+-+.+|.|.
T Consensus 18 ~~~~~~p~~~~~R~~vH~la~~~~L~s~s~g~~~~r~v~i~ 58 (59)
T cd02325 18 KSLELPPMNSYERKLIHDLAEYYGLKSESEGEGPNRRVVIT 58 (59)
T ss_pred CeEEcCCCCHHHHHHHHHHHHHCCCEEEEecCCCCcEEEEe
Confidence 34444443345678999999999999999998888888774
No 119
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=38.09 E-value=33 Score=22.54 Aligned_cols=39 Identities=18% Similarity=0.290 Sum_probs=30.2
Q ss_pred eecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 141 ISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 141 isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
+.+.+=......++||+..+.||.=|..-++-+.||-|.
T Consensus 20 ~~f~pM~~~~R~~vHdla~~~gl~SeS~d~Ep~R~V~v~ 58 (59)
T cd06006 20 LRFPPMRSPQRAFIHELAKDYGLYSESQDPEPKRSVFVK 58 (59)
T ss_pred eeCCCCCHHHHHHHHHHHHHcCCeeEecCCCCCcEEEEe
Confidence 344332334678999999999999999999888888763
No 120
>PRK04351 hypothetical protein; Provisional
Probab=36.18 E-value=22 Score=27.65 Aligned_cols=13 Identities=46% Similarity=0.652 Sum_probs=11.2
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
..+||.|||.|.+
T Consensus 61 l~~vv~HElcH~~ 73 (149)
T PRK04351 61 LIGIIKHELCHYH 73 (149)
T ss_pred HHhhHHHHHHHHH
Confidence 4689999999985
No 121
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=36.16 E-value=19 Score=28.65 Aligned_cols=20 Identities=35% Similarity=0.333 Sum_probs=12.5
Q ss_pred HHHHHHHHHhhCCCCCCCCCCC
Q psy2976 151 GIVVHELGHVVGFWHEHTRPDR 172 (184)
Q Consensus 151 g~i~HElgHALG~~HEh~RpDR 172 (184)
.+++|||+||+. .+....|-
T Consensus 38 ~ilahel~hafd--~~g~~~D~ 57 (206)
T PF01431_consen 38 FILAHELMHAFD--PEGINYDE 57 (206)
T ss_dssp HHHHHHHHHCTS--TTGGGB-T
T ss_pred HHHHHHHHHHHH--HhHhhcCc
Confidence 479999999654 34433443
No 122
>PF14891 Peptidase_M91: Effector protein
Probab=35.42 E-value=25 Score=27.69 Aligned_cols=19 Identities=26% Similarity=0.445 Sum_probs=15.0
Q ss_pred HHHHHHHHHHhhCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHT 168 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~ 168 (184)
+.++.|||.||.-+.+=-.
T Consensus 104 ~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 104 FVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHCCCC
Confidence 5689999999987766544
No 123
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=35.12 E-value=23 Score=34.21 Aligned_cols=13 Identities=31% Similarity=0.552 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhhC
Q psy2976 150 FGIVVHELGHVVG 162 (184)
Q Consensus 150 ~g~i~HElgHALG 162 (184)
+-|+-|||||||=
T Consensus 465 V~TlFHEfGHalH 477 (681)
T PRK10280 465 VITLFHEFGHTLH 477 (681)
T ss_pred HHHHHHHHHHHHH
Confidence 5689999999984
No 124
>PRK04860 hypothetical protein; Provisional
Probab=34.25 E-value=26 Score=27.59 Aligned_cols=13 Identities=38% Similarity=0.539 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
..++|.||+.|.+
T Consensus 63 l~~~v~HEl~H~~ 75 (160)
T PRK04860 63 IDEVVPHELAHLL 75 (160)
T ss_pred HHhHHHHHHHHHH
Confidence 4689999999975
No 125
>PLN02791 Nudix hydrolase homolog
Probab=33.78 E-value=20 Score=35.15 Aligned_cols=20 Identities=20% Similarity=0.124 Sum_probs=16.5
Q ss_pred HHHHHHHHHHhhCCCCCCCC
Q psy2976 150 FGIVVHELGHVVGFWHEHTR 169 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~R 169 (184)
..|+.||+||.+|=.+....
T Consensus 586 ~~v~lHElgHGsG~~~~~~~ 605 (770)
T PLN02791 586 THTICHECCHGIGPHTITLP 605 (770)
T ss_pred HHHHHHHhhccccccceecC
Confidence 57999999999998776554
No 126
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=32.82 E-value=21 Score=30.45 Aligned_cols=18 Identities=22% Similarity=0.650 Sum_probs=14.9
Q ss_pred chHHHHHHHHHHhhCCCC
Q psy2976 148 DKFGIVVHELGHVVGFWH 165 (184)
Q Consensus 148 ~~~g~i~HElgHALG~~H 165 (184)
...-+++|||.|.|=+..
T Consensus 141 dg~NVvIHEFAH~LDm~~ 158 (260)
T PRK15410 141 SGFNLIIHEVAHKLDMRN 158 (260)
T ss_pred CCcchhHhHHHhHhhhhc
Confidence 345699999999998866
No 127
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=32.58 E-value=28 Score=31.15 Aligned_cols=16 Identities=25% Similarity=0.789 Sum_probs=13.6
Q ss_pred hHHHHHHHHHHhhCCC
Q psy2976 149 KFGIVVHELGHVVGFW 164 (184)
Q Consensus 149 ~~g~i~HElgHALG~~ 164 (184)
..+|++|||.|.+=|.
T Consensus 139 ~~sTlAHEfQHmInfy 154 (366)
T PF10460_consen 139 VYSTLAHEFQHMINFY 154 (366)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4689999999999665
No 128
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=30.22 E-value=24 Score=32.68 Aligned_cols=11 Identities=36% Similarity=0.492 Sum_probs=9.2
Q ss_pred HHHHHHHHHhh
Q psy2976 151 GIVVHELGHVV 161 (184)
Q Consensus 151 g~i~HElgHAL 161 (184)
-.++||.||||
T Consensus 261 ~g~iHE~GHAl 271 (497)
T COG2317 261 FGTIHETGHAL 271 (497)
T ss_pred HHHHHhhhhHH
Confidence 35789999997
No 129
>KOG0201|consensus
Probab=27.43 E-value=20 Score=32.88 Aligned_cols=30 Identities=23% Similarity=0.450 Sum_probs=26.1
Q ss_pred HHHHHHHHHHhhCCCCCCCCCCCCCceEeecccC
Q psy2976 150 FGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENI 183 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI 183 (184)
.+++++|+++||+|.|.+.---|| |.=.||
T Consensus 113 i~~ilre~l~~l~ylH~~~kiHrD----IKaani 142 (467)
T KOG0201|consen 113 IAVILREVLKGLDYLHSEKKIHRD----IKAANI 142 (467)
T ss_pred eeeehHHHHHHhhhhhhcceeccc----ccccce
Confidence 578999999999999999988888 666665
No 130
>smart00393 R3H Putative single-stranded nucleic acids-binding domain.
Probab=27.16 E-value=73 Score=21.52 Aligned_cols=39 Identities=21% Similarity=0.156 Sum_probs=28.5
Q ss_pred eecCCCCchHHHHHHHHHHhhCCCCCCCCCCCCCceEee
Q psy2976 141 ISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDNHVQII 179 (184)
Q Consensus 141 isl~~~c~~~g~i~HElgHALG~~HEh~RpDRD~yV~I~ 179 (184)
+.|.+=.....-++|+++..+|+.++..-..-+.+|.|.
T Consensus 39 ~~~~pm~~~~R~~iH~~a~~~~l~s~S~g~g~~R~vvv~ 77 (79)
T smart00393 39 VELPPMNSYERKIVHELAEKYGLESESFGEGPKRRVVIS 77 (79)
T ss_pred EEcCCCCHHHHHHHHHHHHHcCCEEEEEcCCCCcEEEEE
Confidence 333332235678999999999999998777777777764
No 131
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=26.00 E-value=44 Score=27.35 Aligned_cols=16 Identities=25% Similarity=0.401 Sum_probs=13.0
Q ss_pred hHHHHHHHHHHhhCCC
Q psy2976 149 KFGIVVHELGHVVGFW 164 (184)
Q Consensus 149 ~~g~i~HElgHALG~~ 164 (184)
..|++.||+.|+.-..
T Consensus 96 i~Gvl~HE~~H~~Q~~ 111 (205)
T PF04450_consen 96 IIGVLYHEMVHCWQWD 111 (205)
T ss_pred HHHHHHHHHHHHhhcC
Confidence 4799999999996543
No 132
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=24.24 E-value=45 Score=27.04 Aligned_cols=13 Identities=54% Similarity=0.680 Sum_probs=11.3
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
..-++.||-||||
T Consensus 28 ~~~~A~HEAGhAv 40 (213)
T PF01434_consen 28 KRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 4679999999997
No 133
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=24.20 E-value=43 Score=31.15 Aligned_cols=13 Identities=31% Similarity=0.526 Sum_probs=10.7
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
..-.++||.||||
T Consensus 259 ~l~s~iHE~GHal 271 (494)
T PF02074_consen 259 ALFSTIHETGHAL 271 (494)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 3567899999997
No 134
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=23.17 E-value=48 Score=26.47 Aligned_cols=14 Identities=29% Similarity=0.228 Sum_probs=11.7
Q ss_pred hHHHHHHHHHHhhC
Q psy2976 149 KFGIVVHELGHVVG 162 (184)
Q Consensus 149 ~~g~i~HElgHALG 162 (184)
...+++|||-||--
T Consensus 71 l~~~l~HELIHayD 84 (173)
T PF09768_consen 71 LEDTLTHELIHAYD 84 (173)
T ss_pred HHHHHHHHHHHHHH
Confidence 35799999999964
No 135
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=22.56 E-value=49 Score=32.04 Aligned_cols=24 Identities=17% Similarity=0.279 Sum_probs=16.4
Q ss_pred HHHHHHHHHhhcccceEEEEcCCC
Q psy2976 89 ALFKQAMRHWENFTCVKFVERSST 112 (184)
Q Consensus 89 ~~I~~A~~~we~~TcV~F~e~~~~ 112 (184)
++|.--++..+.-=.|+|+|..+.
T Consensus 365 ~Vl~GlF~~~~rLfGI~~~e~~~~ 388 (683)
T COG0339 365 KVLEGLFEVAKRLFGITFVERKDI 388 (683)
T ss_pred HHHHHHHHHHHHHcCeEEEECCCC
Confidence 455445567777778999997543
No 136
>KOG2719|consensus
Probab=22.40 E-value=46 Score=30.44 Aligned_cols=13 Identities=38% Similarity=0.764 Sum_probs=10.8
Q ss_pred chHHHHHHHHHHh
Q psy2976 148 DKFGIVVHELGHV 160 (184)
Q Consensus 148 ~~~g~i~HElgHA 160 (184)
+..++++||+||-
T Consensus 279 el~AVl~HELGHW 291 (428)
T KOG2719|consen 279 ELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHhhHH
Confidence 3578999999994
No 137
>KOG3314|consensus
Probab=21.66 E-value=56 Score=26.27 Aligned_cols=24 Identities=29% Similarity=0.439 Sum_probs=16.9
Q ss_pred HHHHHHHHHHhhCCCCCCCCCCCCCceEeecccC
Q psy2976 150 FGIVVHELGHVVGFWHEHTRPDRDNHVQIIRENI 183 (184)
Q Consensus 150 ~g~i~HElgHALG~~HEh~RpDRD~yV~I~~~NI 183 (184)
.-+++|||.||.- +-|. .|+|.|+
T Consensus 92 n~vv~HElIH~fD----d~r~------~vDW~Nl 115 (194)
T KOG3314|consen 92 NQVVIHELIHAFD----DCRA------KVDWTNL 115 (194)
T ss_pred HHHHHHHHHHHHH----hhhh------hcccchh
Confidence 4699999999973 2222 2778876
No 138
>PF07023 DUF1315: Protein of unknown function (DUF1315); InterPro: IPR009749 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=21.59 E-value=1.3e+02 Score=21.67 Aligned_cols=47 Identities=15% Similarity=0.160 Sum_probs=33.2
Q ss_pred CCCcCCCceEEEEcCCCCHHHHHHHHHHHHHhhcccceEEEEcCCCCCCcEEEEe
Q psy2976 67 ERIWDYGVIPYEIDSNFGGQHKALFKQAMRHWENFTCVKFVERSSTEHPNYILFT 121 (184)
Q Consensus 67 ~~~Wp~g~IpY~~~~~~~~~~r~~I~~A~~~we~~TcV~F~e~~~~~~~~~I~f~ 121 (184)
..|||+| ..+++++|+.-.+|+=.|+...++.=.+.+-. ...||...
T Consensus 22 lGKWpDG-------~~LT~eQre~~mQaVm~yq~~~n~~~e~~ti~-~~G~I~~k 68 (93)
T PF07023_consen 22 LGKWPDG-------RALTPEQRESCMQAVMLYQARHNLPEEHRTIG-ADGYIDMK 68 (93)
T ss_pred hCcCCCC-------CCCCHHHHHHHHHHHHHHHHHcCCCHHHcccC-CCCEEccc
Confidence 6799998 46899999999999999998765532333321 23566664
No 139
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=21.25 E-value=58 Score=30.11 Aligned_cols=16 Identities=38% Similarity=0.738 Sum_probs=13.3
Q ss_pred chHHHHHHHHHHhhCC
Q psy2976 148 DKFGIVVHELGHVVGF 163 (184)
Q Consensus 148 ~~~g~i~HElgHALG~ 163 (184)
..+-++.||+||+..|
T Consensus 246 ~D~~t~~HE~GH~~yy 261 (477)
T cd06461 246 EDFVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3578999999999954
No 140
>KOG3854|consensus
Probab=21.02 E-value=84 Score=29.20 Aligned_cols=40 Identities=23% Similarity=0.570 Sum_probs=24.5
Q ss_pred CCCcccccCCCCCCcceeecCCC-Cch----HHHHHHHHHHhhCC
Q psy2976 124 PCGCCSFVGKRGNGQQAISIGKN-CDK----FGIVVHELGHVVGF 163 (184)
Q Consensus 124 ~~gC~S~vG~~~~g~q~isl~~~-c~~----~g~i~HElgHALG~ 163 (184)
..||....-+.+.+...|.|+.. |.+ .-|++||+.||.-+
T Consensus 321 tag~cl~~~~~~kr~A~IeLs~kV~tTAERir~TLiHEmCHaAaW 365 (505)
T KOG3854|consen 321 TAGCCLNGDAHGKRYAKIELSDKVCTTAERIRDTLIHEMCHAAAW 365 (505)
T ss_pred cccccccchhccCceeEEEehhhhhhHHHHHHHHHHHHHHHHHHh
Confidence 45665543333334556777654 543 34799999999744
No 141
>PF05452 Clavanin: Clavanin; InterPro: IPR008453 This family consists of clavanin proteins from the haemocytes of the invertebrate Styela clava (Sea squirt), a solitary tunicate. The family is made up of four alpha-helical antimicrobial peptides, clavanins A, B, C and D. The tunicate peptides resemble magainins in size, primary sequence and antibacterial activity. Synthetic clavanin A displays comparable antimicrobial activity to magainins and cecropins. The presence of alpha-helical antimicrobial peptides in the haemocytes of a urochordate suggests that such peptides are primeval effectors of innate immunity in the vertebrate lineage [].; GO: 0005576 extracellular region
Probab=20.76 E-value=28 Score=23.74 Aligned_cols=13 Identities=46% Similarity=0.797 Sum_probs=8.1
Q ss_pred HHHHHHHHHHhhC
Q psy2976 150 FGIVVHELGHVVG 162 (184)
Q Consensus 150 ~g~i~HElgHALG 162 (184)
.|..+|-|+||||
T Consensus 41 VGnfVhgfshvfg 53 (80)
T PF05452_consen 41 VGNFVHGFSHVFG 53 (80)
T ss_pred hhhhhhcceeecc
Confidence 3555666667766
No 142
>KOG2661|consensus
Probab=20.76 E-value=56 Score=29.23 Aligned_cols=12 Identities=33% Similarity=0.573 Sum_probs=10.3
Q ss_pred HHHHHHHHHHhh
Q psy2976 150 FGIVVHELGHVV 161 (184)
Q Consensus 150 ~g~i~HElgHAL 161 (184)
..+++||++||+
T Consensus 276 AtvLgHE~aHaV 287 (424)
T KOG2661|consen 276 ATVLGHEIAHAV 287 (424)
T ss_pred HHHHHHHHHHHH
Confidence 468999999996
No 143
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=20.47 E-value=55 Score=24.63 Aligned_cols=15 Identities=40% Similarity=0.997 Sum_probs=11.8
Q ss_pred CC-chHHHHHHHHHHh
Q psy2976 146 NC-DKFGIVVHELGHV 160 (184)
Q Consensus 146 ~c-~~~g~i~HElgHA 160 (184)
.| +...+++||++|-
T Consensus 76 s~~ekvKviiHEllHI 91 (133)
T COG4900 76 SCAEKVKVIIHELLHI 91 (133)
T ss_pred ChHHHHHHHHHHHhcC
Confidence 35 3578999999994
No 144
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=20.08 E-value=79 Score=28.04 Aligned_cols=13 Identities=31% Similarity=0.493 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHhh
Q psy2976 149 KFGIVVHELGHVV 161 (184)
Q Consensus 149 ~~g~i~HElgHAL 161 (184)
....|.|||.|..
T Consensus 165 LA~LIfHELaHq~ 177 (337)
T PF10023_consen 165 LARLIFHELAHQT 177 (337)
T ss_pred HHHHHHHHHhhce
Confidence 4689999999975
Done!