Query psy14250
Match_columns 154
No_of_seqs 110 out of 1100
Neff 7.5
Searched_HMMs 46136
Date Fri Aug 16 23:12:18 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14250.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14250hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd04272 ZnMc_salivary_gland_MP 99.9 1.9E-26 4.1E-31 179.7 9.3 103 2-111 116-220 (220)
2 KOG3607|consensus 99.9 7.2E-26 1.6E-30 200.5 8.8 124 2-141 292-418 (716)
3 PF01421 Reprolysin: Reprolysi 99.9 3.6E-26 7.9E-31 175.3 4.9 96 4-114 102-199 (199)
4 cd04269 ZnMc_adamalysin_II_lik 99.9 2E-25 4.3E-30 170.5 8.4 93 2-112 100-194 (194)
5 cd04273 ZnMc_ADAMTS_like Zinc- 99.9 9.3E-26 2E-30 174.3 6.5 93 2-111 112-207 (207)
6 cd04271 ZnMc_ADAM_fungal Zinc- 99.9 9.2E-25 2E-29 171.4 7.4 103 2-111 109-228 (228)
7 cd04267 ZnMc_ADAM_like Zinc-de 99.9 2.6E-22 5.6E-27 152.9 8.0 89 2-103 103-192 (192)
8 cd04270 ZnMc_TACE_like Zinc-de 99.9 3E-22 6.5E-27 158.6 7.7 105 2-114 114-241 (244)
9 KOG3658|consensus 99.9 5.2E-22 1.1E-26 171.5 7.2 128 2-142 339-487 (764)
10 KOG3538|consensus 99.8 1.1E-19 2.3E-24 165.0 7.8 131 3-151 290-422 (845)
11 PF13574 Reprolysin_2: Metallo 99.8 5.1E-20 1.1E-24 138.9 1.9 92 2-103 81-172 (173)
12 cd00203 ZnMc Zinc-dependent me 99.6 7.4E-15 1.6E-19 108.8 7.0 93 3-101 66-166 (167)
13 PF13583 Reprolysin_4: Metallo 99.5 5.2E-14 1.1E-18 109.0 7.8 98 2-109 107-205 (206)
14 PF13688 Reprolysin_5: Metallo 99.5 1E-14 2.2E-19 111.2 0.8 86 5-91 108-196 (196)
15 PF13582 Reprolysin_3: Metallo 98.8 1.2E-09 2.6E-14 77.3 2.2 46 3-54 77-124 (124)
16 PF00413 Peptidase_M10: Matrix 97.9 7.7E-06 1.7E-10 59.6 3.0 48 36-100 104-151 (154)
17 PRK13267 archaemetzincin-like 97.7 3.8E-05 8.1E-10 58.4 3.6 44 6-55 85-143 (179)
18 cd04279 ZnMc_MMP_like_1 Zinc-d 97.7 4.2E-05 9.2E-10 56.3 3.6 49 37-101 104-154 (156)
19 cd04278 ZnMc_MMP Zinc-dependen 97.4 0.00017 3.7E-09 53.1 3.4 46 37-99 107-153 (157)
20 cd04268 ZnMc_MMP_like Zinc-dep 97.3 0.00021 4.6E-09 52.6 3.5 21 37-57 94-114 (165)
21 PF02031 Peptidase_M7: Strepto 97.3 0.00011 2.4E-09 52.7 1.3 24 35-58 75-98 (132)
22 COG1913 Predicted Zn-dependent 97.2 0.00021 4.5E-09 53.9 1.8 41 37-94 124-166 (181)
23 smart00235 ZnMc Zinc-dependent 97.0 0.00031 6.7E-09 50.6 1.5 20 38-57 87-106 (140)
24 cd04277 ZnMc_serralysin_like Z 96.6 0.00092 2E-08 50.5 1.6 22 37-58 113-134 (186)
25 PF05572 Peptidase_M43: Pregna 96.6 0.001 2.2E-08 49.4 1.4 24 35-58 67-90 (154)
26 PF07998 Peptidase_M54: Peptid 96.4 0.0014 3E-08 50.5 1.3 27 37-81 145-171 (194)
27 cd04283 ZnMc_hatching_enzyme Z 96.3 0.0022 4.8E-08 48.9 2.0 21 38-58 78-98 (182)
28 cd04281 ZnMc_BMP1_TLD Zinc-dep 96.2 0.0022 4.9E-08 49.5 1.6 21 38-58 88-108 (200)
29 PF05548 Peptidase_M11: Gameto 96.2 0.0043 9.3E-08 51.1 3.2 20 38-57 151-170 (314)
30 PF12388 Peptidase_M57: Dual-a 96.1 0.0025 5.4E-08 49.6 1.3 20 37-56 133-152 (211)
31 cd04280 ZnMc_astacin_like Zinc 96.1 0.0032 7E-08 47.7 1.8 21 38-58 75-95 (180)
32 cd04282 ZnMc_meprin Zinc-depen 95.9 0.0048 1E-07 48.7 2.0 21 38-58 121-141 (230)
33 cd04327 ZnMc_MMP_like_3 Zinc-d 95.7 0.0047 1E-07 47.3 1.4 20 39-58 94-113 (198)
34 cd04275 ZnMc_pappalysin_like Z 95.7 0.0024 5.2E-08 50.3 -0.3 24 35-58 135-158 (225)
35 PF01400 Astacin: Astacin (Pep 95.6 0.0036 7.8E-08 47.8 0.2 21 38-58 80-100 (191)
36 cd04276 ZnMc_MMP_like_2 Zinc-d 95.4 0.0074 1.6E-07 46.6 1.5 20 38-57 117-136 (197)
37 KOG3714|consensus 95.3 0.0081 1.8E-07 51.2 1.4 21 38-58 160-180 (411)
38 KOG1565|consensus 94.9 0.016 3.5E-07 50.2 2.2 48 35-102 209-257 (469)
39 PF10462 Peptidase_M66: Peptid 94.7 0.015 3.2E-07 47.8 1.3 18 36-53 192-209 (305)
40 TIGR03296 M6dom_TIGR03296 M6 f 93.9 0.018 4E-07 46.6 0.3 39 37-81 165-203 (286)
41 COG3824 Predicted Zn-dependent 90.5 0.097 2.1E-06 37.3 0.5 20 37-56 109-128 (136)
42 PF12725 DUF3810: Protein of u 90.5 0.11 2.4E-06 42.8 0.9 26 31-56 190-215 (318)
43 PF12044 Metallopep: Putative 90.0 0.19 4.1E-06 43.1 1.9 20 38-57 317-336 (423)
44 PF06114 DUF955: Domain of unk 89.3 0.17 3.8E-06 34.2 1.0 24 26-49 31-54 (122)
45 COG2856 Predicted Zn peptidase 87.2 0.24 5.2E-06 38.7 0.7 16 35-50 70-85 (213)
46 TIGR02232 myxo_disulf_rpt Myxo 86.5 0.32 6.9E-06 27.7 0.8 16 122-137 2-17 (38)
47 PF06262 DUF1025: Possibl zinc 85.3 0.28 6.1E-06 33.7 0.1 18 37-54 73-90 (97)
48 PF05547 Peptidase_M6: Immune 85.0 0.22 4.8E-06 44.9 -0.6 40 36-81 220-259 (645)
49 PF09471 Peptidase_M64: IgA Pe 82.6 0.6 1.3E-05 37.6 1.0 20 37-56 216-235 (264)
50 KOG4525|consensus 82.6 0.79 1.7E-05 39.6 1.8 20 38-57 302-321 (614)
51 COG4783 Putative Zn-dependent 82.4 0.56 1.2E-05 40.7 0.8 14 37-50 130-143 (484)
52 TIGR00181 pepF oligoendopeptid 81.0 1.3 2.9E-05 39.2 2.7 22 28-49 369-390 (591)
53 PF01435 Peptidase_M48: Peptid 79.1 0.9 1.9E-05 34.5 0.9 14 37-50 89-102 (226)
54 PF13948 DUF4215: Domain of un 77.4 1.3 2.9E-05 26.1 1.1 18 121-138 18-35 (47)
55 TIGR02289 M3_not_pepF oligoend 76.4 2.2 4.9E-05 37.6 2.7 22 28-49 328-349 (549)
56 PF11350 DUF3152: Protein of u 74.7 1.6 3.5E-05 33.9 1.2 19 37-55 139-157 (203)
57 COG1164 Oligoendopeptidase F [ 74.5 2.6 5.6E-05 37.8 2.6 19 32-50 375-393 (598)
58 PF13699 DUF4157: Domain of un 74.4 1.3 2.9E-05 29.0 0.6 13 37-49 61-73 (79)
59 PF01863 DUF45: Protein of unk 74.2 1.9 4E-05 32.6 1.5 20 37-56 164-183 (205)
60 cd06459 M3B_Oligoendopeptidase 73.8 3.1 6.7E-05 34.9 2.8 17 33-49 218-234 (427)
61 PF01431 Peptidase_M13: Peptid 72.7 1.6 3.5E-05 33.1 0.8 13 37-49 36-48 (206)
62 PF04228 Zn_peptidase: Putativ 71.5 1.6 3.5E-05 35.7 0.6 14 35-48 168-181 (292)
63 PRK03982 heat shock protein Ht 70.4 2 4.3E-05 34.8 0.9 13 37-49 125-137 (288)
64 PRK03001 M48 family peptidase; 70.2 2 4.4E-05 34.6 0.9 13 37-49 124-136 (283)
65 COG0501 HtpX Zn-dependent prot 69.9 2.1 4.5E-05 34.2 0.9 13 37-49 157-169 (302)
66 PF01457 Peptidase_M8: Leishma 69.8 2.3 5E-05 37.4 1.2 17 37-53 210-226 (521)
67 PRK01345 heat shock protein Ht 69.0 2.2 4.8E-05 35.1 0.9 13 37-49 124-136 (317)
68 TIGR02290 M3_fam_3 oligoendope 68.4 3.3 7.2E-05 36.7 1.9 18 32-49 370-387 (587)
69 PRK03072 heat shock protein Ht 68.1 2.4 5.2E-05 34.4 0.9 14 37-50 127-140 (288)
70 PF13485 Peptidase_MA_2: Pepti 67.4 2.4 5.1E-05 28.7 0.6 14 37-50 25-38 (128)
71 PF01447 Peptidase_M4: Thermol 67.3 3 6.4E-05 30.8 1.2 12 38-49 136-147 (150)
72 PRK04897 heat shock protein Ht 65.6 2.8 6E-05 34.2 0.8 13 37-49 137-149 (298)
73 PF01432 Peptidase_M3: Peptida 65.4 2.6 5.7E-05 36.0 0.7 15 35-49 240-254 (458)
74 COG5549 Predicted Zn-dependent 65.3 2.8 6E-05 32.9 0.7 21 36-56 186-207 (236)
75 PRK05457 heat shock protein Ht 65.0 3 6.5E-05 33.8 0.9 13 37-49 134-146 (284)
76 cd06455 M3A_TOP Peptidase M3 T 64.2 2.8 6.2E-05 36.2 0.7 15 35-49 261-275 (472)
77 COG3590 PepO Predicted metallo 63.9 3.2 6.8E-05 37.1 0.9 22 37-58 487-511 (654)
78 PRK02391 heat shock protein Ht 63.5 3.2 6.9E-05 33.9 0.8 13 37-49 133-145 (296)
79 PRK01265 heat shock protein Ht 62.1 3.5 7.6E-05 34.2 0.8 12 37-48 140-151 (324)
80 KOG3624|consensus 61.7 4.4 9.6E-05 36.6 1.5 21 37-57 518-541 (687)
81 COG2321 Predicted metalloprote 61.4 3.4 7.4E-05 33.6 0.6 16 34-49 164-179 (295)
82 PRK02870 heat shock protein Ht 61.3 3.8 8.3E-05 34.2 0.9 12 37-48 173-184 (336)
83 KOG2719|consensus 61.0 4.9 0.00011 34.6 1.5 36 6-48 249-291 (428)
84 PTZ00337 surface protease GP63 60.6 3.5 7.5E-05 36.8 0.5 17 37-53 229-245 (567)
85 PF08325 WLM: WLM domain; Int 58.5 5.5 0.00012 30.4 1.3 21 36-56 81-101 (186)
86 PF12315 DUF3633: Protein of u 57.2 6.6 0.00014 30.7 1.5 20 31-50 87-106 (212)
87 cd06457 M3A_MIP Peptidase M3 m 56.3 4.8 0.0001 34.8 0.7 14 36-49 247-260 (458)
88 COG1451 Predicted metal-depend 55.0 5.6 0.00012 31.2 0.8 19 37-55 175-193 (223)
89 PF13402 M60-like: Peptidase M 54.8 4.9 0.00011 32.3 0.5 15 38-52 220-234 (307)
90 KOG2414|consensus 54.7 5.2 0.00011 34.4 0.6 19 40-58 399-418 (488)
91 PF04298 Zn_peptidase_2: Putat 54.6 5.5 0.00012 31.4 0.7 25 25-49 74-101 (222)
92 PTZ00257 Glycoprotein GP63 (le 54.6 6.2 0.00013 35.6 1.1 16 37-52 256-271 (622)
93 smart00050 DISIN Homologues of 54.3 5.8 0.00012 25.9 0.7 12 130-141 1-12 (75)
94 PF13398 Peptidase_M50B: Pepti 52.7 5.4 0.00012 30.6 0.4 18 35-52 20-37 (200)
95 PF08219 TOM13: Outer membrane 51.8 7.3 0.00016 25.5 0.8 17 36-52 51-67 (77)
96 cd02639 R3H_RRM R3H domain of 50.8 8.7 0.00019 23.9 1.1 20 38-57 30-49 (60)
97 cd06258 Peptidase_M3_like The 50.7 6.5 0.00014 32.4 0.6 15 35-49 152-166 (365)
98 PF14247 DUF4344: Domain of un 49.9 6.8 0.00015 30.8 0.6 13 37-49 92-104 (220)
99 PF14891 Peptidase_M91: Effect 48.9 8.2 0.00018 28.8 0.8 20 37-56 103-122 (174)
100 KOG1047|consensus 47.5 8.3 0.00018 34.4 0.8 13 37-49 288-300 (613)
101 PF02163 Peptidase_M50: Peptid 46.3 8.8 0.00019 28.6 0.7 20 38-57 8-27 (192)
102 cd06456 M3A_DCP_Oligopeptidase 45.8 8.8 0.00019 32.9 0.7 15 35-49 206-220 (422)
103 cd06460 M32_Taq Peptidase fami 44.7 9.4 0.0002 32.6 0.7 15 35-49 157-171 (396)
104 COG2738 Predicted Zn-dependent 44.2 9.4 0.0002 29.7 0.5 20 35-54 90-109 (226)
105 TIGR02411 leuko_A4_hydro leuko 42.0 12 0.00025 33.7 0.8 13 37-49 279-291 (601)
106 cd06163 S2P-M50_PDZ_RseP-like 42.0 11 0.00024 28.6 0.6 15 38-52 10-24 (182)
107 KOG2661|consensus 41.3 12 0.00025 31.6 0.7 13 37-49 275-287 (424)
108 smart00731 SprT SprT homologue 41.2 11 0.00025 27.2 0.6 14 37-50 59-72 (146)
109 PF10463 Peptidase_U49: Peptid 40.7 12 0.00025 29.2 0.6 20 36-56 100-119 (206)
110 PRK04351 hypothetical protein; 39.7 12 0.00027 27.5 0.5 13 37-49 61-73 (149)
111 PRK04860 hypothetical protein; 39.7 12 0.00027 27.8 0.6 13 37-49 63-75 (160)
112 cd02641 R3H_Smubp-2_like R3H d 39.2 16 0.00035 22.5 1.0 21 37-57 29-49 (60)
113 cd05709 S2P-M50 Site-2 proteas 39.2 13 0.00029 27.4 0.7 19 38-56 9-27 (180)
114 cd06164 S2P-M50_SpoIVFB_CBS Sp 36.0 16 0.00034 28.6 0.7 20 37-56 53-72 (227)
115 PRK10911 oligopeptidase A; Pro 35.8 15 0.00033 33.5 0.6 14 36-49 462-475 (680)
116 cd06161 S2P-M50_SpoIVFB SpoIVF 35.0 17 0.00037 27.9 0.7 20 37-56 38-57 (208)
117 COG3091 SprT Zn-dependent meta 34.8 17 0.00036 27.1 0.6 13 37-49 61-73 (156)
118 PF10263 SprT-like: SprT-like 34.8 17 0.00038 26.1 0.7 15 37-51 60-74 (157)
119 KOG1046|consensus 34.6 18 0.00038 34.1 0.8 13 37-49 324-336 (882)
120 PF02130 UPF0054: Uncharacteri 33.6 29 0.00063 25.2 1.7 20 37-56 108-127 (145)
121 TIGR00043 metalloprotein, YbeY 33.3 29 0.00062 24.1 1.6 21 37-57 73-93 (110)
122 PRK00016 metal-binding heat sh 32.8 26 0.00056 26.0 1.4 21 37-57 113-133 (159)
123 COG4784 Putative Zn-dependent 32.6 18 0.0004 30.6 0.6 13 37-49 124-136 (479)
124 PF14521 Aspzincin_M35: Lysine 32.6 23 0.0005 25.7 1.0 21 35-55 94-116 (148)
125 PF01433 Peptidase_M1: Peptida 32.3 21 0.00046 29.4 0.9 14 37-50 295-309 (390)
126 TIGR02412 pepN_strep_liv amino 31.9 21 0.00046 33.3 0.9 13 37-49 287-299 (831)
127 PHA02456 zinc metallopeptidase 30.6 24 0.00052 25.0 0.8 13 37-49 79-91 (141)
128 cd06160 S2P-M50_like_2 Unchara 30.5 22 0.00049 26.9 0.7 21 37-57 41-61 (183)
129 cd06159 S2P-M50_PDZ_Arch Uncha 30.1 22 0.00048 28.6 0.6 20 37-56 118-137 (263)
130 TIGR00054 RIP metalloprotease 29.9 21 0.00046 30.4 0.5 15 37-51 14-28 (420)
131 COG0308 PepN Aminopeptidase N 27.2 27 0.00059 32.7 0.8 13 37-49 307-319 (859)
132 KOG2737|consensus 27.0 21 0.00045 30.7 -0.0 21 37-57 359-380 (492)
133 COG4900 Predicted metallopepti 26.7 30 0.00065 24.5 0.7 12 37-48 80-91 (133)
134 PRK10779 zinc metallopeptidase 26.5 27 0.00059 30.0 0.6 15 37-51 15-29 (449)
135 PRK09672 phage exclusion prote 26.2 30 0.00065 28.5 0.8 19 36-55 164-182 (305)
136 PLN02791 Nudix hydrolase homol 25.7 31 0.00068 32.1 0.8 16 38-53 586-601 (770)
137 cd02640 R3H_NRF R3H domain of 25.4 41 0.00089 20.8 1.1 21 37-57 29-49 (60)
138 PRK14015 pepN aminopeptidase N 25.3 32 0.0007 32.4 0.9 13 37-49 296-308 (875)
139 cd06162 S2P-M50_PDZ_SREBP Ster 24.0 33 0.00071 27.9 0.6 18 38-55 136-153 (277)
140 TIGR02414 pepN_proteo aminopep 22.8 39 0.00084 31.9 0.9 13 37-49 283-295 (863)
141 PRK10280 dipeptidyl carboxypep 22.4 37 0.0008 31.1 0.7 13 37-49 464-476 (681)
142 cd06007 R3H_DEXH_helicase R3H 21.4 54 0.0012 20.2 1.1 21 37-57 28-48 (59)
143 PF05569 Peptidase_M56: BlaR1 21.2 45 0.00097 26.7 0.9 12 38-49 197-208 (299)
144 cd06461 M2_ACE Peptidase famil 21.0 41 0.00089 29.4 0.6 14 37-50 247-260 (477)
145 COG4219 MecR1 Antirepressor re 20.3 45 0.00099 27.8 0.7 12 37-48 190-201 (337)
146 PF07607 DUF1570: Protein of u 20.1 46 0.00099 23.9 0.6 22 38-59 6-27 (128)
No 1
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=99.93 E-value=1.9e-26 Score=179.65 Aligned_cols=103 Identities=46% Similarity=0.858 Sum_probs=88.1
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCCC-ccchhhhHHhhhhhcCCCCCCCCCC-CCCCCCCCCCCCCCCCceec
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTGG-FSGIIVAAHEVGHLLGAVHDGSPPP-TYLGGPGAEKCRWEDGFIMS 79 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~~-~~~a~~~AHElGH~lG~~HD~~~~~-~~~~~~~~~~C~~~~~~IMs 79 (154)
..+.|+||+|++|+.. +++++++.+. +.++.++||||||+|||.||++..+ .+.+++++..|+++++|||+
T Consensus 116 ~~~~G~A~~g~~C~~~-------~~~~~~d~~~~~~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~ 188 (220)
T cd04272 116 TGTGGYAYVGGACTEN-------RVAMGEDTPGSYYGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMS 188 (220)
T ss_pred cCccceEeecCccCCC-------ceeEeecCCCCcccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEe
Confidence 3567999999999954 5788888753 6779999999999999999998755 35667888999988899999
Q ss_pred cCCCCCCCCccccCcHHHHHhhhcCCCCcccc
Q psy14250 80 DLRHTEKGFKWSPCSVSSFHHFLNGDTATCLF 111 (154)
Q Consensus 80 ~~~~~~~~~~fS~CS~~~i~~~l~~~~~~CL~ 111 (154)
+.....+++.||+||+++|++||.+..+.||.
T Consensus 189 ~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~ 220 (220)
T cd04272 189 YVVNGERQYRFSQCSQRQIRNVFRRLGASCLH 220 (220)
T ss_pred eccCCCCCCccCcchHHHHHHHHcCCCccccC
Confidence 97655677899999999999999998899984
No 2
>KOG3607|consensus
Probab=99.93 E-value=7.2e-26 Score=200.46 Aligned_cols=124 Identities=27% Similarity=0.361 Sum_probs=103.7
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCC--CccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceec
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTG--GFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMS 79 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~--~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs 79 (154)
.+++|+||+|+||++.+ +.+|+.+.. ....|.|+||||||+|||.||... +.|.....|||.
T Consensus 292 ~~~~G~a~~~~mCs~~~------s~gv~~~~~~~~~~~a~v~AhelgH~lGm~hD~~~----------c~C~~~~~CiM~ 355 (716)
T KOG3607|consen 292 GKYVGLAYFGGMCSPGH------SGGVNKFHSDILLAFAVVLAHELGHNLGMIHDESS----------CSCPSSGLCIMP 355 (716)
T ss_pred CceeceeecccccCccc------ccceeecCcccchhHHHHHHHHHHhhcCccccccc----------ccCCCCCccccc
Confidence 57899999999999975 577776653 567899999999999999999965 678656789994
Q ss_pred cCCCCCCCCccccCcHHHHHhhhcCCCCcccccCCCCCC-CCCCCCCCeEECcccccccccCC
Q psy14250 80 DLRHTEKGFKWSPCSVSSFHHFLNGDTATCLFNSPHEDE-SLPRVLPGKLLSLDAQCRKDRGT 141 (154)
Q Consensus 80 ~~~~~~~~~~fS~CS~~~i~~~l~~~~~~CL~~~p~~~~-~~~~~cgn~~~e~~eqCd~~~g~ 141 (154)
...+...+..||+||++++..|+.+++..||.|.|.+.. ...++|||++||.+|||||..-.
T Consensus 356 ~~~~~~~~~~FS~CS~~~~~~~~~~~~~~cl~n~p~p~~~~~~~~CGNg~vE~gEECDCG~~~ 418 (716)
T KOG3607|consen 356 EETGFYIPKKFSNCSYQDFESFLLKGGGSCLLNLPSPEKIFSGPVCGNGVVEEGEECDCGTPE 418 (716)
T ss_pred cccCcCcccccccchHHHHHHHHhcCCcceEecCCCcccccCCCccCCcccccCCcCCCCChH
Confidence 445557788999999999999999998999999876543 34789999999999999997544
No 3
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=99.92 E-value=3.6e-26 Score=175.27 Aligned_cols=96 Identities=43% Similarity=0.724 Sum_probs=74.7
Q ss_pred eeeeeccCCcccCCccCCccceEEEEeeCC--CccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccC
Q psy14250 4 THSFAYVGGACVVNKRLEKVNSVAIIEDTG--GFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 4 ~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~--~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~ 81 (154)
+.|+||+|+||++.+ +++|+++.+ .+..|.++||||||+|||.||+... |.|.++.+|||++.
T Consensus 102 ~~G~a~~~~~C~~~~------s~~i~~~~~~~~~~~a~~~AHelGH~lGm~HD~~~~---------C~C~~~~~cIM~~~ 166 (199)
T PF01421_consen 102 TVGLAYVGGMCSPSR------SCGIVEDHSRSGLSFAVIIAHELGHNLGMPHDGDNG---------CKCPRRDGCIMSPS 166 (199)
T ss_dssp BSEEE-TT-TTSTTT------SEEEEE-SSSSHHHHHHHHHHHHHHHTT---TTTCT---------HCSSSSSSSTTSSS
T ss_pred eeeeEeCCCCCCcCC------CCcEeeeccchhHHHHHHHHHHHHHhcCCCCCCCCc---------CCcCCCCCcccccc
Confidence 789999999999974 688998873 4678999999999999999999932 78977899999996
Q ss_pred CCCCCCCccccCcHHHHHhhhcCCCCcccccCC
Q psy14250 82 RHTEKGFKWSPCSVSSFHHFLNGDTATCLFNSP 114 (154)
Q Consensus 82 ~~~~~~~~fS~CS~~~i~~~l~~~~~~CL~~~p 114 (154)
........||+||+++|.+|+.++.+.||+|.|
T Consensus 167 ~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~n~P 199 (199)
T PF01421_consen 167 IGPSSSDKFSNCSRRQFEEFLSSGKSSCLLNKP 199 (199)
T ss_dssp SSSSSTSSS-HHHHHHHHHHHHHHTTGGGSS--
T ss_pred ccCCCCCCcCHHHHHHHHHHHccCCCCccCCCC
Confidence 544455589999999999999988899999987
No 4
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.92 E-value=2e-25 Score=170.47 Aligned_cols=93 Identities=31% Similarity=0.495 Sum_probs=80.9
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCC--CccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceec
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTG--GFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMS 79 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~--~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs 79 (154)
+...|+||+|++|++.+ +++++++.+ .+..|.++||||||+|||.||++. |.|++ ++|||+
T Consensus 100 ~~~~G~A~~g~iC~~~~------~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~----------c~C~~-~~cIM~ 162 (194)
T cd04269 100 GNTVGLAYVGGMCSPKY------SGGVVQDHSRNLLLFAVTMAHELGHNLGMEHDDGG----------CTCGR-STCIMA 162 (194)
T ss_pred CCceeeeecCCccCCCc------ceEEEEeCCcchHHHHHHHHHHHHhhcCCCcCCCC----------CCCCC-CCeEec
Confidence 56799999999999753 688888876 466799999999999999999986 68986 889999
Q ss_pred cCCCCCCCCccccCcHHHHHhhhcCCCCccccc
Q psy14250 80 DLRHTEKGFKWSPCSVSSFHHFLNGDTATCLFN 112 (154)
Q Consensus 80 ~~~~~~~~~~fS~CS~~~i~~~l~~~~~~CL~~ 112 (154)
+.... +...||+||+++|.+++.++...||++
T Consensus 163 ~~~~~-~~~~fS~CS~~~~~~~l~~~~~~CL~~ 194 (194)
T cd04269 163 PSPSS-LTDAFSNCSYEDYQKFLSRGGGQCLLN 194 (194)
T ss_pred CCCCC-CCCCCChhhHHHHHHHHhcCCCCccCC
Confidence 98654 677999999999999999988999974
No 5
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=99.92 E-value=9.3e-26 Score=174.30 Aligned_cols=93 Identities=42% Similarity=0.687 Sum_probs=80.2
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCCCccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCC--CCceec
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTGGFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWE--DGFIMS 79 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~--~~~IMs 79 (154)
..++|+||+|++|++.+ +++++++.+ +..|.++||||||+|||.||++. +.|+++ ++|||+
T Consensus 112 ~~~~G~A~~ggiC~~~~------s~~i~~~~~-~~~a~~~aHElGH~LG~~HD~~~----------~~C~~~~~~~~IM~ 174 (207)
T cd04273 112 CDTLGLAPVGGMCSPSR------SCSINEDTG-LSSAFTIAHELGHVLGMPHDGDG----------NSCGPEGKDGHIMS 174 (207)
T ss_pred CCceEEeccccCCCCCc------ceEEEcCCC-ceeEEeeeeechhhcCCCCCCCC----------CCCCCCCCCceeec
Confidence 36799999999999874 688998875 66799999999999999999886 578764 589999
Q ss_pred cCCCC-CCCCccccCcHHHHHhhhcCCCCcccc
Q psy14250 80 DLRHT-EKGFKWSPCSVSSFHHFLNGDTATCLF 111 (154)
Q Consensus 80 ~~~~~-~~~~~fS~CS~~~i~~~l~~~~~~CL~ 111 (154)
+.... .+++.||+||+++|++|+.++...||+
T Consensus 175 ~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~ 207 (207)
T cd04273 175 PTLGANTGPFTWSKCSRRYLTSFLDTGDGNCLL 207 (207)
T ss_pred ccccCCCCCCCcCHHHHHHHHHHHccCCCcccC
Confidence 98654 377899999999999999998889984
No 6
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=99.91 E-value=9.2e-25 Score=171.41 Aligned_cols=103 Identities=25% Similarity=0.387 Sum_probs=76.8
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCCC-------ccchhhhHHhhhhhcCCCCCCCCCCCC-CCCCCCCCC---
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTGG-------FSGIIVAAHEVGHLLGAVHDGSPPPTY-LGGPGAEKC--- 70 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~~-------~~~a~~~AHElGH~lG~~HD~~~~~~~-~~~~~~~~C--- 70 (154)
..++|+||+|+||++.. ++++.++... ...+.||||||||+|||.||.+...|. ++++++..|
T Consensus 109 ~~~lGlA~vg~mC~~~~------s~~~~~~~~~~~~~~~~~~~~~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~ 182 (228)
T cd04271 109 GSEVGVAWLGQLCRTGA------SDQGNETVAGTNVVVRTSNEWQVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLS 182 (228)
T ss_pred CCceeEeeccCCCCCCc------CCCccccccccceeecccccceehhhhhhhhcCCCCCCCCCccccccccccccCCCc
Confidence 46899999999999974 4555554321 112369999999999999998764332 334444444
Q ss_pred ----CCCCCceeccCCCCCCCCccccCcHHHHHhhhcCCC--Ccccc
Q psy14250 71 ----RWEDGFIMSDLRHTEKGFKWSPCSVSSFHHFLNGDT--ATCLF 111 (154)
Q Consensus 71 ----~~~~~~IMs~~~~~~~~~~fS~CS~~~i~~~l~~~~--~~CL~ 111 (154)
.++++|||++..+. ++++||+||+++|++||.++. +.||.
T Consensus 183 ~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~l~~~~~~~~CL~ 228 (228)
T cd04271 183 TSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSLLGRNPVRTSCLS 228 (228)
T ss_pred ccccCCCCCEEeecCcCC-CCCccChhhHHHHHHHHhcCCCCCcccC
Confidence 44689999998655 778999999999999998754 78983
No 7
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.87 E-value=2.6e-22 Score=152.91 Aligned_cols=89 Identities=35% Similarity=0.615 Sum_probs=73.1
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCC-CccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceecc
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTG-GFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSD 80 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~-~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~ 80 (154)
..+.|+||++++|++.. +++++++.+ .+..+.++||||||+|||.||...+++ +.|+++++|||++
T Consensus 103 ~~~~G~a~~~~~C~~~~------~~~v~~~~~~~~~~~~~~aHElGH~lG~~HD~~~~~~-------~~c~~~~~~IM~~ 169 (192)
T cd04267 103 GDILGLAYVGSMCNPYS------SVGVVEDTGFTLLTALTMAHELGHNLGAEHDGGDELA-------FECDGGGNYIMAP 169 (192)
T ss_pred CCeeeeecccCCCCCCC------CeEEEecCCcceeehhhhhhhHHhhcCCcCCCCCCcc-------ccCCCCCCeEEcc
Confidence 35789999999999863 578888775 466799999999999999999976421 2677778999999
Q ss_pred CCCCCCCCccccCcHHHHHhhhc
Q psy14250 81 LRHTEKGFKWSPCSVSSFHHFLN 103 (154)
Q Consensus 81 ~~~~~~~~~fS~CS~~~i~~~l~ 103 (154)
.....+...||+||+++|++||.
T Consensus 170 ~~~~~~~~~FS~CS~~~i~~~l~ 192 (192)
T cd04267 170 VDSGLNSYRFSQCSIGSIREFLD 192 (192)
T ss_pred cccCCCCCccChhhHHHHHHHhC
Confidence 76545678899999999999884
No 8
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=99.87 E-value=3e-22 Score=158.56 Aligned_cols=105 Identities=28% Similarity=0.392 Sum_probs=73.7
Q ss_pred CceeeeeccC--------CcccCCccC--Ccc--ceEEEEe--eCCC----ccchhhhHHhhhhhcCCCCCCCCCCCCCC
Q psy14250 2 PVTHSFAYVG--------GACVVNKRL--EKV--NSVAIIE--DTGG----FSGIIVAAHEVGHLLGAVHDGSPPPTYLG 63 (154)
Q Consensus 2 ~~~~G~A~~g--------~~C~~~~~~--~~~--~s~~i~~--~~~~----~~~a~~~AHElGH~lG~~HD~~~~~~~~~ 63 (154)
..++|+||+| |||++.... +.. ...|++. +.+. ...+.+|||||||+|||+||+...
T Consensus 114 ~~tlGlA~vg~~~~~~~gGIC~~~~s~~~~~~~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnlGm~HD~~~~----- 188 (244)
T cd04270 114 MGTLGLAYVGSPRDNSAGGICEKAYYYSNGKKKYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNFGSPHDPDIA----- 188 (244)
T ss_pred CCceeeeeecccccCCCCcccCCcccccCCcceeeecceEeeeccCCccchhHHHHHHHHHHHHhcCCCCCCCCC-----
Confidence 3579999996 589887431 000 0013332 2221 124689999999999999999852
Q ss_pred CCCCCCCCC--CCCceeccCC---CCCCCCccccCcHHHHHhhhcCCCCcccccCC
Q psy14250 64 GPGAEKCRW--EDGFIMSDLR---HTEKGFKWSPCSVSSFHHFLNGDTATCLFNSP 114 (154)
Q Consensus 64 ~~~~~~C~~--~~~~IMs~~~---~~~~~~~fS~CS~~~i~~~l~~~~~~CL~~~p 114 (154)
.|.|.. .++|||++.. ...+++.||+||+++|.+||..+.+.||.+..
T Consensus 189 ---~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~~~~~~CL~~~~ 241 (244)
T cd04270 189 ---ECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLEVKSNSCFVERS 241 (244)
T ss_pred ---CCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHccCCCCCcCCCC
Confidence 144532 3689999874 23567899999999999999998899998663
No 9
>KOG3658|consensus
Probab=99.86 E-value=5.2e-22 Score=171.53 Aligned_cols=128 Identities=24% Similarity=0.346 Sum_probs=93.6
Q ss_pred CceeeeeccC--------CcccCCccCC--ccce--EEEEee--CCC----ccchhhhHHhhhhhcCCCCCCCCCCCCCC
Q psy14250 2 PVTHSFAYVG--------GACVVNKRLE--KVNS--VAIIED--TGG----FSGIIVAAHEVGHLLGAVHDGSPPPTYLG 63 (154)
Q Consensus 2 ~~~~G~A~~g--------~~C~~~~~~~--~~~s--~~i~~~--~~~----~~~a~~~AHElGH~lG~~HD~~~~~~~~~ 63 (154)
..++|||||| |+|.+.+..- +.++ .||+.. ++. --.-.|+|||||||||.+||.+.-| .|+
T Consensus 339 ~GtLGLAwVgsp~~~saGGIC~k~~~y~~G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNfGSpHDp~~EC-sPg 417 (764)
T KOG3658|consen 339 GGTLGLAWVGSPRSNSAGGICEKAYTYNEGKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNFGSPHDPDIEC-SPG 417 (764)
T ss_pred cceeeeEEeccCccCCCCcccccceeccCcceEEeecceeeeeecCCccCcchhheeehhhhccccCCCCCCCCcc-CCC
Confidence 4689999988 7999876531 1111 233321 221 1135799999999999999998521 111
Q ss_pred CCCCCCCCCCCCceeccCC--C-CCCCCccccCcHHHHHhhhcCCCCcccccCCCCCCCCCCCCCCeEECcccccccccC
Q psy14250 64 GPGAEKCRWEDGFIMSDLR--H-TEKGFKWSPCSVSSFHHFLNGDTATCLFNSPHEDESLPRVLPGKLLSLDAQCRKDRG 140 (154)
Q Consensus 64 ~~~~~~C~~~~~~IMs~~~--~-~~~~~~fS~CS~~~i~~~l~~~~~~CL~~~p~~~~~~~~~cgn~~~e~~eqCd~~~g 140 (154)
. ...+.+||++.. + ..+..+||.||++.|.+.|..++..|+... ....|||++||.+|||||.|-
T Consensus 418 ~------~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~VL~~k~~~CFqE~------~~sfCGN~iVE~GEECDcG~~ 485 (764)
T KOG3658|consen 418 E------SQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKVLESKKRNCFQER------ESSFCGNGIVEPGEECDCGFS 485 (764)
T ss_pred C------CCCCcEEEEEecccCcCccCcccCcchHHHHHHHHHHhhhhhccCc------cccccCCccccCCcccCcccc
Confidence 1 115789999873 3 467889999999999999999889999655 357899999999999999887
Q ss_pred CC
Q psy14250 141 TS 142 (154)
Q Consensus 141 ~~ 142 (154)
.+
T Consensus 486 ~~ 487 (764)
T KOG3658|consen 486 TD 487 (764)
T ss_pred cc
Confidence 53
No 10
>KOG3538|consensus
Probab=99.79 E-value=1.1e-19 Score=165.00 Aligned_cols=131 Identities=32% Similarity=0.495 Sum_probs=111.3
Q ss_pred ceeeeeccCCcccCCccCCccceEEEEeeCCCccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCC
Q psy14250 3 VTHSFAYVGGACVVNKRLEKVNSVAIIEDTGGFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLR 82 (154)
Q Consensus 3 ~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~ 82 (154)
.+.|+|.+++||.+.+ ++.|+++.++- .|+++|||+||+|||.||+.. ..|.... +||++..
T Consensus 290 ~~~G~A~v~gmC~~~~------sC~i~e~~gl~-~aftiahE~GH~~gm~hd~~~----------~~C~~~~-~im~~~~ 351 (845)
T KOG3538|consen 290 DTLGVAPVGGMCSPSR------SCSINEDTGLP-SAFTIAHELGHNFGMEHDGRG----------NSCGPTF-HVMSSDT 351 (845)
T ss_pred ccccceeecceeCCcc------ceeeecCCCCc-cceeeeeccccccCcccCCcC----------CCCCCcc-cccCccc
Confidence 4689999999999984 79999999854 799999999999999999987 3475333 8999974
Q ss_pred -CCCCCCccccCcHHHHHhhhcCCCCcccccCCC-CCCCCCCCCCCeEECcccccccccCCCceEeccccc
Q psy14250 83 -HTEKGFKWSPCSVSSFHHFLNGDTATCLFNSPH-EDESLPRVLPGKLLSLDAQCRKDRGTSACFIRYVQT 151 (154)
Q Consensus 83 -~~~~~~~fS~CS~~~i~~~l~~~~~~CL~~~p~-~~~~~~~~cgn~~~e~~eqCd~~~g~~~~~~~~~~~ 151 (154)
.......||.||.+++..||.+....||.+.|. .....+...|+..|+.+|||...||..+.+|..++.
T Consensus 352 ~~~~~~~~WS~CS~~~~t~fl~~~~~~Cl~d~p~~~~~~~p~~lPg~~y~~~~qC~~~fg~~~~~c~~~~~ 422 (845)
T KOG3538|consen 352 CGGNHELTWSECSREYLTRFLDDGEGNCLSDQPFSALPKLPSQLPGQTYDADEQCEFQFGPKSKECPDMRA 422 (845)
T ss_pred ccCCCCccCCcchHHHHHHHhccCccceeecCCccccCCCCccCCccccCCCCccccccCCcceeccccCc
Confidence 334556699999999999999887899999984 334456668899999999999999999999999875
No 11
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=99.78 E-value=5.1e-20 Score=138.94 Aligned_cols=92 Identities=36% Similarity=0.560 Sum_probs=45.5
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeCCCccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccC
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDTGGFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~ 81 (154)
...+|+||+|++|+..+. ..|+..+.+.+..+.++||||||+||+.||.+..+ .+++..|++++++||++.
T Consensus 81 ~g~~Gla~~g~~c~~~~~-----~~g~~~~~~~~~~~~~~aHElGH~lGa~Hd~~~~~----~~~~~~~~~sg~~IMsy~ 151 (173)
T PF13574_consen 81 GGGGGLAYVGGVCSKGYK-----ATGLSSPYGDFFGIDTFAHELGHQLGAPHDFDNGC----NPGSPTCPGSGGYIMSYA 151 (173)
T ss_dssp SGCCSSEEE-CCCCTCCC-----E-CCCS--TTSHHHHHHHHHHHHHHT---SSSS--------SS--T-STTT-TT-SS
T ss_pred CCCccEEEEeEeCcCCCE-----EEEEEcCCCceeeeeeehhhhHhhcCCCCCCCCCC----CCCCCcCCCCCCEEEeeC
Confidence 357899999999988753 23344444433368999999999999999998631 123346888999999987
Q ss_pred CCCCCCCccccCcHHHHHhhhc
Q psy14250 82 RHTEKGFKWSPCSVSSFHHFLN 103 (154)
Q Consensus 82 ~~~~~~~~fS~CS~~~i~~~l~ 103 (154)
+..+..+||+||+++|+.+|.
T Consensus 152 -~~~~~~~fS~cS~~~i~~~l~ 172 (173)
T PF13574_consen 152 -GICNNDKFSPCSIRQIRAVLG 172 (173)
T ss_dssp --GGGG----------------
T ss_pred -CCCCCcccccccccccccccC
Confidence 446777899999999999885
No 12
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.56 E-value=7.4e-15 Score=108.79 Aligned_cols=93 Identities=23% Similarity=0.308 Sum_probs=66.1
Q ss_pred ceeeeeccCCcccCCccCCccceEEEEeeCC--CccchhhhHHhhhhhcCCCCCCCCCCCC--C-CCCCCCCCCCCCCce
Q psy14250 3 VTHSFAYVGGACVVNKRLEKVNSVAIIEDTG--GFSGIIVAAHEVGHLLGAVHDGSPPPTY--L-GGPGAEKCRWEDGFI 77 (154)
Q Consensus 3 ~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~~--~~~~a~~~AHElGH~lG~~HD~~~~~~~--~-~~~~~~~C~~~~~~I 77 (154)
...|+||++++|++.. +.+++.+.. ....+.+++|||||.|||.||...++.+ + .......+..+..||
T Consensus 66 ~~~g~a~~~~~c~~~~------~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~si 139 (167)
T cd00203 66 GTGGWAYLGRVCDSLR------GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSV 139 (167)
T ss_pred CceEEEecCCccCCCC------CcEEEecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeE
Confidence 4689999999998763 455554432 2356899999999999999999864311 0 011112334567899
Q ss_pred eccCCCC---CCCCccccCcHHHHHhh
Q psy14250 78 MSDLRHT---EKGFKWSPCSVSSFHHF 101 (154)
Q Consensus 78 Ms~~~~~---~~~~~fS~CS~~~i~~~ 101 (154)
|++.... ...+.||+|++++|+.+
T Consensus 140 M~y~~~~~~~~~~~~fS~~d~~~i~~~ 166 (167)
T cd00203 140 MSYTKGSFSDGQRKDFSQCDIDQINKL 166 (167)
T ss_pred eccCccccCcccCCCcCHHHHHHHHhh
Confidence 9997643 46779999999999865
No 13
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=99.51 E-value=5.2e-14 Score=108.98 Aligned_cols=98 Identities=27% Similarity=0.210 Sum_probs=64.4
Q ss_pred CceeeeeccCCcccCCccCCccceEEEEeeC-CCccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceecc
Q psy14250 2 PVTHSFAYVGGACVVNKRLEKVNSVAIIEDT-GGFSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSD 80 (154)
Q Consensus 2 ~~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~-~~~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~ 80 (154)
....|+||+++.|.... ..+++.+. .......++||||||+||+.||....... .+.+. +.+.+..||++
T Consensus 107 ~~~~GlA~~~~~~~~~~------~~~~~~~~~~~~~~~~~~aHEiGH~lGl~H~~~~~~~~--~~~~~-~~~~~~tIMsY 177 (206)
T PF13583_consen 107 SGTCGLAYVGSVCWDNA------KAGAYSDPDLSANGYQTFAHEIGHNLGLRHDFDYSGDP--YSYAY-ECDSGQTIMSY 177 (206)
T ss_pred CCCCCeeeecccccccc------ceEEEEecccccccchHHHHHHHHHhcCCCCcccCCCc--ccccc-cCCCCCEEeCC
Confidence 46789999999995442 12233322 22234578999999999999999853211 11222 33456799997
Q ss_pred CCCCCCCCccccCcHHHHHhhhcCCCCcc
Q psy14250 81 LRHTEKGFKWSPCSVSSFHHFLNGDTATC 109 (154)
Q Consensus 81 ~~~~~~~~~fS~CS~~~i~~~l~~~~~~C 109 (154)
...... ..||+|++.+|++++..+...|
T Consensus 178 ~~~~~~-~~FS~~~i~~i~~~~~~~~~~~ 205 (206)
T PF13583_consen 178 ASRPSR-PYFSPPSIEYIREVVICGDASY 205 (206)
T ss_pred CCCCCC-CCCCchhHHHHhCCCcCccccc
Confidence 544333 3599999999998886655444
No 14
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=99.46 E-value=1e-14 Score=111.19 Aligned_cols=86 Identities=34% Similarity=0.455 Sum_probs=41.7
Q ss_pred eeeeccCCcccCCccCCccceEEEE-ee-CCCccchhhhHHhhhhhcCCCCCCCCCCCC-CCCCCCCCCCCCCCceeccC
Q psy14250 5 HSFAYVGGACVVNKRLEKVNSVAII-ED-TGGFSGIIVAAHEVGHLLGAVHDGSPPPTY-LGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 5 ~G~A~~g~~C~~~~~~~~~~s~~i~-~~-~~~~~~a~~~AHElGH~lG~~HD~~~~~~~-~~~~~~~~C~~~~~~IMs~~ 81 (154)
+|+||+|++|.......-....... .. ......+.++||||||+||+.||....... ...+++..|++++++||++.
T Consensus 108 lGlA~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AHEiGH~lGa~HD~~~~~~~~~c~~~~~~~~~~~~~IM~y~ 187 (196)
T PF13688_consen 108 LGLAYVGSSCSNTSGGICSSSVSGSYGQVPPTYNGAITFAHEIGHNLGAPHDGDYSSTSSSCPPSSSTCPAGGGYIMSYC 187 (196)
T ss_dssp EEEESSSSSSTT-SSSSS--E-EEETTEE--HHHHHHHHHHHHHHHTT-----SSS---TTTS-----GGGT---TTSSS
T ss_pred ceeEeccccCcCCccceeccCcccccceECCCCceehhhHHhHHHhcCCCCCCCCCCCCCCCCCCccccCCCCCeEEccc
Confidence 6999999999876421110011100 00 123456899999999999999999874110 01222345777899999998
Q ss_pred CCCCCCCccc
Q psy14250 82 RHTEKGFKWS 91 (154)
Q Consensus 82 ~~~~~~~~fS 91 (154)
.. .+..+||
T Consensus 188 ~~-~n~~~FS 196 (196)
T PF13688_consen 188 AS-PNNNKFS 196 (196)
T ss_dssp ---GGGGS--
T ss_pred CC-CCcccCC
Confidence 65 6677787
No 15
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=98.85 E-value=1.2e-09 Score=77.31 Aligned_cols=46 Identities=41% Similarity=0.615 Sum_probs=25.3
Q ss_pred ceeeeeccCCcccCCccCCccceEEEEeeC--CCccchhhhHHhhhhhcCCCCC
Q psy14250 3 VTHSFAYVGGACVVNKRLEKVNSVAIIEDT--GGFSGIIVAAHEVGHLLGAVHD 54 (154)
Q Consensus 3 ~~~G~A~~g~~C~~~~~~~~~~s~~i~~~~--~~~~~a~~~AHElGH~lG~~HD 54 (154)
...|+||++++|..... .++..+. .......++||||||+||+.||
T Consensus 77 ~~~G~A~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 77 GAGGIAYIGGVCSNNSG------YGVNYDSSPYGDSGVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp GB-EEEECT---TTT--------TT-SEEEEE--TTSTTHHHHHHHHHTT----
T ss_pred CCceEEEeCCccccCCc------eEEEcccCCCCcccceEeeehhhHhcCCCCC
Confidence 47899999999999853 2233222 1234569999999999999997
No 16
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=97.92 E-value=7.7e-06 Score=59.56 Aligned_cols=48 Identities=25% Similarity=0.297 Sum_probs=33.1
Q ss_pred cchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCCCCCCCCccccCcHHHHHh
Q psy14250 36 SGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLRHTEKGFKWSPCSVSSFHH 100 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~~~~~~~~fS~CS~~~i~~ 100 (154)
....|++|||||.||+.|.... .-||++.........++..-++.|+.
T Consensus 104 ~~~~v~~HEiGHaLGL~H~~~~-----------------~svM~~~~~~~~~~~l~~~Di~~i~~ 151 (154)
T PF00413_consen 104 DLQSVAIHEIGHALGLDHSNDP-----------------NSVMYPYYRGPDNKTLSEDDIDGIQY 151 (154)
T ss_dssp EHHHHHHHHHHHHTTBESSSST-----------------TSTTSSSCTSSSSTSTTHHHHHHHHH
T ss_pred hhhhhhhhccccccCcCcCCCc-----------------ccceeeecccCCCCCCCHHHHHHHHH
Confidence 3468999999999999998754 34899875432223566655555544
No 17
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.70 E-value=3.8e-05 Score=58.43 Aligned_cols=44 Identities=30% Similarity=0.331 Sum_probs=29.1
Q ss_pred eeeccCCcccCCccCCccceEEEEeeCC---C----------c--cchhhhHHhhhhhcCCCCCC
Q psy14250 6 SFAYVGGACVVNKRLEKVNSVAIIEDTG---G----------F--SGIIVAAHEVGHLLGAVHDG 55 (154)
Q Consensus 6 G~A~~g~~C~~~~~~~~~~s~~i~~~~~---~----------~--~~a~~~AHElGH~lG~~HD~ 55 (154)
|++|+.|.|.+.. .++|+--.. . + -....++|||||+||+.|-.
T Consensus 85 ~~nfvfG~a~~~~------~~aVvS~~Rl~~~~~~~~~~~~l~~~R~~k~~~HElGH~lGL~HC~ 143 (179)
T PRK13267 85 GLNFVFGLAYPNL------RGAVISTYRLRPEFYGNKPDSELFEERVRKEVTHELGHTLGLEHCD 143 (179)
T ss_pred CCCceeEEeecCC------CEEEEEccccChhhccccchHHHHHHHHHHHHHHHHHHHcCCccCC
Confidence 5778889998864 344442211 1 1 13455899999999999943
No 18
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=97.68 E-value=4.2e-05 Score=56.28 Aligned_cols=49 Identities=22% Similarity=0.224 Sum_probs=34.6
Q ss_pred chhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCCCCCC--CCccccCcHHHHHhh
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLRHTEK--GFKWSPCSVSSFHHF 101 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~~~~~--~~~fS~CS~~~i~~~ 101 (154)
...|+.|||||.||+.|.... ..-||++...... ....+...++-|+.+
T Consensus 104 ~~~~~~HEiGHaLGL~H~~~~----------------~~siM~p~~~~~~~~~~~l~~~D~~~i~~l 154 (156)
T cd04279 104 LQAIALHELGHALGLWHHSDR----------------PEDAMYPSQGQGPDGNPTLSARDVATLKRL 154 (156)
T ss_pred HHHHHHHHhhhhhcCCCCCCC----------------ccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence 457899999999999999875 1359998743222 245677766666543
No 19
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=97.38 E-value=0.00017 Score=53.12 Aligned_cols=46 Identities=35% Similarity=0.447 Sum_probs=30.8
Q ss_pred chhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCCCCC-CCCccccCcHHHHH
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLRHTE-KGFKWSPCSVSSFH 99 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~~~~-~~~~fS~CS~~~i~ 99 (154)
.-.|+.|||||.||+.|..+. .-||.+..... ....++.--+..|+
T Consensus 107 ~~~~~~HEiGHaLGL~H~~~~-----------------~~vM~~~~~~~~~~~~l~~~D~~~~~ 153 (157)
T cd04278 107 LFSVAAHEIGHALGLGHSSDP-----------------DSIMYPYYQGPVPKFKLSQDDIRGIQ 153 (157)
T ss_pred HHHHHHHHhccccccCCCCCC-----------------cCeecccccCCCcCCCCCHHHHHHHH
Confidence 457999999999999997654 35999864322 23455555444443
No 20
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=97.34 E-value=0.00021 Score=52.55 Aligned_cols=21 Identities=38% Similarity=0.436 Sum_probs=18.8
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.-.+++|||||.||+.|....
T Consensus 94 ~~~~~~HEiGHaLGL~H~~~~ 114 (165)
T cd04268 94 LRNTAEHELGHALGLRHNFAA 114 (165)
T ss_pred HHHHHHHHHHHHhcccccCcC
Confidence 358999999999999999876
No 21
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=97.27 E-value=0.00011 Score=52.72 Aligned_cols=24 Identities=38% Similarity=0.679 Sum_probs=16.0
Q ss_pred ccchhhhHHhhhhhcCCCCCCCCC
Q psy14250 35 FSGIIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~HD~~~~ 58 (154)
+....|+||||||+||++....+.
T Consensus 75 y~~~RIaaHE~GHiLGLPD~y~Gp 98 (132)
T PF02031_consen 75 YNSTRIAAHELGHILGLPDHYPGP 98 (132)
T ss_dssp S-HHHHHHHHHHHHHT----TTS-
T ss_pred CccceeeeehhccccCCCCCCCCc
Confidence 556789999999999999888774
No 22
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.16 E-value=0.00021 Score=53.91 Aligned_cols=41 Identities=24% Similarity=0.257 Sum_probs=28.0
Q ss_pred chhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCCC--CCCCCccccCc
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLRH--TEKGFKWSPCS 94 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~~--~~~~~~fS~CS 94 (154)
...-+.|||||.||+.|=. +..|+|+.... ....+.|+-|.
T Consensus 124 v~KEv~HElGH~~GL~HC~-----------------N~~CVM~FSnSl~dvDrKs~~fC~ 166 (181)
T COG1913 124 VVKEVLHELGHLLGLSHCP-----------------NPRCVMNFSNSLRDVDRKSPNFCN 166 (181)
T ss_pred HHHHHHHHhhhhcCcccCC-----------------CCCcEEeCCccHHhhhccchhhhH
Confidence 4577899999999997744 34689998632 23444555554
No 23
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=97.01 E-value=0.00031 Score=50.59 Aligned_cols=20 Identities=55% Similarity=0.720 Sum_probs=18.1
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
..+++|||||.||+.|....
T Consensus 87 ~~~~~HEigHaLGl~H~~~~ 106 (140)
T smart00235 87 TGVAAHELGHALGLYHEQSR 106 (140)
T ss_pred cccHHHHHHHHhcCCcCCCC
Confidence 35999999999999999986
No 24
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=96.64 E-value=0.00092 Score=50.50 Aligned_cols=22 Identities=41% Similarity=0.643 Sum_probs=19.3
Q ss_pred chhhhHHhhhhhcCCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~ 58 (154)
.-.++.|||||.||+.|.....
T Consensus 113 ~~~t~~HEiGHaLGL~H~~~~~ 134 (186)
T cd04277 113 GYQTIIHEIGHALGLEHPGDYN 134 (186)
T ss_pred hHHHHHHHHHHHhcCCCCCcCC
Confidence 3589999999999999998764
No 25
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.57 E-value=0.001 Score=49.37 Aligned_cols=24 Identities=33% Similarity=0.346 Sum_probs=15.5
Q ss_pred ccchhhhHHhhhhhcCCCCCCCCC
Q psy14250 35 FSGIIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~HD~~~~ 58 (154)
+..+.|++||+||-||+.|-..+.
T Consensus 67 ~~~g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 67 YNFGKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp S-SSHHHHHHHHHHTT---TT---
T ss_pred cccccchhhhhhhhhcccccccCC
Confidence 345899999999999999998764
No 26
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.42 E-value=0.0014 Score=50.51 Aligned_cols=27 Identities=37% Similarity=0.560 Sum_probs=18.0
Q ss_pred chhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~ 81 (154)
....+.|||||+||+.| |. + .|+|+..
T Consensus 145 ~~Kea~HElGH~~GL~H----------------C~-~-~CvM~~s 171 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDH----------------CE-N-RCVMNFS 171 (194)
T ss_dssp HHHHHHHHHHHHTT--------------------S-S-TSTTS--
T ss_pred HHHHHHHHHHHHcCCcC----------------CC-C-CCccCCC
Confidence 56899999999999976 54 2 7899975
No 27
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=96.32 E-value=0.0022 Score=48.89 Aligned_cols=21 Identities=29% Similarity=0.369 Sum_probs=18.5
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..++.|||||.||+.|.....
T Consensus 78 ~G~i~HEl~HaLG~~HEhsRp 98 (182)
T cd04283 78 KGIIQHELLHALGFYHEQTRS 98 (182)
T ss_pred cchHHHHHHHHhCCccccccc
Confidence 368999999999999998773
No 28
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=96.24 E-value=0.0022 Score=49.54 Aligned_cols=21 Identities=33% Similarity=0.724 Sum_probs=18.5
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..+++|||+|.||+.|.....
T Consensus 88 ~Gti~HEl~HaLGf~HEhsRp 108 (200)
T cd04281 88 FGIVVHELGHVIGFWHEHTRP 108 (200)
T ss_pred CchHHHHHHHHhcCcchhccc
Confidence 468999999999999998763
No 29
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=96.21 E-value=0.0043 Score=51.13 Aligned_cols=20 Identities=40% Similarity=0.582 Sum_probs=17.3
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
..++.|||||||||.|-...
T Consensus 151 ~~~~~HElgHN~GL~Ha~~~ 170 (314)
T PF05548_consen 151 WATIMHELGHNLGLWHAGRN 170 (314)
T ss_pred HHHHHHHhhhhccccccCCC
Confidence 45999999999999998754
No 30
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=96.11 E-value=0.0025 Score=49.63 Aligned_cols=20 Identities=35% Similarity=0.449 Sum_probs=17.8
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
.+.+|+|||||.+|+.|.+-
T Consensus 133 ~~hvi~HEiGH~IGfRHTD~ 152 (211)
T PF12388_consen 133 IEHVITHEIGHCIGFRHTDY 152 (211)
T ss_pred HHHHHHHHhhhhccccccCc
Confidence 57899999999999999763
No 31
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=96.08 E-value=0.0032 Score=47.66 Aligned_cols=21 Identities=38% Similarity=0.542 Sum_probs=18.6
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..++.|||||.||+.|.....
T Consensus 75 ~g~v~HE~~HalG~~HEh~R~ 95 (180)
T cd04280 75 LGTIVHELMHALGFYHEQSRP 95 (180)
T ss_pred CchhHHHHHHHhcCcchhccc
Confidence 479999999999999998763
No 32
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=95.90 E-value=0.0048 Score=48.73 Aligned_cols=21 Identities=33% Similarity=0.381 Sum_probs=18.5
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..+++|||+|.||+.|.....
T Consensus 121 ~Gti~HEl~HalGf~HEqsRp 141 (230)
T cd04282 121 KATVEHEFLHALGFYHEQSRS 141 (230)
T ss_pred CchHHHHHHHHhCCccccccc
Confidence 468999999999999998773
No 33
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.74 E-value=0.0047 Score=47.32 Aligned_cols=20 Identities=45% Similarity=0.798 Sum_probs=17.9
Q ss_pred hhhHHhhhhhcCCCCCCCCC
Q psy14250 39 IVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 39 ~~~AHElGH~lG~~HD~~~~ 58 (154)
.++.|||||.||+.|.....
T Consensus 94 ~~i~HElgHaLG~~HEh~rp 113 (198)
T cd04327 94 RVVLHEFGHALGFIHEHQSP 113 (198)
T ss_pred HHHHHHHHHHhcCcccccCC
Confidence 58999999999999998763
No 34
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=95.73 E-value=0.0024 Score=50.25 Aligned_cols=24 Identities=38% Similarity=0.380 Sum_probs=20.6
Q ss_pred ccchhhhHHhhhhhcCCCCCCCCC
Q psy14250 35 FSGIIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~HD~~~~ 58 (154)
+....+++|||||-|||.|...+.
T Consensus 135 ~n~g~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 135 YNLGDTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ccccceeEEeccceeeeeeeecCC
Confidence 345689999999999999998774
No 35
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=95.56 E-value=0.0036 Score=47.84 Aligned_cols=21 Identities=38% Similarity=0.624 Sum_probs=16.7
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..+++|||+|.||+.|.....
T Consensus 80 ~~~i~HEl~HaLG~~HEh~Rp 100 (191)
T PF01400_consen 80 VGTILHELGHALGFWHEHQRP 100 (191)
T ss_dssp HHHHHHHHHHHHTB--GGGST
T ss_pred ccchHHHHHHHHhhhhhhhcc
Confidence 568999999999999998763
No 36
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.43 E-value=0.0074 Score=46.56 Aligned_cols=20 Identities=45% Similarity=0.584 Sum_probs=17.6
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
..+++|||||.||+.|....
T Consensus 117 ~~~~~he~gh~lGl~hn~~~ 136 (197)
T cd04276 117 RYLLAHEVGHTLGLRHNFKA 136 (197)
T ss_pred HHHHHHHHHHHhcCcccccc
Confidence 46899999999999998765
No 37
>KOG3714|consensus
Probab=95.30 E-value=0.0081 Score=51.20 Aligned_cols=21 Identities=38% Similarity=0.483 Sum_probs=18.5
Q ss_pred hhhhHHhhhhhcCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPP 58 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~ 58 (154)
..|++|||+|.||..|.....
T Consensus 160 ~G~i~HEl~HaLGf~HehsR~ 180 (411)
T KOG3714|consen 160 FGTIVHELMHALGFWHEHSRP 180 (411)
T ss_pred CchhHHHHHHHhhhhhccCcc
Confidence 578999999999999998764
No 38
>KOG1565|consensus
Probab=94.91 E-value=0.016 Score=50.24 Aligned_cols=48 Identities=31% Similarity=0.452 Sum_probs=32.7
Q ss_pred ccchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccCCC-CCCCCccccCcHHHHHhhh
Q psy14250 35 FSGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDLRH-TEKGFKWSPCSVSSFHHFL 102 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~~~-~~~~~~fS~CS~~~i~~~l 102 (154)
+....|++|||||.||+.|.... +-||+|... ..+.+..+ .++++...
T Consensus 209 ~~l~~Va~HEiGH~LGL~HS~~~-----------------~aiM~P~y~~~~~~~~L~---~DDv~giq 257 (469)
T KOG1565|consen 209 VDLFLVAAHEIGHALGLGHSSDP-----------------DAIMYPFYQPDSGNFDLS---QDDVRGIQ 257 (469)
T ss_pred chhHHHhhhhcccccccCCCCCc-----------------ccccccccccCCCCcccC---hhhhhhhH
Confidence 45578999999999999998654 358998754 23333334 66665433
No 39
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.71 E-value=0.015 Score=47.82 Aligned_cols=18 Identities=39% Similarity=0.351 Sum_probs=13.8
Q ss_pred cchhhhHHhhhhhcCCCC
Q psy14250 36 SGIIVAAHEVGHLLGAVH 53 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~H 53 (154)
+...+++|||||+||+.|
T Consensus 192 s~~~~f~HE~GH~~GL~H 209 (305)
T PF10462_consen 192 SYGNEFSHELGHNFGLGH 209 (305)
T ss_dssp -SHHHHHHHHHHTTT--S
T ss_pred CccceeehhhhhhcCCCC
Confidence 346899999999999999
No 40
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.87 E-value=0.018 Score=46.65 Aligned_cols=39 Identities=33% Similarity=0.285 Sum_probs=25.4
Q ss_pred chhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~ 81 (154)
...|++||+||.||+++--+.... +. .. .-...+||+..
T Consensus 165 ~igv~~HE~gH~lGLPDlY~~~~~--~~---~~-~vG~w~lM~~G 203 (286)
T TIGR03296 165 GVGVIAHELGHDLGLPDLYDTSYD--GG---GE-PVGYWSLMSSG 203 (286)
T ss_pred ceeeeehhhhcccCCCccccCcCC--CC---CC-CccccccccCC
Confidence 468999999999999976543100 00 00 22456899984
No 41
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=90.48 E-value=0.097 Score=37.34 Aligned_cols=20 Identities=35% Similarity=0.471 Sum_probs=17.0
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
...++-|||||-||+..|.-
T Consensus 109 vthvliHEIgHhFGLsDdd~ 128 (136)
T COG3824 109 VTHVLIHEIGHHFGLSDDDL 128 (136)
T ss_pred hhhhhhhhhhhhcCCChhHH
Confidence 46889999999999988763
No 42
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=90.47 E-value=0.11 Score=42.85 Aligned_cols=26 Identities=27% Similarity=0.369 Sum_probs=20.0
Q ss_pred eCCCccchhhhHHhhhhhcCCCCCCC
Q psy14250 31 DTGGFSGIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 31 ~~~~~~~a~~~AHElGH~lG~~HD~~ 56 (154)
+...+....|++||++|.+|....+.
T Consensus 190 ~~p~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 190 DLPPYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred CCCcccccHHHHHHHHHHhCCCCHHH
Confidence 33345678999999999999876653
No 43
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=90.04 E-value=0.19 Score=43.13 Aligned_cols=20 Identities=40% Similarity=0.629 Sum_probs=17.2
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
...+-|||||.||++|-.++
T Consensus 317 iGA~lHEiGH~fg~pH~~~G 336 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQEDG 336 (423)
T ss_pred hHHHHHHHHHhcCCCCCCCC
Confidence 34578999999999999876
No 44
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=89.35 E-value=0.17 Score=34.21 Aligned_cols=24 Identities=29% Similarity=0.382 Sum_probs=16.3
Q ss_pred EEEEeeCCCccchhhhHHhhhhhc
Q psy14250 26 VAIIEDTGGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 26 ~~i~~~~~~~~~a~~~AHElGH~l 49 (154)
+.+........-..++||||||.+
T Consensus 31 I~in~~~~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 31 IFINSNLSPERQRFTLAHELGHIL 54 (122)
T ss_dssp EEEESSS-HHHHHHHHHHHHHHHH
T ss_pred EEECCCCCHHHHHHHHHHHHHHHH
Confidence 444444444455789999999987
No 45
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=87.17 E-value=0.24 Score=38.70 Aligned_cols=16 Identities=38% Similarity=0.355 Sum_probs=12.6
Q ss_pred ccchhhhHHhhhhhcC
Q psy14250 35 FSGIIVAAHEVGHLLG 50 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG 50 (154)
..--+++||||||.|=
T Consensus 70 ~r~rFtlAHELGH~ll 85 (213)
T COG2856 70 ERKRFTLAHELGHALL 85 (213)
T ss_pred HHHHHHHHHHHhHHHh
Confidence 3345899999999883
No 46
>TIGR02232 myxo_disulf_rpt Myxococcus cysteine-rich repeat. This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Probab=86.54 E-value=0.32 Score=27.65 Aligned_cols=16 Identities=6% Similarity=0.252 Sum_probs=14.3
Q ss_pred CCCCCeEECccccccc
Q psy14250 122 RVLPGKLLSLDAQCRK 137 (154)
Q Consensus 122 ~~cgn~~~e~~eqCd~ 137 (154)
+.||+++++..||||=
T Consensus 2 ~~CGdG~~~~~EeCDD 17 (38)
T TIGR02232 2 PTCGDGIIEPGEECDD 17 (38)
T ss_pred CcCCCCcCcCCccccC
Confidence 5799999999999984
No 47
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=85.30 E-value=0.28 Score=33.66 Aligned_cols=18 Identities=22% Similarity=0.287 Sum_probs=13.8
Q ss_pred chhhhHHhhhhhcCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHD 54 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD 54 (154)
...++-|||||.||++.+
T Consensus 73 I~~tlvhEiah~fG~~~e 90 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGISDE 90 (97)
T ss_dssp HHHHHHHHHHHHTT--HH
T ss_pred HHHHHHHHHHHHcCCCHH
Confidence 468999999999998754
No 48
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=84.96 E-value=0.22 Score=44.89 Aligned_cols=40 Identities=35% Similarity=0.194 Sum_probs=25.3
Q ss_pred cchhhhHHhhhhhcCCCCCCCCCCCCCCCCCCCCCCCCCCceeccC
Q psy14250 36 SGIIVAAHEVGHLLGAVHDGSPPPTYLGGPGAEKCRWEDGFIMSDL 81 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~C~~~~~~IMs~~ 81 (154)
.++.++|||+||.||++..-+... .+. .=+-....||+..
T Consensus 220 ~giGVfaHEfGH~LGLPDlYDT~~--~g~----~~~vg~WsLMs~G 259 (645)
T PF05547_consen 220 GGIGVFAHEFGHDLGLPDLYDTDY--SGG----GNGVGFWSLMSSG 259 (645)
T ss_pred CceEEEEeeccccCCCCCcccCCC--CCC----CCCcccceecCCC
Confidence 357899999999999996554310 000 0011356799985
No 49
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=82.63 E-value=0.6 Score=37.62 Aligned_cols=20 Identities=35% Similarity=0.446 Sum_probs=12.7
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
...++.||+||.||---|.-
T Consensus 216 ~~~v~vHE~GHsf~~LaDEY 235 (264)
T PF09471_consen 216 FKQVVVHEFGHSFGGLADEY 235 (264)
T ss_dssp HHHHHHHHHHHHTT------
T ss_pred ccceeeeecccccccccccc
Confidence 46899999999998655543
No 50
>KOG4525|consensus
Probab=82.60 E-value=0.79 Score=39.60 Aligned_cols=20 Identities=35% Similarity=0.612 Sum_probs=17.4
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
...+-|||||.||.+|..++
T Consensus 302 lGA~~HElGH~lgcpHq~~G 321 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQSEG 321 (614)
T ss_pred HHHHHHHhhhccCCCCCCCc
Confidence 45678999999999999876
No 51
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=82.40 E-value=0.56 Score=40.70 Aligned_cols=14 Identities=43% Similarity=0.482 Sum_probs=12.5
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
.|.||||||||+.|
T Consensus 130 lagViAHEigHv~q 143 (484)
T COG4783 130 LAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHhh
Confidence 47899999999887
No 52
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=81.02 E-value=1.3 Score=39.20 Aligned_cols=22 Identities=27% Similarity=0.604 Sum_probs=16.8
Q ss_pred EEeeCCCccchhhhHHhhhhhc
Q psy14250 28 IIEDTGGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 28 i~~~~~~~~~a~~~AHElGH~l 49 (154)
+..-.+.+..+.|+|||+||.+
T Consensus 369 l~N~~~~~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 369 LMNWDGTLNSVFTLAHELGHSM 390 (591)
T ss_pred EEecCCCcchHHHHHHHhhhHH
Confidence 3343456777899999999987
No 53
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=79.09 E-value=0.9 Score=34.53 Aligned_cols=14 Identities=43% Similarity=0.522 Sum_probs=12.1
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
.+.++||||||...
T Consensus 89 l~aVlaHElgH~~~ 102 (226)
T PF01435_consen 89 LAAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHc
Confidence 57899999999873
No 54
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=77.36 E-value=1.3 Score=26.07 Aligned_cols=18 Identities=17% Similarity=0.316 Sum_probs=15.8
Q ss_pred CCCCCCeEECcccccccc
Q psy14250 121 PRVLPGKLLSLDAQCRKD 138 (154)
Q Consensus 121 ~~~cgn~~~e~~eqCd~~ 138 (154)
.++||++++...||||-.
T Consensus 18 ~~~CGDgii~~~E~CDD~ 35 (47)
T PF13948_consen 18 EPICGDGIIVGDEQCDDG 35 (47)
T ss_pred ccccCCCeEECCcccCCC
Confidence 478999999999999954
No 55
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=76.39 E-value=2.2 Score=37.56 Aligned_cols=22 Identities=32% Similarity=0.386 Sum_probs=16.7
Q ss_pred EEeeCCCccchhhhHHhhhhhc
Q psy14250 28 IIEDTGGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 28 i~~~~~~~~~a~~~AHElGH~l 49 (154)
+....+.+.-+.|+|||+||.+
T Consensus 328 ~~Nf~~t~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 328 FSNFNGTSGDIDVLTHEAGHAF 349 (549)
T ss_pred EEeCCCChhHHHHHHHHhhHHH
Confidence 3344455667899999999987
No 56
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=74.70 E-value=1.6 Score=33.85 Aligned_cols=19 Identities=47% Similarity=0.644 Sum_probs=15.2
Q ss_pred chhhhHHhhhhhcCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDG 55 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~ 55 (154)
-.++|=||+||.||-.|..
T Consensus 139 RqYvINHEVGH~LGh~H~~ 157 (203)
T PF11350_consen 139 RQYVINHEVGHALGHGHEP 157 (203)
T ss_pred HHHhhhhhhhhhcccCCCc
Confidence 3689999999999965543
No 57
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=74.53 E-value=2.6 Score=37.82 Aligned_cols=19 Identities=32% Similarity=0.614 Sum_probs=15.1
Q ss_pred CCCccchhhhHHhhhhhcC
Q psy14250 32 TGGFSGIIVAAHEVGHLLG 50 (154)
Q Consensus 32 ~~~~~~a~~~AHElGH~lG 50 (154)
.+...-+.|+||||||.+=
T Consensus 375 ~gt~~dV~TLaHElGHs~H 393 (598)
T COG1164 375 DGTLRDVFTLAHELGHSVH 393 (598)
T ss_pred CCchhHHHHHHHHccHHHH
Confidence 3455668999999999874
No 58
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=74.39 E-value=1.3 Score=29.03 Aligned_cols=13 Identities=38% Similarity=0.715 Sum_probs=11.1
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.-.++||||.|.+
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 4679999999975
No 59
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=74.24 E-value=1.9 Score=32.59 Aligned_cols=20 Identities=30% Similarity=0.230 Sum_probs=16.7
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
.-.+++|||.|+.-+.|...
T Consensus 164 idYVvvHEL~Hl~~~nHs~~ 183 (205)
T PF01863_consen 164 IDYVVVHELCHLRHPNHSKR 183 (205)
T ss_pred HHHHHHHHHHHhccCCCCHH
Confidence 45899999999998888753
No 60
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=73.77 E-value=3.1 Score=34.90 Aligned_cols=17 Identities=35% Similarity=0.749 Sum_probs=13.7
Q ss_pred CCccchhhhHHhhhhhc
Q psy14250 33 GGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 33 ~~~~~a~~~AHElGH~l 49 (154)
+.+....+++||+||.+
T Consensus 218 ~~~~~v~tl~HE~GHa~ 234 (427)
T cd06459 218 GTLDDVFTLAHELGHAF 234 (427)
T ss_pred CChhhHHHHHHHhhHHH
Confidence 44666899999999965
No 61
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=72.69 E-value=1.6 Score=33.12 Aligned_cols=13 Identities=31% Similarity=0.286 Sum_probs=11.7
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...+|||||.|.|
T Consensus 36 lG~ilahel~haf 48 (206)
T PF01431_consen 36 LGFILAHELMHAF 48 (206)
T ss_dssp HHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999988
No 62
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=71.54 E-value=1.6 Score=35.73 Aligned_cols=14 Identities=50% Similarity=0.700 Sum_probs=11.5
Q ss_pred ccchhhhHHhhhhh
Q psy14250 35 FSGIIVAAHEVGHL 48 (154)
Q Consensus 35 ~~~a~~~AHElGH~ 48 (154)
|..+.+||||.||-
T Consensus 168 ~a~ayVlAHEyGHH 181 (292)
T PF04228_consen 168 FAQAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHHHH
Confidence 44678999999994
No 63
>PRK03982 heat shock protein HtpX; Provisional
Probab=70.43 E-value=2 Score=34.75 Aligned_cols=13 Identities=38% Similarity=0.268 Sum_probs=11.3
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 125 l~AVlAHElgHi~ 137 (288)
T PRK03982 125 LEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999975
No 64
>PRK03001 M48 family peptidase; Provisional
Probab=70.24 E-value=2 Score=34.62 Aligned_cols=13 Identities=38% Similarity=0.291 Sum_probs=11.2
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 124 l~aVlAHElgHi~ 136 (283)
T PRK03001 124 IRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHh
Confidence 5789999999965
No 65
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=69.91 E-value=2.1 Score=34.17 Aligned_cols=13 Identities=46% Similarity=0.549 Sum_probs=11.3
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++|||+||..
T Consensus 157 l~aVlaHElgHi~ 169 (302)
T COG0501 157 LEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHh
Confidence 5789999999975
No 66
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=69.77 E-value=2.3 Score=37.37 Aligned_cols=17 Identities=29% Similarity=0.350 Sum_probs=13.8
Q ss_pred chhhhHHhhhhhcCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVH 53 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~H 53 (154)
...+++|||+|.||...
T Consensus 210 ~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 210 FFRTVIHEIAHALGFSS 226 (521)
T ss_dssp HHHHHHHHHHHHTT-SH
T ss_pred ccceeeeeeeeeeeecc
Confidence 45799999999999765
No 67
>PRK01345 heat shock protein HtpX; Provisional
Probab=68.96 E-value=2.2 Score=35.15 Aligned_cols=13 Identities=38% Similarity=0.363 Sum_probs=11.4
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 124 L~aVlAHElgHi~ 136 (317)
T PRK01345 124 VAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999976
No 68
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=68.42 E-value=3.3 Score=36.75 Aligned_cols=18 Identities=33% Similarity=0.538 Sum_probs=14.9
Q ss_pred CCCccchhhhHHhhhhhc
Q psy14250 32 TGGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 32 ~~~~~~a~~~AHElGH~l 49 (154)
.+.+..+.|+|||+||.+
T Consensus 370 ~~~~~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 370 DGSRRDVSTLAHELGHAY 387 (587)
T ss_pred CCCchhHHHHHHHhhHHH
Confidence 345667899999999988
No 69
>PRK03072 heat shock protein HtpX; Provisional
Probab=68.14 E-value=2.4 Score=34.44 Aligned_cols=14 Identities=29% Similarity=0.273 Sum_probs=11.7
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
...++||||||...
T Consensus 127 l~aVlAHElgHi~~ 140 (288)
T PRK03072 127 LRGVLGHELSHVYN 140 (288)
T ss_pred HHHHHHHHHHHHhc
Confidence 57899999999753
No 70
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=67.39 E-value=2.4 Score=28.70 Aligned_cols=14 Identities=36% Similarity=0.107 Sum_probs=11.9
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
...+++||++|.+=
T Consensus 25 ~~~~l~HE~~H~~~ 38 (128)
T PF13485_consen 25 LDRVLAHELAHQWF 38 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 45899999999983
No 71
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=67.31 E-value=3 Score=30.76 Aligned_cols=12 Identities=42% Similarity=0.517 Sum_probs=10.3
Q ss_pred hhhhHHhhhhhc
Q psy14250 38 IIVAAHEVGHLL 49 (154)
Q Consensus 38 a~~~AHElGH~l 49 (154)
..|+||||+|-+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 689999999953
No 72
>PRK04897 heat shock protein HtpX; Provisional
Probab=65.56 E-value=2.8 Score=34.19 Aligned_cols=13 Identities=31% Similarity=0.289 Sum_probs=11.2
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 137 l~aVlAHElgHi~ 149 (298)
T PRK04897 137 LEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHh
Confidence 5789999999965
No 73
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=65.42 E-value=2.6 Score=36.01 Aligned_cols=15 Identities=27% Similarity=0.474 Sum_probs=12.6
Q ss_pred ccchhhhHHhhhhhc
Q psy14250 35 FSGIIVAAHEVGHLL 49 (154)
Q Consensus 35 ~~~a~~~AHElGH~l 49 (154)
+..+.|++||+||.+
T Consensus 240 ~~~v~tLfHE~GHa~ 254 (458)
T PF01432_consen 240 HDDVETLFHEFGHAM 254 (458)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred hhhHHHHHHHHhHHH
Confidence 345899999999976
No 74
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=65.31 E-value=2.8 Score=32.87 Aligned_cols=21 Identities=38% Similarity=0.436 Sum_probs=16.7
Q ss_pred cchhhhHHhhhhhcCCC-CCCC
Q psy14250 36 SGIIVAAHEVGHLLGAV-HDGS 56 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~-HD~~ 56 (154)
+.-.+.-|||||.||+. |.+-
T Consensus 186 ~L~~tarhElGhaLgi~ghsd~ 207 (236)
T COG5549 186 NLNPTARHELGHALGIWGHSDL 207 (236)
T ss_pred hhhHHHHHhhcchheecccccc
Confidence 45678899999999998 5543
No 75
>PRK05457 heat shock protein HtpX; Provisional
Probab=64.99 E-value=3 Score=33.83 Aligned_cols=13 Identities=38% Similarity=0.475 Sum_probs=11.1
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 134 l~aVlAHElgHi~ 146 (284)
T PRK05457 134 VEAVLAHEISHIA 146 (284)
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999964
No 76
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=64.25 E-value=2.8 Score=36.23 Aligned_cols=15 Identities=27% Similarity=0.496 Sum_probs=12.9
Q ss_pred ccchhhhHHhhhhhc
Q psy14250 35 FSGIIVAAHEVGHLL 49 (154)
Q Consensus 35 ~~~a~~~AHElGH~l 49 (154)
+..+.|++||+||.+
T Consensus 261 ~~~V~TLfHEfGHal 275 (472)
T cd06455 261 HDEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456899999999987
No 77
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=63.90 E-value=3.2 Score=37.11 Aligned_cols=22 Identities=41% Similarity=0.496 Sum_probs=18.4
Q ss_pred chhhhHHhhhhhc---CCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLL---GAVHDGSPP 58 (154)
Q Consensus 37 ~a~~~AHElGH~l---G~~HD~~~~ 58 (154)
...+|+|||||-| |+..|.++.
T Consensus 487 IGaVIgHEI~HgFDdqGakfD~~Gn 511 (654)
T COG3590 487 IGAVIGHEIGHGFDDQGAKFDGDGN 511 (654)
T ss_pred ccceehhhhcccccCCccccCCCCc
Confidence 3579999999999 788888775
No 78
>PRK02391 heat shock protein HtpX; Provisional
Probab=63.52 E-value=3.2 Score=33.89 Aligned_cols=13 Identities=38% Similarity=0.365 Sum_probs=10.9
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||||.-
T Consensus 133 l~aVlaHElgHi~ 145 (296)
T PRK02391 133 LEAVLAHELSHVK 145 (296)
T ss_pred HHHHHHHHHHHHH
Confidence 5689999999954
No 79
>PRK01265 heat shock protein HtpX; Provisional
Probab=62.13 E-value=3.5 Score=34.22 Aligned_cols=12 Identities=58% Similarity=0.747 Sum_probs=10.5
Q ss_pred chhhhHHhhhhh
Q psy14250 37 GIIVAAHEVGHL 48 (154)
Q Consensus 37 ~a~~~AHElGH~ 48 (154)
...++||||||.
T Consensus 140 l~aVlAHElgHi 151 (324)
T PRK01265 140 IKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHH
Confidence 568999999995
No 80
>KOG3624|consensus
Probab=61.70 E-value=4.4 Score=36.64 Aligned_cols=21 Identities=33% Similarity=0.252 Sum_probs=15.1
Q ss_pred chhhhHHhhhhhc---CCCCCCCC
Q psy14250 37 GIIVAAHEVGHLL---GAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~l---G~~HD~~~ 57 (154)
.+.+|+|||+|.| |...|..+
T Consensus 518 iG~vigHEl~H~FD~~G~~~D~~G 541 (687)
T KOG3624|consen 518 IGFVIGHELTHGFDDQGRQYDESG 541 (687)
T ss_pred HHHHHHHHHhhcccccccccCCCc
Confidence 5789999999988 44444443
No 81
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=61.42 E-value=3.4 Score=33.60 Aligned_cols=16 Identities=50% Similarity=0.754 Sum_probs=12.7
Q ss_pred CccchhhhHHhhhhhc
Q psy14250 34 GFSGIIVAAHEVGHLL 49 (154)
Q Consensus 34 ~~~~a~~~AHElGH~l 49 (154)
-|+-++++|||+||-+
T Consensus 164 dfAqaYViAHEVGHHV 179 (295)
T COG2321 164 DFAQAYVIAHEVGHHV 179 (295)
T ss_pred cHHHHHHHHhhhhHHH
Confidence 3667899999999843
No 82
>PRK02870 heat shock protein HtpX; Provisional
Probab=61.34 E-value=3.8 Score=34.17 Aligned_cols=12 Identities=42% Similarity=0.517 Sum_probs=10.8
Q ss_pred chhhhHHhhhhh
Q psy14250 37 GIIVAAHEVGHL 48 (154)
Q Consensus 37 ~a~~~AHElGH~ 48 (154)
...++||||||.
T Consensus 173 L~aVlAHELgHi 184 (336)
T PRK02870 173 LQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHH
Confidence 578999999997
No 83
>KOG2719|consensus
Probab=61.02 E-value=4.9 Score=34.63 Aligned_cols=36 Identities=33% Similarity=0.418 Sum_probs=21.9
Q ss_pred eeeccCCcccCCccCCccceEEEEeeCC-------CccchhhhHHhhhhh
Q psy14250 6 SFAYVGGACVVNKRLEKVNSVAIIEDTG-------GFSGIIVAAHEVGHL 48 (154)
Q Consensus 6 G~A~~g~~C~~~~~~~~~~s~~i~~~~~-------~~~~a~~~AHElGH~ 48 (154)
--||+-|.|... ...+....- ..-.+.+++|||||-
T Consensus 249 sNAyfyG~~~~K-------RIvIyDtLl~~~~~~~~eel~AVl~HELGHW 291 (428)
T KOG2719|consen 249 SNAYFYGLCKNK-------RIVIYDTLLLEEEHLNNEELVAVLAHELGHW 291 (428)
T ss_pred CCeeeeeccccc-------eEEEehhhhhhhhccccHHHHHHHHHHhhHH
Confidence 347777888765 243332211 123578999999993
No 84
>PTZ00337 surface protease GP63; Provisional
Probab=60.58 E-value=3.5 Score=36.85 Aligned_cols=17 Identities=59% Similarity=0.864 Sum_probs=14.6
Q ss_pred chhhhHHhhhhhcCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVH 53 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~H 53 (154)
...+++|||.|.||...
T Consensus 229 ~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 229 DVRVAAHELGHALGFVR 245 (567)
T ss_pred HHHHHHHHHHHHHccCH
Confidence 46899999999999854
No 85
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=58.52 E-value=5.5 Score=30.42 Aligned_cols=21 Identities=24% Similarity=0.292 Sum_probs=18.0
Q ss_pred cchhhhHHhhhhhcCCCCCCC
Q psy14250 36 SGIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~HD~~ 56 (154)
....|+-|||+|+.=.+||..
T Consensus 81 ~i~~t~lHELaH~~~~~H~~~ 101 (186)
T PF08325_consen 81 TILGTMLHELAHNVHGPHDDK 101 (186)
T ss_pred HHHHHHHHHHHhcccCCccHH
Confidence 357899999999998888874
No 86
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=57.23 E-value=6.6 Score=30.67 Aligned_cols=20 Identities=20% Similarity=0.013 Sum_probs=14.3
Q ss_pred eCCCccchhhhHHhhhhhcC
Q psy14250 31 DTGGFSGIIVAAHEVGHLLG 50 (154)
Q Consensus 31 ~~~~~~~a~~~AHElGH~lG 50 (154)
......+..++|||++|..=
T Consensus 87 GLPrll~gsiLAHE~mHa~L 106 (212)
T PF12315_consen 87 GLPRLLTGSILAHELMHAWL 106 (212)
T ss_pred CCCHHHHhhHHHHHHHHHHh
Confidence 33334457899999999873
No 87
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=56.26 E-value=4.8 Score=34.77 Aligned_cols=14 Identities=29% Similarity=0.508 Sum_probs=12.1
Q ss_pred cchhhhHHhhhhhc
Q psy14250 36 SGIIVAAHEVGHLL 49 (154)
Q Consensus 36 ~~a~~~AHElGH~l 49 (154)
.-+.|++||+||.+
T Consensus 247 ~~v~TLfHEfGHal 260 (458)
T cd06457 247 HEVETLFHEMGHAM 260 (458)
T ss_pred HHHHHHHHHHhHHH
Confidence 45789999999987
No 88
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=55.00 E-value=5.6 Score=31.20 Aligned_cols=19 Identities=32% Similarity=0.289 Sum_probs=15.6
Q ss_pred chhhhHHhhhhhcCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDG 55 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~ 55 (154)
.-.+++|||+|+.=+.|-.
T Consensus 175 i~YVvvHELaHLke~nHs~ 193 (223)
T COG1451 175 IDYVVVHELAHLKEKNHSK 193 (223)
T ss_pred HHHHHHHHHHHHhhhhccH
Confidence 3589999999999887654
No 89
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=54.83 E-value=4.9 Score=32.31 Aligned_cols=15 Identities=27% Similarity=0.040 Sum_probs=10.6
Q ss_pred hhhhHHhhhhhcCCC
Q psy14250 38 IIVAAHEVGHLLGAV 52 (154)
Q Consensus 38 a~~~AHElGH~lG~~ 52 (154)
..-++|||||++=..
T Consensus 220 ~WG~~HE~GH~~Q~~ 234 (307)
T PF13402_consen 220 GWGPWHELGHNHQQG 234 (307)
T ss_dssp -HHHHHHHHHHH-BG
T ss_pred CeeehhhhhhhcCcc
Confidence 346899999999544
No 90
>KOG2414|consensus
Probab=54.75 E-value=5.2 Score=34.45 Aligned_cols=19 Identities=42% Similarity=0.691 Sum_probs=14.8
Q ss_pred hhHHhhhhhcCCC-CCCCCC
Q psy14250 40 VAAHEVGHLLGAV-HDGSPP 58 (154)
Q Consensus 40 ~~AHElGH~lG~~-HD~~~~ 58 (154)
.--|-+||.|||+ ||.+..
T Consensus 399 lcPHhVgHyLGmDVHD~p~v 418 (488)
T KOG2414|consen 399 LCPHHVGHYLGMDVHDCPTV 418 (488)
T ss_pred cCCcccchhcCcccccCCCC
Confidence 3358899999998 887653
No 91
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=54.63 E-value=5.5 Score=31.36 Aligned_cols=25 Identities=44% Similarity=0.678 Sum_probs=16.9
Q ss_pred eEEEEeeC---CCccchhhhHHhhhhhc
Q psy14250 25 SVAIIEDT---GGFSGIIVAAHEVGHLL 49 (154)
Q Consensus 25 s~~i~~~~---~~~~~a~~~AHElGH~l 49 (154)
.+-+.++. .+.+.+.+.|||.||.+
T Consensus 74 ~vrLS~~vy~~~SiaAvaVAAHEvGHAi 101 (222)
T PF04298_consen 74 VVRLSEDVYNGRSIAAVAVAAHEVGHAI 101 (222)
T ss_pred EEEeCCccCCCCCHHHHHHHHHHHhHHH
Confidence 34444443 23567889999999976
No 92
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=54.56 E-value=6.2 Score=35.61 Aligned_cols=16 Identities=38% Similarity=0.463 Sum_probs=14.1
Q ss_pred chhhhHHhhhhhcCCC
Q psy14250 37 GIIVAAHEVGHLLGAV 52 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~ 52 (154)
...+++|||.|.||..
T Consensus 256 ~~rv~~HEi~HALGFS 271 (622)
T PTZ00257 256 TTRTVTHEVAHALGFS 271 (622)
T ss_pred HHHHHHHHHHHHhcCC
Confidence 4689999999999984
No 93
>smart00050 DISIN Homologues of snake disintegrins. Snake disintegrins inhibit the binding of ligands to integrin receptors. They contain a 'RGD' sequence, identical to the recognition site of many adhesion proteins. Molecules containing both disintegrin and metalloprotease domains are known as ADAMs.
Probab=54.35 E-value=5.8 Score=25.89 Aligned_cols=12 Identities=8% Similarity=0.169 Sum_probs=9.0
Q ss_pred CcccccccccCC
Q psy14250 130 SLDAQCRKDRGT 141 (154)
Q Consensus 130 e~~eqCd~~~g~ 141 (154)
|.+||||+.+..
T Consensus 1 E~gEeCDCG~~~ 12 (75)
T smart00050 1 EEGEECDCGSPK 12 (75)
T ss_pred CcCcccCCCChh
Confidence 568888887764
No 94
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=52.70 E-value=5.4 Score=30.59 Aligned_cols=18 Identities=39% Similarity=0.696 Sum_probs=13.7
Q ss_pred ccchhhhHHhhhhhcCCC
Q psy14250 35 FSGIIVAAHEVGHLLGAV 52 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~ 52 (154)
+....++.||+||.+-+-
T Consensus 20 ~~~l~t~~HE~gHal~a~ 37 (200)
T PF13398_consen 20 FRLLVTFVHELGHALAAL 37 (200)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344679999999988643
No 95
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=51.75 E-value=7.3 Score=25.51 Aligned_cols=17 Identities=29% Similarity=0.135 Sum_probs=14.2
Q ss_pred cchhhhHHhhhhhcCCC
Q psy14250 36 SGIIVAAHEVGHLLGAV 52 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~ 52 (154)
...-++|||+++.+|-.
T Consensus 51 GFGEl~AhE~~fr~gW~ 67 (77)
T PF08219_consen 51 GFGELFAHEIAFRLGWS 67 (77)
T ss_pred hHHHHHHHHHHHHhccC
Confidence 45789999999999873
No 96
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=50.77 E-value=8.7 Score=23.90 Aligned_cols=20 Identities=20% Similarity=0.084 Sum_probs=17.1
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
=..+.|.|+|.||+.|-..+
T Consensus 30 eRriih~la~~lGL~~~s~G 49 (60)
T cd02639 30 ERRIVHLLASRLGLNHVSDG 49 (60)
T ss_pred HHHHHHHHHHHcCCceEEeC
Confidence 47789999999999998665
No 97
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=50.71 E-value=6.5 Score=32.44 Aligned_cols=15 Identities=33% Similarity=0.470 Sum_probs=12.7
Q ss_pred ccchhhhHHhhhhhc
Q psy14250 35 FSGIIVAAHEVGHLL 49 (154)
Q Consensus 35 ~~~a~~~AHElGH~l 49 (154)
+....+++||+||.+
T Consensus 152 ~~~v~tl~HE~GHa~ 166 (365)
T cd06258 152 HDDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHHhHHH
Confidence 456789999999987
No 98
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=49.89 E-value=6.8 Score=30.78 Aligned_cols=13 Identities=38% Similarity=0.613 Sum_probs=11.2
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...|+-||+||.|
T Consensus 92 ~~~~l~HE~GHAl 104 (220)
T PF14247_consen 92 VLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHH
Confidence 5689999999976
No 99
>PF14891 Peptidase_M91: Effector protein
Probab=48.91 E-value=8.2 Score=28.84 Aligned_cols=20 Identities=20% Similarity=0.210 Sum_probs=14.7
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
.+.+++|||+|..-.-+...
T Consensus 103 p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 103 PFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHCCCC
Confidence 36899999999875544443
No 100
>KOG1047|consensus
Probab=47.49 E-value=8.3 Score=34.42 Aligned_cols=13 Identities=38% Similarity=0.419 Sum_probs=11.2
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.+.+|||||+|+-
T Consensus 288 l~~vIaHEIAHSW 300 (613)
T KOG1047|consen 288 LVDVIAHEIAHSW 300 (613)
T ss_pred hhhHHHHHhhhhh
Confidence 4899999999964
No 101
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=46.27 E-value=8.8 Score=28.62 Aligned_cols=20 Identities=40% Similarity=0.562 Sum_probs=14.2
Q ss_pred hhhhHHhhhhhcCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~ 57 (154)
..++.||+||.+-+..-+..
T Consensus 8 i~i~~HE~gH~~~a~~~G~~ 27 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDK 27 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--
T ss_pred cccccccccccccccccccc
Confidence 56889999999976654443
No 102
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=45.83 E-value=8.8 Score=32.87 Aligned_cols=15 Identities=33% Similarity=0.527 Sum_probs=12.8
Q ss_pred ccchhhhHHhhhhhc
Q psy14250 35 FSGIIVAAHEVGHLL 49 (154)
Q Consensus 35 ~~~a~~~AHElGH~l 49 (154)
+..+.|++||+||.+
T Consensus 206 ~~~v~tLfHEfGHal 220 (422)
T cd06456 206 HDEVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456789999999988
No 103
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=44.68 E-value=9.4 Score=32.59 Aligned_cols=15 Identities=40% Similarity=0.603 Sum_probs=12.6
Q ss_pred ccchhhhHHhhhhhc
Q psy14250 35 FSGIIVAAHEVGHLL 49 (154)
Q Consensus 35 ~~~a~~~AHElGH~l 49 (154)
.....+++||+||.+
T Consensus 157 ~~~l~t~iHE~GHal 171 (396)
T cd06460 157 RSALFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHHHHhhHHH
Confidence 456789999999987
No 104
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=44.18 E-value=9.4 Score=29.69 Aligned_cols=20 Identities=50% Similarity=0.725 Sum_probs=15.1
Q ss_pred ccchhhhHHhhhhhcCCCCC
Q psy14250 35 FSGIIVAAHEVGHLLGAVHD 54 (154)
Q Consensus 35 ~~~a~~~AHElGH~lG~~HD 54 (154)
.+.+.+.|||.||.+-=.||
T Consensus 90 ia~~aVAAHEVGHAiQd~~~ 109 (226)
T COG2738 90 IAAIAVAAHEVGHAIQDQED 109 (226)
T ss_pred HHHHHHHHHHhhHHHhhhcc
Confidence 55678999999998754443
No 105
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=42.00 E-value=12 Score=33.68 Aligned_cols=13 Identities=38% Similarity=0.509 Sum_probs=11.0
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.+.++||||+|+-
T Consensus 279 ~~~viaHElAHqW 291 (601)
T TIGR02411 279 NVDVIAHELAHSW 291 (601)
T ss_pred hhhhHHHHHHhhc
Confidence 4679999999975
No 106
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=41.97 E-value=11 Score=28.57 Aligned_cols=15 Identities=47% Similarity=0.780 Sum_probs=11.8
Q ss_pred hhhhHHhhhhhcCCC
Q psy14250 38 IIVAAHEVGHLLGAV 52 (154)
Q Consensus 38 a~~~AHElGH~lG~~ 52 (154)
..++-||+||.+-+.
T Consensus 10 ~~v~iHElGH~~~Ar 24 (182)
T cd06163 10 ILIFVHELGHFLVAK 24 (182)
T ss_pred HHHHHHHHHHHHHHH
Confidence 578999999987443
No 107
>KOG2661|consensus
Probab=41.28 E-value=12 Score=31.62 Aligned_cols=13 Identities=31% Similarity=0.429 Sum_probs=11.7
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.|.+++||+||.+
T Consensus 275 lAtvLgHE~aHaV 287 (424)
T KOG2661|consen 275 LATVLGHEIAHAV 287 (424)
T ss_pred HHHHHHHHHHHHH
Confidence 5899999999987
No 108
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=41.15 E-value=11 Score=27.17 Aligned_cols=14 Identities=21% Similarity=-0.076 Sum_probs=11.6
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
...|+.|||.|.+=
T Consensus 59 l~~~l~HEm~H~~~ 72 (146)
T smart00731 59 LRETLLHELCHAAL 72 (146)
T ss_pred HHhhHHHHHHHHHH
Confidence 45799999999873
No 109
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=40.66 E-value=12 Score=29.19 Aligned_cols=20 Identities=25% Similarity=0.506 Sum_probs=15.0
Q ss_pred cchhhhHHhhhhhcCCCCCCC
Q psy14250 36 SGIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~HD~~ 56 (154)
+.+.+++||++|.. +.|...
T Consensus 100 A~~fil~HE~~Hv~-~~h~~~ 119 (206)
T PF10463_consen 100 AIAFILLHELAHVV-LGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHH-HcCccc
Confidence 56899999999954 555554
No 110
>PRK04351 hypothetical protein; Provisional
Probab=39.69 E-value=12 Score=27.50 Aligned_cols=13 Identities=31% Similarity=0.271 Sum_probs=10.9
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...|++|||.|..
T Consensus 61 l~~vv~HElcH~~ 73 (149)
T PRK04351 61 LIGIIKHELCHYH 73 (149)
T ss_pred HHhhHHHHHHHHH
Confidence 4689999999974
No 111
>PRK04860 hypothetical protein; Provisional
Probab=39.68 E-value=12 Score=27.81 Aligned_cols=13 Identities=46% Similarity=0.480 Sum_probs=10.3
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...||+|||+|.+
T Consensus 63 l~~~v~HEl~H~~ 75 (160)
T PRK04860 63 IDEVVPHELAHLL 75 (160)
T ss_pred HHhHHHHHHHHHH
Confidence 4678899999864
No 112
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=39.25 E-value=16 Score=22.53 Aligned_cols=21 Identities=24% Similarity=0.303 Sum_probs=17.7
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.-..+.|+|++.+|+.|...+
T Consensus 29 ~eR~~vH~lA~~~gL~s~S~G 49 (60)
T cd02641 29 HDRLLVHELAEELGLRHESTG 49 (60)
T ss_pred HHHHHHHHHHHHcCCceEeeC
Confidence 347789999999999998765
No 113
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=39.16 E-value=13 Score=27.44 Aligned_cols=19 Identities=47% Similarity=0.569 Sum_probs=14.2
Q ss_pred hhhhHHhhhhhcCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~ 56 (154)
..++.||+||.+-+..-+.
T Consensus 9 i~i~iHE~gH~~~A~~~G~ 27 (180)
T cd05709 9 ISVTVHELGHALVARRLGV 27 (180)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 4688999999986654443
No 114
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=36.00 E-value=16 Score=28.62 Aligned_cols=20 Identities=40% Similarity=0.315 Sum_probs=14.4
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
...++.||+||.+=+..-+.
T Consensus 53 ~~~v~iHElgH~~~A~~~G~ 72 (227)
T cd06164 53 FASVLLHELGHSLVARRYGI 72 (227)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 35688999999986554443
No 115
>PRK10911 oligopeptidase A; Provisional
Probab=35.83 E-value=15 Score=33.47 Aligned_cols=14 Identities=43% Similarity=0.600 Sum_probs=12.2
Q ss_pred cchhhhHHhhhhhc
Q psy14250 36 SGIIVAAHEVGHLL 49 (154)
Q Consensus 36 ~~a~~~AHElGH~l 49 (154)
.-+.|++||+||.|
T Consensus 462 ~~v~tlfHEfGHal 475 (680)
T PRK10911 462 DEVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHHhHHH
Confidence 45789999999988
No 116
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=34.97 E-value=17 Score=27.93 Aligned_cols=20 Identities=40% Similarity=0.390 Sum_probs=14.7
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
...++.||+||.+=+.+-+.
T Consensus 38 ~~~v~iHElgH~~~A~~~G~ 57 (208)
T cd06161 38 FLSVLLHELGHALVARRYGI 57 (208)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 46789999999886554443
No 117
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=34.84 E-value=17 Score=27.05 Aligned_cols=13 Identities=38% Similarity=0.378 Sum_probs=11.0
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...||.|||+|.+
T Consensus 61 ~~~vV~HELaHl~ 73 (156)
T COG3091 61 IEQVVPHELAHLH 73 (156)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999975
No 118
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=34.80 E-value=17 Score=26.09 Aligned_cols=15 Identities=27% Similarity=0.158 Sum_probs=12.3
Q ss_pred chhhhHHhhhhhcCC
Q psy14250 37 GIIVAAHEVGHLLGA 51 (154)
Q Consensus 37 ~a~~~AHElGH~lG~ 51 (154)
...|+.|||.|..--
T Consensus 60 ~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 60 LIDTLLHEMAHAAAY 74 (157)
T ss_pred HHHHHHHHHHHHHhh
Confidence 568999999998753
No 119
>KOG1046|consensus
Probab=34.57 E-value=18 Score=34.11 Aligned_cols=13 Identities=38% Similarity=0.355 Sum_probs=11.5
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.|.+|||||+|.-
T Consensus 324 va~vIaHElAHQW 336 (882)
T KOG1046|consen 324 VAEVIAHELAHQW 336 (882)
T ss_pred HHHHHHHHHHHHH
Confidence 6899999999974
No 120
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=33.56 E-value=29 Score=25.19 Aligned_cols=20 Identities=30% Similarity=0.318 Sum_probs=16.6
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
.+.+++|=+-|++|-.|...
T Consensus 108 l~~l~vHG~LHLlGyDH~~~ 127 (145)
T PF02130_consen 108 LARLLVHGLLHLLGYDHETE 127 (145)
T ss_dssp HHHHHHHHHHHHTT-SSTTT
T ss_pred HhHHHHHHHHHHcCCCCCCH
Confidence 57899999999999998853
No 121
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=33.29 E-value=29 Score=24.11 Aligned_cols=21 Identities=29% Similarity=0.324 Sum_probs=18.2
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.+.+++|=+-|++|-.|..+.
T Consensus 73 ~~~l~iHG~LHLlGyDH~~~~ 93 (110)
T TIGR00043 73 LAHLTVHGLLHLLGYDHETED 93 (110)
T ss_pred HHHHHHHHHHHHcCCCCCChH
Confidence 578999999999999997653
No 122
>PRK00016 metal-binding heat shock protein; Provisional
Probab=32.84 E-value=26 Score=25.95 Aligned_cols=21 Identities=29% Similarity=0.254 Sum_probs=18.1
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.+.+++|=+-|+||-.|..+.
T Consensus 113 ~~~l~iHG~LHLlGYDH~~~~ 133 (159)
T PRK00016 113 LAHLTVHGILHLLGYDHIEDE 133 (159)
T ss_pred HHHHHHHhhHHhcCCCCCChH
Confidence 578999999999999997653
No 123
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=32.64 E-value=18 Score=30.65 Aligned_cols=13 Identities=46% Similarity=0.616 Sum_probs=11.2
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.|.+++||+||.-
T Consensus 124 vAAVl~HEmgHVt 136 (479)
T COG4784 124 VAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHhhhhhee
Confidence 4789999999975
No 124
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=32.56 E-value=23 Score=25.75 Aligned_cols=21 Identities=29% Similarity=0.154 Sum_probs=13.5
Q ss_pred ccchhhhHHhhhh--hcCCCCCC
Q psy14250 35 FSGIIVAAHEVGH--LLGAVHDG 55 (154)
Q Consensus 35 ~~~a~~~AHElGH--~lG~~HD~ 55 (154)
.+-+.||.||+.| .+|-..|.
T Consensus 94 ~Sk~~TLiHE~SHf~~~~~T~D~ 116 (148)
T PF14521_consen 94 DSKEGTLIHEWSHFTAVGGTDDY 116 (148)
T ss_dssp T-HHHHHHHHHHHSCCCT--B-S
T ss_pred CchHHHHHHhhhhhhhhcCCCcc
Confidence 4568999999999 55555554
No 125
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=32.34 E-value=21 Score=29.35 Aligned_cols=14 Identities=36% Similarity=0.506 Sum_probs=11.5
Q ss_pred chhhhHHhhhhh-cC
Q psy14250 37 GIIVAAHEVGHL-LG 50 (154)
Q Consensus 37 ~a~~~AHElGH~-lG 50 (154)
.+.++||||+|. ||
T Consensus 295 ~~~~iahElahqWfG 309 (390)
T PF01433_consen 295 IASLIAHELAHQWFG 309 (390)
T ss_dssp HHHHHHHHHHTTTBT
T ss_pred hHHHHHHHHHHHHhc
Confidence 468899999998 44
No 126
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=31.88 E-value=21 Score=33.26 Aligned_cols=13 Identities=31% Similarity=0.383 Sum_probs=10.7
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.+.++||||+|+-
T Consensus 287 ~~~viaHElAHqW 299 (831)
T TIGR02412 287 RAGVILHEMAHMW 299 (831)
T ss_pred HHHHHHHHHHHHH
Confidence 3579999999964
No 127
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=30.61 E-value=24 Score=25.02 Aligned_cols=13 Identities=31% Similarity=0.230 Sum_probs=10.6
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
--.+++|||.|.-
T Consensus 79 C~~TL~HEL~H~W 91 (141)
T PHA02456 79 CRDTLAHELNHAW 91 (141)
T ss_pred hHHHHHHHHHHHH
Confidence 3579999999964
No 128
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=30.47 E-value=22 Score=26.91 Aligned_cols=21 Identities=38% Similarity=0.556 Sum_probs=15.6
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
...++.||+||.+-+..-+.+
T Consensus 41 ~~~l~iHElgH~~~A~~~G~~ 61 (183)
T cd06160 41 LAILGIHEMGHYLAARRHGVK 61 (183)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 357889999999977655443
No 129
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=30.07 E-value=22 Score=28.60 Aligned_cols=20 Identities=30% Similarity=0.278 Sum_probs=14.7
Q ss_pred chhhhHHhhhhhcCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGS 56 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~ 56 (154)
...++.||+||.+=+..-+.
T Consensus 118 ~isv~iHElgHa~~Ar~~G~ 137 (263)
T cd06159 118 VVGVVVHELSHGILARVEGI 137 (263)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 35688999999986655443
No 130
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=29.89 E-value=21 Score=30.43 Aligned_cols=15 Identities=40% Similarity=0.893 Sum_probs=12.0
Q ss_pred chhhhHHhhhhhcCC
Q psy14250 37 GIIVAAHEVGHLLGA 51 (154)
Q Consensus 37 ~a~~~AHElGH~lG~ 51 (154)
...++.||+||-|=+
T Consensus 14 ~~~v~~HE~gH~~~a 28 (420)
T TIGR00054 14 AVLIFVHELGHFLAA 28 (420)
T ss_pred HHHHHHHhHHHHHHH
Confidence 468899999998643
No 131
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=27.22 E-value=27 Score=32.75 Aligned_cols=13 Identities=38% Similarity=0.307 Sum_probs=10.3
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
.+.++||||+|+=
T Consensus 307 ~~~viaHElaHqW 319 (859)
T COG0308 307 VEEVIAHELAHQW 319 (859)
T ss_pred HHHHHHHHHhhhc
Confidence 3458999999963
No 132
>KOG2737|consensus
Probab=27.01 E-value=21 Score=30.67 Aligned_cols=21 Identities=38% Similarity=0.681 Sum_probs=17.6
Q ss_pred chhhhHHhhhhhcCCC-CCCCC
Q psy14250 37 GIIVAAHEVGHLLGAV-HDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~-HD~~~ 57 (154)
++.-+-|-|||.||++ ||..+
T Consensus 359 ~~vF~PHGLGH~lGlDvHDvGG 380 (492)
T KOG2737|consen 359 GAVFMPHGLGHFLGLDVHDVGG 380 (492)
T ss_pred CeeeccccccccccccccccCC
Confidence 4677899999999998 88655
No 133
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=26.75 E-value=30 Score=24.53 Aligned_cols=12 Identities=33% Similarity=0.487 Sum_probs=10.1
Q ss_pred chhhhHHhhhhh
Q psy14250 37 GIIVAAHEVGHL 48 (154)
Q Consensus 37 ~a~~~AHElGH~ 48 (154)
-..+++|||.|.
T Consensus 80 kvKviiHEllHI 91 (133)
T COG4900 80 KVKVIIHELLHI 91 (133)
T ss_pred HHHHHHHHHhcC
Confidence 468999999994
No 134
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=26.52 E-value=27 Score=30.03 Aligned_cols=15 Identities=40% Similarity=0.866 Sum_probs=11.6
Q ss_pred chhhhHHhhhhhcCC
Q psy14250 37 GIIVAAHEVGHLLGA 51 (154)
Q Consensus 37 ~a~~~AHElGH~lG~ 51 (154)
+..++-||+||-+=+
T Consensus 15 ~~li~vHElGHfl~A 29 (449)
T PRK10779 15 GVLITVHEFGHFWVA 29 (449)
T ss_pred HHHHHHHHHHHHHHH
Confidence 467889999997533
No 135
>PRK09672 phage exclusion protein Lit; Provisional
Probab=26.15 E-value=30 Score=28.52 Aligned_cols=19 Identities=21% Similarity=0.284 Sum_probs=14.8
Q ss_pred cchhhhHHhhhhhcCCCCCC
Q psy14250 36 SGIIVAAHEVGHLLGAVHDG 55 (154)
Q Consensus 36 ~~a~~~AHElGH~lG~~HD~ 55 (154)
+.|.++.||++|+- ..|+.
T Consensus 164 A~a~i~~HEiaHv~-~~h~~ 182 (305)
T PRK09672 164 ALAWILLHEIAHVE-FQHSS 182 (305)
T ss_pred HHHHHHHHHHHHHH-hcccc
Confidence 57899999999976 44554
No 136
>PLN02791 Nudix hydrolase homolog
Probab=25.67 E-value=31 Score=32.09 Aligned_cols=16 Identities=25% Similarity=0.283 Sum_probs=14.1
Q ss_pred hhhhHHhhhhhcCCCC
Q psy14250 38 IIVAAHEVGHLLGAVH 53 (154)
Q Consensus 38 a~~~AHElGH~lG~~H 53 (154)
..|+.|||||-+|-.+
T Consensus 586 ~~v~lHElgHGsG~~~ 601 (770)
T PLN02791 586 THTICHECCHGIGPHT 601 (770)
T ss_pred HHHHHHHhhccccccc
Confidence 6899999999999855
No 137
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=25.36 E-value=41 Score=20.78 Aligned_cols=21 Identities=10% Similarity=0.045 Sum_probs=17.4
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.-..+-|+|++.+|+.+...+
T Consensus 29 ~eR~~vH~~a~~~gL~s~S~G 49 (60)
T cd02640 29 EERALIHQIAQKYGLKSRSYG 49 (60)
T ss_pred HHHHHHHHHHHHcCCceeeEe
Confidence 347789999999999988655
No 138
>PRK14015 pepN aminopeptidase N; Provisional
Probab=25.28 E-value=32 Score=32.41 Aligned_cols=13 Identities=38% Similarity=0.043 Sum_probs=10.9
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++||||+|+-
T Consensus 296 i~~vIaHElaHqW 308 (875)
T PRK14015 296 IESVIAHEYFHNW 308 (875)
T ss_pred HHHHHHHHHHHHH
Confidence 4579999999975
No 139
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=23.95 E-value=33 Score=27.93 Aligned_cols=18 Identities=39% Similarity=0.536 Sum_probs=13.5
Q ss_pred hhhhHHhhhhhcCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDG 55 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~ 55 (154)
..++.||+||.+=+...+
T Consensus 136 isvvvHElgHal~A~~~g 153 (277)
T cd06162 136 ISGVVHEMGHGVAAVREQ 153 (277)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 467899999988665443
No 140
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=22.82 E-value=39 Score=31.87 Aligned_cols=13 Identities=38% Similarity=0.043 Sum_probs=10.9
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
...++|||++|+-
T Consensus 283 i~~VIaHElaHqW 295 (863)
T TIGR02414 283 IESVIAHEYFHNW 295 (863)
T ss_pred HHHHHHHHHHHHH
Confidence 4579999999975
No 141
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=22.38 E-value=37 Score=31.07 Aligned_cols=13 Identities=46% Similarity=0.736 Sum_probs=11.6
Q ss_pred chhhhHHhhhhhc
Q psy14250 37 GIIVAAHEVGHLL 49 (154)
Q Consensus 37 ~a~~~AHElGH~l 49 (154)
-+.|+.||.||.|
T Consensus 464 eV~TlFHEfGHal 476 (681)
T PRK10280 464 DVITLFHEFGHTL 476 (681)
T ss_pred HHHHHHHHHHHHH
Confidence 5789999999987
No 142
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.44 E-value=54 Score=20.21 Aligned_cols=21 Identities=14% Similarity=0.074 Sum_probs=17.5
Q ss_pred chhhhHHhhhhhcCCCCCCCC
Q psy14250 37 GIIVAAHEVGHLLGAVHDGSP 57 (154)
Q Consensus 37 ~a~~~AHElGH~lG~~HD~~~ 57 (154)
.-..+-|++++.+|+.|...+
T Consensus 28 ~eR~~vH~~a~~~gL~s~S~G 48 (59)
T cd06007 28 HERAVIHRLCRKLGLKSKSKG 48 (59)
T ss_pred HHHHHHHHHHHHcCCCceeec
Confidence 357889999999999988655
No 143
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=21.18 E-value=45 Score=26.75 Aligned_cols=12 Identities=25% Similarity=0.448 Sum_probs=10.3
Q ss_pred hhhhHHhhhhhc
Q psy14250 38 IIVAAHEVGHLL 49 (154)
Q Consensus 38 a~~~AHElGH~l 49 (154)
-.+++||+.|.-
T Consensus 197 ~~il~HEl~Hik 208 (299)
T PF05569_consen 197 RAILLHELAHIK 208 (299)
T ss_pred HHHHHHHHHHHH
Confidence 479999999965
No 144
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=20.96 E-value=41 Score=29.44 Aligned_cols=14 Identities=29% Similarity=0.584 Sum_probs=11.8
Q ss_pred chhhhHHhhhhhcC
Q psy14250 37 GIIVAAHEVGHLLG 50 (154)
Q Consensus 37 ~a~~~AHElGH~lG 50 (154)
-..++.||+||.+-
T Consensus 247 D~~t~~HE~GH~~y 260 (477)
T cd06461 247 DFVTVHHEMGHIQY 260 (477)
T ss_pred HHHHHHHHHHHHHH
Confidence 35789999999994
No 145
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=20.35 E-value=45 Score=27.85 Aligned_cols=12 Identities=33% Similarity=0.437 Sum_probs=10.5
Q ss_pred chhhhHHhhhhh
Q psy14250 37 GIIVAAHEVGHL 48 (154)
Q Consensus 37 ~a~~~AHElGH~ 48 (154)
...+++||++|.
T Consensus 190 ~~yIilHEl~Hl 201 (337)
T COG4219 190 LKYIILHELSHL 201 (337)
T ss_pred hhhhHhHHHhhh
Confidence 468999999995
No 146
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=20.07 E-value=46 Score=23.87 Aligned_cols=22 Identities=18% Similarity=0.067 Sum_probs=14.5
Q ss_pred hhhhHHhhhhhcCCCCCCCCCC
Q psy14250 38 IIVAAHEVGHLLGAVHDGSPPP 59 (154)
Q Consensus 38 a~~~AHElGH~lG~~HD~~~~~ 59 (154)
+.-.+|.|.+|.|+.......+
T Consensus 6 ~HEa~HQl~~N~Gl~~r~~~~P 27 (128)
T PF07607_consen 6 AHEATHQLAFNTGLHPRLADWP 27 (128)
T ss_pred HHHHHHHHHHHccccccCCCCc
Confidence 3445666777889887766543
Done!