Query 043639
Match_columns 261
No_of_seqs 254 out of 1490
Neff 7.0
Searched_HMMs 46136
Date Fri Mar 29 06:52:54 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043639.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043639hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 3.2E-48 6.9E-53 370.5 14.3 187 12-230 79-268 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 100.0 3.5E-38 7.7E-43 262.1 15.8 155 60-223 1-157 (157)
3 PF00413 Peptidase_M10: Matrix 100.0 8.7E-37 1.9E-41 251.3 15.7 151 60-223 1-154 (154)
4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 2.9E-29 6.2E-34 208.3 14.0 150 67-223 2-156 (156)
5 smart00235 ZnMc Zinc-dependent 99.9 1E-24 2.2E-29 177.6 10.1 136 58-224 2-140 (140)
6 cd04277 ZnMc_serralysin_like Z 99.9 4.8E-24 1E-28 181.7 13.5 124 85-223 33-186 (186)
7 cd04268 ZnMc_MMP_like Zinc-dep 99.9 2.2E-24 4.8E-29 179.4 9.1 142 67-222 2-165 (165)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 99.7 1.5E-17 3.3E-22 143.4 12.0 109 60-191 2-110 (198)
9 cd00203 ZnMc Zinc-dependent me 99.7 3.1E-17 6.8E-22 136.3 11.6 120 84-222 20-167 (167)
10 cd04276 ZnMc_MMP_like_2 Zinc-d 99.6 7E-15 1.5E-19 126.8 11.9 146 68-224 9-197 (197)
11 cd04280 ZnMc_astacin_like Zinc 99.6 2.4E-14 5.2E-19 121.9 12.1 91 68-192 3-93 (180)
12 PF01400 Astacin: Astacin (Pep 99.5 2.8E-14 6.2E-19 122.5 9.6 96 60-191 2-97 (191)
13 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.5 1.1E-13 2.3E-18 119.7 13.1 99 58-191 7-105 (200)
14 PF12388 Peptidase_M57: Dual-a 99.5 1.3E-13 2.9E-18 119.1 12.4 136 66-222 38-211 (211)
15 cd04282 ZnMc_meprin Zinc-depen 99.4 1.4E-12 3E-17 114.9 12.8 103 52-192 37-139 (230)
16 cd04283 ZnMc_hatching_enzyme Z 99.4 3.3E-12 7.1E-17 108.9 11.8 92 67-191 4-95 (182)
17 KOG3714 Meprin A metalloprotea 99.4 5.8E-12 1.3E-16 119.9 11.7 107 50-191 71-177 (411)
18 PF02031 Peptidase_M7: Strepto 99.2 1.1E-10 2.5E-15 92.7 9.3 125 67-223 2-130 (132)
19 COG5549 Predicted Zn-dependent 98.9 3.8E-09 8.2E-14 90.5 8.8 148 57-225 77-235 (236)
20 PF11350 DUF3152: Protein of u 98.3 4.4E-06 9.5E-11 71.8 10.0 142 67-218 29-183 (203)
21 cd04267 ZnMc_ADAM_like Zinc-de 98.0 5.5E-05 1.2E-09 64.4 9.9 106 91-220 74-189 (192)
22 cd04273 ZnMc_ADAMTS_like Zinc- 98.0 4.2E-05 9E-10 66.3 8.8 120 91-224 72-200 (207)
23 PF13583 Reprolysin_4: Metallo 97.9 1.1E-05 2.4E-10 70.1 4.2 48 173-222 137-198 (206)
24 PF05572 Peptidase_M43: Pregna 97.7 2.6E-05 5.7E-10 64.9 2.9 25 170-194 66-90 (154)
25 PF13574 Reprolysin_2: Metallo 97.5 2.6E-05 5.6E-10 65.9 0.8 48 173-222 111-171 (173)
26 cd04275 ZnMc_pappalysin_like Z 97.4 5.4E-05 1.2E-09 66.8 1.6 26 170-195 134-159 (225)
27 COG1913 Predicted Zn-dependent 97.4 7.2E-05 1.6E-09 62.9 1.6 27 174-200 125-151 (181)
28 PF13688 Reprolysin_5: Metallo 97.3 7E-05 1.5E-09 63.8 0.9 22 172-193 141-162 (196)
29 PF13582 Reprolysin_3: Metallo 97.2 9.8E-05 2.1E-09 58.2 0.7 17 174-190 108-124 (124)
30 cd04271 ZnMc_ADAM_fungal Zinc- 97.2 0.0004 8.7E-09 61.3 4.1 49 175-225 147-220 (228)
31 PRK13267 archaemetzincin-like 97.1 0.00021 4.6E-09 60.9 1.9 26 174-199 126-151 (179)
32 cd04269 ZnMc_adamalysin_II_lik 97.1 0.00063 1.4E-08 58.0 4.2 50 173-224 131-186 (194)
33 PF07998 Peptidase_M54: Peptid 96.9 0.00039 8.5E-09 59.9 1.5 27 173-200 145-171 (194)
34 PF01421 Reprolysin: Reprolysi 96.5 0.0039 8.4E-08 53.4 4.5 52 171-223 129-188 (199)
35 cd04270 ZnMc_TACE_like Zinc-de 96.3 0.0045 9.9E-08 55.1 4.3 50 174-223 168-230 (244)
36 cd04272 ZnMc_salivary_gland_MP 96.0 0.007 1.5E-07 52.8 3.7 50 173-223 145-212 (220)
37 KOG4525 Jacalin-like lectin do 95.5 0.013 2.8E-07 55.9 3.5 58 174-233 302-381 (614)
38 PF11150 DUF2927: Protein of u 95.0 0.69 1.5E-05 40.5 12.8 150 59-225 28-196 (213)
39 PF04298 Zn_peptidase_2: Putat 94.6 0.027 5.9E-07 49.5 2.9 81 88-192 18-106 (222)
40 PF10462 Peptidase_M66: Peptid 94.6 0.015 3.2E-07 53.6 1.2 20 171-190 191-210 (305)
41 PF05548 Peptidase_M11: Gameto 94.5 0.02 4.3E-07 53.0 1.8 19 173-191 150-168 (314)
42 PF12044 Metallopep: Putative 93.9 0.029 6.2E-07 53.9 1.6 23 174-198 317-339 (423)
43 TIGR03296 M6dom_TIGR03296 M6 f 93.5 0.023 5E-07 51.8 0.2 19 174-192 166-184 (286)
44 COG2738 Predicted Zn-dependent 93.3 0.063 1.4E-06 46.2 2.5 78 87-186 20-105 (226)
45 PTZ00337 surface protease GP63 92.6 0.2 4.3E-06 50.0 5.4 36 145-190 211-246 (567)
46 PTZ00257 Glycoprotein GP63 (le 92.5 0.3 6.5E-06 49.0 6.4 96 114-225 205-313 (622)
47 COG3824 Predicted Zn-dependent 91.4 0.077 1.7E-06 42.2 0.8 20 170-189 106-125 (136)
48 PF01457 Peptidase_M8: Leishma 91.4 0.11 2.4E-06 51.3 2.0 97 114-226 159-268 (521)
49 PF05547 Peptidase_M6: Immune 88.8 0.11 2.3E-06 52.6 -0.5 51 173-223 221-286 (645)
50 KOG3607 Meltrins, fertilins an 88.0 0.46 1E-05 48.7 3.4 46 173-219 323-375 (716)
51 PF09471 Peptidase_M64: IgA Pe 86.6 0.27 5.8E-06 44.5 0.7 23 172-194 215-238 (264)
52 PF06262 DUF1025: Possibl zinc 85.8 0.31 6.8E-06 37.4 0.7 16 173-188 73-88 (97)
53 KOG3658 Tumor necrosis factor- 84.1 0.76 1.6E-05 46.3 2.6 48 174-221 393-452 (764)
54 COG4783 Putative Zn-dependent 82.8 0.45 9.8E-06 46.2 0.5 20 170-190 127-146 (484)
55 PF01863 DUF45: Protein of unk 78.7 7 0.00015 33.1 6.4 73 88-192 111-183 (205)
56 PF06114 DUF955: Domain of unk 78.1 0.81 1.8E-05 34.6 0.4 14 172-185 41-54 (122)
57 PF13398 Peptidase_M50B: Pepti 77.6 0.83 1.8E-05 39.4 0.4 14 173-186 22-35 (200)
58 PF04228 Zn_peptidase: Putativ 77.1 1.3 2.7E-05 40.7 1.4 40 143-184 141-181 (292)
59 COG2856 Predicted Zn peptidase 77.1 0.9 2E-05 39.8 0.4 18 173-192 72-89 (213)
60 PF12725 DUF3810: Protein of u 76.7 1.4 3E-05 40.9 1.6 23 170-192 193-215 (318)
61 COG5549 Predicted Zn-dependent 75.0 1.6 3.4E-05 38.2 1.4 29 174-202 154-182 (236)
62 PF14247 DUF4344: Domain of un 74.8 1 2.2E-05 39.6 0.2 14 172-185 91-104 (220)
63 KOG3538 Disintegrin metallopro 74.2 2.6 5.7E-05 44.1 3.1 48 171-218 315-368 (845)
64 cd06456 M3A_DCP_Oligopeptidase 74.1 15 0.00033 35.3 8.0 24 88-111 107-130 (422)
65 cd06457 M3A_MIP Peptidase M3 m 71.5 18 0.00039 35.1 7.9 25 87-111 132-156 (458)
66 PF01435 Peptidase_M48: Peptid 71.0 1.6 3.4E-05 37.3 0.5 21 170-191 86-106 (226)
67 PF13699 DUF4157: Domain of un 70.9 3.4 7.4E-05 30.3 2.2 30 147-185 44-73 (79)
68 PF02163 Peptidase_M50: Peptid 70.8 1.5 3.3E-05 36.9 0.3 20 174-193 8-27 (192)
69 cd06163 S2P-M50_PDZ_RseP-like 70.3 1.7 3.7E-05 37.1 0.5 12 174-185 10-21 (182)
70 cd05709 S2P-M50 Site-2 proteas 69.9 1.7 3.6E-05 36.4 0.4 16 174-189 9-24 (180)
71 PRK02870 heat shock protein Ht 67.8 3 6.6E-05 39.0 1.6 15 170-184 170-184 (336)
72 PRK03982 heat shock protein Ht 67.8 1.9 4.2E-05 39.2 0.3 16 170-185 122-137 (288)
73 COG0501 HtpX Zn-dependent prot 67.0 2.2 4.7E-05 38.3 0.5 16 170-185 154-169 (302)
74 cd06161 S2P-M50_SpoIVFB SpoIVF 66.2 2.2 4.8E-05 36.9 0.4 16 173-188 38-53 (208)
75 PRK03001 M48 family peptidase; 65.8 2.2 4.8E-05 38.7 0.3 16 170-185 121-136 (283)
76 PRK01345 heat shock protein Ht 64.9 2.4 5.1E-05 39.3 0.3 15 171-185 122-136 (317)
77 PRK03072 heat shock protein Ht 64.9 2.4 5.1E-05 38.7 0.3 16 170-185 124-139 (288)
78 cd06459 M3B_Oligoendopeptidase 63.8 2.6 5.7E-05 39.8 0.4 19 81-99 153-171 (427)
79 cd06258 Peptidase_M3_like The 62.6 2.8 6.1E-05 39.0 0.4 14 172-185 153-166 (365)
80 PF01432 Peptidase_M3: Peptida 62.2 2.9 6.3E-05 40.2 0.4 25 87-111 136-160 (458)
81 cd06164 S2P-M50_SpoIVFB_CBS Sp 61.6 3 6.6E-05 36.7 0.4 14 174-187 54-67 (227)
82 PRK04897 heat shock protein Ht 61.5 3 6.5E-05 38.2 0.3 16 170-185 134-149 (298)
83 COG4227 Antirestriction protei 60.4 4.9 0.00011 36.4 1.4 22 170-191 200-221 (316)
84 PRK05457 heat shock protein Ht 60.4 3.2 6.9E-05 37.8 0.3 15 170-184 131-145 (284)
85 TIGR02289 M3_not_pepF oligoend 60.1 3.3 7.2E-05 41.1 0.4 13 173-185 337-349 (549)
86 cd06455 M3A_TOP Peptidase M3 T 59.9 3.4 7.3E-05 40.2 0.4 12 174-185 264-275 (472)
87 PRK02391 heat shock protein Ht 59.5 3.4 7.3E-05 37.9 0.3 15 170-184 130-144 (296)
88 PRK01265 heat shock protein Ht 59.1 3.5 7.5E-05 38.4 0.3 15 170-184 137-151 (324)
89 cd06162 S2P-M50_PDZ_SREBP Ster 58.7 3.6 7.9E-05 37.5 0.4 13 174-186 136-148 (277)
90 cd06159 S2P-M50_PDZ_Arch Uncha 58.4 3.9 8.4E-05 37.0 0.5 17 174-190 119-135 (263)
91 TIGR00181 pepF oligoendopeptid 58.1 3.7 8.1E-05 41.0 0.4 13 173-185 378-390 (591)
92 TIGR02290 M3_fam_3 oligoendope 58.1 3.8 8.2E-05 41.0 0.4 12 174-185 376-387 (587)
93 PF13485 Peptidase_MA_2: Pepti 57.3 4 8.7E-05 31.0 0.4 14 172-185 24-37 (128)
94 PF01447 Peptidase_M4: Thermol 56.8 4.2 9.1E-05 33.6 0.4 12 174-185 136-147 (150)
95 cd06160 S2P-M50_like_2 Unchara 53.7 4.9 0.00011 34.3 0.4 20 174-193 42-61 (183)
96 KOG2921 Intramembrane metallop 53.5 5 0.00011 38.3 0.4 17 174-190 132-148 (484)
97 PF08434 CLCA_N: Calcium-activ 52.1 12 0.00025 33.9 2.5 15 174-188 150-165 (262)
98 COG1164 Oligoendopeptidase F [ 47.9 6.8 0.00015 39.5 0.4 12 174-185 381-392 (598)
99 PF03272 Enhancin: Viral enhan 47.9 27 0.00059 36.4 4.7 49 59-110 128-191 (775)
100 PLN02791 Nudix hydrolase homol 47.5 7.6 0.00017 40.3 0.7 20 173-192 585-604 (770)
101 cd06158 S2P-M50_like_1 Unchara 47.3 7.2 0.00016 33.1 0.4 12 174-185 10-21 (181)
102 cd06460 M32_Taq Peptidase fami 46.9 7.4 0.00016 37.3 0.4 12 174-185 160-171 (396)
103 TIGR00054 RIP metalloprotease 46.7 7.5 0.00016 37.3 0.4 12 174-185 15-26 (420)
104 PRK10779 zinc metallopeptidase 45.6 7.9 0.00017 37.5 0.4 11 174-184 16-26 (449)
105 COG0339 Dcp Zn-dependent oligo 44.3 68 0.0015 32.9 6.7 33 88-120 364-398 (683)
106 KOG2719 Metalloprotease [Gener 43.1 9.1 0.0002 36.9 0.4 15 170-184 277-291 (428)
107 PF01434 Peptidase_M41: Peptid 43.1 8.4 0.00018 33.4 0.1 16 170-185 25-40 (213)
108 PRK10911 oligopeptidase A; Pro 41.8 9.7 0.00021 39.0 0.4 22 89-110 361-382 (680)
109 KOG2089 Metalloendopeptidase f 39.8 79 0.0017 32.4 6.3 32 89-120 392-425 (718)
110 COG3590 PepO Predicted metallo 37.4 20 0.00044 35.9 1.8 12 174-185 488-499 (654)
111 cd02641 R3H_Smubp-2_like R3H d 36.2 15 0.00034 25.4 0.6 20 173-192 29-48 (60)
112 PF03571 Peptidase_M49: Peptid 35.8 14 0.0003 36.9 0.4 14 174-187 294-308 (549)
113 PF06167 Peptidase_M90: Glucos 34.6 17 0.00037 32.7 0.7 20 170-189 151-170 (253)
114 KOG2661 Peptidase family M48 [ 31.4 18 0.0004 34.0 0.4 20 172-192 274-293 (424)
115 PRK10280 dipeptidyl carboxypep 30.8 19 0.00041 37.0 0.4 34 87-120 362-398 (681)
116 COG2321 Predicted metalloprote 29.1 22 0.00047 32.5 0.5 41 143-185 138-179 (295)
117 cd02639 R3H_RRM R3H domain of 28.6 26 0.00055 24.4 0.7 21 174-194 30-50 (60)
118 PHA02150 hypothetical protein 27.8 34 0.00074 24.3 1.2 33 70-102 2-34 (77)
119 PF14891 Peptidase_M91: Effect 25.7 29 0.00063 29.0 0.6 15 173-187 103-117 (174)
120 PF02102 Peptidase_M35: Deuter 25.6 30 0.00066 32.7 0.8 50 136-187 262-311 (359)
121 PF12315 DUF3633: Protein of u 25.2 30 0.00065 30.3 0.6 12 174-185 94-105 (212)
122 COG3091 SprT Zn-dependent meta 24.4 17 0.00037 30.2 -1.0 39 138-185 35-73 (156)
123 PF01431 Peptidase_M13: Peptid 24.0 32 0.0007 29.1 0.6 15 174-188 37-51 (206)
124 PF02128 Peptidase_M36: Fungal 23.4 47 0.001 31.7 1.6 15 174-188 186-200 (378)
125 PRK10733 hflB ATP-dependent me 23.4 29 0.00064 35.3 0.3 13 173-185 408-420 (644)
126 cd06461 M2_ACE Peptidase famil 23.0 34 0.00074 33.6 0.6 14 174-187 248-261 (477)
127 PRK15410 DgsA anti-repressor M 22.9 36 0.00077 30.8 0.7 20 172-191 141-160 (260)
128 PF13402 M60-like: Peptidase M 22.4 33 0.00071 31.0 0.3 11 175-185 221-231 (307)
129 COG2317 Zn-dependent carboxype 22.0 34 0.00073 33.6 0.3 11 175-185 261-271 (497)
130 CHL00176 ftsH cell division pr 21.9 34 0.00074 34.9 0.4 14 172-185 437-450 (638)
131 COG4784 Putative Zn-dependent 21.3 33 0.00072 32.5 0.1 16 170-185 121-136 (479)
132 smart00731 SprT SprT homologue 21.2 34 0.00075 27.7 0.2 15 171-185 57-71 (146)
133 cd02640 R3H_NRF R3H domain of 20.8 48 0.001 23.0 0.8 19 173-191 29-47 (60)
134 PF02074 Peptidase_M32: Carbox 20.1 38 0.00083 33.4 0.3 16 174-189 260-278 (494)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=3.2e-48 Score=370.50 Aligned_cols=187 Identities=43% Similarity=0.787 Sum_probs=172.2
Q ss_pred HHHHHHhcCCCCCCCCCCCcccccCCCCCChhhhhccccceeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHH
Q 043639 12 IFIIFIHESPQCIAVRSTPKFKLKQSDKYDPAEATRHKKDYVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVF 91 (261)
Q Consensus 12 ~~~~~~m~~PRCGvpD~~~~~~~~~~~~~~~~~~~~~~kry~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i 91 (261)
-.|+..|++|||||||. | |++++++.||.+. +|||+|. |++++++..+++.++
T Consensus 79 ~~Tl~~m~~prCgvpd~-------------------~---~~~~~~~~kW~k~---~lT~ri~--n~~~~l~~~~vr~~~ 131 (469)
T KOG1565|consen 79 NATLALMNKPRCGVPDG-------------------R---YRYFPGKPKWNKE---HLTYRIK--NYTPYLPQAEVRCAK 131 (469)
T ss_pred hhhhhhccCCCcCCCCC-------------------c---cccCcccCccccc---ccceecc--ccCCCCCHHHHHHHH
Confidence 46889999999999991 2 6778899999999 9999999 678899999999999
Q ss_pred HHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC---CceeeecccccccccCCCCCCC
Q 043639 92 KRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE---SGKFHLDAAETWAVDFGSEKSP 168 (261)
Q Consensus 92 ~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~---~G~vhfd~~e~wt~~~~~~~~~ 168 (261)
++||+.|+++++|+|+|+.....|||+|.|..+.|||++||||++|+|||||+|+ .|++|||++|.|+++. .
T Consensus 132 ~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~~-----~ 206 (469)
T KOG1565|consen 132 SEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLHFDKDETWTYGD-----S 206 (469)
T ss_pred HHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCCCCccccCcccceeccC-----C
Confidence 9999999999999999999778999999999999999999999999999999995 6899999999999862 2
Q ss_pred ccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCCCCCCCC
Q 043639 169 VAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQGVQALYGSNPNFTI 230 (261)
Q Consensus 169 ~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~~~~~~~ 230 (261)
.+++|+.||+|||||+|||.||.++++||||+|++...++.|++|||.+||.|||.+++...
T Consensus 207 ~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~~~~~~~ 268 (469)
T KOG1565|consen 207 NGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGGPPRDYF 268 (469)
T ss_pred ccchhHHHhhhhcccccccCCCCCcccccccccccCCCCcccChhhhhhhHHHhCCCccccc
Confidence 48999999999999999999999999999999997667889999999999999999887543
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00 E-value=3.5e-38 Score=262.12 Aligned_cols=155 Identities=44% Similarity=0.761 Sum_probs=138.0
Q ss_pred cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639 60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL 139 (261)
Q Consensus 60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l 139 (261)
||+++ +|||+|.. ..+.++...++++|++||+.|+++++|+|+|+.+...|||+|.|....++++.+|+|.+|++
T Consensus 1 kW~~~---~itY~i~~--~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l 75 (157)
T cd04278 1 KWSKT---NLTYRILN--YPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTL 75 (157)
T ss_pred CCCCC---ceeEEEEC--CCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccc
Confidence 79999 99999974 34446678999999999999999999999999875589999999998888889999999999
Q ss_pred eeecCC--CCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHHH
Q 043639 140 AHSFSP--ESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQG 217 (261)
Q Consensus 140 AhA~~P--~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~~ 217 (261)
+||++| ..|+++||.++.|..... ..+.+++.|++|||||+|||.|+.++++||||+|.+......|+++||.+
T Consensus 76 ~~a~~p~~~~g~i~~~~~~~~~~~~~----~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~ 151 (157)
T cd04278 76 AHAFFPGGIGGDIHFDDDEQWTLGSD----SGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRG 151 (157)
T ss_pred ccccCCCCcceeEEECCCcccccCCC----CccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHH
Confidence 999999 679999999999987541 23678999999999999999999999999999998643467899999999
Q ss_pred HHHHhC
Q 043639 218 VQALYG 223 (261)
Q Consensus 218 iq~LYG 223 (261)
||+|||
T Consensus 152 ~~~lYg 157 (157)
T cd04278 152 IQALYG 157 (157)
T ss_pred HHHhcC
Confidence 999998
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00 E-value=8.7e-37 Score=251.27 Aligned_cols=151 Identities=44% Similarity=0.810 Sum_probs=134.9
Q ss_pred cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639 60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL 139 (261)
Q Consensus 60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l 139 (261)
||+++ +|||+|... .+.++.+++++++++||+.|+++++|+|+|+.+ ..+||+|.|....+++...|++.++++
T Consensus 1 ~W~~~---~ity~i~~~--~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~ 74 (154)
T PF00413_consen 1 KWPKK---TITYSISNT--TPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTL 74 (154)
T ss_dssp SSSSS---EEEEEESSB--CTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSES
T ss_pred CCCCC---cEEEEEECC--CCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeee
Confidence 79999 999999854 466778899999999999999999999999985 579999999998888888999999999
Q ss_pred eeecCCC---CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHH
Q 043639 140 AHSFSPE---SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQ 216 (261)
Q Consensus 140 AhA~~P~---~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~ 216 (261)
++++.|. .|+++|+.++.|.... ...+++.|++|||||+|||.|++++++||||.+.+.. ...|+.+||+
T Consensus 75 ~~~~~~~~~~~~~i~~~~~~~~~~~~------~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~-~~~l~~~Di~ 147 (154)
T PF00413_consen 75 AHAYFPNNIVSGDIHFNDDESWTIDD------SGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPD-NKTLSEDDID 147 (154)
T ss_dssp EEEEESSSTTTTEEEEETTSHEESSS------SSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSS-STSTTHHHHH
T ss_pred eccccccccccccccccccccchhhh------hhhhhhhhhhhccccccCcCcCCCcccceeeecccCC-CCCCCHHHHH
Confidence 9999997 6999999999998652 3678899999999999999999999999999998654 2489999999
Q ss_pred HHHHHhC
Q 043639 217 GVQALYG 223 (261)
Q Consensus 217 ~iq~LYG 223 (261)
+||+|||
T Consensus 148 ~i~~lYg 154 (154)
T PF00413_consen 148 GIQYLYG 154 (154)
T ss_dssp HHHHHHS
T ss_pred HHHHHhC
Confidence 9999998
No 4
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.96 E-value=2.9e-29 Score=208.26 Aligned_cols=150 Identities=25% Similarity=0.307 Sum_probs=114.8
Q ss_pred eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCC-CCcceEEEeeecCCCCCCCccCCCCceeeecCC
Q 043639 67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDY-DFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSP 145 (261)
Q Consensus 67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P 145 (261)
..|+|++...+..++-....++++|++||+.|+++++|+|+++... ..+||+|.|.... +|+|.|+++|||++|
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p 76 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP 76 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence 3799999865332222356799999999999999999999998753 3899999998632 788999999999999
Q ss_pred CCceeeecccccccccCCCCC--CCccccchhhhhhhhhhhcCCCCCCCCC-CccccccCCCC-CCCCCCHHHHHHHHHH
Q 043639 146 ESGKFHLDAAETWAVDFGSEK--SPVAVDLESVAVHEIGHLLGLTHSPVKE-AVMYPTLKPRD-KKVDLALDDIQGVQAL 221 (261)
Q Consensus 146 ~~G~vhfd~~e~wt~~~~~~~--~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-svM~p~y~~~~-~~~~l~~dDI~~iq~L 221 (261)
..|. ++....|........ ...+.+++.|++|||||+|||.|++++. ++|||++.... .+..|+++|+++||.|
T Consensus 77 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~l 154 (156)
T cd04279 77 LISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRL 154 (156)
T ss_pred ccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence 7543 333333322211000 0125678999999999999999999998 99999988642 3468999999999999
Q ss_pred hC
Q 043639 222 YG 223 (261)
Q Consensus 222 YG 223 (261)
||
T Consensus 155 Y~ 156 (156)
T cd04279 155 YG 156 (156)
T ss_pred hC
Confidence 97
No 5
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.92 E-value=1e-24 Score=177.55 Aligned_cols=136 Identities=32% Similarity=0.548 Sum_probs=109.8
Q ss_pred CCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCC
Q 043639 58 KPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLG 137 (261)
Q Consensus 58 ~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg 137 (261)
..+|++. +|+|+|.. +.++..+++++|++||+.|+++++|+|+|+.+ .+||.|.|...+++ |.
T Consensus 2 ~~~W~~~---~v~Y~i~~----~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~ 64 (140)
T smart00235 2 SKKWPKG---TVPYVIDS----SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GC 64 (140)
T ss_pred CCcCCCC---cEEEEEcC----CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Cc
Confidence 4689998 99999962 23445556999999999999999999999874 79999999986541 11
Q ss_pred ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCC---ccccccCCCCCCCCCCHHH
Q 043639 138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEA---VMYPTLKPRDKKVDLALDD 214 (261)
Q Consensus 138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---vM~p~y~~~~~~~~l~~dD 214 (261)
.+++++.| .|+.+++. +.|.. ...|++|||||+|||.|++++.+ +|++.|......+.++++|
T Consensus 65 ~~a~~g~~-~g~~~~~~-~~~~~------------~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~ 130 (140)
T smart00235 65 TLSHAGRP-GGDQHFSL-GNGCI------------NTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDD 130 (140)
T ss_pred ceeeeecC-CCceEEEc-cCCcC------------CcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhcccccccc
Confidence 26888888 78888876 55542 13599999999999999998887 9999987654567888889
Q ss_pred HHHHHHHhCC
Q 043639 215 IQGVQALYGS 224 (261)
Q Consensus 215 I~~iq~LYG~ 224 (261)
..+++++||+
T Consensus 131 ~~~~~~~yg~ 140 (140)
T smart00235 131 SLGIPYDYGS 140 (140)
T ss_pred CCCchhccCc
Confidence 9999999984
No 6
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.91 E-value=4.8e-24 Score=181.72 Aligned_cols=124 Identities=27% Similarity=0.331 Sum_probs=97.0
Q ss_pred HHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCC-------ceeeeccccc
Q 043639 85 TDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES-------GKFHLDAAET 157 (261)
Q Consensus 85 ~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-------G~vhfd~~e~ 157 (261)
..+++++++||+.|+++++|+|+|+.+...+||+|.+..... +++.+.|++|.. |+|+++.+..
T Consensus 33 ~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~~ 103 (186)
T cd04277 33 AAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSYD 103 (186)
T ss_pred HHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCcc
Confidence 578999999999999999999999988668999998865321 356778888853 6899998877
Q ss_pred ccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCC----------------CCccccccCCC-------CCCCCCCHHH
Q 043639 158 WAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVK----------------EAVMYPTLKPR-------DKKVDLALDD 214 (261)
Q Consensus 158 wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~svM~p~y~~~-------~~~~~l~~dD 214 (261)
+... ..+...+.|++|||||+|||.|+++. .+||....... .....++..|
T Consensus 104 ~~~~------~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~D 177 (186)
T cd04277 104 TNSD------SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLD 177 (186)
T ss_pred cccC------CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHH
Confidence 6532 12556789999999999999999863 35885432221 2346799999
Q ss_pred HHHHHHHhC
Q 043639 215 IQGVQALYG 223 (261)
Q Consensus 215 I~~iq~LYG 223 (261)
|++||+|||
T Consensus 178 I~AlQ~lYG 186 (186)
T cd04277 178 IAALQYLYG 186 (186)
T ss_pred HHHHHHhhC
Confidence 999999998
No 7
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.91 E-value=2.2e-24 Score=179.41 Aligned_cols=142 Identities=23% Similarity=0.338 Sum_probs=103.5
Q ss_pred eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC
Q 043639 67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE 146 (261)
Q Consensus 67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~ 146 (261)
+.|+|+|... ++ ..+|+++++||+.|+++++|+|+|+.....+||+|.+.+..+.. +|..+...+...|.
T Consensus 2 ~~I~y~i~~~-~~-----~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~ 71 (165)
T cd04268 2 KPITYYIDDS-VP-----DKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPL 71 (165)
T ss_pred CCEEEEEcCC-CC-----HHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCC
Confidence 3899999742 22 57899999999999999999999998755899999998643111 11111222222466
Q ss_pred CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC----------------CCCccccccCCC------
Q 043639 147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV----------------KEAVMYPTLKPR------ 204 (261)
Q Consensus 147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~svM~p~y~~~------ 204 (261)
.|+|+++....|.... ...+..++.|++|||||+|||.|+++ ..|||.+.....
T Consensus 72 ~g~i~~~~~~~~~~~~----~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~ 147 (165)
T cd04268 72 TGEILLARVYLYSSFV----EYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGD 147 (165)
T ss_pred CccEEeeEEEEchhHH----HHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCcccccccccc
Confidence 7899998766543110 01245678999999999999999998 899998654421
Q ss_pred CCCCCCCHHHHHHHHHHh
Q 043639 205 DKKVDLALDDIQGVQALY 222 (261)
Q Consensus 205 ~~~~~l~~dDI~~iq~LY 222 (261)
.....|+.+||++||.||
T Consensus 148 ~~~~~~~~~Di~ai~~lY 165 (165)
T cd04268 148 GQKYTIGPYDIAAIKKLY 165 (165)
T ss_pred ccCCCCCHHHHHHHHhcC
Confidence 134689999999999999
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74 E-value=1.5e-17 Score=143.36 Aligned_cols=109 Identities=28% Similarity=0.440 Sum_probs=75.6
Q ss_pred cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639 60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL 139 (261)
Q Consensus 60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l 139 (261)
+|+.+. .|+|.|... . .+.+++.|++||+.|+++++|+|+|++. ..+||+|.|..+. | ++++.|..+..
T Consensus 2 ~W~~~~--~~~~~f~~~-~-----~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-G-c~S~vG~~~~~ 70 (198)
T cd04327 2 LWRNGT--VLRIAFLGG-P-----DAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-G-YWSYVGTDALL 70 (198)
T ss_pred CCCCCC--eEEEEeCCC-C-----cHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-C-CCCCcCCcccc
Confidence 688873 688888742 1 3678999999999999999999999976 5799999998743 3 56666654321
Q ss_pred eeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639 140 AHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 140 AhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
..+..-.+.+. |.... ..+.+...|++|||||||||.|++
T Consensus 71 ---~~~~~~t~~l~----~~~~~-----~~~~~~~~~i~HElgHaLG~~HEh 110 (198)
T cd04327 71 ---IGADAPTMNLG----WFTDD-----TPDPEFSRVVLHEFGHALGFIHEH 110 (198)
T ss_pred ---cCCCCceeeee----eecCC-----CchhhHHHHHHHHHHHHhcCcccc
Confidence 11112234342 21110 113456789999999999999974
No 9
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.73 E-value=3.1e-17 Score=136.29 Aligned_cols=120 Identities=23% Similarity=0.269 Sum_probs=89.9
Q ss_pred HHHHHHHHHHHHHHhHhcCCceEEEcCCC-CCcceEEEeeecCCCCCCCccCCCCceeeecCCCC-----ceeeeccccc
Q 043639 84 ITDIKAVFKRAFSKWASVIPVSFIETDDY-DFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES-----GKFHLDAAET 157 (261)
Q Consensus 84 ~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-----G~vhfd~~e~ 157 (261)
...++++++.|++.|+++++++|+++... ..+||.+.+...+. .++.+++||.|.. |.+++.....
T Consensus 20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~~~~c~~~~~~~~~~~~~~ 91 (167)
T cd00203 20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYLGRVCDSLRGVGVLQDNQS 91 (167)
T ss_pred HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEecCCccCCCCCcEEEecCCc
Confidence 46799999999999999999999999863 38999998875432 2245567776653 4555433222
Q ss_pred ccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC--------------------CCCccccccCCC--CCCCCCCHHHH
Q 043639 158 WAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV--------------------KEAVMYPTLKPR--DKKVDLALDDI 215 (261)
Q Consensus 158 wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~svM~p~y~~~--~~~~~l~~dDI 215 (261)
+ +.....|++|||||+|||.|.++ ..+||+|..... .....++..||
T Consensus 92 ~-----------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~ 160 (167)
T cd00203 92 G-----------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDI 160 (167)
T ss_pred c-----------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHH
Confidence 1 33567999999999999999876 358998765531 24568999999
Q ss_pred HHHHHHh
Q 043639 216 QGVQALY 222 (261)
Q Consensus 216 ~~iq~LY 222 (261)
+.||++|
T Consensus 161 ~~i~~~Y 167 (167)
T cd00203 161 DQINKLY 167 (167)
T ss_pred HHHHhhC
Confidence 9999998
No 10
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.61 E-value=7e-15 Score=126.76 Aligned_cols=146 Identities=18% Similarity=0.236 Sum_probs=88.5
Q ss_pred eEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhc------CCceEEEcCCC--CCcceEEEeeecCCCCCCCccCCCCce
Q 043639 68 TLSYAFSKENMINYLHITDIKAVFKRAFSKWASV------IPVSFIETDDY--DFADIRIGFYNGDHGDGEPFDGVLGVL 139 (261)
Q Consensus 68 ~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~v------t~l~F~ev~~~--~~aDI~I~f~~~~hgd~~~fdG~gg~l 139 (261)
.|||+|.. +++ .+++++|++||+.|+++ .++.+.++... ...||++...+-.+.+ ++..+..
T Consensus 9 pItyyI~~--~~p----~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~~~g 78 (197)
T cd04276 9 PIVYYLDN--TFP----EKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGWAYG 78 (197)
T ss_pred CEEEEecC--CCc----HHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcceec
Confidence 89999984 333 78999999999999997 46666665421 2468985444333322 1222333
Q ss_pred eeecCCCCc-----eeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC---------------------C
Q 043639 140 AHSFSPESG-----KFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV---------------------K 193 (261)
Q Consensus 140 AhA~~P~~G-----~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~---------------------~ 193 (261)
.+...|..| +|.++........... ....+..+..+++||+||+|||.|... .
T Consensus 79 ps~~dPrTGeIl~a~V~l~~~~~~~~~~~~-~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~ 157 (197)
T cd04276 79 PSVVDPRTGEILKADVILYSGFLRQDQLWY-EDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGAT 157 (197)
T ss_pred ccccCCCCCCeEEEEEEeCchhhccchhHH-HHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCc
Confidence 444456554 5566553321110000 000123467899999999999999752 2
Q ss_pred CCcc-ccccCC--C------CCCCCCCHHHHHHHHHHhCC
Q 043639 194 EAVM-YPTLKP--R------DKKVDLALDDIQGVQALYGS 224 (261)
Q Consensus 194 ~svM-~p~y~~--~------~~~~~l~~dDI~~iq~LYG~ 224 (261)
.||| |+.+.. . .....+...||.+||.+||.
T Consensus 158 ~SVMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~ 197 (197)
T cd04276 158 SSVMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP 197 (197)
T ss_pred ceeecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence 4677 322111 0 12346899999999999984
No 11
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.57 E-value=2.4e-14 Score=121.87 Aligned_cols=91 Identities=21% Similarity=0.274 Sum_probs=62.9
Q ss_pred eEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCC
Q 043639 68 TLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES 147 (261)
Q Consensus 68 ~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~ 147 (261)
.|.|.|... ++ ...++.|++||+.|+++++|+|+|... ++..|.|..+ .| ++++.|..+ +.
T Consensus 3 ~VpY~i~~~-~~-----~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~~--------~~ 63 (180)
T cd04280 3 TVPYVIDGS-FD-----ESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRVG--------GR 63 (180)
T ss_pred EEEEEECCC-CC-----HHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCccC--------Cc
Confidence 799999842 32 578999999999999999999999875 4455666554 23 444444311 11
Q ss_pred ceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639 148 GKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 148 G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
-.+.+.. ++....+++|||||||||.|++.
T Consensus 64 q~i~l~~---------------~c~~~g~v~HE~~HalG~~HEh~ 93 (180)
T cd04280 64 QVVSLGS---------------GCFSLGTIVHELMHALGFYHEQS 93 (180)
T ss_pred eeEEeCC---------------CcCcCchhHHHHHHHhcCcchhc
Confidence 2232221 23336899999999999999874
No 12
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.54 E-value=2.8e-14 Score=122.49 Aligned_cols=96 Identities=24% Similarity=0.436 Sum_probs=60.5
Q ss_pred cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639 60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL 139 (261)
Q Consensus 60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l 139 (261)
||+.. +|.|.|.. +++ +..++.|++||+.|++.++|+|+++. . .....|.|..+ .| ++++.|..+
T Consensus 2 ~Wp~~---~IpY~~~~-~~~-----~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~g-C~S~vG~~~-- 66 (191)
T PF01400_consen 2 KWPNG---TIPYYIDP-SFS-----SSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SG-CWSYVGRQG-- 66 (191)
T ss_dssp S-GGG---EEEEEEET-TS------HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SS-EEEESS--S--
T ss_pred cCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-cc-ccchhhhcC--
Confidence 89999 99999985 332 57889999999999999999999998 2 33344555442 22 344444311
Q ss_pred eeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639 140 AHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 140 AhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
+.=.+.+. .+.....+++|||||||||.|.+
T Consensus 67 ------g~q~i~l~---------------~~c~~~~~i~HEl~HaLG~~HEh 97 (191)
T PF01400_consen 67 ------GEQTINLG---------------DGCFSVGTILHELGHALGFWHEH 97 (191)
T ss_dssp ------SEEEEEE----------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred ------cceeEEec---------------ceeCCccchHHHHHHHHhhhhhh
Confidence 00012221 02234679999999999999986
No 13
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.54 E-value=1.1e-13 Score=119.70 Aligned_cols=99 Identities=24% Similarity=0.434 Sum_probs=66.7
Q ss_pred CCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCC
Q 043639 58 KPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLG 137 (261)
Q Consensus 58 ~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg 137 (261)
..+|++. .|.|.|.. +++ ...++.|++||+.|++.|+|+|++-.. ... .|.|..+..| ++++.|.-+
T Consensus 7 ~~~Wp~~---~VpY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~--yi~f~~~~~G-C~S~vG~~~ 73 (200)
T cd04281 7 ERIWPGG---VIPYVIDG-NFT-----GSQRAMFKQAMRHWENFTCVTFVERTP-EEN--YIVFTYRPCG-CCSYVGRRG 73 (200)
T ss_pred cCcCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceEEEECCC-CCC--EEEEEECCCC-eeEcCCCcC
Confidence 5699999 99999964 332 467889999999999999999998764 333 3444443333 556655422
Q ss_pred ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639 138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
.-.+ .+.+ + .+.....+++|||||||||-|++
T Consensus 74 ~g~q-------~isl--------~-------~~C~~~Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 74 NGPQ-------AISI--------G-------KNCDKFGIVVHELGHVIGFWHEH 105 (200)
T ss_pred CCce-------eeec--------C-------CCcCcCchHHHHHHHHhcCcchh
Confidence 0011 1111 1 12334689999999999999986
No 14
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.52 E-value=1.3e-13 Score=119.15 Aligned_cols=136 Identities=20% Similarity=0.352 Sum_probs=89.0
Q ss_pred CeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhc-CCceEEEc--CCCCCcceEEEeeecCCCCCCCccCCCCceeee
Q 043639 66 PMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASV-IPVSFIET--DDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHS 142 (261)
Q Consensus 66 ~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~v-t~l~F~ev--~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA 142 (261)
+++|+..+.+.+. ..++ +.++.++++|++.|+++ ..|+|+-+ .....+||.|.-... +.+.|+.+-|
T Consensus 38 ~~~I~i~~~~~~~-~~l~-s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~ggsA 107 (211)
T PF12388_consen 38 PRTITIIGYTGGS-QSLN-SAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAGGSA 107 (211)
T ss_pred CCEEEEEeCCCcc-cccc-HHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcceec
Confidence 3488888875321 1233 67999999999999997 35677422 233466776632211 1223456678
Q ss_pred cCC-CCc----eeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC--------------------------
Q 043639 143 FSP-ESG----KFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP-------------------------- 191 (261)
Q Consensus 143 ~~P-~~G----~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~-------------------------- 191 (261)
.|| ..| .|.+ +.+. .........|++|||||+|||.|++
T Consensus 108 GFP~s~G~P~~~I~I-----~~~~-----~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGT 177 (211)
T PF12388_consen 108 GFPTSNGNPYKFIQI-----YGLS-----NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGT 177 (211)
T ss_pred cCCCCCCCCCceEEE-----EecC-----CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCC
Confidence 889 444 3555 1111 1112335789999999999999986
Q ss_pred ----CCCCccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639 192 ----VKEAVMYPTLKPRDKKVDLALDDIQGVQALY 222 (261)
Q Consensus 192 ----~~~svM~p~y~~~~~~~~l~~dDI~~iq~LY 222 (261)
|++|||...+... ....++..||.+|++||
T Consensus 178 Pt~~d~~SiM~ac~~~~-~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 178 PTGADPNSIMNACFSSG-EDGEFTSNDITALNYLY 211 (211)
T ss_pred CCCCCCchhhhccccCC-CCCCcChhhHHHHHhhC
Confidence 3457887766543 35579999999999999
No 15
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.44 E-value=1.4e-12 Score=114.88 Aligned_cols=103 Identities=21% Similarity=0.332 Sum_probs=67.9
Q ss_pred eeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCC
Q 043639 52 YVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEP 131 (261)
Q Consensus 52 y~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~ 131 (261)
-++.....+|++ .|-|.|.. +++ ...+..|++||+.|++.|+|+|++... .. | .|.|..+. | +++
T Consensus 37 ~a~~~~~~~Wp~----~VPY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~rt~-e~-~-yi~i~~~~-G-C~S 101 (230)
T cd04282 37 NGLIGDTYRWPF----PIPYILDD-SLD-----LNAKGVILKAFEMYRLKSCVDFKPYEG-ES-N-YIFFFKGS-G-CWS 101 (230)
T ss_pred ccccCcccCCCc----ceeEEECC-CCC-----HHHHHHHHHHHHHHHhCCCeeEEECCC-CC-c-EEEEEcCC-C-eee
Confidence 344445679987 49999964 343 467899999999999999999998764 33 3 24444432 3 555
Q ss_pred ccCCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639 132 FDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 132 fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
+.|..| +.-.+.+. .++....+++|||||||||-|++.
T Consensus 102 ~vG~~g--------g~q~isl~---------------~~C~~~Gti~HEl~HalGf~HEqs 139 (230)
T cd04282 102 MVGDQQ--------GGQNLSIG---------------AGCDYKATVEHEFLHALGFYHEQS 139 (230)
T ss_pred ccCccC--------CeEEEEEC---------------CCcCCCchHHHHHHHHhCCccccc
Confidence 555421 01112221 133446899999999999999863
No 16
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.40 E-value=3.3e-12 Score=108.89 Aligned_cols=92 Identities=20% Similarity=0.309 Sum_probs=59.6
Q ss_pred eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC
Q 043639 67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE 146 (261)
Q Consensus 67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~ 146 (261)
..|-|.|.. +++ ...++.|++|++.|++.|+|+|++... ....|+| ..+ .| ++++.|..|-
T Consensus 4 ~~IpY~i~~-~~~-----~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~--~~~-~g-C~S~vG~~gg-------- 64 (182)
T cd04283 4 VYVPYVISP-QYS-----ENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNI--ESR-SG-CWSYIGRQGG-------- 64 (182)
T ss_pred EEEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEE--EcC-CC-ceEecCccCC--------
Confidence 378899964 332 467899999999999999999998753 3334444 332 22 4555443210
Q ss_pred CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639 147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
.-.+.+. ..+.....+++|||||||||.|++
T Consensus 65 ~q~i~l~--------------~~~C~~~G~i~HEl~HaLG~~HEh 95 (182)
T cd04283 65 RQTVSLQ--------------KQGCMYKGIIQHELLHALGFYHEQ 95 (182)
T ss_pred ceeEecC--------------CCCcCccchHHHHHHHHhCCcccc
Confidence 0011111 013334689999999999999986
No 17
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.35 E-value=5.8e-12 Score=119.92 Aligned_cols=107 Identities=21% Similarity=0.291 Sum_probs=75.4
Q ss_pred cceeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCC
Q 043639 50 KDYVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDG 129 (261)
Q Consensus 50 kry~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~ 129 (261)
+|........+|+.. .|.|.|... +. ...+.+|++||+.|++.|+|+|.|..+ .+.+..+.+... | +
T Consensus 71 ~r~~~~~~~~~Wp~~---~ipY~i~~~-~~-----~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--g-C 137 (411)
T KOG3714|consen 71 RRNGTSNPERRWPNG---VIPYYIDGS-FT-----SSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--G-C 137 (411)
T ss_pred hhhcccChhhcCCCC---eeeeEECCC-CC-----HHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--c-c
Confidence 345555667899998 999999853 43 589999999999999999999999874 445555554432 3 5
Q ss_pred CCccCCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639 130 EPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 130 ~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
+++.|..|.- . ..-+++ .+...+.|++|||+||||+-|.+
T Consensus 138 ~S~VGr~gg~---------~------q~~sl~-------~~C~~~G~i~HEl~HaLGf~Heh 177 (411)
T KOG3714|consen 138 YSYVGRRGGG---------Q------QLLSLG-------DGCDRFGTIVHELMHALGFWHEH 177 (411)
T ss_pred eeeeCccCCC---------c------cceecC-------CCcCcCchhHHHHHHHhhhhhcc
Confidence 5665643311 0 011111 24455899999999999999986
No 18
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.19 E-value=1.1e-10 Score=92.72 Aligned_cols=125 Identities=26% Similarity=0.313 Sum_probs=75.9
Q ss_pred eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHh-cCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCC
Q 043639 67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWAS-VIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSP 145 (261)
Q Consensus 67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~-vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P 145 (261)
++|+|.-+. ..+.+..|.+|.+.|+. |.+++|++... ++|++.... + +++..+..--.
T Consensus 2 vtv~Ydas~--------A~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~-~---------~~~sya~~~g~ 60 (132)
T PF02031_consen 2 VTVYYDASR--------APEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN-D---------PRGSYASTDGL 60 (132)
T ss_dssp EEEEEEEEE---------GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE------------TT--EEEE-SS
T ss_pred EEEEEeCCC--------CchhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec-C---------CCCcccccCCC
Confidence 378888763 24688999999999987 78999999863 888876432 1 12333333334
Q ss_pred CCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCC--CCCCCCccccccCCCC-CCCCCCHHHHHHHHHHh
Q 043639 146 ESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTH--SPVKEAVMYPTLKPRD-KKVDLALDDIQGVQALY 222 (261)
Q Consensus 146 ~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~H--s~~~~svM~p~y~~~~-~~~~l~~dDI~~iq~LY 222 (261)
+.|.|.+|.. +. .+++-..|++|||||.|||.. +...+.+|...-.+.. .+..+...++..+++||
T Consensus 61 G~G~I~l~~~--~~---------qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~ 129 (132)
T PF02031_consen 61 GSGYIFLDYQ--QN---------QGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLF 129 (132)
T ss_dssp S-EEEEEEHH--HH---------HHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHC
T ss_pred CcEEEEechH--Hh---------hCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHh
Confidence 5688888752 21 156667899999999999953 3445789975544322 34467889999999998
Q ss_pred C
Q 043639 223 G 223 (261)
Q Consensus 223 G 223 (261)
.
T Consensus 130 a 130 (132)
T PF02031_consen 130 A 130 (132)
T ss_dssp T
T ss_pred c
Confidence 5
No 19
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.94 E-value=3.8e-09 Score=90.53 Aligned_cols=148 Identities=22% Similarity=0.320 Sum_probs=86.3
Q ss_pred CCCcccCCCCeeEEEEEecC-CcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCC
Q 043639 57 GKPRWTRHVPMTLSYAFSKE-NMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGV 135 (261)
Q Consensus 57 ~~~kW~~~~~~~LTY~~~~~-~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~ 135 (261)
+..-|.+. .++.++.-. +...--....+++++..|++-|++.-|+..+|-. ..|||+|...+.. | .+.++.
T Consensus 77 ~~L~~S~f---~v~v~irwpq~l~n~~~ap~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~p-g--tg~~~~ 148 (236)
T COG5549 77 GYLIWSEF---PVDVRIRWPQNLENVEGAPRWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNPP-G--TGWRQY 148 (236)
T ss_pred ceeecccC---cceeeecccccccccccchhHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCCC-C--Cccccc
Confidence 34568776 555555411 1111111235789999999999999777666654 6899999875432 2 112222
Q ss_pred CC---------ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCC-CCCCCCCccccccCCCC
Q 043639 136 LG---------VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLT-HSPVKEAVMYPTLKPRD 205 (261)
Q Consensus 136 gg---------~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~svM~p~y~~~~ 205 (261)
|. .+.|+..|+ |++.. ..+... .+....++..+|.||+|||||+. |+..+ ++|.+..+
T Consensus 149 grArtal~~~efl~~~~~~~----~lnhr--~~i~~~--pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s--- 216 (236)
T COG5549 149 GRARTALIAYEFLGHALGLG----HLNHR--GDIMYP--PGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS--- 216 (236)
T ss_pred chHHHHHHHHHHhhhccCcc----ccccc--ccccCC--cccchhhhhHHHHHhhcchheecccccch-hhhhcchh---
Confidence 11 112222221 12110 011111 12335678899999999999997 98765 45543333
Q ss_pred CCCCCCHHHHHHHHHHhCCC
Q 043639 206 KKVDLALDDIQGVQALYGSN 225 (261)
Q Consensus 206 ~~~~l~~dDI~~iq~LYG~~ 225 (261)
...++..|+..+.++|-.+
T Consensus 217 -~~~is~rdv~tL~r~Yeqp 235 (236)
T COG5549 217 -WQGISRRDVKTLERKYEQP 235 (236)
T ss_pred -hcccCHHHHHHHHHHhcCC
Confidence 2238999999999999754
No 20
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=98.33 E-value=4.4e-06 Score=71.79 Aligned_cols=142 Identities=20% Similarity=0.270 Sum_probs=89.9
Q ss_pred eeEEEEEecCCcCCcC---CHHHHHHHHHHHH---HHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCC--CccCCCCc
Q 043639 67 MTLSYAFSKENMINYL---HITDIKAVFKRAF---SKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGE--PFDGVLGV 138 (261)
Q Consensus 67 ~~LTY~~~~~~~~~~l---~~~~~~~~i~~Af---~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~--~fdG~gg~ 138 (261)
+.++|++.-++-.... ..+++.++|.+.+ +-|.....+.|++|.. +.+|++|...+...-+.. +++-.+.
T Consensus 29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e- 106 (203)
T PF11350_consen 29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGE- 106 (203)
T ss_pred ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCce-
Confidence 5899999866422211 1356667777766 4799888999999987 559999999876543321 2222211
Q ss_pred eeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCC-----CCccccccCCCCCCCCCCHH
Q 043639 139 LAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVK-----EAVMYPTLKPRDKKVDLALD 213 (261)
Q Consensus 139 lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~-----~svM~p~y~~~~~~~~l~~d 213 (261)
. .-+.|..++|.+|. ..|..+.......-+.+...|+=||+||+||-.|..=+ --||- +..+.|.-+
T Consensus 107 ~-SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg~G~~APVMm------QQT~gL~g~ 178 (203)
T PF11350_consen 107 T-SCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPGPGRPAPVMM------QQTLGLDGC 178 (203)
T ss_pred e-EeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCCCCCcCCcch------hhhcccCCc
Confidence 1 12334456777765 46877655433333455678999999999999887643 33553 223455556
Q ss_pred HHHHH
Q 043639 214 DIQGV 218 (261)
Q Consensus 214 DI~~i 218 (261)
+|..|
T Consensus 179 ~~~~~ 183 (203)
T PF11350_consen 179 ELAKL 183 (203)
T ss_pred ccccc
Confidence 66655
No 21
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.98 E-value=5.5e-05 Score=64.40 Aligned_cols=106 Identities=16% Similarity=0.083 Sum_probs=59.7
Q ss_pred HHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCCCcc
Q 043639 91 FKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVA 170 (261)
Q Consensus 91 i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~ 170 (261)
..++|..|.... ....|+.+.+...+- +.+++++.||.++.-...+... ..... ...
T Consensus 74 ~l~~f~~~~~~~---------~~~~d~~~l~t~~~~-------~~~~~~G~a~~~~~C~~~~~~~--v~~~~-----~~~ 130 (192)
T cd04267 74 TLNSFSFWRAEG---------PIRHDNAVLLTAQDF-------IEGDILGLAYVGSMCNPYSSVG--VVEDT-----GFT 130 (192)
T ss_pred HHHHHHHHHhcC---------CCCCceEEEEeeecc-------CCCCeeeeecccCCCCCCCCeE--EEecC-----Ccc
Confidence 356788997644 145666666553321 2345667777665211000000 00000 001
Q ss_pred ccchhhhhhhhhhhcCCCCCCCC----------CCccccccCCCCCCCCCCHHHHHHHHH
Q 043639 171 VDLESVAVHEIGHLLGLTHSPVK----------EAVMYPTLKPRDKKVDLALDDIQGVQA 220 (261)
Q Consensus 171 ~~l~~Va~HEiGHaLGL~Hs~~~----------~svM~p~y~~~~~~~~l~~dDI~~iq~ 220 (261)
.....+++||+||.|||.|.... .-||.|.... .....++...+..|+.
T Consensus 131 ~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~ 189 (192)
T cd04267 131 LLTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE 189 (192)
T ss_pred eeehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence 23457999999999999998753 2389876543 2345678877777764
No 22
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.96 E-value=4.2e-05 Score=66.26 Aligned_cols=120 Identities=16% Similarity=0.212 Sum_probs=71.7
Q ss_pred HHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCCCcc
Q 043639 91 FKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVA 170 (261)
Q Consensus 91 i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~ 170 (261)
.-+.|..|.......-. .+....|+.+.+...+.... ++++++++.||..+-=. ......+.. ..+
T Consensus 72 ~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~ggiC~----~~~s~~i~~-----~~~ 137 (207)
T cd04273 72 SLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNCDTLGLAPVGGMCS----PSRSCSINE-----DTG 137 (207)
T ss_pred HHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCCCceEEeccccCCC----CCcceEEEc-----CCC
Confidence 35789999764332211 11246777777765443221 46677777777532100 000001110 113
Q ss_pred ccchhhhhhhhhhhcCCCCCCC---------CCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 043639 171 VDLESVAVHEIGHLLGLTHSPV---------KEAVMYPTLKPRDKKVDLALDDIQGVQALYGS 224 (261)
Q Consensus 171 ~~l~~Va~HEiGHaLGL~Hs~~---------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~ 224 (261)
.....|++|||||.||+.|-.+ .+-||+|..........++.--+..++.+...
T Consensus 138 ~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~~ 200 (207)
T cd04273 138 LSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLDT 200 (207)
T ss_pred ceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHcc
Confidence 3445799999999999999875 26799987765334667888888888776653
No 23
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.91 E-value=1.1e-05 Score=70.13 Aligned_cols=48 Identities=25% Similarity=0.306 Sum_probs=32.2
Q ss_pred chhhhhhhhhhhcCCCCCCCC--------------CCccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639 173 LESVAVHEIGHLLGLTHSPVK--------------EAVMYPTLKPRDKKVDLALDDIQGVQALY 222 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~~--------------~svM~p~y~~~~~~~~l~~dDI~~iq~LY 222 (261)
...+++|||||+|||.|..+. .+||. |........++.-+|..|+...
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMs--Y~~~~~~~~FS~~~i~~i~~~~ 198 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMS--YASRPSRPYFSPPSIEYIREVV 198 (206)
T ss_pred cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeC--CCCCCCCCCCCchhHHHHhCCC
Confidence 357899999999999998752 25785 2211112237788888776543
No 24
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.68 E-value=2.6e-05 Score=64.86 Aligned_cols=25 Identities=32% Similarity=0.492 Sum_probs=15.9
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHSPVKE 194 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs~~~~ 194 (261)
..+...|++||+||-|||.|.....
T Consensus 66 ~~~~g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp TS-SSHHHHHHHHHHTT---TT---
T ss_pred ccccccchhhhhhhhhcccccccCC
Confidence 4456789999999999999998654
No 25
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.54 E-value=2.6e-05 Score=65.94 Aligned_cols=48 Identities=21% Similarity=0.197 Sum_probs=19.3
Q ss_pred chhhhhhhhhhhcCCCCCCCCC-------------CccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639 173 LESVAVHEIGHLLGLTHSPVKE-------------AVMYPTLKPRDKKVDLALDDIQGVQALY 222 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~~~-------------svM~p~y~~~~~~~~l~~dDI~~iq~LY 222 (261)
...|++|||||.||+.|..+.. .||++. +......++..-|..|+.+-
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~--~~~~~~~fS~cS~~~i~~~l 171 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYA--GICNNDKFSPCSIRQIRAVL 171 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG---------------
T ss_pred eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeC--CCCCCccccccccccccccc
Confidence 3578999999999999987642 588743 32344567777777776543
No 26
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.44 E-value=5.4e-05 Score=66.77 Aligned_cols=26 Identities=38% Similarity=0.593 Sum_probs=22.4
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHSPVKEA 195 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs~~~~s 195 (261)
..++-.|++|||||.|||.|++...+
T Consensus 134 ~~n~g~t~~HEvGH~lGL~HtF~~~~ 159 (225)
T cd04275 134 PYNLGDTATHEVGHWLGLYHTFQGGS 159 (225)
T ss_pred cccccceeEEeccceeeeeeeecCCC
Confidence 45677899999999999999998764
No 27
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.38 E-value=7.2e-05 Score=62.91 Aligned_cols=27 Identities=44% Similarity=0.730 Sum_probs=24.2
Q ss_pred hhhhhhhhhhhcCCCCCCCCCCccccc
Q 043639 174 ESVAVHEIGHLLGLTHSPVKEAVMYPT 200 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~svM~p~ 200 (261)
..=++||+||+|||.|=.++.=||++.
T Consensus 125 ~KEv~HElGH~~GL~HC~N~~CVM~FS 151 (181)
T COG1913 125 VKEVLHELGHLLGLSHCPNPRCVMNFS 151 (181)
T ss_pred HHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence 456899999999999999999999854
No 28
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.32 E-value=7e-05 Score=63.84 Aligned_cols=22 Identities=36% Similarity=0.401 Sum_probs=14.5
Q ss_pred cchhhhhhhhhhhcCCCCCCCC
Q 043639 172 DLESVAVHEIGHLLGLTHSPVK 193 (261)
Q Consensus 172 ~l~~Va~HEiGHaLGL~Hs~~~ 193 (261)
....|++|||||.||+.|..+.
T Consensus 141 ~~~~~~AHEiGH~lGa~HD~~~ 162 (196)
T PF13688_consen 141 NGAITFAHEIGHNLGAPHDGDY 162 (196)
T ss_dssp HHHHHHHHHHHHHTT-----SS
T ss_pred ceehhhHHhHHHhcCCCCCCCC
Confidence 3468999999999999998775
No 29
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.23 E-value=9.8e-05 Score=58.23 Aligned_cols=17 Identities=53% Similarity=0.808 Sum_probs=12.3
Q ss_pred hhhhhhhhhhhcCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs 190 (261)
..+++|||||.|||.|+
T Consensus 108 ~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 108 VDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp TTHHHHHHHHHTT----
T ss_pred ceEeeehhhHhcCCCCC
Confidence 48999999999999995
No 30
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=97.18 E-value=0.0004 Score=61.34 Aligned_cols=49 Identities=29% Similarity=0.455 Sum_probs=38.5
Q ss_pred hhhhhhhhhhcCCCCCCCC-------------------------CCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 043639 175 SVAVHEIGHLLGLTHSPVK-------------------------EAVMYPTLKPRDKKVDLALDDIQGVQALYGSN 225 (261)
Q Consensus 175 ~Va~HEiGHaLGL~Hs~~~-------------------------~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~~ 225 (261)
.|++|||||.||+.|-.+. .=||+|.... ....++.--+..|+.+...+
T Consensus 147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~ 220 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN 220 (228)
T ss_pred eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence 6999999999999996431 1189987543 45789999999999888754
No 31
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.14 E-value=0.00021 Score=60.90 Aligned_cols=26 Identities=38% Similarity=0.551 Sum_probs=23.5
Q ss_pred hhhhhhhhhhhcCCCCCCCCCCcccc
Q 043639 174 ESVAVHEIGHLLGLTHSPVKEAVMYP 199 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~svM~p 199 (261)
..+++||+||.|||.|-.++.-+|+.
T Consensus 126 ~k~~~HElGH~lGL~HC~~~~CvM~~ 151 (179)
T PRK13267 126 RKEVTHELGHTLGLEHCDNPRCVMNF 151 (179)
T ss_pred HHHHHHHHHHHcCCccCCCCCccCCC
Confidence 45699999999999999999999984
No 32
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.07 E-value=0.00063 Score=58.04 Aligned_cols=50 Identities=20% Similarity=0.321 Sum_probs=36.5
Q ss_pred chhhhhhhhhhhcCCCCCCC------CCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 043639 173 LESVAVHEIGHLLGLTHSPV------KEAVMYPTLKPRDKKVDLALDDIQGVQALYGS 224 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~ 224 (261)
.-.+++|||||.||+.|-.+ ..-||+|.... ....++.--..-++++...
T Consensus 131 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~~ 186 (194)
T cd04269 131 FAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLSR 186 (194)
T ss_pred HHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHhc
Confidence 35799999999999999653 35689886543 3456777777777766654
No 33
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.91 E-value=0.00039 Score=59.91 Aligned_cols=27 Identities=44% Similarity=0.651 Sum_probs=19.2
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccccc
Q 043639 173 LESVAVHEIGHLLGLTHSPVKEAVMYPT 200 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~~~svM~p~ 200 (261)
+..+++||+||++||.|=.+ .-+|++.
T Consensus 145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s 171 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDHCEN-RCVMNFS 171 (194)
T ss_dssp HHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred HHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence 45799999999999999999 9999864
No 34
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.46 E-value=0.0039 Score=53.42 Aligned_cols=52 Identities=19% Similarity=0.415 Sum_probs=33.4
Q ss_pred ccchhhhhhhhhhhcCCCCCCC--------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 043639 171 VDLESVAVHEIGHLLGLTHSPV--------KEAVMYPTLKPRDKKVDLALDDIQGVQALYG 223 (261)
Q Consensus 171 ~~l~~Va~HEiGHaLGL~Hs~~--------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG 223 (261)
..+-.+++|||||.||+.|-.. ..-||.|...+. ....++.--+..++++..
T Consensus 129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~-~~~~fS~CS~~~~~~~l~ 188 (199)
T PF01421_consen 129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPS-SSDKFSNCSRRQFEEFLS 188 (199)
T ss_dssp HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSS-STSSS-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCC-CCCCcCHHHHHHHHHHHc
Confidence 3445789999999999999776 256899855432 233788877777776643
No 35
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.33 E-value=0.0045 Score=55.12 Aligned_cols=50 Identities=20% Similarity=0.408 Sum_probs=36.1
Q ss_pred hhhhhhhhhhhcCCCCCCC-----C------CCccccccCCC--CCCCCCCHHHHHHHHHHhC
Q 043639 174 ESVAVHEIGHLLGLTHSPV-----K------EAVMYPTLKPR--DKKVDLALDDIQGVQALYG 223 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~-----~------~svM~p~y~~~--~~~~~l~~dDI~~iq~LYG 223 (261)
..|++|||||.||..|-.. + .=||+|.-... .....++.--++.++...-
T Consensus 168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~ 230 (244)
T cd04270 168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE 230 (244)
T ss_pred HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence 4799999999999999665 1 24899864321 2356788888888876654
No 36
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.98 E-value=0.007 Score=52.83 Aligned_cols=50 Identities=16% Similarity=0.274 Sum_probs=35.5
Q ss_pred chhhhhhhhhhhcCCCCCCCC------------------CCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 043639 173 LESVAVHEIGHLLGLTHSPVK------------------EAVMYPTLKPRDKKVDLALDDIQGVQALYG 223 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~~------------------~svM~p~y~~~~~~~~l~~dDI~~iq~LYG 223 (261)
...+++|||||.||+.|-.+. .-||.+... ....+.++.--+.-++++..
T Consensus 145 ~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~-~~~~~~fS~CS~~~~~~~l~ 212 (220)
T cd04272 145 GVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVN-GERQYRFSQCSQRQIRNVFR 212 (220)
T ss_pred cHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccC-CCCCCccCcchHHHHHHHHc
Confidence 368999999999999995332 357886432 23455788877777777654
No 37
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=95.48 E-value=0.013 Score=55.88 Aligned_cols=58 Identities=24% Similarity=0.336 Sum_probs=40.6
Q ss_pred hhhhhhhhhhhcCCCCCCCCCCcc---ccccCC----CC------------C---CCCCCHHHHHHHHHHhCCCCCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHSPVKEAVM---YPTLKP----RD------------K---KVDLALDDIQGVQALYGSNPNFTIG 231 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~svM---~p~y~~----~~------------~---~~~l~~dDI~~iq~LYG~~~~~~~~ 231 (261)
...+.|||||.||+.|.. +.|| |+++.- .+ + +-+.++.-+++++-+|-+..+-..+
T Consensus 302 lGA~~HElGH~lgcpHq~--~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~d 379 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQS--EGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPGD 379 (614)
T ss_pred HHHHHHHhhhccCCCCCC--CceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCCC
Confidence 468899999999999975 5688 443321 00 1 2346778899999999877666555
Q ss_pred CC
Q 043639 232 SL 233 (261)
Q Consensus 232 ~~ 233 (261)
++
T Consensus 380 pt 381 (614)
T KOG4525|consen 380 PT 381 (614)
T ss_pred CC
Confidence 44
No 38
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=95.03 E-value=0.69 Score=40.50 Aligned_cols=150 Identities=15% Similarity=0.127 Sum_probs=83.3
Q ss_pred CcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCC---------C
Q 043639 59 PRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGD---------G 129 (261)
Q Consensus 59 ~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd---------~ 129 (261)
.||.+. |+..+.. .+. -......+.+..-++.-..++.+.+..+... .|+|.|.|...+... .
T Consensus 28 ~Rw~~P----Vrv~v~~--~~~-~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~ 99 (213)
T PF11150_consen 28 RRWEGP----VRVRVEG--VPP-ADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGA 99 (213)
T ss_pred ccCCCC----eEEEEec--cCh-hhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccc
Confidence 589976 8888872 111 1123445566666788888999999998864 999999998765411 0
Q ss_pred CCcc--------CCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCC-CC-CCCCcccc
Q 043639 130 EPFD--------GVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTH-SP-VKEAVMYP 199 (261)
Q Consensus 130 ~~fd--------G~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~H-s~-~~~svM~p 199 (261)
..+. -....+..++....+.+.....-.. +... ...--+...+..||-.+|||.- +. .+.||-+
T Consensus 100 ~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi~-i~~~----~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFN- 173 (213)
T PF11150_consen 100 ACFVVPNFSSLPRYARCLVFARSNWSGEITRARAVIF-IRAD----HPPQERRDCIHEELAQGLGLPNDSPRARPSIFN- 173 (213)
T ss_pred ccccchhhhcccccceEEEEEEecCCCceEEEEEEEE-Eecc----CCchHHHHHHHHHHHHHcCCccCCCcCCCceee-
Confidence 0010 0011122222221111111000000 0000 0122356788889999999952 22 2233322
Q ss_pred ccCCCCCCCCCCHHHHHHHHHHhCCC
Q 043639 200 TLKPRDKKVDLALDDIQGVQALYGSN 225 (261)
Q Consensus 200 ~y~~~~~~~~l~~dDI~~iq~LYG~~ 225 (261)
-+.....|+..|..-|+.||-+.
T Consensus 174 ---Dd~~~~~LT~~D~lLLr~LYdP~ 196 (213)
T PF11150_consen 174 ---DDNEFALLTGFDELLLRMLYDPR 196 (213)
T ss_pred ---CCCccccccHHHHHHHHHHcCcc
Confidence 22345578999999999999864
No 39
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.63 E-value=0.027 Score=49.46 Aligned_cols=81 Identities=22% Similarity=0.339 Sum_probs=43.8
Q ss_pred HHHHHHHHHHhHhcC---CceEEEcC-----CCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeeccccccc
Q 043639 88 KAVFKRAFSKWASVI---PVSFIETD-----DYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWA 159 (261)
Q Consensus 88 ~~~i~~Af~~Ws~vt---~l~F~ev~-----~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt 159 (261)
+--++..|++|+++. +++=.|+. +.+-.|++|.-..|.-+|. |-|.+-.+.+.++.. .
T Consensus 18 q~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtDH-------------YdP~~k~vrLS~~vy-~ 83 (222)
T PF04298_consen 18 QWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTDH-------------YDPRNKVVRLSEDVY-N 83 (222)
T ss_pred HHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCCC-------------cCCCCCEEEeCCccC-C
Confidence 345677788888763 23222332 2345677887666544442 344444444433211 0
Q ss_pred ccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639 160 VDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 160 ~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
....--..|++||.|||+ .|...
T Consensus 84 --------~~SiaAvaVAAHEvGHAi--Q~a~~ 106 (222)
T PF04298_consen 84 --------GRSIAAVAVAAHEVGHAI--QHAEG 106 (222)
T ss_pred --------CCCHHHHHHHHHHHhHHH--hcccc
Confidence 012223579999999995 66654
No 40
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.59 E-value=0.015 Score=53.59 Aligned_cols=20 Identities=35% Similarity=0.532 Sum_probs=14.8
Q ss_pred ccchhhhhhhhhhhcCCCCC
Q 043639 171 VDLESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 171 ~~l~~Va~HEiGHaLGL~Hs 190 (261)
.....+++||+||++||.|-
T Consensus 191 ~s~~~~f~HE~GH~~GL~H~ 210 (305)
T PF10462_consen 191 YSYGNEFSHELGHNFGLGHY 210 (305)
T ss_dssp S-SHHHHHHHHHHTTT--SS
T ss_pred cCccceeehhhhhhcCCCCC
Confidence 34568999999999999994
No 41
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.47 E-value=0.02 Score=53.02 Aligned_cols=19 Identities=42% Similarity=0.566 Sum_probs=17.0
Q ss_pred chhhhhhhhhhhcCCCCCC
Q 043639 173 LESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~ 191 (261)
-..|++||+||-|||.|+.
T Consensus 150 ~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 150 DWATIMHELGHNLGLWHAG 168 (314)
T ss_pred cHHHHHHHhhhhccccccC
Confidence 3579999999999999995
No 42
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.87 E-value=0.029 Score=53.90 Aligned_cols=23 Identities=39% Similarity=0.658 Sum_probs=19.5
Q ss_pred hhhhhhhhhhhcCCCCCCCCCCccc
Q 043639 174 ESVAVHEIGHLLGLTHSPVKEAVMY 198 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~svM~ 198 (261)
....+|||||++||.|..+ .||-
T Consensus 317 iGA~lHEiGH~fg~pH~~~--GIM~ 339 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQED--GIML 339 (423)
T ss_pred hHHHHHHHHHhcCCCCCCC--Ceee
Confidence 4579999999999999965 7773
No 43
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.46 E-value=0.023 Score=51.80 Aligned_cols=19 Identities=42% Similarity=0.513 Sum_probs=16.6
Q ss_pred hhhhhhhhhhhcCCCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~ 192 (261)
..|++||+||.|||.+--+
T Consensus 166 igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred eeeeehhhhcccCCCcccc
Confidence 6899999999999987654
No 44
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=93.27 E-value=0.063 Score=46.20 Aligned_cols=78 Identities=19% Similarity=0.334 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHhHhcC---CceEEEcC-----CCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccc
Q 043639 87 IKAVFKRAFSKWASVI---PVSFIETD-----DYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETW 158 (261)
Q Consensus 87 ~~~~i~~Af~~Ws~vt---~l~F~ev~-----~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~w 158 (261)
.+.-++.++++|++|. +++=.||. ..+--|+.+.-.+|...| -|-|..-.+.+.+ .++
T Consensus 20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTD-------------HYDP~~kvvrLSe-~~y 85 (226)
T COG2738 20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTD-------------HYDPRRKVVRLSE-ANY 85 (226)
T ss_pred HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCccc-------------ccChhhheeeccc-ccc
Confidence 4567789999999984 34434443 124456666654443222 2333322222211 111
Q ss_pred cccCCCCCCCccccchhhhhhhhhhhcC
Q 043639 159 AVDFGSEKSPVAVDLESVAVHEIGHLLG 186 (261)
Q Consensus 159 t~~~~~~~~~~~~~l~~Va~HEiGHaLG 186 (261)
. ....--..|++||.|||+=
T Consensus 86 ~--------g~Sia~~aVAAHEVGHAiQ 105 (226)
T COG2738 86 Y--------GPSIAAIAVAAHEVGHAIQ 105 (226)
T ss_pred C--------CccHHHHHHHHHHhhHHHh
Confidence 0 0122236899999999964
No 45
>PTZ00337 surface protease GP63; Provisional
Probab=92.63 E-value=0.2 Score=49.98 Aligned_cols=36 Identities=31% Similarity=0.488 Sum_probs=24.3
Q ss_pred CCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCC
Q 043639 145 PESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 145 P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs 190 (261)
|-.|.|.|+.... . .......|++|||.||||+...
T Consensus 211 Pi~G~in~np~~i---~-------~~~~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 211 PFAAAVNFDPRQI---A-------VTNGDVRVAAHELGHALGFVRD 246 (567)
T ss_pred ceEEEEEECHHHc---c-------chhHHHHHHHHHHHHHHccCHH
Confidence 4447788876432 0 0223578999999999999654
No 46
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=92.46 E-value=0.3 Score=49.02 Aligned_cols=96 Identities=21% Similarity=0.230 Sum_probs=49.9
Q ss_pred CcceEEEeeecCCCCCCCccCCCCceeeec--------CCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639 114 FADIRIGFYNGDHGDGEPFDGVLGVLAHSF--------SPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL 185 (261)
Q Consensus 114 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL 185 (261)
++|+.|...... +. .+++|.|- .|..|.+.|+.... .. ........|++|||.|+|
T Consensus 205 naD~vLYVta~P------~~--~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-----~~~~~~~rv~~HEi~HAL 268 (622)
T PTZ00257 205 NADFVLYVASVP------SE--PGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-----RYDQGTTRTVTHEVAHAL 268 (622)
T ss_pred ccCEEEEEEEec------CC--CCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-----ccchHHHHHHHHHHHHHh
Confidence 566665554332 21 25666553 24457788876432 00 012235789999999999
Q ss_pred CCCCCCCCCCccccccCC-CCCCC----CCCHHHHHHHHHHhCCC
Q 043639 186 GLTHSPVKEAVMYPTLKP-RDKKV----DLALDDIQGVQALYGSN 225 (261)
Q Consensus 186 GL~Hs~~~~svM~p~y~~-~~~~~----~l~~dDI~~iq~LYG~~ 225 (261)
|+.-+....--|.-.... ....+ ..++.-++..++-||.+
T Consensus 269 GFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~ 313 (622)
T PTZ00257 269 GFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP 313 (622)
T ss_pred cCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence 996443222111111110 01111 12566677778888854
No 47
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=91.42 E-value=0.077 Score=42.20 Aligned_cols=20 Identities=40% Similarity=0.735 Sum_probs=16.2
Q ss_pred cccchhhhhhhhhhhcCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTH 189 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~H 189 (261)
+.....|++|||||-.||+.
T Consensus 106 gd~vthvliHEIgHhFGLsD 125 (136)
T COG3824 106 GDQVTHVLIHEIGHHFGLSD 125 (136)
T ss_pred hhHhhhhhhhhhhhhcCCCh
Confidence 33456899999999999974
No 48
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.41 E-value=0.11 Score=51.29 Aligned_cols=97 Identities=16% Similarity=0.168 Sum_probs=40.8
Q ss_pred CcceEEEeeecCCCCCCCccCCCCceeeec--------CCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639 114 FADIRIGFYNGDHGDGEPFDGVLGVLAHSF--------SPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL 185 (261)
Q Consensus 114 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL 185 (261)
++|+.|.......+ .+++|.|. -|-.|.|.|+.... .. .....+..|++|||.|+|
T Consensus 159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~i---~~-----~~~~~~~~~~~HEi~HaL 222 (521)
T PF01457_consen 159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSYI---PS-----FYFQEFFRTVIHEIAHAL 222 (521)
T ss_dssp S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG-----------S--HHHHHHHHHHHHHHT
T ss_pred CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhHc---cc-----hhhhcccceeeeeeeeee
Confidence 57776655433221 16777764 13337777766431 10 012345789999999999
Q ss_pred CCCCCCCCCCccc---cccCCCC--CCCCCCHHHHHHHHHHhCCCC
Q 043639 186 GLTHSPVKEAVMY---PTLKPRD--KKVDLALDDIQGVQALYGSNP 226 (261)
Q Consensus 186 GL~Hs~~~~svM~---p~y~~~~--~~~~l~~dDI~~iq~LYG~~~ 226 (261)
|+.-......-|. +...+.. ..+--++.=+.-.+.-||.+.
T Consensus 223 GFs~~~~~~~~~~~~~~~~~~~~~~~~~l~tp~v~~~ar~hf~C~~ 268 (521)
T PF01457_consen 223 GFSSSFFRYYGMPRTVSNVRGKPTTVTVLVTPNVVEFARKHFGCPT 268 (521)
T ss_dssp T-SHHHHHHTT-EEEES-GGG-SS-EEEE--HHHHHHHHHHHT-TT
T ss_pred eecccccccccccccccceeecCceeEEEeCcHHHHHHHHHhCCCc
Confidence 9975211111110 0111100 112235566667788998643
No 49
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=88.80 E-value=0.11 Score=52.62 Aligned_cols=51 Identities=20% Similarity=0.149 Sum_probs=33.4
Q ss_pred chhhhhhhhhhhcCCCCCCCC------C-----Ccccc-ccCCC---CCCCCCCHHHHHHHHHHhC
Q 043639 173 LESVAVHEIGHLLGLTHSPVK------E-----AVMYP-TLKPR---DKKVDLALDDIQGVQALYG 223 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~~------~-----svM~p-~y~~~---~~~~~l~~dDI~~iq~LYG 223 (261)
-..|++||+||.|||.---+. + ++|.. .|.+. .....++..-.+-+|..+|
T Consensus 221 giGVfaHEfGH~LGLPDlYDT~~~g~~~~vg~WsLMs~GSw~g~~~g~~p~~~~~~~~~w~k~~lG 286 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLPDLYDTDYSGGGNGVGFWSLMSSGSWNGKGSGTEPGGFPPYMKAWLKLQLG 286 (645)
T ss_pred ceEEEEeeccccCCCCCcccCCCCCCCCCcccceecCCCCCCCCCCCCCCCccChhHhHHHhhhcc
Confidence 368999999999999754432 1 57853 45432 1234566677777777777
No 50
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=87.96 E-value=0.46 Score=48.69 Aligned_cols=46 Identities=24% Similarity=0.304 Sum_probs=30.2
Q ss_pred chhhhhhhhhhhcCCCCCCC-------CCCccccccCCCCCCCCCCHHHHHHHH
Q 043639 173 LESVAVHEIGHLLGLTHSPV-------KEAVMYPTLKPRDKKVDLALDDIQGVQ 219 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~-------~~svM~p~y~~~~~~~~l~~dDI~~iq 219 (261)
...|++|||||.||+.|-.. ..=|| +.+.....+..++.--++.+.
T Consensus 323 ~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~~ 375 (716)
T KOG3607|consen 323 FAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDFE 375 (716)
T ss_pred HHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHHH
Confidence 45799999999999999762 22478 655533334456655555553
No 51
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=86.61 E-value=0.27 Score=44.50 Aligned_cols=23 Identities=35% Similarity=0.560 Sum_probs=12.9
Q ss_pred cchhhhhhhhhhhc-CCCCCCCCC
Q 043639 172 DLESVAVHEIGHLL-GLTHSPVKE 194 (261)
Q Consensus 172 ~l~~Va~HEiGHaL-GL~Hs~~~~ 194 (261)
....|++||+||++ ||+-.-..+
T Consensus 215 ~~~~v~vHE~GHsf~~LaDEY~~~ 238 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADEYYYG 238 (264)
T ss_dssp THHHHHHHHHHHHTT------ECS
T ss_pred cccceeeeecccccccccccccCC
Confidence 45689999999965 566665443
No 52
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=85.79 E-value=0.31 Score=37.41 Aligned_cols=16 Identities=38% Similarity=0.771 Sum_probs=12.7
Q ss_pred chhhhhhhhhhhcCCC
Q 043639 173 LESVAVHEIGHLLGLT 188 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~ 188 (261)
+..|++|||||.+|+.
T Consensus 73 I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHcCCC
Confidence 4679999999999985
No 53
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=84.10 E-value=0.76 Score=46.34 Aligned_cols=48 Identities=23% Similarity=0.439 Sum_probs=33.5
Q ss_pred hhhhhhhhhhhcCCCCCCC----C------CCccccccCCCC--CCCCCCHHHHHHHHHH
Q 043639 174 ESVAVHEIGHLLGLTHSPV----K------EAVMYPTLKPRD--KKVDLALDDIQGVQAL 221 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~----~------~svM~p~y~~~~--~~~~l~~dDI~~iq~L 221 (261)
.-|++|||||..|=.|-.+ | +=|||++-...+ .+-+++.--+..|...
T Consensus 393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~V 452 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKV 452 (764)
T ss_pred heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHH
Confidence 4599999999999999765 1 238998755432 3456777777766543
No 54
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=82.85 E-value=0.45 Score=46.16 Aligned_cols=20 Identities=40% Similarity=0.639 Sum_probs=15.9
Q ss_pred cccchhhhhhhhhhhcCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs 190 (261)
...+-+|++|||||..| .|-
T Consensus 127 esElagViAHEigHv~q-rH~ 146 (484)
T COG4783 127 ESELAGVIAHEIGHVAQ-RHL 146 (484)
T ss_pred HHHHHHHHHHHHHHHhh-hhH
Confidence 34678999999999977 554
No 55
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=78.73 E-value=7 Score=33.09 Aligned_cols=73 Identities=23% Similarity=0.370 Sum_probs=42.6
Q ss_pred HHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCC
Q 043639 88 KAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKS 167 (261)
Q Consensus 88 ~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~ 167 (261)
++.+..-+..|++..++. ...|+|.-....-|.+ +..|.|.|+ |-+-.-+
T Consensus 111 ~~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc---------------~~~~~I~ln----~~L~~~P--- 160 (205)
T PF01863_consen 111 KEYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC---------------SSKGNITLN----WRLVMAP--- 160 (205)
T ss_pred HHHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC---------------CCCCcEEee----cccccCC---
Confidence 344445555666655553 3456665444332221 335667774 3332221
Q ss_pred CccccchhhhhhhhhhhcCCCCCCC
Q 043639 168 PVAVDLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 168 ~~~~~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
..-+..|++||+-|..-..||..
T Consensus 161 --~~~idYVvvHEL~Hl~~~nHs~~ 183 (205)
T PF01863_consen 161 --PEVIDYVVVHELCHLRHPNHSKR 183 (205)
T ss_pred --ccHHHHHHHHHHHHhccCCCCHH
Confidence 33457899999999999999865
No 56
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=78.11 E-value=0.81 Score=34.60 Aligned_cols=14 Identities=36% Similarity=0.629 Sum_probs=11.7
Q ss_pred cchhhhhhhhhhhc
Q 043639 172 DLESVAVHEIGHLL 185 (261)
Q Consensus 172 ~l~~Va~HEiGHaL 185 (261)
....+++|||||.+
T Consensus 41 ~~~f~laHELgH~~ 54 (122)
T PF06114_consen 41 RQRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 34689999999985
No 57
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=77.63 E-value=0.83 Score=39.42 Aligned_cols=14 Identities=50% Similarity=0.705 Sum_probs=12.0
Q ss_pred chhhhhhhhhhhcC
Q 043639 173 LESVAVHEIGHLLG 186 (261)
Q Consensus 173 l~~Va~HEiGHaLG 186 (261)
+..|+.||+||++-
T Consensus 22 ~l~t~~HE~gHal~ 35 (200)
T PF13398_consen 22 LLVTFVHELGHALA 35 (200)
T ss_pred HHHHHHHHHHHHHH
Confidence 46799999999975
No 58
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.15 E-value=1.3 Score=40.75 Aligned_cols=40 Identities=20% Similarity=0.247 Sum_probs=21.3
Q ss_pred cCCCCceeeeccccccc-ccCCCCCCCccccchhhhhhhhhhh
Q 043639 143 FSPESGKFHLDAAETWA-VDFGSEKSPVAVDLESVAVHEIGHL 184 (261)
Q Consensus 143 ~~P~~G~vhfd~~e~wt-~~~~~~~~~~~~~l~~Va~HEiGHa 184 (261)
|-|.+..|.+|... |. +... ....+..-...|++||+||.
T Consensus 141 YCp~D~tIYlD~~f-~~~L~~~-~ga~G~~a~ayVlAHEyGHH 181 (292)
T PF04228_consen 141 YCPADQTIYLDLSF-FDELQQR-FGASGDFAQAYVLAHEYGHH 181 (292)
T ss_pred eCCCCCEEEechHH-HHHHHHH-hCCccHHHHHHHHHHHHHHH
Confidence 34667788888642 11 1100 00011122356999999996
No 59
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=77.09 E-value=0.9 Score=39.80 Aligned_cols=18 Identities=33% Similarity=0.547 Sum_probs=13.8
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 043639 173 LESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~ 192 (261)
-.-|++||+||+| -|...
T Consensus 72 ~rFtlAHELGH~l--lH~~~ 89 (213)
T COG2856 72 KRFTLAHELGHAL--LHTDL 89 (213)
T ss_pred HHHHHHHHHhHHH--hcccc
Confidence 3579999999995 56554
No 60
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=76.69 E-value=1.4 Score=40.92 Aligned_cols=23 Identities=26% Similarity=0.400 Sum_probs=19.6
Q ss_pred cccchhhhhhhhhhhcCCCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
...+..|++||++|-+|...+.+
T Consensus 193 ~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 193 PYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred cccccHHHHHHHHHHhCCCCHHH
Confidence 45567899999999999998865
No 61
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=75.03 E-value=1.6 Score=38.15 Aligned_cols=29 Identities=34% Similarity=0.561 Sum_probs=25.3
Q ss_pred hhhhhhhhhhhcCCCCCCCCCCccccccC
Q 043639 174 ESVAVHEIGHLLGLTHSPVKEAVMYPTLK 202 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~svM~p~y~ 202 (261)
..+.-|++||++||+|.+.+..||+|.+.
T Consensus 154 al~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 154 ALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred HHHHHHHhhhccCcccccccccccCCccc
Confidence 46788999999999999999999986554
No 62
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=74.75 E-value=1 Score=39.65 Aligned_cols=14 Identities=36% Similarity=0.570 Sum_probs=11.9
Q ss_pred cchhhhhhhhhhhc
Q 043639 172 DLESVAVHEIGHLL 185 (261)
Q Consensus 172 ~l~~Va~HEiGHaL 185 (261)
.+..|+.||+||+|
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999985
No 63
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=74.22 E-value=2.6 Score=44.10 Aligned_cols=48 Identities=17% Similarity=0.196 Sum_probs=31.1
Q ss_pred ccchhhhhhhhhhhcCCCCCCCCC------CccccccCCCCCCCCCCHHHHHHH
Q 043639 171 VDLESVAVHEIGHLLGLTHSPVKE------AVMYPTLKPRDKKVDLALDDIQGV 218 (261)
Q Consensus 171 ~~l~~Va~HEiGHaLGL~Hs~~~~------svM~p~y~~~~~~~~l~~dDI~~i 218 (261)
...-.+++||+||.+|+.|-...+ .||.|...........+..+..-+
T Consensus 315 l~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~~~~~~~WS~CS~~~~ 368 (845)
T KOG3538|consen 315 LPSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCGGNHELTWSECSREYL 368 (845)
T ss_pred CccceeeeeccccccCcccCCcCCCCCCcccccCcccccCCCCccCCcchHHHH
Confidence 334579999999999999986543 588886654333333444433333
No 64
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=74.06 E-value=15 Score=35.29 Aligned_cols=24 Identities=21% Similarity=0.319 Sum_probs=16.9
Q ss_pred HHHHHHHHHHhHhcCCceEEEcCC
Q 043639 88 KAVFKRAFSKWASVIPVSFIETDD 111 (261)
Q Consensus 88 ~~~i~~Af~~Ws~vt~l~F~ev~~ 111 (261)
..+++.-|..-+.+=+++|+++..
T Consensus 107 ~~v~~gl~~~~~~lfgi~~~~~~~ 130 (422)
T cd06456 107 EKVLDGLFELAERLYGITFKERTD 130 (422)
T ss_pred HHHHHHHHHHHHHHcCeeEEECCC
Confidence 345556667777788899988764
No 65
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=71.49 E-value=18 Score=35.11 Aligned_cols=25 Identities=4% Similarity=0.109 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHhHhcCCceEEEcCC
Q 043639 87 IKAVFKRAFSKWASVIPVSFIETDD 111 (261)
Q Consensus 87 ~~~~i~~Af~~Ws~vt~l~F~ev~~ 111 (261)
+..+++--|..-+.+=+|+|+++..
T Consensus 132 l~~vl~gl~~~~~~lfgi~~~~~~~ 156 (458)
T cd06457 132 LGTVMEGLSRLFSRLYGIRLVPVPL 156 (458)
T ss_pred HHHHHHHHHHHHHHHhCeEEEecCC
Confidence 3456666777788888999999863
No 66
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=70.98 E-value=1.6 Score=37.30 Aligned_cols=21 Identities=33% Similarity=0.520 Sum_probs=16.0
Q ss_pred cccchhhhhhhhhhhcCCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
...+..|++||+||.. .+|.-
T Consensus 86 ~~el~aVlaHElgH~~-~~h~~ 106 (226)
T PF01435_consen 86 EDELAAVLAHELGHIK-HRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHHHH-TTHCC
T ss_pred HHHHHHHHHHHHHHHH-cCCcc
Confidence 3457899999999997 35554
No 67
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=70.89 E-value=3.4 Score=30.32 Aligned_cols=30 Identities=13% Similarity=0.319 Sum_probs=18.9
Q ss_pred CceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639 147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL 185 (261)
Q Consensus 147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL 185 (261)
..+|+|...+. . .....-..+++||+.|++
T Consensus 44 G~~I~f~~g~~-~--------~~s~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 44 GNDIYFAPGKY-N--------PDSPEGRALLAHELAHVV 73 (79)
T ss_pred CCEEEEcCCCc-C--------CCCCCcchhHhHHHHHHH
Confidence 46788865442 1 012223689999999985
No 68
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=70.77 E-value=1.5 Score=36.87 Aligned_cols=20 Identities=35% Similarity=0.439 Sum_probs=14.5
Q ss_pred hhhhhhhhhhhcCCCCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHSPVK 193 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~ 193 (261)
..+++||+||++=-....++
T Consensus 8 i~i~~HE~gH~~~a~~~G~~ 27 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDK 27 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--
T ss_pred cccccccccccccccccccc
Confidence 57899999999866655544
No 69
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=70.32 E-value=1.7 Score=37.07 Aligned_cols=12 Identities=50% Similarity=0.844 Sum_probs=10.4
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..|++||+||.+
T Consensus 10 ~~v~iHElGH~~ 21 (182)
T cd06163 10 ILIFVHELGHFL 21 (182)
T ss_pred HHHHHHHHHHHH
Confidence 478999999985
No 70
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=69.91 E-value=1.7 Score=36.40 Aligned_cols=16 Identities=50% Similarity=0.638 Sum_probs=12.4
Q ss_pred hhhhhhhhhhhcCCCC
Q 043639 174 ESVAVHEIGHLLGLTH 189 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~H 189 (261)
..+.+||+||++=-..
T Consensus 9 i~i~iHE~gH~~~A~~ 24 (180)
T cd05709 9 ISVTVHELGHALVARR 24 (180)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5789999999984433
No 71
>PRK02870 heat shock protein HtpX; Provisional
Probab=67.80 E-value=3 Score=39.01 Aligned_cols=15 Identities=33% Similarity=0.676 Sum_probs=12.8
Q ss_pred cccchhhhhhhhhhh
Q 043639 170 AVDLESVAVHEIGHL 184 (261)
Q Consensus 170 ~~~l~~Va~HEiGHa 184 (261)
...+..|++||+||.
T Consensus 170 ~dEL~aVlAHELgHi 184 (336)
T PRK02870 170 RDELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345789999999998
No 72
>PRK03982 heat shock protein HtpX; Provisional
Probab=67.77 E-value=1.9 Score=39.18 Aligned_cols=16 Identities=38% Similarity=0.594 Sum_probs=13.1
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+..|++||+||.-
T Consensus 122 ~~El~AVlAHElgHi~ 137 (288)
T PRK03982 122 EDELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3457899999999974
No 73
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=66.95 E-value=2.2 Score=38.34 Aligned_cols=16 Identities=44% Similarity=0.754 Sum_probs=13.2
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+..|++||+||..
T Consensus 154 ~dEl~aVlaHElgHi~ 169 (302)
T COG0501 154 DDELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3457899999999983
No 74
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=66.16 E-value=2.2 Score=36.94 Aligned_cols=16 Identities=44% Similarity=0.594 Sum_probs=12.4
Q ss_pred chhhhhhhhhhhcCCC
Q 043639 173 LESVAVHEIGHLLGLT 188 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~ 188 (261)
+..|+.||+||++=-.
T Consensus 38 ~~~v~iHElgH~~~A~ 53 (208)
T cd06161 38 FLSVLLHELGHALVAR 53 (208)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4589999999986433
No 75
>PRK03001 M48 family peptidase; Provisional
Probab=65.84 E-value=2.2 Score=38.67 Aligned_cols=16 Identities=25% Similarity=0.536 Sum_probs=13.1
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+..|++||+||.-
T Consensus 121 ~~El~aVlAHElgHi~ 136 (283)
T PRK03001 121 EREIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3457899999999973
No 76
>PRK01345 heat shock protein HtpX; Provisional
Probab=64.90 E-value=2.4 Score=39.33 Aligned_cols=15 Identities=27% Similarity=0.543 Sum_probs=13.0
Q ss_pred ccchhhhhhhhhhhc
Q 043639 171 VDLESVAVHEIGHLL 185 (261)
Q Consensus 171 ~~l~~Va~HEiGHaL 185 (261)
..+..|++|||||.-
T Consensus 122 dEL~aVlAHElgHi~ 136 (317)
T PRK01345 122 EEVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457899999999985
No 77
>PRK03072 heat shock protein HtpX; Provisional
Probab=64.88 E-value=2.4 Score=38.71 Aligned_cols=16 Identities=31% Similarity=0.522 Sum_probs=13.2
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+..|++||+||.-
T Consensus 124 ~~El~aVlAHElgHi~ 139 (288)
T PRK03072 124 ERELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3457899999999974
No 78
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=63.79 E-value=2.6 Score=39.82 Aligned_cols=19 Identities=16% Similarity=0.141 Sum_probs=13.3
Q ss_pred cCCHHHHHHHHHHHHHHhH
Q 043639 81 YLHITDIKAVFKRAFSKWA 99 (261)
Q Consensus 81 ~l~~~~~~~~i~~Af~~Ws 99 (261)
..+.++..+.+.++++.|.
T Consensus 153 ~~~~~~~~~~v~~~~~~l~ 171 (427)
T cd06459 153 KYTYEEAKELVLEALSPLG 171 (427)
T ss_pred cCcHHHHHHHHHHHHHccC
Confidence 3455677788888887763
No 79
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=62.64 E-value=2.8 Score=38.98 Aligned_cols=14 Identities=36% Similarity=0.515 Sum_probs=11.7
Q ss_pred cchhhhhhhhhhhc
Q 043639 172 DLESVAVHEIGHLL 185 (261)
Q Consensus 172 ~l~~Va~HEiGHaL 185 (261)
+-..++.||+||++
T Consensus 153 ~~v~tl~HE~GHa~ 166 (365)
T cd06258 153 DDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHhHHH
Confidence 34579999999998
No 80
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=62.22 E-value=2.9 Score=40.24 Aligned_cols=25 Identities=20% Similarity=0.344 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHhHhcCCceEEEcCC
Q 043639 87 IKAVFKRAFSKWASVIPVSFIETDD 111 (261)
Q Consensus 87 ~~~~i~~Af~~Ws~vt~l~F~ev~~ 111 (261)
+.++++.-+...+.+=+|+|+++..
T Consensus 136 ~~~~~~g~~~l~~~lfgi~~~~~~~ 160 (458)
T PF01432_consen 136 LEEVLEGLFELAERLFGIRFEEVPD 160 (458)
T ss_dssp HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred HHHHHHHHHHHHHHHhcEEEEeccc
Confidence 4456666677888888899998865
No 81
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=61.56 E-value=3 Score=36.72 Aligned_cols=14 Identities=50% Similarity=0.681 Sum_probs=11.5
Q ss_pred hhhhhhhhhhhcCC
Q 043639 174 ESVAVHEIGHLLGL 187 (261)
Q Consensus 174 ~~Va~HEiGHaLGL 187 (261)
..|++||+||++=-
T Consensus 54 ~~v~iHElgH~~~A 67 (227)
T cd06164 54 ASVLLHELGHSLVA 67 (227)
T ss_pred HHHHHHHHHHHHHH
Confidence 57999999998643
No 82
>PRK04897 heat shock protein HtpX; Provisional
Probab=61.46 E-value=3 Score=38.23 Aligned_cols=16 Identities=44% Similarity=0.586 Sum_probs=13.2
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+..|++||+||.-
T Consensus 134 ~~El~aVlAHElgHi~ 149 (298)
T PRK04897 134 REELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3457899999999974
No 83
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=60.43 E-value=4.9 Score=36.40 Aligned_cols=22 Identities=27% Similarity=0.471 Sum_probs=17.0
Q ss_pred cccchhhhhhhhhhhcCCCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~Hs~ 191 (261)
..+.+.++.||+||.-|=.+--
T Consensus 200 a~~yyaTl~HElghwtgh~~rl 221 (316)
T COG4227 200 AINYYATLLHELGHWTGHEARL 221 (316)
T ss_pred hHhHHHHHHHHhccccCchhhh
Confidence 4567899999999998755443
No 84
>PRK05457 heat shock protein HtpX; Provisional
Probab=60.39 E-value=3.2 Score=37.83 Aligned_cols=15 Identities=40% Similarity=0.685 Sum_probs=12.7
Q ss_pred cccchhhhhhhhhhh
Q 043639 170 AVDLESVAVHEIGHL 184 (261)
Q Consensus 170 ~~~l~~Va~HEiGHa 184 (261)
...+..|++||+||.
T Consensus 131 ~~El~aVlAHElgHi 145 (284)
T PRK05457 131 RDEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346789999999996
No 85
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=60.07 E-value=3.3 Score=41.07 Aligned_cols=13 Identities=38% Similarity=0.468 Sum_probs=11.3
Q ss_pred chhhhhhhhhhhc
Q 043639 173 LESVAVHEIGHLL 185 (261)
Q Consensus 173 l~~Va~HEiGHaL 185 (261)
=..|++||+||++
T Consensus 337 dv~TL~HElGHa~ 349 (549)
T TIGR02289 337 DIDVLTHEAGHAF 349 (549)
T ss_pred HHHHHHHHhhHHH
Confidence 3579999999997
No 86
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=59.95 E-value=3.4 Score=40.25 Aligned_cols=12 Identities=33% Similarity=0.664 Sum_probs=10.7
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..|+.||+||+|
T Consensus 264 V~TLfHEfGHal 275 (472)
T cd06455 264 VETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHH
Confidence 568999999998
No 87
>PRK02391 heat shock protein HtpX; Provisional
Probab=59.50 E-value=3.4 Score=37.92 Aligned_cols=15 Identities=40% Similarity=0.705 Sum_probs=12.6
Q ss_pred cccchhhhhhhhhhh
Q 043639 170 AVDLESVAVHEIGHL 184 (261)
Q Consensus 170 ~~~l~~Va~HEiGHa 184 (261)
...+..|++||+||.
T Consensus 130 ~~El~aVlaHElgHi 144 (296)
T PRK02391 130 PDELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345789999999995
No 88
>PRK01265 heat shock protein HtpX; Provisional
Probab=59.07 E-value=3.5 Score=38.43 Aligned_cols=15 Identities=47% Similarity=0.795 Sum_probs=12.6
Q ss_pred cccchhhhhhhhhhh
Q 043639 170 AVDLESVAVHEIGHL 184 (261)
Q Consensus 170 ~~~l~~Va~HEiGHa 184 (261)
...+..|++||+||.
T Consensus 137 ~~El~aVlAHElgHi 151 (324)
T PRK01265 137 RDEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345789999999995
No 89
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=58.69 E-value=3.6 Score=37.49 Aligned_cols=13 Identities=46% Similarity=0.572 Sum_probs=11.1
Q ss_pred hhhhhhhhhhhcC
Q 043639 174 ESVAVHEIGHLLG 186 (261)
Q Consensus 174 ~~Va~HEiGHaLG 186 (261)
..|++||+||++=
T Consensus 136 isvvvHElgHal~ 148 (277)
T cd06162 136 ISGVVHEMGHGVA 148 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 5799999999963
No 90
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=58.41 E-value=3.9 Score=36.98 Aligned_cols=17 Identities=29% Similarity=0.296 Sum_probs=12.8
Q ss_pred hhhhhhhhhhhcCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs 190 (261)
..+++||+||++=-.+.
T Consensus 119 isv~iHElgHa~~Ar~~ 135 (263)
T cd06159 119 VGVVVHELSHGILARVE 135 (263)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 57999999999644433
No 91
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=58.09 E-value=3.7 Score=40.97 Aligned_cols=13 Identities=31% Similarity=0.457 Sum_probs=11.3
Q ss_pred chhhhhhhhhhhc
Q 043639 173 LESVAVHEIGHLL 185 (261)
Q Consensus 173 l~~Va~HEiGHaL 185 (261)
-..|++||+||++
T Consensus 378 dv~TLaHElGHa~ 390 (591)
T TIGR00181 378 SVFTLAHELGHSM 390 (591)
T ss_pred hHHHHHHHhhhHH
Confidence 3689999999996
No 92
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=58.05 E-value=3.8 Score=40.99 Aligned_cols=12 Identities=42% Similarity=0.639 Sum_probs=10.9
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..|++||+||++
T Consensus 376 v~TL~HE~GHa~ 387 (587)
T TIGR02290 376 VSTLAHELGHAY 387 (587)
T ss_pred HHHHHHHhhHHH
Confidence 579999999998
No 93
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=57.34 E-value=4 Score=30.96 Aligned_cols=14 Identities=36% Similarity=0.482 Sum_probs=12.0
Q ss_pred cchhhhhhhhhhhc
Q 043639 172 DLESVAVHEIGHLL 185 (261)
Q Consensus 172 ~l~~Va~HEiGHaL 185 (261)
.+..|++||+.|.+
T Consensus 24 ~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 24 WLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 35699999999996
No 94
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=56.82 E-value=4.2 Score=33.60 Aligned_cols=12 Identities=33% Similarity=0.445 Sum_probs=10.2
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
.-|++|||+|.+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 469999999974
No 95
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=53.73 E-value=4.9 Score=34.25 Aligned_cols=20 Identities=35% Similarity=0.614 Sum_probs=15.0
Q ss_pred hhhhhhhhhhhcCCCCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHSPVK 193 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~ 193 (261)
..+.+||+||++--..-.++
T Consensus 42 ~~l~iHElgH~~~A~~~G~~ 61 (183)
T cd06160 42 AILGIHEMGHYLAARRHGVK 61 (183)
T ss_pred HHHHHHHHHHHHHHHHCCCC
Confidence 46889999999876554444
No 96
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=53.54 E-value=5 Score=38.33 Aligned_cols=17 Identities=41% Similarity=0.655 Sum_probs=12.8
Q ss_pred hhhhhhhhhhhcCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHS 190 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs 190 (261)
..+.+||+||||--.-.
T Consensus 132 i~~vvHElGHalAA~se 148 (484)
T KOG2921|consen 132 ITVVVHELGHALAAASE 148 (484)
T ss_pred HHHHHHHhhHHHHHHhc
Confidence 56889999999854433
No 97
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=52.05 E-value=12 Score=33.94 Aligned_cols=15 Identities=47% Similarity=0.607 Sum_probs=11.4
Q ss_pred hhhhhhhhhhh-cCCC
Q 043639 174 ESVAVHEIGHL-LGLT 188 (261)
Q Consensus 174 ~~Va~HEiGHa-LGL~ 188 (261)
-.|++||.+|. =|+-
T Consensus 150 grv~VhEWAhlRWGVF 165 (262)
T PF08434_consen 150 GRVFVHEWAHLRWGVF 165 (262)
T ss_pred ceeeeehhhhhccccc
Confidence 47999999996 3553
No 98
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=47.93 E-value=6.8 Score=39.50 Aligned_cols=12 Identities=33% Similarity=0.526 Sum_probs=10.7
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..|++||+||++
T Consensus 381 V~TLaHElGHs~ 392 (598)
T COG1164 381 VFTLAHELGHSV 392 (598)
T ss_pred HHHHHHHccHHH
Confidence 589999999994
No 99
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=47.92 E-value=27 Score=36.44 Aligned_cols=49 Identities=14% Similarity=0.281 Sum_probs=31.5
Q ss_pred CcccCCCCeeEEEEEecCCcC----Cc-----------CCHHHHHHHHHHHHHHhHhcCCceEEEcC
Q 043639 59 PRWTRHVPMTLSYAFSKENMI----NY-----------LHITDIKAVFKRAFSKWASVIPVSFIETD 110 (261)
Q Consensus 59 ~kW~~~~~~~LTY~~~~~~~~----~~-----------l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~ 110 (261)
..|.+. +-.|.+...... |. .+..+..+--.+.|+...+.+.|.|....
T Consensus 128 ~~w~~~---~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~ 191 (775)
T PF03272_consen 128 SEWDDS---DSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSD 191 (775)
T ss_pred hhhhhc---CCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCc
Confidence 478877 788887754321 11 12233445557888999999999987543
No 100
>PLN02791 Nudix hydrolase homolog
Probab=47.55 E-value=7.6 Score=40.32 Aligned_cols=20 Identities=20% Similarity=0.262 Sum_probs=16.4
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 043639 173 LESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~ 192 (261)
...|++||+||.+|=.+.-.
T Consensus 585 ~~~v~lHElgHGsG~~~~~~ 604 (770)
T PLN02791 585 FTHTICHECCHGIGPHTITL 604 (770)
T ss_pred HHHHHHHHhhccccccceec
Confidence 36899999999999877543
No 101
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=47.35 E-value=7.2 Score=33.09 Aligned_cols=12 Identities=25% Similarity=0.548 Sum_probs=10.8
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
.++.+||+||++
T Consensus 10 ~~i~~HE~aHa~ 21 (181)
T cd06158 10 LAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHH
Confidence 578999999997
No 102
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=46.91 E-value=7.4 Score=37.30 Aligned_cols=12 Identities=42% Similarity=0.598 Sum_probs=10.1
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..+++||+||+|
T Consensus 160 l~t~iHE~GHal 171 (396)
T cd06460 160 LFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHhhHHH
Confidence 458999999985
No 103
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=46.68 E-value=7.5 Score=37.35 Aligned_cols=12 Identities=50% Similarity=0.866 Sum_probs=10.1
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
..|++||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 469999999964
No 104
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=45.65 E-value=7.9 Score=37.45 Aligned_cols=11 Identities=45% Similarity=0.857 Sum_probs=9.2
Q ss_pred hhhhhhhhhhh
Q 043639 174 ESVAVHEIGHL 184 (261)
Q Consensus 174 ~~Va~HEiGHa 184 (261)
.-|++||+||-
T Consensus 16 ~li~vHElGHf 26 (449)
T PRK10779 16 VLITVHEFGHF 26 (449)
T ss_pred HHHHHHHHHHH
Confidence 35889999995
No 105
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=44.25 E-value=68 Score=32.91 Aligned_cols=33 Identities=21% Similarity=0.358 Sum_probs=23.1
Q ss_pred HHHHHHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639 88 KAVFKRAFSKWASVIPVSFIETDDY--DFADIRIG 120 (261)
Q Consensus 88 ~~~i~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~ 120 (261)
..+|.-.|..-+..=+|+|+|..+. -..|+++.
T Consensus 364 ~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~ 398 (683)
T COG0339 364 NKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF 398 (683)
T ss_pred hHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence 3566677788888889999997431 35677654
No 106
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=43.14 E-value=9.1 Score=36.93 Aligned_cols=15 Identities=40% Similarity=0.674 Sum_probs=12.8
Q ss_pred cccchhhhhhhhhhh
Q 043639 170 AVDLESVAVHEIGHL 184 (261)
Q Consensus 170 ~~~l~~Va~HEiGHa 184 (261)
...+..|+.||+||.
T Consensus 277 ~eel~AVl~HELGHW 291 (428)
T KOG2719|consen 277 NEELVAVLAHELGHW 291 (428)
T ss_pred cHHHHHHHHHHhhHH
Confidence 446789999999997
No 107
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.12 E-value=8.4 Score=33.41 Aligned_cols=16 Identities=31% Similarity=0.528 Sum_probs=13.1
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
......+|+||-||||
T Consensus 25 ~~~~~~~A~HEAGhAv 40 (213)
T PF01434_consen 25 EEEKRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3456789999999986
No 108
>PRK10911 oligopeptidase A; Provisional
Probab=41.81 E-value=9.7 Score=39.00 Aligned_cols=22 Identities=9% Similarity=0.087 Sum_probs=14.9
Q ss_pred HHHHHHHHHhHhcCCceEEEcC
Q 043639 89 AVFKRAFSKWASVIPVSFIETD 110 (261)
Q Consensus 89 ~~i~~Af~~Ws~vt~l~F~ev~ 110 (261)
.+++.-|..-+..=+++|.|+.
T Consensus 361 ~v~~gl~~~~~~Lfgi~~~e~~ 382 (680)
T PRK10911 361 KAVNGLFEVVKRIYGITAKERK 382 (680)
T ss_pred HHHHHHHHHHHHHcCeeEEecC
Confidence 4445556666677788998864
No 109
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=39.78 E-value=79 Score=32.37 Aligned_cols=32 Identities=25% Similarity=0.386 Sum_probs=25.4
Q ss_pred HHHHHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639 89 AVFKRAFSKWASVIPVSFIETDDY--DFADIRIG 120 (261)
Q Consensus 89 ~~i~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~ 120 (261)
.++..-|...+..-+|+|.|+++. -.+|+++.
T Consensus 392 ~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y 425 (718)
T KOG2089|consen 392 VVLSGLFGIYQTLFGLKFEEATDAEVWHADVRVY 425 (718)
T ss_pred HHHHHHHHHHHHHhCceeeecCCchhcccceeEE
Confidence 466777888899999999999873 36788854
No 110
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.45 E-value=20 Score=35.91 Aligned_cols=12 Identities=50% Similarity=0.650 Sum_probs=10.4
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
-.|+.|||||+.
T Consensus 488 GaVIgHEI~HgF 499 (654)
T COG3590 488 GAVIGHEIGHGF 499 (654)
T ss_pred cceehhhhcccc
Confidence 489999999974
No 111
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=36.18 E-value=15 Score=25.38 Aligned_cols=20 Identities=35% Similarity=0.602 Sum_probs=16.8
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 043639 173 LESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~~ 192 (261)
...-++||+.+.+||.|.+.
T Consensus 29 ~eR~~vH~lA~~~gL~s~S~ 48 (60)
T cd02641 29 HDRLLVHELAEELGLRHEST 48 (60)
T ss_pred HHHHHHHHHHHHcCCceEee
Confidence 35789999999999998653
No 112
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=35.82 E-value=14 Score=36.94 Aligned_cols=14 Identities=43% Similarity=0.643 Sum_probs=10.7
Q ss_pred hhhhhhhh-hhhcCC
Q 043639 174 ESVAVHEI-GHLLGL 187 (261)
Q Consensus 174 ~~Va~HEi-GHaLGL 187 (261)
..|++||+ ||..|=
T Consensus 294 ~~v~lHEllGHGsGk 308 (549)
T PF03571_consen 294 VQVGLHELLGHGSGK 308 (549)
T ss_dssp HHHHHHHHCCCCS--
T ss_pred hhhhHHhhccCcCcc
Confidence 46999998 999984
No 113
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=34.59 E-value=17 Score=32.66 Aligned_cols=20 Identities=30% Similarity=0.476 Sum_probs=14.2
Q ss_pred cccchhhhhhhhhhhcCCCC
Q 043639 170 AVDLESVAVHEIGHLLGLTH 189 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaLGL~H 189 (261)
..|-.+|++||+.|.|=+..
T Consensus 151 ~~dg~NVviHEfAH~LD~~~ 170 (253)
T PF06167_consen 151 PNDGHNVVIHEFAHKLDMED 170 (253)
T ss_dssp SSSS--HHHHHHHHHHHCTT
T ss_pred CCCCcchHHHHHHHHHHhhc
Confidence 44567999999999996653
No 114
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=31.39 E-value=18 Score=33.96 Aligned_cols=20 Identities=35% Similarity=0.489 Sum_probs=15.2
Q ss_pred cchhhhhhhhhhhcCCCCCCC
Q 043639 172 DLESVAVHEIGHLLGLTHSPV 192 (261)
Q Consensus 172 ~l~~Va~HEiGHaLGL~Hs~~ 192 (261)
.+..|+.||++|++ +.|+.+
T Consensus 274 glAtvLgHE~aHaV-arH~AE 293 (424)
T KOG2661|consen 274 GLATVLGHEIAHAV-ARHAAE 293 (424)
T ss_pred HHHHHHHHHHHHHH-HHHHHH
Confidence 35789999999996 466543
No 115
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=30.75 E-value=19 Score=36.98 Aligned_cols=34 Identities=21% Similarity=0.358 Sum_probs=22.0
Q ss_pred HHHHH-HHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639 87 IKAVF-KRAFSKWASVIPVSFIETDDY--DFADIRIG 120 (261)
Q Consensus 87 ~~~~i-~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~ 120 (261)
+..++ +-.|..-+..=+|+|+|+.+. -..|++..
T Consensus 362 l~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~~ 398 (681)
T PRK10280 362 LNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVW 398 (681)
T ss_pred HHHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeEE
Confidence 33555 356777788889999997531 34566543
No 116
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=29.14 E-value=22 Score=32.49 Aligned_cols=41 Identities=22% Similarity=0.250 Sum_probs=21.9
Q ss_pred cCCCCceeeecccccccccCCCCCCCcc-ccchhhhhhhhhhhc
Q 043639 143 FSPESGKFHLDAAETWAVDFGSEKSPVA-VDLESVAVHEIGHLL 185 (261)
Q Consensus 143 ~~P~~G~vhfd~~e~wt~~~~~~~~~~~-~~l~~Va~HEiGHaL 185 (261)
|-|.+.++.||....=.+... ....+ .-...|++||+||.+
T Consensus 138 YCP~D~kvYlDlsFf~~m~~~--fga~GdfAqaYViAHEVGHHV 179 (295)
T COG2321 138 YCPADTKVYLDLSFFDEMKTK--FGASGDFAQAYVIAHEVGHHV 179 (295)
T ss_pred ecCCCceEEEehhHHHHHHHH--hcCCccHHHHHHHHhhhhHHH
Confidence 457777888876321111100 01112 223579999999963
No 117
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=28.59 E-value=26 Score=24.45 Aligned_cols=21 Identities=33% Similarity=0.390 Sum_probs=17.9
Q ss_pred hhhhhhhhhhhcCCCCCCCCC
Q 043639 174 ESVAVHEIGHLLGLTHSPVKE 194 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~Hs~~~~ 194 (261)
..-++|.+-|-|||.|.++.+
T Consensus 30 eRriih~la~~lGL~~~s~G~ 50 (60)
T cd02639 30 ERRIVHLLASRLGLNHVSDGT 50 (60)
T ss_pred HHHHHHHHHHHcCCceEEeCC
Confidence 568999999999999987654
No 118
>PHA02150 hypothetical protein
Probab=27.75 E-value=34 Score=24.35 Aligned_cols=33 Identities=27% Similarity=0.516 Sum_probs=21.3
Q ss_pred EEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcC
Q 043639 70 SYAFSKENMINYLHITDIKAVFKRAFSKWASVI 102 (261)
Q Consensus 70 TY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt 102 (261)
.|+|.++..-...+.+.....+..||+.|.++-
T Consensus 2 ~yrfvseeqfnaw~~qnya~efa~~f~~wd~if 34 (77)
T PHA02150 2 LYRFVSEEQFNAWSSQNYAKEFAKHFKKWDDVF 34 (77)
T ss_pred ceeeccHHHhhHHHhhhHHHHHHHHhhhHhHhh
Confidence 477765432222334566778899999999873
No 119
>PF14891 Peptidase_M91: Effector protein
Probab=25.73 E-value=29 Score=29.02 Aligned_cols=15 Identities=27% Similarity=0.151 Sum_probs=12.2
Q ss_pred chhhhhhhhhhhcCC
Q 043639 173 LESVAVHEIGHLLGL 187 (261)
Q Consensus 173 l~~Va~HEiGHaLGL 187 (261)
-+.++.||+.||.--
T Consensus 103 p~v~L~HEL~HA~~~ 117 (174)
T PF14891_consen 103 PFVVLYHELIHAYDY 117 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 368999999999644
No 120
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=25.63 E-value=30 Score=32.72 Aligned_cols=50 Identities=22% Similarity=0.161 Sum_probs=23.6
Q ss_pred CCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCC
Q 043639 136 LGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGL 187 (261)
Q Consensus 136 gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL 187 (261)
.|++|++. |..+.|.+=....+.+.... ....+.|.-.+++||+-|+.+.
T Consensus 262 ~~vlAYT~-p~~~~I~~Cp~ff~~lp~~~-~~C~~qDqatt~LHE~TH~~~V 311 (359)
T PF02102_consen 262 SGVLAYTL-PSQNQIVNCPIFFSDLPALS-NRCHAQDQATTTLHEMTHAPAV 311 (359)
T ss_dssp TT--EEEE-GGGTEEEE-HHHHHH--SS---STT---HHHHHHHHHHT-TTT
T ss_pred CCeEEEEE-cCCCeEEECchhhccCCCcc-ccccCCCccchhhhhhhccccc
Confidence 47899886 44455543221111122111 1344567789999999999886
No 121
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=25.19 E-value=30 Score=30.29 Aligned_cols=12 Identities=33% Similarity=0.457 Sum_probs=10.3
Q ss_pred hhhhhhhhhhhc
Q 043639 174 ESVAVHEIGHLL 185 (261)
Q Consensus 174 ~~Va~HEiGHaL 185 (261)
-+|++||++||-
T Consensus 94 gsiLAHE~mHa~ 105 (212)
T PF12315_consen 94 GSILAHELMHAW 105 (212)
T ss_pred hhHHHHHHHHHH
Confidence 579999999983
No 122
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=24.41 E-value=17 Score=30.23 Aligned_cols=39 Identities=21% Similarity=0.207 Sum_probs=24.4
Q ss_pred ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639 138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL 185 (261)
Q Consensus 138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL 185 (261)
+.+-||++.+ +|-||....- . .+...+..|+.||+-|.+
T Consensus 35 taG~ayL~~~-~I~lNP~ll~--e------n~~~f~~~vV~HELaHl~ 73 (156)
T COG3091 35 TAGGAYLLKS-EIRLNPKLLE--E------NGEDFIEQVVPHELAHLH 73 (156)
T ss_pred hcchhhcccc-ccccCHHHHH--H------ccHHHHHHHHHHHHHHHH
Confidence 3444677766 7777653210 0 123457899999999974
No 123
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=24.02 E-value=32 Score=29.12 Aligned_cols=15 Identities=20% Similarity=0.264 Sum_probs=12.1
Q ss_pred hhhhhhhhhhhcCCC
Q 043639 174 ESVAVHEIGHLLGLT 188 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~ 188 (261)
-.+++|||.|++.-.
T Consensus 37 G~ilahel~hafd~~ 51 (206)
T PF01431_consen 37 GFILAHELMHAFDPE 51 (206)
T ss_dssp HHHHHHHHHHCTSTT
T ss_pred HHHHHHHHHHHHHHh
Confidence 479999999987553
No 124
>PF02128 Peptidase_M36: Fungalysin metallopeptidase (M36); InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=23.42 E-value=47 Score=31.71 Aligned_cols=15 Identities=27% Similarity=0.463 Sum_probs=12.2
Q ss_pred hhhhhhhhhhhcCCC
Q 043639 174 ESVAVHEIGHLLGLT 188 (261)
Q Consensus 174 ~~Va~HEiGHaLGL~ 188 (261)
-.|++||.||.|--.
T Consensus 186 ~~Ii~HEy~HGiSnR 200 (378)
T PF02128_consen 186 NGIIAHEYGHGISNR 200 (378)
T ss_pred cCeeEEeeccccccc
Confidence 579999999987543
No 125
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=23.37 E-value=29 Score=35.25 Aligned_cols=13 Identities=54% Similarity=0.718 Sum_probs=11.4
Q ss_pred chhhhhhhhhhhc
Q 043639 173 LESVAVHEIGHLL 185 (261)
Q Consensus 173 l~~Va~HEiGHaL 185 (261)
...|+.||+||||
T Consensus 408 ~~~~a~he~gha~ 420 (644)
T PRK10733 408 KESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHH
Confidence 4679999999997
No 126
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=23.04 E-value=34 Score=33.60 Aligned_cols=14 Identities=29% Similarity=0.361 Sum_probs=12.0
Q ss_pred hhhhhhhhhhhcCC
Q 043639 174 ESVAVHEIGHLLGL 187 (261)
Q Consensus 174 ~~Va~HEiGHaLGL 187 (261)
+.|+.||+||+...
T Consensus 248 ~~t~~HE~GH~~yy 261 (477)
T cd06461 248 FVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHH
Confidence 67899999999854
No 127
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=22.88 E-value=36 Score=30.81 Aligned_cols=20 Identities=20% Similarity=0.366 Sum_probs=16.5
Q ss_pred cchhhhhhhhhhhcCCCCCC
Q 043639 172 DLESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 172 ~l~~Va~HEiGHaLGL~Hs~ 191 (261)
+-.+|++||+-|.|=+.+..
T Consensus 141 dg~NVvIHEFAH~LDm~~G~ 160 (260)
T PRK15410 141 SGFNLIIHEVAHKLDMRNGD 160 (260)
T ss_pred CCcchhHhHHHhHhhhhcCC
Confidence 45799999999999887643
No 128
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=22.37 E-value=33 Score=30.96 Aligned_cols=11 Identities=36% Similarity=0.341 Sum_probs=9.0
Q ss_pred hhhhhhhhhhc
Q 043639 175 SVAVHEIGHLL 185 (261)
Q Consensus 175 ~Va~HEiGHaL 185 (261)
.-..|||||.+
T Consensus 221 WG~~HE~GH~~ 231 (307)
T PF13402_consen 221 WGPWHELGHNH 231 (307)
T ss_dssp HHHHHHHHHHH
T ss_pred eeehhhhhhhc
Confidence 35899999985
No 129
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=22.03 E-value=34 Score=33.57 Aligned_cols=11 Identities=45% Similarity=0.667 Sum_probs=9.0
Q ss_pred hhhhhhhhhhc
Q 043639 175 SVAVHEIGHLL 185 (261)
Q Consensus 175 ~Va~HEiGHaL 185 (261)
.=++||.||||
T Consensus 261 ~g~iHE~GHAl 271 (497)
T COG2317 261 FGTIHETGHAL 271 (497)
T ss_pred HHHHHhhhhHH
Confidence 34689999996
No 130
>CHL00176 ftsH cell division protein; Validated
Probab=21.89 E-value=34 Score=34.86 Aligned_cols=14 Identities=36% Similarity=0.707 Sum_probs=11.9
Q ss_pred cchhhhhhhhhhhc
Q 043639 172 DLESVAVHEIGHLL 185 (261)
Q Consensus 172 ~l~~Va~HEiGHaL 185 (261)
....||.||.||||
T Consensus 437 ~~~~vA~hEaGhA~ 450 (638)
T CHL00176 437 NKRLIAYHEVGHAI 450 (638)
T ss_pred HHHHHHHHhhhhHH
Confidence 34679999999998
No 131
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=21.30 E-value=33 Score=32.48 Aligned_cols=16 Identities=31% Similarity=0.692 Sum_probs=13.0
Q ss_pred cccchhhhhhhhhhhc
Q 043639 170 AVDLESVAVHEIGHLL 185 (261)
Q Consensus 170 ~~~l~~Va~HEiGHaL 185 (261)
...+-.|+.||+||..
T Consensus 121 ~sEvAAVl~HEmgHVt 136 (479)
T COG4784 121 SSEVAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHHHHhhhhhee
Confidence 3456799999999985
No 132
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=21.24 E-value=34 Score=27.68 Aligned_cols=15 Identities=27% Similarity=0.357 Sum_probs=12.8
Q ss_pred ccchhhhhhhhhhhc
Q 043639 171 VDLESVAVHEIGHLL 185 (261)
Q Consensus 171 ~~l~~Va~HEiGHaL 185 (261)
..+..|++||+-|++
T Consensus 57 ~~l~~~l~HEm~H~~ 71 (146)
T smart00731 57 DRLRETLLHELCHAA 71 (146)
T ss_pred HHHHhhHHHHHHHHH
Confidence 456789999999986
No 133
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.79 E-value=48 Score=23.00 Aligned_cols=19 Identities=26% Similarity=0.433 Sum_probs=16.2
Q ss_pred chhhhhhhhhhhcCCCCCC
Q 043639 173 LESVAVHEIGHLLGLTHSP 191 (261)
Q Consensus 173 l~~Va~HEiGHaLGL~Hs~ 191 (261)
...-++|++-+.+||.|.+
T Consensus 29 ~eR~~vH~~a~~~gL~s~S 47 (60)
T cd02640 29 EERALIHQIAQKYGLKSRS 47 (60)
T ss_pred HHHHHHHHHHHHcCCceee
Confidence 3578999999999998864
No 134
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=20.14 E-value=38 Score=33.44 Aligned_cols=16 Identities=44% Similarity=0.648 Sum_probs=11.5
Q ss_pred hhhhhhhhhhhc---CCCC
Q 043639 174 ESVAVHEIGHLL---GLTH 189 (261)
Q Consensus 174 ~~Va~HEiGHaL---GL~H 189 (261)
..-++||.||+| ||..
T Consensus 260 l~s~iHE~GHalYEq~~~~ 278 (494)
T PF02074_consen 260 LFSTIHETGHALYEQGLPP 278 (494)
T ss_dssp HHHHHHHHHHHHHHHTS-G
T ss_pred HHHHHHHHHHHHHHcCCCh
Confidence 346789999996 7753
Done!