Query psy12854
Match_columns 265
No_of_seqs 129 out of 1161
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 21:30:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12854.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12854hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0339 Dcp Zn-dependent oligo 100.0 1.6E-72 3.6E-77 545.6 24.5 239 7-265 439-682 (683)
2 PRK10280 dipeptidyl carboxypep 100.0 5.5E-71 1.2E-75 549.9 25.5 238 8-265 437-680 (681)
3 PRK10911 oligopeptidase A; Pro 100.0 8.8E-68 1.9E-72 527.9 26.2 240 7-265 435-679 (680)
4 KOG2089|consensus 100.0 2.1E-68 4.5E-73 511.5 18.6 242 5-264 465-708 (718)
5 cd06456 M3A_DCP_Oligopeptidase 100.0 1.7E-64 3.6E-69 480.8 26.4 238 7-264 180-422 (422)
6 cd06457 M3A_MIP Peptidase M3 m 100.0 3.4E-62 7.4E-67 469.7 23.8 231 7-260 220-456 (458)
7 cd06455 M3A_TOP Peptidase M3 T 100.0 3.3E-60 7.2E-65 457.5 25.1 235 7-262 235-472 (472)
8 KOG2090|consensus 100.0 6.7E-56 1.4E-60 421.2 21.0 233 6-261 450-688 (704)
9 PF01432 Peptidase_M3: Peptida 100.0 5.6E-56 1.2E-60 426.2 20.1 239 6-264 213-458 (458)
10 cd06258 Peptidase_M3_like The 100.0 2.5E-49 5.5E-54 369.7 25.5 234 9-261 128-364 (365)
11 TIGR00181 pepF oligoendopeptid 100.0 2.4E-33 5.2E-38 277.3 18.3 208 4-261 354-576 (591)
12 TIGR02290 M3_fam_3 oligoendope 100.0 4.4E-33 9.6E-38 275.3 19.3 210 4-261 350-572 (587)
13 cd06459 M3B_Oligoendopeptidase 100.0 1.3E-32 2.8E-37 261.0 19.1 211 4-261 197-421 (427)
14 TIGR02289 M3_not_pepF oligoend 100.0 2E-29 4.3E-34 247.6 19.5 204 4-254 312-528 (549)
15 COG1164 Oligoendopeptidase F [ 99.9 2.5E-25 5.5E-30 219.9 19.2 212 3-261 354-580 (598)
16 cd06460 M32_Taq Peptidase fami 99.8 3.4E-17 7.3E-22 154.5 18.8 230 13-262 144-391 (396)
17 cd06461 M2_ACE Peptidase famil 99.3 1E-10 2.2E-15 113.3 17.1 214 12-260 233-464 (477)
18 PF02074 Peptidase_M32: Carbox 98.7 4.9E-07 1.1E-11 88.0 16.6 213 33-262 257-490 (494)
19 COG2317 Zn-dependent carboxype 98.3 1.7E-05 3.6E-10 75.8 13.8 226 14-262 245-491 (497)
20 PF01401 Peptidase_M2: Angiote 98.2 4.6E-05 1E-09 76.0 16.7 211 28-261 342-567 (595)
21 KOG3690|consensus 97.1 0.059 1.3E-06 53.4 17.9 214 28-261 364-596 (646)
22 COG2856 Predicted Zn peptidase 92.8 0.079 1.7E-06 46.5 2.6 32 8-49 55-86 (213)
23 PF14247 DUF4344: Domain of un 88.9 0.28 6.1E-06 43.2 2.3 18 34-51 91-108 (220)
24 PF00413 Peptidase_M10: Matrix 88.8 0.23 4.9E-06 40.2 1.5 13 35-47 105-117 (154)
25 PF06114 DUF955: Domain of unk 88.4 0.41 8.9E-06 36.4 2.7 22 32-53 39-60 (122)
26 cd04268 ZnMc_MMP_like Zinc-dep 85.7 0.45 9.8E-06 39.0 1.6 14 35-48 94-107 (165)
27 cd04279 ZnMc_MMP_like_1 Zinc-d 85.0 0.48 1E-05 38.8 1.5 15 34-48 103-117 (156)
28 cd04278 ZnMc_MMP Zinc-dependen 84.8 0.38 8.3E-06 39.6 0.8 16 33-48 105-120 (157)
29 PF13058 DUF3920: Protein of u 84.3 0.71 1.5E-05 36.2 2.0 16 34-49 75-90 (126)
30 cd04277 ZnMc_serralysin_like Z 83.3 0.66 1.4E-05 39.2 1.6 16 33-48 111-126 (186)
31 PF13688 Reprolysin_5: Metallo 81.9 0.73 1.6E-05 39.0 1.4 17 32-48 139-155 (196)
32 PF05572 Peptidase_M43: Pregna 81.9 1.4 2.9E-05 36.6 2.9 17 32-48 66-82 (154)
33 PF13582 Reprolysin_3: Metallo 81.9 0.7 1.5E-05 36.0 1.2 13 35-47 107-119 (124)
34 PF13583 Reprolysin_4: Metallo 81.8 1.3 2.8E-05 38.4 2.9 15 33-47 135-149 (206)
35 smart00235 ZnMc Zinc-dependent 81.6 0.71 1.5E-05 37.0 1.1 11 37-47 88-98 (140)
36 PF13398 Peptidase_M50B: Pepti 81.5 1.2 2.6E-05 38.4 2.6 24 33-56 20-43 (200)
37 PF13574 Reprolysin_2: Metallo 81.4 0.9 1.9E-05 38.3 1.7 14 35-48 111-124 (173)
38 cd04327 ZnMc_MMP_like_3 Zinc-d 79.4 1 2.2E-05 38.7 1.4 15 36-50 93-107 (198)
39 PF04298 Zn_peptidase_2: Putat 78.9 1.5 3.3E-05 38.6 2.4 19 32-50 86-104 (222)
40 cd00203 ZnMc Zinc-dependent me 78.7 1 2.2E-05 36.8 1.2 14 34-47 95-108 (167)
41 cd04267 ZnMc_ADAM_like Zinc-de 78.2 2.1 4.6E-05 36.1 3.1 16 32-47 130-145 (192)
42 PF14891 Peptidase_M91: Effect 78.2 1.6 3.4E-05 36.9 2.2 21 34-54 102-122 (174)
43 PF04228 Zn_peptidase: Putativ 76.1 1.7 3.6E-05 40.0 1.9 19 35-53 170-188 (292)
44 cd04272 ZnMc_salivary_gland_MP 75.0 1.8 4E-05 37.6 1.8 16 32-47 142-157 (220)
45 PF10460 Peptidase_M30: Peptid 74.4 4.3 9.2E-05 38.5 4.2 46 35-83 139-184 (366)
46 cd04271 ZnMc_ADAM_fungal Zinc- 72.2 1.2 2.6E-05 39.4 -0.1 14 34-47 144-157 (228)
47 PF13485 Peptidase_MA_2: Pepti 72.2 7.3 0.00016 29.5 4.5 35 37-78 27-61 (128)
48 cd04270 ZnMc_TACE_like Zinc-de 71.2 2.2 4.8E-05 38.0 1.4 12 36-47 168-179 (244)
49 cd04280 ZnMc_astacin_like Zinc 71.0 2.1 4.5E-05 36.3 1.2 13 36-48 75-87 (180)
50 PF02031 Peptidase_M7: Strepto 69.2 3.2 6.9E-05 33.5 1.8 17 32-48 74-90 (132)
51 PF01400 Astacin: Astacin (Pep 68.0 3.4 7.4E-05 35.3 1.9 15 35-49 79-93 (191)
52 PF12388 Peptidase_M57: Dual-a 67.6 2.5 5.5E-05 37.0 1.0 13 37-49 135-147 (211)
53 cd04269 ZnMc_adamalysin_II_lik 67.0 3 6.6E-05 35.3 1.4 14 34-47 130-143 (194)
54 cd04283 ZnMc_hatching_enzyme Z 66.1 3.2 7E-05 35.5 1.3 15 36-50 78-92 (182)
55 PF13699 DUF4157: Domain of un 66.0 3.9 8.4E-05 30.1 1.6 19 32-50 58-76 (79)
56 PF02163 Peptidase_M50: Peptid 64.0 4.5 9.7E-05 34.1 1.8 24 36-59 8-31 (192)
57 cd04281 ZnMc_BMP1_TLD Zinc-dep 60.8 4.5 9.8E-05 35.1 1.3 14 36-49 88-101 (200)
58 TIGR03296 M6dom_TIGR03296 M6 f 59.1 1.9 4.2E-05 39.3 -1.4 14 35-48 165-178 (286)
59 cd05709 S2P-M50 Site-2 proteas 58.7 6.2 0.00014 32.9 1.8 19 36-54 9-27 (180)
60 cd04276 ZnMc_MMP_like_2 Zinc-d 57.5 5.9 0.00013 34.3 1.5 12 37-48 118-129 (197)
61 PF09471 Peptidase_M64: IgA Pe 56.0 6 0.00013 35.8 1.3 19 35-53 216-234 (264)
62 cd06163 S2P-M50_PDZ_RseP-like 56.0 7.3 0.00016 33.2 1.8 20 36-55 10-29 (182)
63 cd06161 S2P-M50_SpoIVFB SpoIVF 53.5 8.5 0.00018 33.3 1.8 17 37-53 40-56 (208)
64 cd04282 ZnMc_meprin Zinc-depen 52.8 7.1 0.00015 34.6 1.2 16 36-51 121-136 (230)
65 COG2738 Predicted Zn-dependent 51.8 10 0.00022 33.0 1.9 18 32-49 89-106 (226)
66 PF01435 Peptidase_M48: Peptid 50.8 8.3 0.00018 32.9 1.3 21 31-51 84-105 (226)
67 PF01421 Reprolysin: Reprolysi 50.6 15 0.00033 31.1 2.9 15 33-47 129-143 (199)
68 PF07607 DUF1570: Protein of u 47.7 19 0.0004 29.1 2.8 41 35-81 1-42 (128)
69 cd06159 S2P-M50_PDZ_Arch Uncha 47.0 12 0.00026 33.8 1.8 16 36-51 119-134 (263)
70 cd04273 ZnMc_ADAMTS_like Zinc- 46.2 4 8.7E-05 35.1 -1.4 15 33-47 138-152 (207)
71 cd06164 S2P-M50_SpoIVFB_CBS Sp 44.4 14 0.00031 32.5 1.8 16 36-51 54-69 (227)
72 PF00427 PBS_linker_poly: Phyc 43.5 14 0.0003 29.9 1.5 31 226-256 58-89 (131)
73 COG0501 HtpX Zn-dependent prot 43.1 12 0.00026 33.5 1.2 20 30-49 151-171 (302)
74 PF05548 Peptidase_M11: Gameto 42.6 13 0.00027 34.6 1.2 14 34-47 149-162 (314)
75 PF10462 Peptidase_M66: Peptid 42.0 14 0.00031 34.2 1.4 18 30-47 188-205 (305)
76 PRK03982 heat shock protein Ht 40.4 14 0.00031 33.6 1.2 19 31-49 120-139 (288)
77 cd06162 S2P-M50_PDZ_SREBP Ster 40.3 18 0.00038 33.1 1.8 16 36-51 136-151 (277)
78 PF13402 M60-like: Peptidase M 39.5 16 0.00034 33.1 1.3 16 35-50 219-234 (307)
79 PRK13267 archaemetzincin-like 39.0 17 0.00038 30.9 1.5 17 31-47 121-137 (179)
80 cd04275 ZnMc_pappalysin_like Z 38.8 4.9 0.00011 35.5 -2.0 17 32-48 134-150 (225)
81 PRK03001 M48 family peptidase; 37.9 16 0.00035 33.1 1.1 19 31-49 119-138 (283)
82 cd06160 S2P-M50_like_2 Unchara 37.4 22 0.00047 30.3 1.8 17 37-53 43-59 (183)
83 KOG3714|consensus 37.4 16 0.00035 35.1 1.1 20 36-55 160-179 (411)
84 PF01447 Peptidase_M4: Thermol 36.8 23 0.0005 29.2 1.8 19 29-47 128-147 (150)
85 TIGR00054 RIP metalloprotease 36.1 21 0.00045 34.3 1.7 17 36-52 15-31 (420)
86 PF07998 Peptidase_M54: Peptid 35.8 21 0.00046 30.8 1.5 22 27-48 137-158 (194)
87 PRK02870 heat shock protein Ht 35.7 19 0.0004 33.8 1.2 20 30-49 167-187 (336)
88 PF01742 Peptidase_M27: Clostr 35.0 28 0.00062 33.3 2.3 24 31-54 210-233 (408)
89 PRK01345 heat shock protein Ht 35.0 19 0.00042 33.3 1.2 20 31-50 119-139 (317)
90 PRK04897 heat shock protein Ht 34.9 19 0.00042 33.0 1.2 19 31-49 132-151 (298)
91 PRK05457 heat shock protein Ht 31.4 24 0.00052 32.1 1.2 19 31-49 129-148 (284)
92 PRK10779 zinc metallopeptidase 31.3 28 0.00062 33.7 1.7 15 36-50 16-30 (449)
93 PF05547 Peptidase_M6: Immune 30.8 7.8 0.00017 39.5 -2.3 13 35-47 221-233 (645)
94 PRK03072 heat shock protein Ht 30.8 25 0.00054 32.1 1.2 19 31-49 122-141 (288)
95 KOG2921|consensus 29.7 31 0.00068 33.1 1.7 34 22-55 118-151 (484)
96 COG4227 Antirestriction protei 29.7 24 0.00051 32.1 0.8 11 36-46 204-214 (316)
97 COG5549 Predicted Zn-dependent 29.3 26 0.00057 30.8 1.0 22 27-48 179-200 (236)
98 PRK02391 heat shock protein Ht 27.9 30 0.00065 31.7 1.2 19 31-49 128-147 (296)
99 PF04539 Sigma70_r3: Sigma-70 27.9 50 0.0011 23.3 2.2 24 242-265 10-33 (78)
100 PRK01265 heat shock protein Ht 27.1 32 0.00069 32.1 1.2 19 31-49 135-154 (324)
101 PHA00527 hypothetical protein 26.9 1.5E+02 0.0033 23.2 4.6 22 36-57 74-95 (129)
102 TIGR00488 putative HD superfam 25.4 3.7E+02 0.0079 21.8 7.8 98 35-133 35-146 (158)
103 PHA02456 zinc metallopeptidase 24.6 1E+02 0.0022 24.5 3.4 13 37-49 81-93 (141)
104 KOG2719|consensus 24.3 47 0.001 32.2 1.8 18 36-53 281-298 (428)
105 PHA00657 crystallin beta/gamma 24.3 50 0.0011 36.4 2.1 24 28-51 784-807 (2052)
106 COG1913 Predicted Zn-dependent 23.2 45 0.00098 28.4 1.3 20 28-47 117-136 (181)
107 KOG1565|consensus 23.0 29 0.00062 34.1 0.1 18 35-52 211-228 (469)
108 PLN02791 Nudix hydrolase homol 22.2 37 0.00079 35.5 0.7 20 34-53 584-603 (770)
109 PF12315 DUF3633: Protein of u 22.0 51 0.0011 28.9 1.4 13 36-48 94-106 (212)
110 PF13946 DUF4214: Domain of un 21.9 72 0.0016 22.9 2.0 21 241-261 25-45 (75)
111 KOG2661|consensus 21.8 1.1E+02 0.0023 29.1 3.5 53 32-84 271-329 (424)
112 KOG3607|consensus 21.5 48 0.001 34.4 1.4 16 32-47 320-335 (716)
113 PF01457 Peptidase_M8: Leishma 21.2 52 0.0011 32.6 1.5 15 34-48 209-223 (521)
114 PF02128 Peptidase_M36: Fungal 21.1 16 0.00035 34.8 -2.0 40 14-53 148-203 (378)
115 COG2321 Predicted metalloprote 20.6 56 0.0012 29.9 1.4 20 34-53 166-185 (295)
116 PF10263 SprT-like: SprT-like 20.5 70 0.0015 25.8 1.9 17 36-52 61-77 (157)
No 1
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00 E-value=1.6e-72 Score=545.64 Aligned_cols=239 Identities=34% Similarity=0.551 Sum_probs=227.3
Q ss_pred CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854 7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV 86 (265)
Q Consensus 7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~ 86 (265)
+.+|+||++++|||.+|++++|+||+|+||+||||||||+||++|++++|+.+|||+ |||||||+||||||||||+|++
T Consensus 439 ~~~q~PV~ylvCNF~~p~~gkPaLls~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~-v~wDfVElPSQ~mE~w~~~p~v 517 (683)
T COG0339 439 GGGQKPVIYLVCNFTKPVGGKPALLSHDEVTTLFHEFGHGLHHLLTRVKYPGVSGTN-VPWDFVELPSQFMENWCWEPEV 517 (683)
T ss_pred CCcccceEEEeccCCCCCCCCCceeeHHHHHHHHHHhhhHHHHHhhcCCccccCCCC-CCcchhhccHHHHHHhhcCHHH
Confidence 458999999999999999999999999999999999999999999999999999996 9999999999999999999999
Q ss_pred HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854 87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN 161 (265)
Q Consensus 87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~ 161 (265)
|..+++||+||++||++ +++++++++++.|+.++||+.+|+|||.+|...+ ..+.++.+++.++..-.+..++.+
T Consensus 518 L~~~a~Hy~TGe~lP~~ll~k~laaknf~~g~~t~rql~fal~Dm~~H~~~~~~~~~~i~~~~~~~~~~~~v~~~~~~~~ 597 (683)
T COG0339 518 LAKYARHYQTGEPLPKELLDKMLAAKNFQAGLFTLRQLEFALFDMRLHTEFDPDANADILEFEAEVLKKVAVLPSIPPRR 597 (683)
T ss_pred HHHHHHhhccCCcCHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHhhccCCcccccCHHHHHHHHHHHhCCCCCcchhh
Confidence 99999999999999999 9999999999999999999999999999999873 357889999998877665567788
Q ss_pred cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854 162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS 241 (265)
Q Consensus 162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~ 241 (265)
|+++|+|||+|||+||||||+||+|+++|.|.+|.+.|. +| +++|+|||+.||+.|||++|
T Consensus 598 ~~~~F~HIFagGYsAGYYSY~WaeVLsaDafa~Fee~g~----~~---------------~e~G~rfrd~ILs~GGS~dp 658 (683)
T COG0339 598 RPHSFGHIFAGGYSAGYYSYLWAEVLSADAFAAFEEEGP----FN---------------RETGQRFRDAILSRGGSRDP 658 (683)
T ss_pred ccccccceecCcccchhHHHHHHHHHhhHHHHHHHhcCC----CC---------------HHHHHHHHHHHHhccCCcCH
Confidence 999999999999999999999999999999999999876 66 99999999999999999999
Q ss_pred HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854 242 SEVFRRFRGRDPCFKPFLDMFRLN 265 (265)
Q Consensus 242 ~e~l~~flGr~ps~~a~~~~~g~~ 265 (265)
++++++|+||+|+.||+|+++|+.
T Consensus 659 ~e~f~~frGrep~~dalLr~~Gl~ 682 (683)
T COG0339 659 MELFKAFRGREPSIDALLRHRGLA 682 (683)
T ss_pred HHHHHHHhcCCCChhHHHHhcCCC
Confidence 999999999999999999999974
No 2
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00 E-value=5.5e-71 Score=549.94 Aligned_cols=238 Identities=26% Similarity=0.440 Sum_probs=218.5
Q ss_pred ccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHHH
Q psy12854 8 TVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERVF 87 (265)
Q Consensus 8 ~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L 87 (265)
.+++|+++++|||++|++++|+||+|+||+||||||||+||++|++++|+++|||+ |+|||||+||||||+|||+|++|
T Consensus 437 ~~~~Pv~~lvcNf~~p~~~~p~LL~~~eV~TlFHEfGHalH~lls~~~y~~~sGt~-v~~DfVE~PSq~mE~w~~~~~vL 515 (681)
T PRK10280 437 NETRPVIYNVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTN-TPRDFVEFPSQINEHWASHPQVF 515 (681)
T ss_pred CCCCCeEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhCCCccccCCCC-CCcchhcCcHHHHHHHhcCHHHH
Confidence 46789999999999999999999999999999999999999999999999999985 99999999999999999999999
Q ss_pred HHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCC-CCCCCCCC
Q psy12854 88 AELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFG-FPIDKYSN 161 (265)
Q Consensus 88 ~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~-~~~~~~~~ 161 (265)
+.+++||+||+|||++ +++|+++++++.|+.+++|+.+|+|||.+|.... ....+..+++..+... ++..++++
T Consensus 516 ~~~a~Hy~TgepiP~~l~~~l~~ar~~~~g~~~~~ql~~al~D~~lH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 595 (681)
T PRK10280 516 ARYARHYQSGEAMPDELQEKMRNASLFNKGYDMSELLSAALLDMRWHCLEENEAMQDVDDFELRALVAENLDLPAVPPRY 595 (681)
T ss_pred HHHhhccCCCCCCCHHHHHHHHHhhCcchHHHHHHHHHHHHHhHHHhccCcccccccHHHHHHHHHHHhCCCCCCCCCCC
Confidence 9999999999999999 9999999999999999999999999999997632 1344555555555543 56667788
Q ss_pred cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854 162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS 241 (265)
Q Consensus 162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~ 241 (265)
++++|+|||++||+|+||||+||+|+|+|+|++|++++. +| +++|++||++||++|||++|
T Consensus 596 ~~~~F~Hif~ggY~AgYYsYlwaevlaaD~f~~f~~~g~----~n---------------~~~G~~fr~~iL~~GGs~d~ 656 (681)
T PRK10280 596 RSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGG----LT---------------RENGQRFREAILSRGNSTDL 656 (681)
T ss_pred CCCcccccccCCcchhhHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhhcCCCcCH
Confidence 999999999889999999999999999999999988654 45 99999999999999999999
Q ss_pred HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854 242 SEVFRRFRGRDPCFKPFLDMFRLN 265 (265)
Q Consensus 242 ~e~l~~flGr~ps~~a~~~~~g~~ 265 (265)
.++|++||||+|+++|||+++|++
T Consensus 657 ~~~~~~FlGR~P~~~alL~~~Gl~ 680 (681)
T PRK10280 657 ERLYRQWRGHAPQIMPMLQHRGLN 680 (681)
T ss_pred HHHHHHhcCCCCChHHHHHhcCCC
Confidence 999999999999999999999985
No 3
>PRK10911 oligopeptidase A; Provisional
Probab=100.00 E-value=8.8e-68 Score=527.94 Aligned_cols=240 Identities=30% Similarity=0.541 Sum_probs=226.2
Q ss_pred CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854 7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV 86 (265)
Q Consensus 7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~ 86 (265)
|.++.|+++|+|||++|.+++|+||+|+||.||||||||+||++|++++|+.++||++++|||||+|||+||+|+|+|++
T Consensus 435 g~~~~Pv~~l~~Nf~~p~~~~p~LL~~~~v~tlfHEfGHalH~~ls~~~~~~~sGt~~~~~D~vE~pS~~~E~~~~~~~v 514 (680)
T PRK10911 435 GSLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEA 514 (680)
T ss_pred CceeCCeEEEECCCCCCCCCCCcccCHHHHHHHHHHHhHHHHHHHhCCCcCcCCCcCCCCchHhhccHHHHHHHhcCHHH
Confidence 55688999999999999999999999999999999999999999999999999998889999999999999999999999
Q ss_pred HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854 87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN 161 (265)
Q Consensus 87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~ 161 (265)
|+.+++||+||++||++ +++++++++++.++.++||+.+|.||+.+|...+ ..+.++++++++++..++..++++
T Consensus 515 L~~~a~H~~tgeplp~~l~~~l~~~~~~~~~~~~~rql~~a~~D~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 594 (680)
T PRK10911 515 LAFISGHYETGEPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFDPDQGAKILETLAEIKKQVAVVPSPSWGR 594 (680)
T ss_pred HHHHHHHhcCCCCCCHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHhhccccccccCHHHHHHHHHHHcCCCCCCCCCc
Confidence 99999999999999999 9999999999999999999999999999997532 157889999998887667667889
Q ss_pred cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854 162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS 241 (265)
Q Consensus 162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~ 241 (265)
++++|+|||++||+|+||||+||+++|+|+|++|++++. +| +++|++||++||++|||++|
T Consensus 595 ~~~~F~Hif~gGY~AgYYsYlwa~vla~d~~~~f~~~g~----~~---------------~~~g~~~r~~iL~~Ggs~~p 655 (680)
T PRK10911 595 FPHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFEEEGI----FN---------------RETGQSFLDNILSRGGSEEP 655 (680)
T ss_pred CCccccccCcCCcccchHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHHhCcCCcCH
Confidence 999999999999999999999999999999999988765 56 99999999999999999999
Q ss_pred HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854 242 SEVFRRFRGRDPCFKPFLDMFRLN 265 (265)
Q Consensus 242 ~e~l~~flGr~ps~~a~~~~~g~~ 265 (265)
.+++++||||+|+++|||+++|++
T Consensus 656 ~~~~~~F~GR~P~~~all~~~gl~ 679 (680)
T PRK10911 656 MELFKRFRGREPQLDAMLEHYGIK 679 (680)
T ss_pred HHHHHHhcCCCCChHHHHHhcCCC
Confidence 999999999999999999999985
No 4
>KOG2089|consensus
Probab=100.00 E-value=2.1e-68 Score=511.53 Aligned_cols=242 Identities=37% Similarity=0.649 Sum_probs=230.3
Q ss_pred CCCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccH
Q psy12854 5 GTKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEE 84 (265)
Q Consensus 5 g~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~ 84 (265)
-++.+++||++++|||++|..++|+||+|+||+|+||||||+||++|+++.|+.+||+++++|||||+||||||||+|++
T Consensus 465 ~~ss~~~PVaalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~ 544 (718)
T KOG2089|consen 465 KDSSRRIPVAALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDP 544 (718)
T ss_pred cCCccccchHHHHHhcCCcccCCCCccchHHHHHHHHHHhHHHHHHHhcCccccccCcccCCcchhhchHHHHHHhccCc
Confidence 36788999999999999999999999999999999999999999999999999999998899999999999999999999
Q ss_pred HHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHccccCCCCCCCCCCcC
Q psy12854 85 RVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKPHWSEISRELYPLHFGFPIDKYSNLP 163 (265)
Q Consensus 85 ~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~ 163 (265)
++|..+|.||+||++||++ +++|+.+|..+.|+.++|||++|.||+.+|........+.++++++++.++|..+++++|
T Consensus 545 d~L~~lS~Hy~tge~l~eEl~~kl~~~r~~~~gl~tlrqL~~a~~D~~lht~~d~~~~~~~~~l~~~i~~~p~~~~d~~p 624 (718)
T KOG2089|consen 545 DTLRSLSKHYKTGEPLPEELLKKLILTRTVNAGLFTLRQLVLADFDLELHTKTDADLADTYRQLCQEISIVPATPGDNMP 624 (718)
T ss_pred hHhhhhcccccCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHhhccccchHHHHHHhchhheecCCCCCCCCC
Confidence 9999999999999999999 999999999999999999999999999999988666678899999999999999999999
Q ss_pred ccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHH
Q psy12854 164 CRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSS 242 (265)
Q Consensus 164 ~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~ 242 (265)
|+|+|||++||+|+||+|+||+|+|+|||.. |+++|. +| ..++|+|||++||++||+.+++
T Consensus 625 csF~hifa~gy~A~yY~yLWsEv~aaDif~t~fe~~g~----~N--------------~~~~G~ryR~tiLa~GG~~~~~ 686 (718)
T KOG2089|consen 625 CSFGHIFAGGYAAGYYSYLWSEVLAADIFSTFFEQEGE----DN--------------IKEVGMRYRNTILAPGGGKDPM 686 (718)
T ss_pred ccccchhcCchHHHHHHHHHHHHHHHHHHHHHhhhcCC----cc--------------HHHHHHHHHHhhhcCCCCccHH
Confidence 9999999989999999999999999999986 665665 45 2399999999999999999999
Q ss_pred HHHHHhhCCCCCchHHHHhcCC
Q psy12854 243 EVFRRFRGRDPCFKPFLDMFRL 264 (265)
Q Consensus 243 e~l~~flGr~ps~~a~~~~~g~ 264 (265)
|+|++|+||+||.+|||+++||
T Consensus 687 e~f~~FlGRePS~~Afl~s~gl 708 (718)
T KOG2089|consen 687 EVFKRFLGREPSQEAFLKSLGL 708 (718)
T ss_pred HHHHHhhCCCCChhHHHHhhcc
Confidence 9999999999999999999987
No 5
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00 E-value=1.7e-64 Score=480.83 Aligned_cols=238 Identities=36% Similarity=0.548 Sum_probs=222.6
Q ss_pred CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854 7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV 86 (265)
Q Consensus 7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~ 86 (265)
+.++.|+++++|||++|.+++|+||+++||.|||||||||||++|++++|+.++|+ ++++||||+|||+||+|+|||++
T Consensus 180 ~~~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l~~~-~~~~d~~E~pS~~~E~~~~d~~v 258 (422)
T cd06456 180 GLGQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSLGGT-NVEWDFVELPSQFMENWAWEPEV 258 (422)
T ss_pred CCCCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCccccCCC-cCchhHhhccHHHHHHHhcCHHH
Confidence 56789999999999999999999999999999999999999999999999999887 49999999999999999999999
Q ss_pred HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854 87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN 161 (265)
Q Consensus 87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~ 161 (265)
|+.+++|++|+++||++ +++++++++++.++.+++|+.+|.||+.+|...+ +.+.++++++.+++.+++..++++
T Consensus 259 L~~~s~h~~t~~~lp~~l~~~~~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (422)
T cd06456 259 LKLFAKHYETGEPLPDELIDKLLAARNFNSGFATVRQLEFALLDLALHSLTDPEILDVVQFELDALRKEGLVIPPPPPRY 338 (422)
T ss_pred HHHHHhhcCCCCCCCHHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHhcCCcccccchHHHHHHHHHHhcCCCCCCCCCC
Confidence 99999999999999999 9999999999999999999999999999998753 246778888887777777667889
Q ss_pred cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854 162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS 241 (265)
Q Consensus 162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~ 241 (265)
++++|+|||+++|+|+||||+||+++|+|+|++|++++. +| +++|++||++||++||+++|
T Consensus 339 ~~~~f~Hi~~~gY~A~YYsYlws~vla~di~~~f~~~~~----~~---------------~~~G~~~r~~iL~~Ggs~~~ 399 (422)
T cd06456 339 FSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEEGI----FN---------------RETGRRFRDTILSKGGSRDP 399 (422)
T ss_pred CCCccCcCCCCCcchhhHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhhcCCCcCH
Confidence 999999999879999999999999999999999888664 55 99999999999999999999
Q ss_pred HHHHHHhhCCCCCchHHHHhcCC
Q psy12854 242 SEVFRRFRGRDPCFKPFLDMFRL 264 (265)
Q Consensus 242 ~e~l~~flGr~ps~~a~~~~~g~ 264 (265)
.+++++||||+||++||++.+|+
T Consensus 400 ~e~~~~F~Gr~p~~~a~l~~~gl 422 (422)
T cd06456 400 MELFRAFRGRDPSIEALLRRRGL 422 (422)
T ss_pred HHHHHHhcCCCCChHHHHHHcCC
Confidence 99999999999999999999986
No 6
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00 E-value=3.4e-62 Score=469.67 Aligned_cols=231 Identities=27% Similarity=0.409 Sum_probs=213.8
Q ss_pred CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854 7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV 86 (265)
Q Consensus 7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~ 86 (265)
|.+|+|+++|+|||++|++++|+||+|+||.||||||||+||++|++++|+.++||+ +|+||||+||||||+|+|++++
T Consensus 220 g~~q~Pv~~lvcnf~~p~~~~p~lL~~~~v~TLfHEfGHalH~~ls~~~~~~~sgt~-~~~d~vE~pS~~~E~~~~~~~~ 298 (458)
T cd06457 220 GTYQLPVVALMCNFPPPSPSGPTLLSPHEVETLFHEMGHAMHSMLGRTEYQHVSGTR-CATDFVEVPSILMEYFASDPRV 298 (458)
T ss_pred CceeCCeEEEECCCCCCCCCCCCCcCHHHHHHHHHHHhHHHHHHHcCCCccccCCCC-CCcchhhcCHHHHHHHHhhHHH
Confidence 677999999999999999999999999999999999999999999999999999985 9999999999999999999999
Q ss_pred HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC---cHHHHHHHHccccCCCCCCCC-CC
Q psy12854 87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP---HWSEISRELYPLHFGFPIDKY-SN 161 (265)
Q Consensus 87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~---~~~~l~~~l~~~~~~~~~~~~-~~ 161 (265)
|+.+++|++|++++|++ +++++++++++.++.++||+.+|.||+.+|...+. .+.++++++.+++.+++..++ ++
T Consensus 299 L~~~a~h~~t~e~ip~~l~~~l~~~~~~~~~~~~~rq~~~a~fD~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (458)
T cd06457 299 LKLFARHYSTGEPLPEEMLARLLASKNSFAALETQQQIVYALLDQELHGEQPLSPTFTSDVLRDSTEIFYGLPYVPGGTA 378 (458)
T ss_pred HHHHhcccCCCCcCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCcccccCHHHHHHHHHHHhcCCCCCCCCCC
Confidence 99999999999999999 99999999999999999999999999999976532 467999999999888776665 78
Q ss_pred cCccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCcc
Q psy12854 162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCH 240 (265)
Q Consensus 162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~ 240 (265)
++++|+||+ +|+|+||||+||+++|+|+|++ |++++ +| +++|++||++||++|||++
T Consensus 379 ~~~~f~Hl~--gy~a~YYsYl~a~vla~di~~~~f~~~~-----~n---------------~~~g~~y~~~iL~~Ggs~~ 436 (458)
T cd06457 379 WQLRFGHLV--GYGATYYSYLFDRAIASKIWQKLFAADP-----LS---------------REAGERLREELLKHGGGKD 436 (458)
T ss_pred Ccccccccc--CccccchHHHHHHHHHHHHHHHHHHhCC-----CC---------------HHHHHHHHHHHccCCCCcC
Confidence 899999998 6999999999999999999975 66654 45 8999999999999999999
Q ss_pred HHHHHHHhhCCCCCchHHHH
Q psy12854 241 SSEVFRRFRGRDPCFKPFLD 260 (265)
Q Consensus 241 ~~e~l~~flGr~ps~~a~~~ 260 (265)
|.+++++||||+|+.+....
T Consensus 437 p~e~l~~flGrdp~~~~~~~ 456 (458)
T cd06457 437 PWELLAGVLGKPPLVKGGAG 456 (458)
T ss_pred HHHHHHHHcCCCCCCCcccc
Confidence 99999999999999987643
No 7
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=100.00 E-value=3.3e-60 Score=457.51 Aligned_cols=235 Identities=34% Similarity=0.560 Sum_probs=219.0
Q ss_pred CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854 7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV 86 (265)
Q Consensus 7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~ 86 (265)
|.+|+|+++|+|||++|.+++|+||+++||.||||||||+||++|++++++.++|++ +|+||||+|||+||+|+|+|++
T Consensus 235 g~~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~~~~~~sg~~-~~~d~aE~pS~~~E~~~~~~~~ 313 (472)
T cd06455 235 GSRQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRTKYARFSGTR-VERDFVEAPSQMLENWCWEPEV 313 (472)
T ss_pred CCEeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCCCccccCCCc-CChhhhhcchHHHHHHhcCHHH
Confidence 456789999999999999999999999999999999999999999999999999985 9999999999999999999999
Q ss_pred HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC--cHHHHHHHHccccCCCCCCCCCCcC
Q psy12854 87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP--HWSEISRELYPLHFGFPIDKYSNLP 163 (265)
Q Consensus 87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~--~~~~l~~~l~~~~~~~~~~~~~~~~ 163 (265)
|+.+++|++|+++||++ +++++++++++.++.+++|+.+|.||+.+|...+. .+.+++.++++++.+.+..+++++|
T Consensus 314 l~~l~~h~~t~e~i~~~li~~~~~~~~~~~~~~~~~q~~~a~fd~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (472)
T cd06455 314 LKRLSKHYKTGEKIPDELIERLIASRHFNRGLFYLRQLFFALFDLALHTGDPADLDTTKLYNDLREEISLIPSTEGTHGP 393 (472)
T ss_pred HHHHhhccCCCCCCCHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHhCCCCCCCCCCcc
Confidence 99999999999999999 99999999999999999999999999999987653 6899999999999887666667899
Q ss_pred ccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHH
Q psy12854 164 CRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSE 243 (265)
Q Consensus 164 ~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e 243 (265)
++|+|++ ++|+|+||+|+||+++|+++|++|++++. +| +++|++||++||++|||++|.+
T Consensus 394 ~~f~H~~-~~Y~a~yY~Y~~a~~la~~~~~~~~~~~~----~~---------------~~~g~~y~~~~L~~Ggs~~p~e 453 (472)
T cd06455 394 ASFGHLA-GGYDAGYYGYLWSEVFAADMFSSFFKDGL----LN---------------PEVGLRYRDTVLAPGGSKDAAD 453 (472)
T ss_pred ccccccc-cCcccchHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhcCcCCcCHHH
Confidence 9999999 46899999999999999999999877664 45 8999999989999999999999
Q ss_pred HHHHhhCCCCCchHHHHhc
Q psy12854 244 VFRRFRGRDPCFKPFLDMF 262 (265)
Q Consensus 244 ~l~~flGr~ps~~a~~~~~ 262 (265)
++++||||+|+.+||++++
T Consensus 454 ll~~flGr~~~~~a~~~~~ 472 (472)
T cd06455 454 MLKDFLGREPNNDAFLKSL 472 (472)
T ss_pred HHHHHhCCCCChhHHhhcC
Confidence 9999999999999999975
No 8
>KOG2090|consensus
Probab=100.00 E-value=6.7e-56 Score=421.17 Aligned_cols=233 Identities=24% Similarity=0.366 Sum_probs=219.2
Q ss_pred CCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHH
Q psy12854 6 TKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEER 85 (265)
Q Consensus 6 ~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~ 85 (265)
+|.+|+||.+|+|||..+....|++|++.+|+|||||||||||++|++|+|++++||| ||.||||+||.+||.|++|+.
T Consensus 450 Dg~yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSmLGrT~YQhvtGTR-c~tDfaEiPSiLMEyFa~D~r 528 (704)
T KOG2090|consen 450 DGTYQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGRTHYQHVTGTR-CPTDFAEIPSILMEYFANDYR 528 (704)
T ss_pred CCCeeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHHhccchhccccCcc-cchhHhhhhHHHHHHHhcchH
Confidence 4677999999999999999999999999999999999999999999999999999997 999999999999999999999
Q ss_pred HHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCC-C---CcHHHHHHHHccccCCCCCCCCC
Q psy12854 86 VFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGP-K---PHWSEISRELYPLHFGFPIDKYS 160 (265)
Q Consensus 86 ~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~-~---~~~~~l~~~l~~~~~~~~~~~~~ 160 (265)
+|..+++||.|++++|.+ +.++..+++.+.+.++.+|+++|++|+.+|... + ....+++.++.+++.+++..+++
T Consensus 529 Vl~~~aRhy~t~e~l~~~mv~~l~~s~n~~Aa~e~q~Qv~ya~~Dq~fhg~~~~~~~~~~~~~~~~v~~k~~~~~~~~~t 608 (704)
T KOG2090|consen 529 VLRFFARHYSTGEPLPEDMVNRLCESRNSFAAQETQRQVFYALLDQEFHGIACPLIAEDTTDLLSEVKRKFSGLLYVPPT 608 (704)
T ss_pred HHHHHHHHcCCCCCCCHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHhcccccccccchhHHHHHHHHhcCCCCCCCCC
Confidence 999999999999999999 999999999999999999999999999999942 2 24678999999999998888899
Q ss_pred CcCccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCc
Q psy12854 161 NLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSC 239 (265)
Q Consensus 161 ~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~ 239 (265)
.|+.+|+|++ +|+|.||||++++++|.-||++ |.+||+ + +++|++||++||+.||++
T Consensus 609 aw~~rFsHl~--gYGA~YYSYL~~r~~AS~IWq~~Fe~dPf-----s---------------R~aGek~r~eil~hGG~~ 666 (704)
T KOG2090|consen 609 AWQLRFSHLV--GYGATYYSYLFARAIASLIWQQLFENDPF-----S---------------RKAGEKFRKEILKHGGGR 666 (704)
T ss_pred ccccchhhhh--ccCchHHHHHHHHHHHHHHHHHHHhcCcc-----c---------------hhhhHHHHHHHHHhcCCC
Confidence 9999999998 7999999999999999999986 888885 5 999999999999999999
Q ss_pred cHHHHHHHhhCCCCCchHHHHh
Q psy12854 240 HSSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 240 ~~~e~l~~flGr~ps~~a~~~~ 261 (265)
+|+++++++||++|..+.+.+.
T Consensus 667 ~P~~lva~~L~~~~~~~g~~~A 688 (704)
T KOG2090|consen 667 DPAELVADILGKPPLENGGVDA 688 (704)
T ss_pred ChHHHHHHHhcCCCCccchHHH
Confidence 9999999999999998776653
No 9
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00 E-value=5.6e-56 Score=426.16 Aligned_cols=239 Identities=33% Similarity=0.531 Sum_probs=216.5
Q ss_pred CCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHH
Q psy12854 6 TKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEER 85 (265)
Q Consensus 6 ~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~ 85 (265)
+|++|+|+++|+|||++|.+++|+||+++||.|||||||||||++|++++++.++|+ ++|+|+||+||+|||+|+|+++
T Consensus 213 ~~~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tLfHE~GHa~H~~ls~~~~~~~sg~-~~~~d~aE~~S~~~E~~~~~~~ 291 (458)
T PF01432_consen 213 DGERQLPVPYIFCNFTGPSAGKPSLLSHDDVETLFHEFGHAMHSLLSRTKYQHLSGT-RVPMDFAEFPSQFMENWLWDPL 291 (458)
T ss_dssp TSTCECEEEEEEEEE-S-BTTC--B-SHHHHHHHHHHHHHHHHHHHCCCSSGGGSTT-SS-CHHCHHHHHHHHHHGGCHH
T ss_pred cccCCCCceEEEecCCCCCCCCCCccChhhHHHHHHHHhHHHHHHHhccccccccCC-chhHHHHhcchHHHHHhhhchh
Confidence 578999999999999999999999999999999999999999999999999999998 5999999999999999999999
Q ss_pred HHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC------cHHHHHHHHccccCCCCCCC
Q psy12854 86 VFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP------HWSEISRELYPLHFGFPIDK 158 (265)
Q Consensus 86 ~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~------~~~~l~~~l~~~~~~~~~~~ 158 (265)
+++.+++|++++++||++ +++++++++.+.+..+.+|+.++.||+.+|...+. .+.++++++++++.+.+..+
T Consensus 292 ~l~~~~~~~~~~~~i~~~~l~~~~~~~~~~~~~~~~~ql~~a~fd~~~h~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 371 (458)
T PF01432_consen 292 VLKAFSRHYETGEPIPEELLEDLIASRNFFAAIFLFRQLLFALFDQELHESPEDGEPESEDLNELYRELQKEYYGDPSDP 371 (458)
T ss_dssp HHHHH-BSTTTHHHHHHHHHHHHHHTTTTTHHHHHHHHHHHHHHHHHHCCHCCCTHHHHHHHHHHHHHHHHHCCTTTCBT
T ss_pred hhhhhccChhhhhhHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHhccccCCccchhHHHHHHHHHHHhcCcchhc
Confidence 999999999999999999 99999999999999999999999999999998763 36799999999999888778
Q ss_pred CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCC
Q psy12854 159 YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGS 238 (265)
Q Consensus 159 ~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs 238 (265)
+..++++|.|+++++|+++||+|+||+++|+++|++|++++. +| +++|++||+++|++|||
T Consensus 372 ~~~~~~~f~hl~~~~Y~a~yY~Y~ya~~~a~~~~~~~~~~~~----~n---------------~~~g~~~~~~~L~~Ggs 432 (458)
T PF01432_consen 372 DDYFPASFSHLFSHFYAAGYYSYLYAEVLAADLFSQFFEEDP----LN---------------RETGRRFYKEFLSPGGS 432 (458)
T ss_dssp TCCGGGGGGGGHCCCGTTTTTCHHHHHHHHHHHHHHHHHCHT----TC---------------HHHHHHHHHHHCTTTTS
T ss_pred cccccceehhhccCccCCCCchhHHHHHHHHHHHHHHHhcCC----cc---------------hHHHHHHHHHHhcCCCC
Confidence 888999999997678999999999999999999999877543 45 89999999999999999
Q ss_pred ccHHHHHHHhhCCCCCchHHHHhcCC
Q psy12854 239 CHSSEVFRRFRGRDPCFKPFLDMFRL 264 (265)
Q Consensus 239 ~~~~e~l~~flGr~ps~~a~~~~~g~ 264 (265)
++|.+++++|+||+|+.++|++.+|+
T Consensus 433 ~~~~e~l~~~~g~~~~~~~~~~~~~~ 458 (458)
T PF01432_consen 433 KDPLELLKKFLGREPSPDAFLKALGL 458 (458)
T ss_dssp S-HHHHHHHCCSSTTSSHHHHHHHT-
T ss_pred CCHHHHHHHhCCCCCChHHHHHHcCC
Confidence 99999999999999999999999986
No 10
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=100.00 E-value=2.5e-49 Score=369.68 Aligned_cols=234 Identities=24% Similarity=0.329 Sum_probs=211.5
Q ss_pred cccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHHHH
Q psy12854 9 VITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFA 88 (265)
Q Consensus 9 ~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~ 88 (265)
++.|+++|+|||.+|.+++|+|+++++|.|||||||||+|+++++++++.++|+ ++++||+|+||++||+|++++++|+
T Consensus 128 ~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~-~~~~~~~E~~S~~~E~~~~~~~~L~ 206 (365)
T cd06258 128 RQDKDVRILANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERT-PTSTDFAEAQSMFLESFATDPEWLE 206 (365)
T ss_pred CCCCeEEEEccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCC-CCCccHHhccHHHHHHHHCCHHHHH
Confidence 367999999999999999999999999999999999999999999998888887 5999999999999999999999999
Q ss_pred HHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC--cHHHHHHHHccccCCCCCCCCCCcCcc
Q psy12854 89 ELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP--HWSEISRELYPLHFGFPIDKYSNLPCR 165 (265)
Q Consensus 89 ~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~--~~~~l~~~l~~~~~~~~~~~~~~~~~~ 165 (265)
.++.|++++ ++|.+ ++++++.+....+..+.+|+.++.||+.+|..... .+.++|.++.+++.+++..+.+.++++
T Consensus 207 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~l~~~~~g~~~~~~~~~~~~ 285 (365)
T cd06258 207 RYARHYQGG-VVPDELIEKLIAARLPNTLYETRRILVVAKFEKALYENPDRELELQKLWRDLVKEILGVRPDPSTPDPAA 285 (365)
T ss_pred HHhhhcCCC-CCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCccCCHHHHHHHHHHHhcCCCCCCCCCCCCc
Confidence 999999988 89999 99999998888889999999999999999986543 488999999999888766666788889
Q ss_pred ccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHH
Q psy12854 166 FVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVF 245 (265)
Q Consensus 166 f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l 245 (265)
|.|+++ ||+++||+|+||+++|+++|++|+++.. +...++++|++||++||++||++++.|++
T Consensus 286 ~~H~~~-gy~~yyy~Y~~~~v~a~qi~~~~~~~~~----------------~~~~~~~~g~~l~~~il~~G~s~~~~el~ 348 (365)
T cd06258 286 FPHLAG-GSPAYYYGYLLAEMLASQLRATFKKKVG----------------YLTDNPEAGPRLREHILRPGNSEPWKELL 348 (365)
T ss_pred cchhcc-CcccchHHHHHHHHHHHHHHHHHHhcCC----------------CCCCCHHHHHHHHHHHccCcCCcCHHHHH
Confidence 999985 4999999999999999999999877531 00123899999998899999999999999
Q ss_pred HHhhCCCCCchHHHHh
Q psy12854 246 RRFRGRDPCFKPFLDM 261 (265)
Q Consensus 246 ~~flGr~ps~~a~~~~ 261 (265)
++|+||+||.++|++.
T Consensus 349 ~~~~G~~~~~~a~~~~ 364 (365)
T cd06258 349 KRATGEDPNADAFLDH 364 (365)
T ss_pred HHHcCCCCChHHHHhc
Confidence 9999999999999985
No 11
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=100.00 E-value=2.4e-33 Score=277.33 Aligned_cols=208 Identities=16% Similarity=0.141 Sum_probs=164.6
Q ss_pred CCCCcccc--ceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHh
Q psy12854 4 FGTKTVIT--PLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFL 81 (265)
Q Consensus 4 ~g~~~~~~--p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~ 81 (265)
+|++|... +..+|+|||++ +++||.||||||||+||+++++..++..++ ++|+++||+||++||+|+
T Consensus 354 ~Ga~~~~~~~~~p~il~N~~~---------~~~dv~TLaHElGHa~H~~~~~~~~~~~~~--~~~~~~aE~aS~~~E~l~ 422 (591)
T TIGR00181 354 SGAYSIGGYKVKPYILMNWDG---------TLNSVFTLAHELGHSMHSYFSSKHQPYPNS--DYSIFYAEIASTFNELLL 422 (591)
T ss_pred CCcccCCCCCCCCeEEEecCC---------CcchHHHHHHHhhhHHHHHHHccCCCCccC--CCCchhhhHhHHHHHHHH
Confidence 46665444 56899999999 999999999999999999999866665555 589999999999999999
Q ss_pred ccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccCC
Q psy12854 82 YEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHFG 153 (265)
Q Consensus 82 ~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~~ 153 (265)
+++ +..|+ ++++.+.. +++.+. +++. .+++|+.++.||+.+|...+ +.++++|.++.+++.|
T Consensus 423 ~~~-----l~~~~-~~~~~k~~~l~~~l~--~~~~--~~~~q~~~~~Fe~~~~~~~~~~~~lt~~~l~~~~~~~~~~~~g 492 (591)
T TIGR00181 423 ADY-----LLKNS-NDPEMKIYILLERIS--NFFG--TFTRQTLFAEFEYEAYELIEEGEPLTAETLNEIYANLLKKYFG 492 (591)
T ss_pred HHH-----HHhhC-CCHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHcC
Confidence 987 55666 55667777 777776 2332 67899999999999996421 2467899999888754
Q ss_pred C--CCCCC-CCcCccccccCCCCccCCchhhHhHHHHHHHH--HHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHH
Q psy12854 154 F--PIDKY-SNLPCRFVEVGSGDLAAGYYSFLWSKLVSADI--FYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRL 228 (265)
Q Consensus 154 ~--~~~~~-~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv--~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~y 228 (265)
. ..... ..+|.+++|+ |.+ ||+|.||.++++++ |+++++++ +.++++|
T Consensus 493 ~~~~~~~~~~~~w~~~~Hf----y~~-fY~Y~Ya~g~~~a~~l~~~~~~~~----------------------~~~~~~Y 545 (591)
T TIGR00181 493 DLVKIDEGAGLTWMRIPHF----YMG-FYVYKYATGQVAATALYEKIKEEG----------------------KGAVEKY 545 (591)
T ss_pred CccccCCcccceeeecCcc----cCC-CcCHHHHHHHHHHHHHHHHHHHCC----------------------ccHHHHH
Confidence 2 22222 4567789997 665 99999988655555 88898876 3589999
Q ss_pred HHHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854 229 RDTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 229 R~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~ 261 (265)
+ ++|++|||++|.++++. +|-+++..++++.
T Consensus 546 ~-~~L~~Ggs~~p~ell~~-~Gvd~~~~~~~~~ 576 (591)
T TIGR00181 546 L-KFLKSGGSKYPLETLKI-AGVDLTKPQPWQA 576 (591)
T ss_pred H-HHHhCcCCCCHHHHHHH-cCcCCCChHHHHH
Confidence 7 99999999999999999 5999999888764
No 12
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=100.00 E-value=4.4e-33 Score=275.33 Aligned_cols=210 Identities=13% Similarity=0.152 Sum_probs=158.6
Q ss_pred CCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854 4 FGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF 80 (265)
Q Consensus 4 ~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~ 80 (265)
+|++|..+|. .+|+|||++ ++++|.||||||||++|++++++.... ++ ++|+|+||+||+|||+|
T Consensus 350 ~Ga~~~~~~~~~~p~i~~N~~~---------~~~~v~TL~HE~GHa~H~~ls~~~~~~--~~-~~~~~~aE~~S~~~E~l 417 (587)
T TIGR02290 350 GGAFCTGFPPSKEPRVLMNYDG---------SRRDVSTLAHELGHAYHSELAKDQPLL--NA-RYPMTLAETASIFAEML 417 (587)
T ss_pred CCcccCCCCCCCCCEEEEecCC---------CchhHHHHHHHhhHHHHHHHHccCCcc--cC-CCCchhhhHHHHHHHHH
Confidence 4666655432 579999999 899999999999999999999874432 32 48999999999999999
Q ss_pred hccHHHHHHHHhhhcCCcccHHHHHHHHHHHhhhhhHH-HHHHHHHHHHHHHHhcCC--C----CcHHHHHHHHccccCC
Q psy12854 81 LYEERVFAELNSHFATGEKLKMTDEQLKALRAHNAGID-VCSELFKANLDLQLHNGP--K----PHWSEISRELYPLHFG 153 (265)
Q Consensus 81 ~~~~~~L~~ls~h~~t~~~lp~~~~~l~~~~~~~~~~~-~~~ql~~a~~D~~lh~~~--~----~~~~~l~~~l~~~~~~ 153 (265)
+|++ .|+.+ .+ ++....++.. ....++. +++|+.++.||+.+|... + ..+.+++.++.+++.|
T Consensus 418 ~~~~-ll~~~----~~----~~~~~~~~~~-~l~~~~~~~~~q~~~~~fE~~l~~~~~~~~lt~~~l~~~~~~~~~~~~g 487 (587)
T TIGR02290 418 LFDA-LLKEA----KT----DEEKLSLLAE-KLEDAIATLVRIHARFLFERRFHEARKEGELSADDICDLMLEAQKESYG 487 (587)
T ss_pred HHHH-HHhhC----CC----HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHhC
Confidence 9997 34432 21 1113334433 4455553 689999999999999642 1 2367888888877644
Q ss_pred --CCCCC-CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHH
Q psy12854 154 --FPIDK-YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRD 230 (265)
Q Consensus 154 --~~~~~-~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~ 230 (265)
++..+ ...+|.+++|++..+| .||.|+||.++|.++|.+|++++ +.+.++|+
T Consensus 488 ~~~~~~~~~~~~w~~~~H~y~~~f--Y~y~Y~~g~~~a~~l~~~~~~~~----------------------~~~~~~y~- 542 (587)
T TIGR02290 488 DALDESELHPYMWAYKPHFYHAPF--YNYPYTFGYLFVLGLYAKYREEG----------------------ESFVPKYI- 542 (587)
T ss_pred cccccCCCCCceeecCCeecCCCC--cChhhHHHHHHHHHHHHHHHHCC----------------------hhHHHHHH-
Confidence 23222 3567788999953222 23688899999999999998876 35778895
Q ss_pred HHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854 231 TFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 231 ~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~ 261 (265)
+||+.||+++|.+++++ +|++|+.++|++.
T Consensus 543 ~~L~~Ggs~~p~ell~~-~G~d~~~~~~~~~ 572 (587)
T TIGR02290 543 ALLRDTGSMTPEELVKK-FGFDLTSPDFWQK 572 (587)
T ss_pred HHHHCcCCCCHHHHHHH-hCcCCCChHHHHH
Confidence 99999999999999999 8999999999974
No 13
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=100.00 E-value=1.3e-32 Score=261.03 Aligned_cols=211 Identities=18% Similarity=0.193 Sum_probs=162.4
Q ss_pred CCCCccccc---eEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854 4 FGTKTVITP---LNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF 80 (265)
Q Consensus 4 ~g~~~~~~p---~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~ 80 (265)
+|++|...+ .++|+|||.+ ++++|.|||||||||+|.++++..++.+.+ ++|+|++|+||++||+|
T Consensus 197 ~gaf~~~~~~~~~p~i~~n~~~---------~~~~v~tl~HE~GHa~h~~~~~~~~~~~~~--~~~~~~~E~~S~~~E~~ 265 (427)
T cd06459 197 SGAYCTGLPPGKHPFILMNFNG---------TLDDVFTLAHELGHAFHSYLSRDNQPYLYS--DYPIFLAEIASTFNELL 265 (427)
T ss_pred CCeecCCCCCCCCCeEEecCCC---------ChhhHHHHHHHhhHHHHHHHHccCCCcccC--CCCchhhHHHHHHHHHH
Confidence 355555443 4689999998 899999999999999999999876553332 48999999999999999
Q ss_pred hccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccC
Q psy12854 81 LYEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHF 152 (265)
Q Consensus 81 ~~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~ 152 (265)
++++ +..|+++.+ .+.. +..++.. .. ..+.+|+.++.||+.+|...+ +...++|+++.+++.
T Consensus 266 ~~~~-----l~~~~~~~~-~~~~~~~~~~~~---~~-~~~~~~~~~~~fe~~l~~~~~~~~~~~~~~~~~~~~~~~~~y~ 335 (427)
T cd06459 266 LFDY-----LLKFAKDPE-EKLYLLEHLLED---IR-ATLPRQTMFAEFEHEVYENPEEGEPLTAEELNEIYRELEKKYG 335 (427)
T ss_pred HHHH-----HHHhCCCHH-HHHHHHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhc
Confidence 9985 456666531 2223 3333321 11 267899999999999997421 246788999988887
Q ss_pred CCCC---CCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHH
Q psy12854 153 GFPI---DKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLR 229 (265)
Q Consensus 153 ~~~~---~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR 229 (265)
+... ......|.+|.|++.+++ +||+|+||.++|.++|++|++++ +.++++|+
T Consensus 336 g~~~~~~~~~~~~w~~~~H~~~~~f--Yyy~Y~~g~~~a~~l~~~~~~~~----------------------~~~~~~y~ 391 (427)
T cd06459 336 GDLVEIDEEHGYEWARIPHFYYVPF--YVYPYAFGQLAALALYAKYKEDG----------------------EGAVEKYL 391 (427)
T ss_pred CCccccCCccCeeeeecCeecCCCC--cChHHHHHHHHHHHHHHHHHHcC----------------------ccHHHHHH
Confidence 7532 233445668999975322 47899999999999999999876 46899995
Q ss_pred HHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854 230 DTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 230 ~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~ 261 (265)
++|+.||+++|.++++.+ |.+++...+++.
T Consensus 392 -~~L~~Ggs~~~~ell~~~-g~d~~~~~~~~~ 421 (427)
T cd06459 392 -ELLKAGGSKSPLELLKKA-GVDLTSPDFWEE 421 (427)
T ss_pred -HHHHccCCCCHHHHHHHc-CcCCCChHHHHH
Confidence 999999999999999975 999999988875
No 14
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=99.97 E-value=2e-29 Score=247.62 Aligned_cols=204 Identities=12% Similarity=0.077 Sum_probs=153.8
Q ss_pred CCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854 4 FGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF 80 (265)
Q Consensus 4 ~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~ 80 (265)
+|++|..+|. .+|.|||++ +++||.||+||||||+|+++++.... ...+..|+|+||+||++||+|
T Consensus 312 ~Gayc~~~~~~~~P~I~~Nf~~---------t~~dv~TL~HElGHa~H~~~s~~~~~--~~~~~~~~~~aE~aS~~~E~l 380 (549)
T TIGR02289 312 AGGYCTYLPKYKAPFIFSNFNG---------TSGDIDVLTHEAGHAFHVYESRKDLL--PEYRWPTYEAAELASMSMELL 380 (549)
T ss_pred CCcccCCCCCCCCcEEEEeCCC---------ChhHHHHHHHHhhHHHHHHHhcCCcc--cccccCcchhhhhhHHHHHHH
Confidence 5677766553 699999999 99999999999999999999986322 222346899999999999999
Q ss_pred hccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC---CcHHHHHHHHccccCC-CC
Q psy12854 81 LYEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK---PHWSEISRELYPLHFG-FP 155 (265)
Q Consensus 81 ~~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~---~~~~~l~~~l~~~~~~-~~ 155 (265)
+|++ +..+++++++.+.. ++.|..... .+.+|+....|...+|...+ +...++|.++.+++.+ ..
T Consensus 381 ~~~~-----l~~~~~~~e~~~~~~~~~L~~~~~-----~~~~~~~~d~fe~~vy~~~~lt~~e~~~~~~~l~~~y~~~~~ 450 (549)
T TIGR02289 381 SMPW-----WDLFYTDEKDLKKAKISHLKGALS-----FLPYGVIVDHFQHWVYENPNHTPEERDEKYAELEKRYQPSTV 450 (549)
T ss_pred HHHH-----HHHHcCCcchHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHccCCCCCHHHHHHHHHHHHHHhCCCcc
Confidence 9985 66778877777777 766654422 12466677777777776532 2467888888887653 11
Q ss_pred --C---CCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHH
Q psy12854 156 --I---DKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRD 230 (265)
Q Consensus 156 --~---~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~ 230 (265)
. .....+|.+++|++. +...||+|.||+++|.++|.++++++ +.+.++| .
T Consensus 451 ~~~~~~~~~~~~W~~~~H~~~--~pFYyy~Y~~a~~~a~~l~~~~~~~~----------------------~~~~~~Y-~ 505 (549)
T TIGR02289 451 YIAGLELEIGTFWLRQLHIFE--VPFYYIEYTIAQIGALQIYKIYKEDP----------------------EKALKDY-K 505 (549)
T ss_pred cCCccccccCCcceeeceeec--CCCcchhhHHHHHHHHHHHHHHHhCH----------------------HHHHHHH-H
Confidence 1 122345678999973 22235799999999999999998876 4788999 6
Q ss_pred HHhcCCCCccHHHHHHHhhCCCCC
Q psy12854 231 TFLTFGGSCHSSEVFRRFRGRDPC 254 (265)
Q Consensus 231 ~iL~~Ggs~~~~e~l~~flGr~ps 254 (265)
++|+.|||++|.++++.+ |-+..
T Consensus 506 ~~L~~Ggs~~~~ell~~a-Gid~~ 528 (549)
T TIGR02289 506 KLCSAGGSQSFLELYETA-GLTFP 528 (549)
T ss_pred HHHhccCCcCHHHHHHHh-CCCCC
Confidence 999999999999999995 75544
No 15
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=99.93 E-value=2.5e-25 Score=219.86 Aligned_cols=212 Identities=14% Similarity=0.155 Sum_probs=157.5
Q ss_pred cCCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHH
Q psy12854 3 IFGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTH 79 (265)
Q Consensus 3 ~~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~ 79 (265)
-+|++|...+. .+|.+||.+ +.+||.||+||+||++|+.+++.+.+.+.. ..|+.+||+||++||.
T Consensus 354 rsGaYs~~~~~~~~p~IlmN~~g---------t~~dV~TLaHElGHs~Hs~~s~~~qp~~~~--~~~i~~AEiAS~fnE~ 422 (598)
T COG1164 354 RSGAYSIGFYKGDHPFILMNYDG---------TLRDVFTLAHELGHSVHSYFSRKHQPYLYA--DYSIFLAEIASTFNEM 422 (598)
T ss_pred CCCcccCCCCCCCCCeEEEeCCC---------chhHHHHHHHHccHHHHHHHHhccCCcccc--CCchHHHHHHHHHHHH
Confidence 47888766555 799999999 999999999999999999999987764554 4789999999999999
Q ss_pred HhccHHHHHHHHhhhcCCcccHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccC
Q psy12854 80 FLYEERVFAELNSHFATGEKLKMTDEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHF 152 (265)
Q Consensus 80 ~~~~~~~L~~ls~h~~t~~~lp~~~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~ 152 (265)
++++. +...+++. ++...++....-..--.+.||+.++.|++.+|...+ +.++++|.++.+++.
T Consensus 423 l~~~~-----ll~~~~~~----~~~~~il~~~l~~~~~t~~rq~~f~~FE~~~h~~~~~~~~lt~~~l~~~~~~l~~~y~ 493 (598)
T COG1164 423 LLFDY-----LLERFKDP----EERLAILEEKLEGFFATLFRQTLFAEFEHRVHELIEEGEELTAEELNELYLELQKEYY 493 (598)
T ss_pred HHHHH-----HHHHcCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHhc
Confidence 99985 33333331 222222222111111346799999999999999843 237889999999987
Q ss_pred CC--CCCCC-CCcCccccccCCCCccCCchhhHhHH--HHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHH
Q psy12854 153 GF--PIDKY-SNLPCRFVEVGSGDLAAGYYSFLWSK--LVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSR 227 (265)
Q Consensus 153 ~~--~~~~~-~~~~~~f~Hl~s~~Y~a~YYsYl~s~--~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 227 (265)
|- ..++. ..+|.+++|+ |.++||+|.|+. +.|..++..+++++. +.-++
T Consensus 494 gd~~~~d~~~~~~W~~ipHf----y~~pFYvy~Ya~G~~~a~~l~~~~~~~~~----------------------~~~~~ 547 (598)
T COG1164 494 GDAVKLDELSGLEWARIPHF----YHSPFYVYQYATGQLAALALYAKILTNDA----------------------EAFEK 547 (598)
T ss_pred CCccccCcccccchhhcCee----eecCceehHhHHHHHHHHHHHHHHHhccH----------------------HHHHH
Confidence 63 34443 4578889999 455666666555 677777888887753 34444
Q ss_pred HHHHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854 228 LRDTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 228 yR~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~ 261 (265)
|..++|+.|||.+|.|+++ -+|-+.+..++++.
T Consensus 548 ~y~~~Lk~GgS~~p~ell~-~~GiD~~~~~~~~~ 580 (598)
T COG1164 548 YYIAFLKSGGSKSPLELLK-IAGIDLTTPDPWEE 580 (598)
T ss_pred HHHHHHhCCCCCCHHHHHH-HcCCCCCCchHHHH
Confidence 4468999999999999999 99999888777763
No 16
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=99.76 E-value=3.4e-17 Score=154.51 Aligned_cols=230 Identities=12% Similarity=0.090 Sum_probs=151.7
Q ss_pred eEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHH-hhcC-CCCCCcccCCccchhHhhchHHHHHHhcc-HH----
Q psy12854 13 LNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRS-LTAT-HYSDVSGVNNVEWDSVYIINYFLTHFLYE-ER---- 85 (265)
Q Consensus 13 ~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~l-ls~~-~~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~---- 85 (265)
-+.|.+||.... -.++|.|+.||+|||+|.. ++.. ..+.+.. +.++-+.|.+|.|+|+.+.+ +.
T Consensus 144 dvRItt~y~~~d-------~~~~l~t~iHE~GHalye~~l~~~~~~~pl~~--~~sm~ihESqS~f~E~~vgrs~~F~~~ 214 (396)
T cd06460 144 DVRITTRYDEND-------FRSALFSTIHETGHALYEQGLPPELRGTPLGG--GASMGIHESQSRLWENQVGRSRAFWEF 214 (396)
T ss_pred CceEEeeeCCcc-------hHHHHHHHHHHhhHHHHHhcCCHhHccCCccc--CCCchhHHHHHHHHHHHHhcCHHHHHH
Confidence 467888888721 2788999999999999999 6531 1122322 36899999999999998866 22
Q ss_pred HHHHHHhhhcCC-cccHHH-HHHHHHHHh--hh-----hhHHHHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCCC
Q psy12854 86 VFAELNSHFATG-EKLKMT-DEQLKALRA--HN-----AGIDVCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFGF 154 (265)
Q Consensus 86 ~L~~ls~h~~t~-~~lp~~-~~~l~~~~~--~~-----~~~~~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~~ 154 (265)
.+..+..++... .....+ +-.+.+... +. .-...+++++...|+..+....- +.+.++|++..+++.|+
T Consensus 215 l~~~l~~~f~~~~~~~~~e~~~~~~n~v~ps~iRveADEvty~lHiilR~e~Ek~l~~G~l~v~dLp~~W~~~~~eylG~ 294 (396)
T cd06460 215 LYPKLKKTFPEQLKDVSLENFYRAVNRVQPSLIRVEADEVTYNLHIMLRYELEKALIEGDLEVADLPEAWNEKMKEYLGI 294 (396)
T ss_pred HHHHHHHHcccccccCCHHHHHHHHhhcCccceeeecchhhHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHcCC
Confidence 333444444221 112222 333332211 01 00245789999999998887642 34778999999999886
Q ss_pred CCCCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhc
Q psy12854 155 PIDKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLT 234 (265)
Q Consensus 155 ~~~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~ 234 (265)
........+.+-.|...+.+ +.||+|..+.++|+++|++++++... ++ . .-.+|. -....+=+|++|++
T Consensus 295 ~~~~d~~g~lQd~HW~~g~f-GYfptYalG~l~Aaql~~~~~~~~~~---~~---~-~i~~g~---f~~~~~wL~e~i~~ 363 (396)
T cd06460 295 RPPNDAEGCLQDIHWSGGSF-GYFPTYTLGNLYAAQLFAAAKKDLPD---ID---E-QIERGD---FSPLLEWLRENIHQ 363 (396)
T ss_pred CCCCCccccccccccccCcC-CcCchhHHHHHHHHHHHHHHHHhCcc---HH---H-HHhccC---hHHHHHHHHHHHHh
Confidence 54333334445677754322 24689999999999999987765320 11 0 000111 12255667788999
Q ss_pred CCCCccHHHHHHHhhCCCCCchHHHHhc
Q psy12854 235 FGGSCHSSEVFRRFRGRDPCFKPFLDMF 262 (265)
Q Consensus 235 ~Ggs~~~~e~l~~flGr~ps~~a~~~~~ 262 (265)
.|++.+|.|++++..|.+.+.++|++.|
T Consensus 364 ~G~~~~p~eLl~~atG~~l~~~~f~~yL 391 (396)
T cd06460 364 HGSRYSPDELLKKATGEPLNPEYFLEYL 391 (396)
T ss_pred ccCCCCHHHHHHHHhCCCCChHHHHHHH
Confidence 9999999999999999999999999876
No 17
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=99.30 E-value=1e-10 Score=113.29 Aligned_cols=214 Identities=13% Similarity=0.040 Sum_probs=133.0
Q ss_pred ceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhh-c-CCCCCCcccCCccc--hhHhhchHHH---HHHhccH
Q psy12854 12 PLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLT-A-THYSDVSGVNNVEW--DSVYIINYFL---THFLYEE 84 (265)
Q Consensus 12 p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls-~-~~~~~~sg~~~~~~--d~~E~pS~~~---E~~~~~~ 84 (265)
....|.+|+.. +..||.|+.||+||+. -.++ + .++-...|. +.+. .++|+.|+.| |++..-+
T Consensus 233 ~d~rI~~c~~~---------t~~D~~t~~HE~GH~~-yy~~y~~~p~~~r~~a-np~fheav~e~~smS~~tpe~L~~~~ 301 (477)
T cd06461 233 KDFRIKMCTKV---------NMEDFVTVHHEMGHIQ-YYLQYKDQPVLFREGA-NPGFHEAVGDAIALSVSTPKHLHKIG 301 (477)
T ss_pred CCcceeeCCCC---------CHHHHHHHHHHHHHHH-HHHHhccCCHHHhCCC-CCChHHHHHHHHHHhcCCHHHHhhcc
Confidence 35778888887 9999999999999999 4434 3 332223443 3677 8999999999 8887322
Q ss_pred HHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-----CcHHHHHHHHccccCCC-CCC
Q psy12854 85 RVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-----PHWSEISRELYPLHFGF-PID 157 (265)
Q Consensus 85 ~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-----~~~~~l~~~l~~~~~~~-~~~ 157 (265)
.|... .+.....| .. +..++.. +....+...+|..-|.... +.+++.|.+|.++|.++ |+.
T Consensus 302 -ll~~~--~~~~~~~i-~~l~~~al~~--------i~fLP~~~~vD~fr~~vy~~~~~~~e~n~~WweL~~~yqgv~pP~ 369 (477)
T cd06461 302 -LLDSE--VDDEEADI-NFLLKMALDK--------IAFLPFGYLLDKWRWDVFDGEIPKDDYNKAWWELREKYQGVVPPV 369 (477)
T ss_pred -ccccc--CCCcHHHH-HHHHHHHHHH--------HHHhhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhcCCCCCC
Confidence 22111 01111111 11 2222211 2233466777887665543 34788899998888775 222
Q ss_pred C---CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhc
Q psy12854 158 K---YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLT 234 (265)
Q Consensus 158 ~---~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~ 234 (265)
+ ....+...-|+-. ...|..|..|.+++.++|+.+.+.....--++. =+-..++++|++|+ ++|+
T Consensus 370 ~r~~~~fd~gak~h~~~---~~PYi~YflA~ilqfQf~~~lc~~a~~~~plh~--------C~~~~s~~Ag~~l~-~~l~ 437 (477)
T cd06461 370 PRSEEDFDPGAKYHVPA---NTPYIRYFLSFILQFQFHKALCKAAGHTGPLHK--------CDIYGSKEAGKKLR-AMLS 437 (477)
T ss_pred CCCccccCccceeeccC---CCcHHHHHHHHHHHHHHHHHHHHhccCCCCccc--------ccccChHHHHHHHH-HHHh
Confidence 2 1223444556642 444789999999999999886543110000000 00012378999995 8999
Q ss_pred CCCCccHHHHHHHhhCC-CCCchHHHH
Q psy12854 235 FGGSCHSSEVFRRFRGR-DPCFKPFLD 260 (265)
Q Consensus 235 ~Ggs~~~~e~l~~flGr-~ps~~a~~~ 260 (265)
.|||++-.|+++...|. +-+.+++++
T Consensus 438 lG~S~~w~e~le~~~g~~~~s~~~l~e 464 (477)
T cd06461 438 LGSSKPWPEALEALTGEREMDASALLE 464 (477)
T ss_pred CcCCCCHHHHHHHhcCCCCCChHHHHH
Confidence 99999999999999985 566666665
No 18
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=98.73 E-value=4.9e-07 Score=88.03 Aligned_cols=213 Identities=15% Similarity=0.181 Sum_probs=119.8
Q ss_pred cccHHHHHhHHhHHHhHHhhcCC--CCCCcccCCccchhHhhchHHHHHHhcc-HHHHHHHH----hhhcCCcccHHHHH
Q psy12854 33 FKQVENLFFHFGSLLQRSLTATH--YSDVSGVNNVEWDSVYIINYFLTHFLYE-ERVFAELN----SHFATGEKLKMTDE 105 (265)
Q Consensus 33 ~~~v~tLfHEfGHalH~lls~~~--~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~~L~~ls----~h~~t~~~lp~~~~ 105 (265)
.+.+....||.||+|=-.-.... .+.+.+ ++++-+=|-.|-|+||.+.. .+..+.+. .++.....++.+
T Consensus 257 ~~~l~s~iHE~GHalYEq~~~~~~~~tpl~~--~~smgiHESQSrf~En~vgrS~~Fw~~~~p~l~~~f~~~~~v~~e-- 332 (494)
T PF02074_consen 257 LSALFSTIHETGHALYEQGLPPEYYGTPLGE--GVSMGIHESQSRFWENMVGRSRAFWEFLYPKLQEHFPQLKDVSLE-- 332 (494)
T ss_dssp HHHHHHHHHHHHHHHHHHTS-GGGTTSTTSS---S-HHHHHHHHHHHHTTTTTSHHHHHHHHHHHHHHCGGGTT-SHH--
T ss_pred HHHHHHHHHHHHHHHHHcCCChhHccCcccc--ccccchHHHHHHHHHHHHccCHHHHHHHHHHHHHHhCcccCCCHH--
Confidence 35677788999999976544332 223333 47788899999999996543 33333332 233221222222
Q ss_pred HHHHHHh-----hhhh-----HHHHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCCCCCCCCCCcCccccccCCCC
Q psy12854 106 QLKALRA-----HNAG-----IDVCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFGFPIDKYSNLPCRFVEVGSGD 173 (265)
Q Consensus 106 ~l~~~~~-----~~~~-----~~~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~s~~ 173 (265)
.+.++.+ +.+. ..-+.-++.-.++..+-...- ..+-++|++-+++|.|+.+.........=-|=.+|.
T Consensus 333 ~~y~~~N~V~ps~IR~eADElTY~lHIilRyEiEk~li~g~l~v~dLP~~Wn~km~eyLGi~p~~d~eG~LQDvHWs~G~ 412 (494)
T PF02074_consen 333 NFYRAINRVKPSLIRVEADELTYPLHIILRYEIEKALINGELSVDDLPEAWNEKMEEYLGITPPNDAEGVLQDVHWSSGS 412 (494)
T ss_dssp HHHHHHT-----S-GGG--TTTHHHHHHHHHHHHHHHHTTSS-GGGHHHHHHHHHHHHHS---SSCTTTTTS-STTTTT-
T ss_pred HHHHHHccCCCcccchhhHHHHHHHHHHHHHHHHHHHHcCCCChhhhHHHHHHHHHHHcCCCCCCCCccccccccccCCc
Confidence 2222211 1110 123455566666666665442 356788888888888863321111111111322221
Q ss_pred ccCCch-hhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCC-CCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhhCC
Q psy12854 174 LAAGYY-SFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGV-SGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFRGR 251 (265)
Q Consensus 174 Y~a~YY-sYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~flGr 251 (265)
+ ||+ +|..+.++|+++|.+.+++-. - ++ +.. .|. ......=.|++|.+.|+..+|.|++++..|.
T Consensus 413 f--GYFPtY~LG~~~AaQl~~~~~~~~p-d--~~-----~~i~~G~---f~~i~~WL~e~Ih~~G~~y~~~eLl~~aTGe 479 (494)
T PF02074_consen 413 F--GYFPTYALGNIYAAQLFAAMKKDIP-D--LD-----EQIAKGD---FSPIRDWLRENIHQYGSRYTPDELLKKATGE 479 (494)
T ss_dssp T--S-THHHHHHHHHHHHHHHHHHHHHT-T--HH-----HHHHTT-----HHHHHHHHHHTGGGTTSS-HHHHHHHHHSS
T ss_pred c--ccchhhHHHHHHHHHHHHHHHHHcc-C--HH-----HHHHcCC---hHHHHHHHHHHHHHhccCCCHHHHHHHHhCC
Confidence 2 344 999999999999987655321 0 10 000 111 1456677889999999999999999999999
Q ss_pred CCCchHHHHhc
Q psy12854 252 DPCFKPFLDMF 262 (265)
Q Consensus 252 ~ps~~a~~~~~ 262 (265)
++|++.|++.|
T Consensus 480 ~l~~~~~~~YL 490 (494)
T PF02074_consen 480 PLNPEYFIDYL 490 (494)
T ss_dssp S--THHHHHHH
T ss_pred CCCHHHHHHHH
Confidence 99999999875
No 19
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=98.28 E-value=1.7e-05 Score=75.83 Aligned_cols=226 Identities=13% Similarity=0.176 Sum_probs=125.9
Q ss_pred EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCC--CCCCcccCCccchhHhhchHHHHHHhcc-HHHHHHH
Q psy12854 14 NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATH--YSDVSGVNNVEWDSVYIINYFLTHFLYE-ERVFAEL 90 (265)
Q Consensus 14 ~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~--~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~~L~~l 90 (265)
+-|..++.... -.+.+....||+|||+=-.-.... ...+.+ +.++-+=|-.|-|+||.+-. +...+.+
T Consensus 245 VRITTRy~~~d-------f~~aL~g~iHE~GHAlYEqn~~~~l~gtPlg~--g~smgiHESQSrF~En~VGRs~aFw~~~ 315 (497)
T COG2317 245 VRITTRYNEQD-------FRSALFGTIHETGHALYEQNLDEALLGTPLGE--GRSMGIHESQSRFWENQVGRSRAFWEAI 315 (497)
T ss_pred eeEEeecCCcc-------HHHHHHHHHHhhhhHHHHcCCCHHHcCCcccC--CcccccccHHHHHHHHHhccCHHHHHHH
Confidence 45566666521 235577889999999865433221 222322 46788899999999999764 4343333
Q ss_pred Hh----hhcCCcccHHH-HHHHHHHHhhh-hhH-----H----HHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCC
Q psy12854 91 NS----HFATGEKLKMT-DEQLKALRAHN-AGI-----D----VCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFG 153 (265)
Q Consensus 91 s~----h~~t~~~lp~~-~~~l~~~~~~~-~~~-----~----~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~ 153 (265)
.. |+. +.+++. .+.+....+.. .++ + -+.-++.-.++..+-+..- .++-++|++-+.+|.|
T Consensus 316 ~~~~~~~~~--~~~~~~s~dd~y~~vn~v~~~lIRveADElTY~lHIilRyEiEk~li~G~l~v~DlP~lWn~kme~yLG 393 (497)
T COG2317 316 YPLLRKHFP--EQFDKYSLDDFYRAVNRVEPSLIRVEADELTYPLHIILRYEIEKELISGELKVDDLPELWNDKMEEYLG 393 (497)
T ss_pred HHHHHHhhh--hhcccCCHHHHHHHHhcccCcceeeeccccceeeeeehhHHHHHHHHcCCcchhhhHHHHHHHHHHHcC
Confidence 22 221 112222 33332221110 000 0 0111222333333333221 2355778888888888
Q ss_pred CCCCCCCCcCccccccCCCCccCCch-hhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHH
Q psy12854 154 FPIDKYSNLPCRFVEVGSGDLAAGYY-SFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTF 232 (265)
Q Consensus 154 ~~~~~~~~~~~~f~Hl~s~~Y~a~YY-sYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~i 232 (265)
+.+.........=-|-..|.+ ||+ +|..+.++|+++|.+.+++-. .-|+...-| +-.....=+|++|
T Consensus 394 irp~~d~eG~LQDIHWs~Gsf--GYFPsYtlG~v~AAQ~~~a~~~~~p--------~~d~~i~~g--d~~~i~~WL~e~i 461 (497)
T COG2317 394 IRPKNDAEGVLQDIHWSHGSF--GYFPTYTLGNVYAAQLYAAMKKDIP--------DVDALIAKG--DFSPIKNWLRENI 461 (497)
T ss_pred CCCCCccccccccccccCCCc--CcCchhhhHHHHHHHHHHHHHhhcc--------cHHHHHHcC--CcHHHHHHHHHHH
Confidence 644321111110112111222 344 999999999999987665311 011111111 0145667788999
Q ss_pred hcCCCCccHHHHHHHhhCCCCCchHHHHhc
Q psy12854 233 LTFGGSCHSSEVFRRFRGRDPCFKPFLDMF 262 (265)
Q Consensus 233 L~~Ggs~~~~e~l~~flGr~ps~~a~~~~~ 262 (265)
-+.|...+|.|++++-.|.+-|++.|++++
T Consensus 462 h~~Gs~~~p~eLi~~atGE~lnp~y~i~yL 491 (497)
T COG2317 462 HRHGSRYPPKELLKRATGEALNPEYFIDYL 491 (497)
T ss_pred HHccCcCCcHHHHHHhhCCcCCHHHHHHHH
Confidence 999999999999999999999999999975
No 20
>PF01401 Peptidase_M2: Angiotensin-converting enzyme This Prosite motif covers only the active site.; InterPro: IPR001548 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M2 (clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The catalytic residues and zinc ligands have been identified, the zinc ion being ligated to two His residues within the motif HEXXH, showing that the enzyme belongs to the E sub-group of metalloproteases []. Pepetidyl-dipeptidase A (angiotensin-converting enzyme) is a mammalian enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II []. The enzyme exists in two differentially transcribed forms, the most common of which is from lung endothelium; this contains two homologous domains that have arisen by gene duplication []. The testis-specific form contains only the C-terminal domain, arising from a duplicated promoter region present in intron 12 of the gene []. Both enzymatic forms are membrane proteins that are anchored by means of a C-terminal transmembrane domain. Both domains of the endothelial enzyme are active, but have differing kinetic constants []. ]. A number of insect enzymes have been shown to be similar to peptidyl-dipeptidase A, these containing a single catalytic domain.; GO: 0008237 metallopeptidase activity, 0008241 peptidyl-dipeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 2YDM_A 3BKL_A 2C6N_B 1UZE_A 3BKK_A 2C6F_B 2IUX_A 2IUL_A 2XYD_B 3NXQ_B ....
Probab=98.25 E-value=4.6e-05 Score=76.03 Aligned_cols=211 Identities=16% Similarity=0.164 Sum_probs=130.6
Q ss_pred CCccCcccHHHHHhHHhHHHhHHhhcC-CCCCCcccCCccchhHhhchHHHHHHhccHHHHHHHHhhhcCCcccHHH-HH
Q psy12854 28 PCLLTFKQVENLFFHFGSLLQRSLTAT-HYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFAELNSHFATGEKLKMT-DE 105 (265)
Q Consensus 28 p~ll~~~~v~tLfHEfGHalH~lls~~-~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~~ls~h~~t~~~lp~~-~~ 105 (265)
++-++.+|+.|+.||+||....+.-+. +...-.| .---|-|..+-++-.-+..|.-|+++.---.. ..-.+. ++
T Consensus 342 Ct~v~~~df~t~hhemghi~Y~~~y~~qp~~fr~g---an~gfhEAigd~ials~~tp~hL~~~gLl~~~-~~~~~~~in 417 (595)
T PF01401_consen 342 CTEVTMEDFLTAHHEMGHIQYYMQYKDQPVLFREG---ANPGFHEAIGDTIALSVSTPKHLKRIGLLDNV-TDDEESDIN 417 (595)
T ss_dssp ---SSHHHHHHHHHHHHHHHHHHHTTTS-GGGSS----SSHHHHHHHHHHHHHHHTSHHHHHHTTSSSTT-TTSHHHHHH
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhhCChhhhcC---CCchHHHHHHHHHHHHcCCHHHHHHcCCCCCC-CcchHHHHH
Confidence 566799999999999999998766543 2222223 23568899999999899899888887542111 111233 55
Q ss_pred HHHHHHhhhhhHHHH-HHHHHHHHHHH---HhcC--CCCcHHHHHHHHccccCCC-CCCC---CCCcCccccccCCCCcc
Q psy12854 106 QLKALRAHNAGIDVC-SELFKANLDLQ---LHNG--PKPHWSEISRELYPLHFGF-PIDK---YSNLPCRFVEVGSGDLA 175 (265)
Q Consensus 106 ~l~~~~~~~~~~~~~-~ql~~a~~D~~---lh~~--~~~~~~~l~~~l~~~~~~~-~~~~---~~~~~~~f~Hl~s~~Y~ 175 (265)
.|.. .++..+ ...+.-..|+- +... .++.+++-|-++..+|.|+ |+.+ .+.-|...-|+.+ .
T Consensus 418 ~L~~-----~AL~ki~~lPf~~~~d~wRw~vf~g~i~~~~~n~~wW~lr~~yqGi~pP~~r~~~~fD~~ak~hi~~---~ 489 (595)
T PF01401_consen 418 FLLK-----QALDKIAFLPFGYLMDKWRWDVFSGSIPPDQYNEAWWELREKYQGIVPPVERSESDFDPGAKYHIPA---N 489 (595)
T ss_dssp HHHH-----HHHHHCCHHHHHHHHHHHHHHHHTTSS-CCGHHHHHHHHHHHHHSEE-SS-EETTS-GGGGSHHHHT---T
T ss_pred HHHH-----HhhhHHHhccHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHcCCccCCCCCCcccCCcchhhhhcc---c
Confidence 4432 233322 22333344443 2222 1256899999999999885 3332 2222455668853 4
Q ss_pred CCchhhHhHHHHHHHHHHHHHHc-CC-CCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhhC-CC
Q psy12854 176 AGYYSFLWSKLVSADIFYAFKED-GE-GRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFRG-RD 252 (265)
Q Consensus 176 a~YYsYl~s~~~A~dv~~~f~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~flG-r~ 252 (265)
..|..|-.|.++--++++.+.+. |. +- +. +-| -.| ++++|++++ ++|+.|+|++=.++++.+.| ++
T Consensus 490 ~py~ryfls~ilqFQ~~~~lC~~ag~~~p--Lh--~Cd--iy~----s~~aG~~l~-~ml~~G~S~~W~~~l~~~tg~~~ 558 (595)
T PF01401_consen 490 TPYIRYFLSTILQFQFYKALCEAAGHEGP--LH--RCD--IYG----SKEAGRKLK-KMLSLGSSKPWPEALEELTGERE 558 (595)
T ss_dssp --SHHHHHHHHHHHHHHHHHHHHTTTTSS--GG--G-----TT-----HHHHHHHH-HHHTTTTSS-HHHHHHHHHSSSS
T ss_pred cccHHHHHHHHHHHHHHHHHHHhcCCCCC--cc--ccc--ccc----chhhHHHHH-HHhcCCCCCCHHHHHHHHhCCCC
Confidence 56899999999999999885543 21 00 00 000 012 389999995 89999999999999999999 68
Q ss_pred CCchHHHHh
Q psy12854 253 PCFKPFLDM 261 (265)
Q Consensus 253 ps~~a~~~~ 261 (265)
-|.+|+++.
T Consensus 559 ~~a~~ll~Y 567 (595)
T PF01401_consen 559 MSASALLEY 567 (595)
T ss_dssp --SHHHHHH
T ss_pred cChHHHHHH
Confidence 999999875
No 21
>KOG3690|consensus
Probab=97.05 E-value=0.059 Score=53.35 Aligned_cols=214 Identities=15% Similarity=0.112 Sum_probs=127.8
Q ss_pred CCccCcccHHHHHhHHhHHHhHHhhc-CCCCCCcccCCccchhHhhchHHHHHHhccHHHHHHHHhhhcCCcc-cHHHHH
Q psy12854 28 PCLLTFKQVENLFFHFGSLLQRSLTA-THYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFAELNSHFATGEK-LKMTDE 105 (265)
Q Consensus 28 p~ll~~~~v~tLfHEfGHalH~lls~-~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~~ls~h~~t~~~-lp~~~~ 105 (265)
++.+...+..+..||+||.==.+.-+ .+.-.-.| .---|-|...-.+-.-+..|.-|+++.---.++.. .+..|+
T Consensus 364 Ct~vd~~~f~~~Hhem~hiQY~l~y~~qP~llre~---anPgFheAIgd~~als~stPrhL~slgLL~~~~~~d~~~~IN 440 (646)
T KOG3690|consen 364 CTKVDEEDFEQAHHEMGHIQYYLQYKQQPFLLREG---ANPGFHEAIGDAFALSVSTPRHLHSLGLLSEDLDEDDEVRIN 440 (646)
T ss_pred cCcCCHHHHHHHHHHHHHHHHHHHHHhccHHhhCC---CCCcHHHHHHHHHHHhcCCHHHHHHhcccccccccchHHHHH
Confidence 67789999999999999965555443 23222222 22345666666777777888777776431111111 111255
Q ss_pred HHHHHHhhhhhHHHH-HHHHHHHHHHHHhcCC----C-CcHHHHHHHHccccCCC-CCCCCCC---cCccccccCCCCcc
Q psy12854 106 QLKALRAHNAGIDVC-SELFKANLDLQLHNGP----K-PHWSEISRELYPLHFGF-PIDKYSN---LPCRFVEVGSGDLA 175 (265)
Q Consensus 106 ~l~~~~~~~~~~~~~-~ql~~a~~D~~lh~~~----~-~~~~~l~~~l~~~~~~~-~~~~~~~---~~~~f~Hl~s~~Y~ 175 (265)
.|.+ .++..+ ...+.-..|..-+..- + ..+++-|-+|..+|.|+ |+.+.+. -+..--|+ . =+
T Consensus 441 ~L~k-----~aL~~v~~LPf~y~~DkwR~~Vf~G~i~k~~~N~~~W~lr~ky~Gv~~P~~r~~~dfD~~akf~~-~--~d 512 (646)
T KOG3690|consen 441 RLFK-----MALDKVAFLPFTYALDKWRYEVFDGEIPKDQYNCAYWELRNKYEGVRPPVPRTNKDFDPPAKFHI-Q--VD 512 (646)
T ss_pred HHHH-----HHHHHHhhhhhhhhhhhhHHHHhcCCCCcchhhHHHHHHHHHhcCcCCCCcCccccCCchhhhcc-C--cc
Confidence 4432 233322 2223333443322211 1 35899999999999885 3333332 12223354 2 23
Q ss_pred CCchhhHhHHHHHHHHHHHHHH-cCC-----CCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhh
Q psy12854 176 AGYYSFLWSKLVSADIFYAFKE-DGE-----GRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFR 249 (265)
Q Consensus 176 a~YYsYl~s~~~A~dv~~~f~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~fl 249 (265)
..|-.|..|.++--++++.|.. .|. +|..++ .- +--.++++|+++| +.++.|.|++=.|+++.++
T Consensus 513 ~~~~ryf~s~vlqFQf~kalC~~agq~~pg~pr~pLh--~C------Di~~sk~aG~kL~-~~M~lG~S~~W~evLE~~t 583 (646)
T KOG3690|consen 513 VSYTRYFLSTVLQFQFLKALCQAAGQYGPGDPRKPLH--NC------DIYDSKEAGEKLR-TMMSLGSSRHWREVLEEIT 583 (646)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCch--hc------cccCCHHHHHHHH-HHHHccCCccHHHHHHHHc
Confidence 3456888899998888877543 221 122221 00 1112379999995 8999999999999999999
Q ss_pred C-CCCCchHHHHh
Q psy12854 250 G-RDPCFKPFLDM 261 (265)
Q Consensus 250 G-r~ps~~a~~~~ 261 (265)
| ++.|.+++|+.
T Consensus 584 Ge~els~~~LLeY 596 (646)
T KOG3690|consen 584 GEGELSGRALLEY 596 (646)
T ss_pred CCCcCChHHHHHH
Confidence 9 56899988875
No 22
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=92.85 E-value=0.079 Score=46.46 Aligned_cols=32 Identities=16% Similarity=0.076 Sum_probs=23.3
Q ss_pred ccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhH
Q psy12854 8 TVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQR 49 (265)
Q Consensus 8 ~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~ 49 (265)
..+.|+.+++.|-+ ......||+||+||++=|
T Consensus 55 ~~~~~~I~iN~n~~----------~~r~rFtlAHELGH~llH 86 (213)
T COG2856 55 DEEKPVIYINANNS----------LERKRFTLAHELGHALLH 86 (213)
T ss_pred eccCceEEEeCCCC----------HHHHHHHHHHHHhHHHhc
Confidence 33457777777774 356789999999997643
No 23
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=88.91 E-value=0.28 Score=43.21 Aligned_cols=18 Identities=22% Similarity=0.383 Sum_probs=15.5
Q ss_pred ccHHHHHhHHhHHHhHHh
Q psy12854 34 KQVENLFFHFGSLLQRSL 51 (265)
Q Consensus 34 ~~v~tLfHEfGHalH~ll 51 (265)
.-+.||+||+|||+-..+
T Consensus 91 ~~~~~l~HE~GHAlI~~~ 108 (220)
T PF14247_consen 91 NVLFTLYHELGHALIDDL 108 (220)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 346899999999999887
No 24
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=88.78 E-value=0.23 Score=40.24 Aligned_cols=13 Identities=15% Similarity=0.135 Sum_probs=11.1
Q ss_pred cHHHHHhHHhHHH
Q psy12854 35 QVENLFFHFGSLL 47 (265)
Q Consensus 35 ~v~tLfHEfGHal 47 (265)
-..++.||+||+|
T Consensus 105 ~~~v~~HEiGHaL 117 (154)
T PF00413_consen 105 LQSVAIHEIGHAL 117 (154)
T ss_dssp HHHHHHHHHHHHT
T ss_pred hhhhhhhcccccc
Confidence 3478999999997
No 25
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=88.35 E-value=0.41 Score=36.35 Aligned_cols=22 Identities=18% Similarity=0.120 Sum_probs=16.6
Q ss_pred CcccHHHHHhHHhHHHhHHhhc
Q psy12854 32 TFKQVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH~lls~ 53 (265)
......|++||+||.+.+--..
T Consensus 39 ~~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 39 PERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHH-HH
T ss_pred HHHHHHHHHHHHHHHHhhhccc
Confidence 4566899999999988766543
No 26
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=85.68 E-value=0.45 Score=39.00 Aligned_cols=14 Identities=14% Similarity=0.078 Sum_probs=11.8
Q ss_pred cHHHHHhHHhHHHh
Q psy12854 35 QVENLFFHFGSLLQ 48 (265)
Q Consensus 35 ~v~tLfHEfGHalH 48 (265)
-..|+.||+||+|=
T Consensus 94 ~~~~~~HEiGHaLG 107 (165)
T cd04268 94 LRNTAEHELGHALG 107 (165)
T ss_pred HHHHHHHHHHHHhc
Confidence 46899999999874
No 27
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=85.04 E-value=0.48 Score=38.85 Aligned_cols=15 Identities=13% Similarity=0.140 Sum_probs=12.0
Q ss_pred ccHHHHHhHHhHHHh
Q psy12854 34 KQVENLFFHFGSLLQ 48 (265)
Q Consensus 34 ~~v~tLfHEfGHalH 48 (265)
+-..|+.||+||+|=
T Consensus 103 ~~~~~~~HEiGHaLG 117 (156)
T cd04279 103 NLQAIALHELGHALG 117 (156)
T ss_pred HHHHHHHHHhhhhhc
Confidence 345789999999984
No 28
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=84.81 E-value=0.38 Score=39.55 Aligned_cols=16 Identities=13% Similarity=0.019 Sum_probs=12.9
Q ss_pred cccHHHHHhHHhHHHh
Q psy12854 33 FKQVENLFFHFGSLLQ 48 (265)
Q Consensus 33 ~~~v~tLfHEfGHalH 48 (265)
.+-..|+.||+||+|=
T Consensus 105 ~~~~~~~~HEiGHaLG 120 (157)
T cd04278 105 TDLFSVAAHEIGHALG 120 (157)
T ss_pred chHHHHHHHHhccccc
Confidence 4457799999999974
No 29
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=84.29 E-value=0.71 Score=36.22 Aligned_cols=16 Identities=19% Similarity=0.428 Sum_probs=13.8
Q ss_pred ccHHHHHhHHhHHHhH
Q psy12854 34 KQVENLFFHFGSLLQR 49 (265)
Q Consensus 34 ~~v~tLfHEfGHalH~ 49 (265)
.-+.||+|||-|+||+
T Consensus 75 qvlktllhefrh~mQh 90 (126)
T PF13058_consen 75 QVLKTLLHEFRHAMQH 90 (126)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4478999999999993
No 30
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=83.28 E-value=0.66 Score=39.20 Aligned_cols=16 Identities=13% Similarity=0.179 Sum_probs=13.4
Q ss_pred cccHHHHHhHHhHHHh
Q psy12854 33 FKQVENLFFHFGSLLQ 48 (265)
Q Consensus 33 ~~~v~tLfHEfGHalH 48 (265)
-....|+.||+||+|=
T Consensus 111 ~~~~~t~~HEiGHaLG 126 (186)
T cd04277 111 SYGYQTIIHEIGHALG 126 (186)
T ss_pred hhhHHHHHHHHHHHhc
Confidence 4568899999999974
No 31
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=81.95 E-value=0.73 Score=39.03 Aligned_cols=17 Identities=18% Similarity=0.251 Sum_probs=13.9
Q ss_pred CcccHHHHHhHHhHHHh
Q psy12854 32 TFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH 48 (265)
+...+.+++||+||.|=
T Consensus 139 ~~~~~~~~AHEiGH~lG 155 (196)
T PF13688_consen 139 TYNGAITFAHEIGHNLG 155 (196)
T ss_dssp HHHHHHHHHHHHHHHTT
T ss_pred CCceehhhHHhHHHhcC
Confidence 46778999999999653
No 32
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=81.89 E-value=1.4 Score=36.64 Aligned_cols=17 Identities=18% Similarity=0.249 Sum_probs=12.5
Q ss_pred CcccHHHHHhHHhHHHh
Q psy12854 32 TFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH 48 (265)
....-.||.||+||-|-
T Consensus 66 ~~~~g~TltHEvGH~LG 82 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLG 82 (154)
T ss_dssp TS-SSHHHHHHHHHHTT
T ss_pred ccccccchhhhhhhhhc
Confidence 44556899999999653
No 33
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=81.88 E-value=0.7 Score=36.04 Aligned_cols=13 Identities=23% Similarity=0.360 Sum_probs=11.2
Q ss_pred cHHHHHhHHhHHH
Q psy12854 35 QVENLFFHFGSLL 47 (265)
Q Consensus 35 ~v~tLfHEfGHal 47 (265)
.+.|+.||+||.|
T Consensus 107 ~~~~~~HEiGH~l 119 (124)
T PF13582_consen 107 GVDTFAHEIGHNL 119 (124)
T ss_dssp STTHHHHHHHHHT
T ss_pred cceEeeehhhHhc
Confidence 4689999999976
No 34
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=81.81 E-value=1.3 Score=38.40 Aligned_cols=15 Identities=13% Similarity=0.157 Sum_probs=12.1
Q ss_pred cccHHHHHhHHhHHH
Q psy12854 33 FKQVENLFFHFGSLL 47 (265)
Q Consensus 33 ~~~v~tLfHEfGHal 47 (265)
-....+++||+||.+
T Consensus 135 ~~~~~~~aHEiGH~l 149 (206)
T PF13583_consen 135 ANGYQTFAHEIGHNL 149 (206)
T ss_pred cccchHHHHHHHHHh
Confidence 346688999999976
No 35
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=81.58 E-value=0.71 Score=37.00 Aligned_cols=11 Identities=18% Similarity=0.153 Sum_probs=10.0
Q ss_pred HHHHhHHhHHH
Q psy12854 37 ENLFFHFGSLL 47 (265)
Q Consensus 37 ~tLfHEfGHal 47 (265)
.++.||+||+|
T Consensus 88 ~~~~HEigHaL 98 (140)
T smart00235 88 GVAAHELGHAL 98 (140)
T ss_pred ccHHHHHHHHh
Confidence 48999999998
No 36
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=81.50 E-value=1.2 Score=38.43 Aligned_cols=24 Identities=21% Similarity=0.343 Sum_probs=19.4
Q ss_pred cccHHHHHhHHhHHHhHHhhcCCC
Q psy12854 33 FKQVENLFFHFGSLLQRSLTATHY 56 (265)
Q Consensus 33 ~~~v~tLfHEfGHalH~lls~~~~ 56 (265)
..-+.|++||+||++=.+++-.+.
T Consensus 20 ~~~l~t~~HE~gHal~a~l~G~~v 43 (200)
T PF13398_consen 20 FRLLVTFVHELGHALAALLTGGRV 43 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHhCCCc
Confidence 345679999999999999986543
No 37
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=81.37 E-value=0.9 Score=38.25 Aligned_cols=14 Identities=14% Similarity=0.361 Sum_probs=12.2
Q ss_pred cHHHHHhHHhHHHh
Q psy12854 35 QVENLFFHFGSLLQ 48 (265)
Q Consensus 35 ~v~tLfHEfGHalH 48 (265)
++.+++||+||.+=
T Consensus 111 ~~~~~aHElGH~lG 124 (173)
T PF13574_consen 111 GIDTFAHELGHQLG 124 (173)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred eeeeehhhhHhhcC
Confidence 78999999999663
No 38
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=79.36 E-value=1 Score=38.72 Aligned_cols=15 Identities=20% Similarity=0.257 Sum_probs=12.2
Q ss_pred HHHHHhHHhHHHhHH
Q psy12854 36 VENLFFHFGSLLQRS 50 (265)
Q Consensus 36 v~tLfHEfGHalH~l 50 (265)
..|+.|||||||=..
T Consensus 93 ~~~i~HElgHaLG~~ 107 (198)
T cd04327 93 SRVVLHEFGHALGFI 107 (198)
T ss_pred HHHHHHHHHHHhcCc
Confidence 468999999998544
No 39
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=78.88 E-value=1.5 Score=38.61 Aligned_cols=19 Identities=16% Similarity=0.219 Sum_probs=15.9
Q ss_pred CcccHHHHHhHHhHHHhHH
Q psy12854 32 TFKQVENLFFHFGSLLQRS 50 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH~l 50 (265)
+...|-.=+||.|||+|+-
T Consensus 86 SiaAvaVAAHEvGHAiQ~a 104 (222)
T PF04298_consen 86 SIAAVAVAAHEVGHAIQHA 104 (222)
T ss_pred CHHHHHHHHHHHhHHHhcc
Confidence 5667888899999999964
No 40
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=78.73 E-value=1 Score=36.79 Aligned_cols=14 Identities=14% Similarity=0.347 Sum_probs=12.2
Q ss_pred ccHHHHHhHHhHHH
Q psy12854 34 KQVENLFFHFGSLL 47 (265)
Q Consensus 34 ~~v~tLfHEfGHal 47 (265)
....++.||+||+|
T Consensus 95 ~~~~~~~HElGH~L 108 (167)
T cd00203 95 EGAQTIAHELGHAL 108 (167)
T ss_pred cchhhHHHHHHHHh
Confidence 45789999999998
No 41
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=78.23 E-value=2.1 Score=36.13 Aligned_cols=16 Identities=19% Similarity=0.218 Sum_probs=13.4
Q ss_pred CcccHHHHHhHHhHHH
Q psy12854 32 TFKQVENLFFHFGSLL 47 (265)
Q Consensus 32 ~~~~v~tLfHEfGHal 47 (265)
......+++||+||.+
T Consensus 130 ~~~~~~~~aHElGH~l 145 (192)
T cd04267 130 TLLTALTMAHELGHNL 145 (192)
T ss_pred ceeehhhhhhhHHhhc
Confidence 3566789999999987
No 42
>PF14891 Peptidase_M91: Effector protein
Probab=78.16 E-value=1.6 Score=36.86 Aligned_cols=21 Identities=5% Similarity=-0.088 Sum_probs=17.8
Q ss_pred ccHHHHHhHHhHHHhHHhhcC
Q psy12854 34 KQVENLFFHFGSLLQRSLTAT 54 (265)
Q Consensus 34 ~~v~tLfHEfGHalH~lls~~ 54 (265)
.-+..|+|||.||.|.+-...
T Consensus 102 ~p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 447899999999999887765
No 43
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=76.12 E-value=1.7 Score=39.99 Aligned_cols=19 Identities=26% Similarity=0.252 Sum_probs=15.6
Q ss_pred cHHHHHhHHhHHHhHHhhc
Q psy12854 35 QVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 35 ~v~tLfHEfGHalH~lls~ 53 (265)
..-.|+||+||.++.++.-
T Consensus 170 ~ayVlAHEyGHHVQ~l~Gi 188 (292)
T PF04228_consen 170 QAYVLAHEYGHHVQNLLGI 188 (292)
T ss_pred HHHHHHHHHHHHHHHHhhh
Confidence 3446999999999998854
No 44
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=74.99 E-value=1.8 Score=37.59 Aligned_cols=16 Identities=19% Similarity=0.332 Sum_probs=13.9
Q ss_pred CcccHHHHHhHHhHHH
Q psy12854 32 TFKQVENLFFHFGSLL 47 (265)
Q Consensus 32 ~~~~v~tLfHEfGHal 47 (265)
++..+.|++||+||.|
T Consensus 142 ~~~~~~~~AHElGH~l 157 (220)
T cd04272 142 SYYGVYTMTHELAHLL 157 (220)
T ss_pred CcccHHHHHHHHHHHh
Confidence 4567999999999987
No 45
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=74.37 E-value=4.3 Score=38.52 Aligned_cols=46 Identities=4% Similarity=-0.065 Sum_probs=31.0
Q ss_pred cHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhcc
Q psy12854 35 QVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYE 83 (265)
Q Consensus 35 ~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~ 83 (265)
-+.||+|||=|.+|..-.......- ...+.-.=|.-|+++|.++..
T Consensus 139 ~~sTlAHEfQHmInfy~~~v~~g~~---~~~dtWLnE~lS~~aEdl~s~ 184 (366)
T PF10460_consen 139 VYSTLAHEFQHMINFYQRGVLHGKQ---YAMDTWLNEMLSMSAEDLYSS 184 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC---cccccHHHHHHHHHHHHHHhc
Confidence 3679999999999986443221101 124566778899999988654
No 46
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=72.21 E-value=1.2 Score=39.37 Aligned_cols=14 Identities=7% Similarity=0.180 Sum_probs=11.6
Q ss_pred ccHHHHHhHHhHHH
Q psy12854 34 KQVENLFFHFGSLL 47 (265)
Q Consensus 34 ~~v~tLfHEfGHal 47 (265)
..+.|++||+||.|
T Consensus 144 ~~~~t~AHElGHnL 157 (228)
T cd04271 144 NEWQVFAHEIGHTF 157 (228)
T ss_pred ccceehhhhhhhhc
Confidence 34579999999987
No 47
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=72.15 E-value=7.3 Score=29.52 Aligned_cols=35 Identities=6% Similarity=-0.107 Sum_probs=22.6
Q ss_pred HHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHH
Q psy12854 37 ENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLT 78 (265)
Q Consensus 37 ~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E 78 (265)
.+|.||++|.+|...... . ...|.-+.|=-+..+|
T Consensus 27 ~~l~HE~~H~~~~~~~~~-----~--~~~~~W~~EG~A~y~~ 61 (128)
T PF13485_consen 27 RVLAHELAHQWFGNYFGG-----D--DNAPRWFNEGLAEYVE 61 (128)
T ss_pred HHHHHHHHHHHHHHHcCC-----C--ccCchHHHHHHHHHHh
Confidence 889999999999877543 1 1245555555444444
No 48
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=71.15 E-value=2.2 Score=37.96 Aligned_cols=12 Identities=8% Similarity=-0.103 Sum_probs=10.3
Q ss_pred HHHHHhHHhHHH
Q psy12854 36 VENLFFHFGSLL 47 (265)
Q Consensus 36 v~tLfHEfGHal 47 (265)
..|++||+||.+
T Consensus 168 a~t~AHElGHnl 179 (244)
T cd04270 168 DLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHhc
Confidence 379999999975
No 49
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=71.01 E-value=2.1 Score=36.31 Aligned_cols=13 Identities=8% Similarity=0.125 Sum_probs=11.2
Q ss_pred HHHHHhHHhHHHh
Q psy12854 36 VENLFFHFGSLLQ 48 (265)
Q Consensus 36 v~tLfHEfGHalH 48 (265)
+.|+.|||||||=
T Consensus 75 ~g~v~HE~~HalG 87 (180)
T cd04280 75 LGTIVHELMHALG 87 (180)
T ss_pred CchhHHHHHHHhc
Confidence 5789999999874
No 50
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=69.17 E-value=3.2 Score=33.51 Aligned_cols=17 Identities=12% Similarity=0.268 Sum_probs=13.5
Q ss_pred CcccHHHHHhHHhHHHh
Q psy12854 32 TFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH 48 (265)
.++.+....||+||.|=
T Consensus 74 gy~~~RIaaHE~GHiLG 90 (132)
T PF02031_consen 74 GYNSTRIAAHELGHILG 90 (132)
T ss_dssp HS-HHHHHHHHHHHHHT
T ss_pred CCccceeeeehhccccC
Confidence 46678899999999875
No 51
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=67.97 E-value=3.4 Score=35.32 Aligned_cols=15 Identities=20% Similarity=0.343 Sum_probs=12.4
Q ss_pred cHHHHHhHHhHHHhH
Q psy12854 35 QVENLFFHFGSLLQR 49 (265)
Q Consensus 35 ~v~tLfHEfGHalH~ 49 (265)
.+.|+.|||||||=.
T Consensus 79 ~~~~i~HEl~HaLG~ 93 (191)
T PF01400_consen 79 SVGTILHELGHALGF 93 (191)
T ss_dssp SHHHHHHHHHHHHTB
T ss_pred CccchHHHHHHHHhh
Confidence 567999999999843
No 52
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=67.56 E-value=2.5 Score=36.97 Aligned_cols=13 Identities=8% Similarity=0.094 Sum_probs=10.7
Q ss_pred HHHHhHHhHHHhH
Q psy12854 37 ENLFFHFGSLLQR 49 (265)
Q Consensus 37 ~tLfHEfGHalH~ 49 (265)
.++.||+||+|=.
T Consensus 135 hvi~HEiGH~IGf 147 (211)
T PF12388_consen 135 HVITHEIGHCIGF 147 (211)
T ss_pred HHHHHHhhhhccc
Confidence 4789999999853
No 53
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=66.97 E-value=3 Score=35.26 Aligned_cols=14 Identities=14% Similarity=0.102 Sum_probs=12.1
Q ss_pred ccHHHHHhHHhHHH
Q psy12854 34 KQVENLFFHFGSLL 47 (265)
Q Consensus 34 ~~v~tLfHEfGHal 47 (265)
.-+.+++||+||.+
T Consensus 130 ~~a~~~AHElGH~l 143 (194)
T cd04269 130 LFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHhhc
Confidence 44789999999987
No 54
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=66.13 E-value=3.2 Score=35.47 Aligned_cols=15 Identities=7% Similarity=-0.235 Sum_probs=12.0
Q ss_pred HHHHHhHHhHHHhHH
Q psy12854 36 VENLFFHFGSLLQRS 50 (265)
Q Consensus 36 v~tLfHEfGHalH~l 50 (265)
+-|+.|||||||=..
T Consensus 78 ~G~i~HEl~HaLG~~ 92 (182)
T cd04283 78 KGIIQHELLHALGFY 92 (182)
T ss_pred cchHHHHHHHHhCCc
Confidence 568999999998543
No 55
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=66.01 E-value=3.9 Score=30.06 Aligned_cols=19 Identities=11% Similarity=0.086 Sum_probs=15.6
Q ss_pred CcccHHHHHhHHhHHHhHH
Q psy12854 32 TFKQVENLFFHFGSLLQRS 50 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH~l 50 (265)
+..+...|+||+.|.++.-
T Consensus 58 s~~~~~llaHEl~Hv~Qq~ 76 (79)
T PF13699_consen 58 SPEGRALLAHELAHVVQQR 76 (79)
T ss_pred CCCcchhHhHHHHHHHhhc
Confidence 5567788999999999853
No 56
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=63.98 E-value=4.5 Score=34.07 Aligned_cols=24 Identities=13% Similarity=0.239 Sum_probs=15.6
Q ss_pred HHHHHhHHhHHHhHHhhcCCCCCC
Q psy12854 36 VENLFFHFGSLLQRSLTATHYSDV 59 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~~~~~~~ 59 (265)
+..+.||+||++=......+....
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~~~~ 31 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKVPRF 31 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B--E
T ss_pred cccccccccccccccccccccccc
Confidence 457899999999877766554444
No 57
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=60.81 E-value=4.5 Score=35.08 Aligned_cols=14 Identities=7% Similarity=0.180 Sum_probs=11.6
Q ss_pred HHHHHhHHhHHHhH
Q psy12854 36 VENLFFHFGSLLQR 49 (265)
Q Consensus 36 v~tLfHEfGHalH~ 49 (265)
+-|+.||+||||=.
T Consensus 88 ~Gti~HEl~HaLGf 101 (200)
T cd04281 88 FGIVVHELGHVIGF 101 (200)
T ss_pred CchHHHHHHHHhcC
Confidence 56899999998853
No 58
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=59.14 E-value=1.9 Score=39.28 Aligned_cols=14 Identities=21% Similarity=0.183 Sum_probs=11.6
Q ss_pred cHHHHHhHHhHHHh
Q psy12854 35 QVENLFFHFGSLLQ 48 (265)
Q Consensus 35 ~v~tLfHEfGHalH 48 (265)
.+-|++|||||+|-
T Consensus 165 ~igv~~HE~gH~lG 178 (286)
T TIGR03296 165 GVGVIAHELGHDLG 178 (286)
T ss_pred ceeeeehhhhcccC
Confidence 37899999999763
No 59
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=58.69 E-value=6.2 Score=32.94 Aligned_cols=19 Identities=11% Similarity=0.109 Sum_probs=14.2
Q ss_pred HHHHHhHHhHHHhHHhhcC
Q psy12854 36 VENLFFHFGSLLQRSLTAT 54 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~~ 54 (265)
+..+.||+||++=...-..
T Consensus 9 i~i~iHE~gH~~~A~~~G~ 27 (180)
T cd05709 9 ISVTVHELGHALVARRLGV 27 (180)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 4568999999986665544
No 60
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=57.49 E-value=5.9 Score=34.27 Aligned_cols=12 Identities=25% Similarity=0.127 Sum_probs=10.3
Q ss_pred HHHHhHHhHHHh
Q psy12854 37 ENLFFHFGSLLQ 48 (265)
Q Consensus 37 ~tLfHEfGHalH 48 (265)
.++.||+||+|=
T Consensus 118 ~~~~he~gh~lG 129 (197)
T cd04276 118 YLLAHEVGHTLG 129 (197)
T ss_pred HHHHHHHHHHhc
Confidence 589999999874
No 61
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=56.04 E-value=6 Score=35.83 Aligned_cols=19 Identities=11% Similarity=0.097 Sum_probs=12.3
Q ss_pred cHHHHHhHHhHHHhHHhhc
Q psy12854 35 QVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 35 ~v~tLfHEfGHalH~lls~ 53 (265)
-..++.|||||++-.|--+
T Consensus 216 ~~~v~vHE~GHsf~~LaDE 234 (264)
T PF09471_consen 216 FKQVVVHEFGHSFGGLADE 234 (264)
T ss_dssp HHHHHHHHHHHHTT-----
T ss_pred ccceeeeeccccccccccc
Confidence 3678999999999887654
No 62
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=56.04 E-value=7.3 Score=33.20 Aligned_cols=20 Identities=15% Similarity=0.255 Sum_probs=14.9
Q ss_pred HHHHHhHHhHHHhHHhhcCC
Q psy12854 36 VENLFFHFGSLLQRSLTATH 55 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~~~ 55 (265)
+..+.||+||.+=.-....+
T Consensus 10 ~~v~iHElGH~~~Ar~~Gv~ 29 (182)
T cd06163 10 ILIFVHELGHFLVAKLFGVK 29 (182)
T ss_pred HHHHHHHHHHHHHHHHcCCe
Confidence 45789999998877665443
No 63
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=53.46 E-value=8.5 Score=33.33 Aligned_cols=17 Identities=18% Similarity=0.190 Sum_probs=12.6
Q ss_pred HHHHhHHhHHHhHHhhc
Q psy12854 37 ENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 37 ~tLfHEfGHalH~lls~ 53 (265)
..++||+||++=...-.
T Consensus 40 ~v~iHElgH~~~A~~~G 56 (208)
T cd06161 40 SVLLHELGHALVARRYG 56 (208)
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 56899999987655443
No 64
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=52.77 E-value=7.1 Score=34.64 Aligned_cols=16 Identities=13% Similarity=-0.118 Sum_probs=12.4
Q ss_pred HHHHHhHHhHHHhHHh
Q psy12854 36 VENLFFHFGSLLQRSL 51 (265)
Q Consensus 36 v~tLfHEfGHalH~ll 51 (265)
+-|+.||+||||=..-
T Consensus 121 ~Gti~HEl~HalGf~H 136 (230)
T cd04282 121 KATVEHEFLHALGFYH 136 (230)
T ss_pred CchHHHHHHHHhCCcc
Confidence 4689999999985433
No 65
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=51.84 E-value=10 Score=32.97 Aligned_cols=18 Identities=11% Similarity=0.189 Sum_probs=14.1
Q ss_pred CcccHHHHHhHHhHHHhH
Q psy12854 32 TFKQVENLFFHFGSLLQR 49 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH~ 49 (265)
+...+..-+||.|||+|+
T Consensus 89 Sia~~aVAAHEVGHAiQd 106 (226)
T COG2738 89 SIAAIAVAAHEVGHAIQD 106 (226)
T ss_pred cHHHHHHHHHHhhHHHhh
Confidence 345566779999999996
No 66
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=50.76 E-value=8.3 Score=32.87 Aligned_cols=21 Identities=19% Similarity=0.366 Sum_probs=14.8
Q ss_pred cCcccHH-HHHhHHhHHHhHHh
Q psy12854 31 LTFKQVE-NLFFHFGSLLQRSL 51 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ll 51 (265)
++-+|+. .|+||+||..|.=.
T Consensus 84 ~~~~el~aVlaHElgH~~~~h~ 105 (226)
T PF01435_consen 84 LSEDELAAVLAHELGHIKHRHI 105 (226)
T ss_dssp SSHHHHHHHHHHHHHHHHTTHC
T ss_pred ccHHHHHHHHHHHHHHHHcCCc
Confidence 3445654 78999999987533
No 67
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=50.63 E-value=15 Score=31.12 Aligned_cols=15 Identities=13% Similarity=0.115 Sum_probs=11.7
Q ss_pred cccHHHHHhHHhHHH
Q psy12854 33 FKQVENLFFHFGSLL 47 (265)
Q Consensus 33 ~~~v~tLfHEfGHal 47 (265)
..-..+++||+||.+
T Consensus 129 ~~~a~~~AHelGH~l 143 (199)
T PF01421_consen 129 LSFAVIIAHELGHNL 143 (199)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHhc
Confidence 345678999999965
No 68
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=47.69 E-value=19 Score=29.06 Aligned_cols=41 Identities=2% Similarity=-0.112 Sum_probs=27.8
Q ss_pred cHHHHHhHHhHHHhHHhhcCCC-CCCcccCCccchhHhhchHHHHHHh
Q psy12854 35 QVENLFFHFGSLLQRSLTATHY-SDVSGVNNVEWDSVYIINYFLTHFL 81 (265)
Q Consensus 35 ~v~tLfHEfGHalH~lls~~~~-~~~sg~~~~~~d~~E~pS~~~E~~~ 81 (265)
++.|++|| +.|+++-++.. +.++ .+|+-+.|=-.+++|..-
T Consensus 1 ~~~T~~HE---a~HQl~~N~Gl~~r~~---~~P~Wv~EGlA~yFE~~~ 42 (128)
T PF07607_consen 1 NIATIAHE---ATHQLAFNTGLHPRLA---DWPRWVSEGLATYFETPG 42 (128)
T ss_pred CchHHHHH---HHHHHHHHccccccCC---CCchHHHHhHHHHcCCCc
Confidence 57899999 67777766542 3333 478888887777777553
No 69
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=46.98 E-value=12 Score=33.83 Aligned_cols=16 Identities=6% Similarity=0.056 Sum_probs=12.0
Q ss_pred HHHHHhHHhHHHhHHh
Q psy12854 36 VENLFFHFGSLLQRSL 51 (265)
Q Consensus 36 v~tLfHEfGHalH~ll 51 (265)
+..+.||+||++=...
T Consensus 119 isv~iHElgHa~~Ar~ 134 (263)
T cd06159 119 VGVVVHELSHGILARV 134 (263)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3467899999986544
No 70
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=46.20 E-value=4 Score=35.09 Aligned_cols=15 Identities=13% Similarity=0.312 Sum_probs=12.4
Q ss_pred cccHHHHHhHHhHHH
Q psy12854 33 FKQVENLFFHFGSLL 47 (265)
Q Consensus 33 ~~~v~tLfHEfGHal 47 (265)
+..+.+++||+||.|
T Consensus 138 ~~~a~~~aHElGH~L 152 (207)
T cd04273 138 LSSAFTIAHELGHVL 152 (207)
T ss_pred ceeEEeeeeechhhc
Confidence 445788999999987
No 71
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=44.41 E-value=14 Score=32.51 Aligned_cols=16 Identities=19% Similarity=0.214 Sum_probs=11.7
Q ss_pred HHHHHhHHhHHHhHHh
Q psy12854 36 VENLFFHFGSLLQRSL 51 (265)
Q Consensus 36 v~tLfHEfGHalH~ll 51 (265)
+..+.||+||++=...
T Consensus 54 ~~v~iHElgH~~~A~~ 69 (227)
T cd06164 54 ASVLLHELGHSLVARR 69 (227)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4557999999876443
No 72
>PF00427 PBS_linker_poly: Phycobilisome Linker polypeptide; InterPro: IPR001297 Phycobilisomes (PBSs) are the major light-harvesting systems in cyanobacteria and red algae. PBS is a supercomplex that is composed of a core complex and multiple peripheral rod complexes. Typically, the core consists of two or five cylinders lying on the membrane with, in most cases, multiple rods radiating from the core to form a hemidiscoidal structure. The building units of the core cylinders and the peripheral rods are trimeric and hexameric discs, in which a monomer consists of a pair of related phycobiliproteins (PBPs), such as phycorerythrins, phycoerythrocyanins, phycocyanins, and allophycocyanins. The discs are connected to each other via specific linker polypeptides to form peripheral rods or core cylinders. Linker polypeptides share a conserved domain of ~180 residues, which can be present in one or multiple copies [, , , , ].; GO: 0015979 photosynthesis, 0030089 phycobilisome; PDB: 2L8V_A 2KY4_A 3OSJ_D 2L06_A 3NPH_B 2L3W_A 3PRU_C 3OHW_A.
Probab=43.49 E-value=14 Score=29.94 Aligned_cols=31 Identities=19% Similarity=0.292 Sum_probs=20.6
Q ss_pred HHHHHHHhcC-CCCccHHHHHHHhhCCCCCch
Q psy12854 226 SRLRDTFLTF-GGSCHSSEVFRRFRGRDPCFK 256 (265)
Q Consensus 226 ~~yR~~iL~~-Ggs~~~~e~l~~flGr~ps~~ 256 (265)
+.||+.++.+ ...+-..-.++.+|||+|...
T Consensus 58 ~~yr~~f~~~~~~~R~iEl~~khlLGR~p~~~ 89 (131)
T PF00427_consen 58 ELYRKRFFEPNSNYRFIELAFKHLLGRAPYNQ 89 (131)
T ss_dssp HHHHHHHTTTS-HHHHHHHHHHHHCSS--SSH
T ss_pred HHHHHHHcccccchHHHHHHHHHHhCCCCCCH
Confidence 5688888874 344455668999999999843
No 73
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=43.13 E-value=12 Score=33.54 Aligned_cols=20 Identities=30% Similarity=0.433 Sum_probs=15.2
Q ss_pred ccCcccHH-HHHhHHhHHHhH
Q psy12854 30 LLTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 30 ll~~~~v~-tLfHEfGHalH~ 49 (265)
.|+-+|+. .|+||+||..|.
T Consensus 151 ~l~~dEl~aVlaHElgHi~~r 171 (302)
T COG0501 151 LLNDDELEAVLAHELGHIKNR 171 (302)
T ss_pred hCCHHHHHHHHHHHHHHHhcc
Confidence 45667765 489999998874
No 74
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=42.63 E-value=13 Score=34.61 Aligned_cols=14 Identities=14% Similarity=0.344 Sum_probs=11.2
Q ss_pred ccHHHHHhHHhHHH
Q psy12854 34 KQVENLFFHFGSLL 47 (265)
Q Consensus 34 ~~v~tLfHEfGHal 47 (265)
.+..++|||+||-+
T Consensus 149 ~~~~~~~HElgHN~ 162 (314)
T PF05548_consen 149 QDWATIMHELGHNL 162 (314)
T ss_pred ccHHHHHHHhhhhc
Confidence 45679999999954
No 75
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=41.95 E-value=14 Score=34.18 Aligned_cols=18 Identities=11% Similarity=0.216 Sum_probs=13.4
Q ss_pred ccCcccHHHHHhHHhHHH
Q psy12854 30 LLTFKQVENLFFHFGSLL 47 (265)
Q Consensus 30 ll~~~~v~tLfHEfGHal 47 (265)
.+....-.++.||+||++
T Consensus 188 t~~~s~~~~f~HE~GH~~ 205 (305)
T PF10462_consen 188 TLDYSYGNEFSHELGHNF 205 (305)
T ss_dssp E-SS-SHHHHHHHHHHTT
T ss_pred ecccCccceeehhhhhhc
Confidence 346667889999999986
No 76
>PRK03982 heat shock protein HtpX; Provisional
Probab=40.37 E-value=14 Score=33.56 Aligned_cols=19 Identities=11% Similarity=0.163 Sum_probs=14.0
Q ss_pred cCcccH-HHHHhHHhHHHhH
Q psy12854 31 LTFKQV-ENLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v-~tLfHEfGHalH~ 49 (265)
++-+|+ .-++||+||.-|.
T Consensus 120 l~~~El~AVlAHElgHi~~~ 139 (288)
T PRK03982 120 LNEDELEGVIAHELTHIKNR 139 (288)
T ss_pred CCHHHHHHHHHHHHHHHHcC
Confidence 455664 4589999998874
No 77
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=40.27 E-value=18 Score=33.07 Aligned_cols=16 Identities=6% Similarity=0.154 Sum_probs=12.1
Q ss_pred HHHHHhHHhHHHhHHh
Q psy12854 36 VENLFFHFGSLLQRSL 51 (265)
Q Consensus 36 v~tLfHEfGHalH~ll 51 (265)
+.-+.||+||++=...
T Consensus 136 isvvvHElgHal~A~~ 151 (277)
T cd06162 136 ISGVVHEMGHGVAAVR 151 (277)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3567899999986544
No 78
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=39.49 E-value=16 Score=33.12 Aligned_cols=16 Identities=13% Similarity=0.034 Sum_probs=11.0
Q ss_pred cHHHHHhHHhHHHhHH
Q psy12854 35 QVENLFFHFGSLLQRS 50 (265)
Q Consensus 35 ~v~tLfHEfGHalH~l 50 (265)
+..=++||+||.++.-
T Consensus 219 ~~WG~~HE~GH~~Q~~ 234 (307)
T PF13402_consen 219 GGWGPWHELGHNHQQG 234 (307)
T ss_dssp H-HHHHHHHHHHH-BG
T ss_pred CCeeehhhhhhhcCcc
Confidence 3456899999999843
No 79
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=39.05 E-value=17 Score=30.88 Aligned_cols=17 Identities=12% Similarity=0.204 Sum_probs=11.9
Q ss_pred cCcccHHHHHhHHhHHH
Q psy12854 31 LTFKQVENLFFHFGSLL 47 (265)
Q Consensus 31 l~~~~v~tLfHEfGHal 47 (265)
+.-.-+.+..||+||.+
T Consensus 121 ~~~R~~k~~~HElGH~l 137 (179)
T PRK13267 121 FEERVRKEVTHELGHTL 137 (179)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 33344566899999975
No 80
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=38.78 E-value=4.9 Score=35.48 Aligned_cols=17 Identities=12% Similarity=0.198 Sum_probs=13.2
Q ss_pred CcccHHHHHhHHhHHHh
Q psy12854 32 TFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 32 ~~~~v~tLfHEfGHalH 48 (265)
..+.-.|+.||+||.|-
T Consensus 134 ~~n~g~t~~HEvGH~lG 150 (225)
T cd04275 134 PYNLGDTATHEVGHWLG 150 (225)
T ss_pred cccccceeEEeccceee
Confidence 34556799999999764
No 81
>PRK03001 M48 family peptidase; Provisional
Probab=37.89 E-value=16 Score=33.10 Aligned_cols=19 Identities=11% Similarity=0.282 Sum_probs=14.4
Q ss_pred cCcccHH-HHHhHHhHHHhH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ 49 (265)
++-+|+. -++||+||.-|.
T Consensus 119 l~~~El~aVlAHElgHi~~~ 138 (283)
T PRK03001 119 LSEREIRGVMAHELAHVKHR 138 (283)
T ss_pred CCHHHHHHHHHHHHHHHhCC
Confidence 4666644 589999998875
No 82
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=37.43 E-value=22 Score=30.35 Aligned_cols=17 Identities=12% Similarity=0.006 Sum_probs=11.9
Q ss_pred HHHHhHHhHHHhHHhhc
Q psy12854 37 ENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 37 ~tLfHEfGHalH~lls~ 53 (265)
..+.||+||++=...-.
T Consensus 43 ~l~iHElgH~~~A~~~G 59 (183)
T cd06160 43 ILGIHEMGHYLAARRHG 59 (183)
T ss_pred HHHHHHHHHHHHHHHCC
Confidence 34689999987654443
No 83
>KOG3714|consensus
Probab=37.35 E-value=16 Score=35.12 Aligned_cols=20 Identities=5% Similarity=-0.066 Sum_probs=15.9
Q ss_pred HHHHHhHHhHHHhHHhhcCC
Q psy12854 36 VENLFFHFGSLLQRSLTATH 55 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~~~ 55 (265)
.-|..||+||||=..-.+++
T Consensus 160 ~G~i~HEl~HaLGf~HehsR 179 (411)
T KOG3714|consen 160 FGTIVHELMHALGFWHEHSR 179 (411)
T ss_pred CchhHHHHHHHhhhhhccCc
Confidence 66889999999977666554
No 84
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=36.83 E-value=23 Score=29.20 Aligned_cols=19 Identities=5% Similarity=0.105 Sum_probs=14.1
Q ss_pred CccCccc-HHHHHhHHhHHH
Q psy12854 29 CLLTFKQ-VENLFFHFGSLL 47 (265)
Q Consensus 29 ~ll~~~~-v~tLfHEfGHal 47 (265)
..-.+.. +-.++|||+|++
T Consensus 128 ~f~~~~~~lDVvaHEltHGV 147 (150)
T PF01447_consen 128 IFKPFASSLDVVAHELTHGV 147 (150)
T ss_dssp SBS-GGG-HHHHHHHHHHHH
T ss_pred ccccCccccceeeecccccc
Confidence 4445554 889999999987
No 85
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=36.09 E-value=21 Score=34.33 Aligned_cols=17 Identities=24% Similarity=0.337 Sum_probs=12.9
Q ss_pred HHHHHhHHhHHHhHHhh
Q psy12854 36 VENLFFHFGSLLQRSLT 52 (265)
Q Consensus 36 v~tLfHEfGHalH~lls 52 (265)
+..+.||+||.+=.-.-
T Consensus 15 ~~v~~HE~gH~~~a~~~ 31 (420)
T TIGR00054 15 VLIFVHELGHFLAARLC 31 (420)
T ss_pred HHHHHHhHHHHHHHHHc
Confidence 56789999998766543
No 86
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=35.80 E-value=21 Score=30.81 Aligned_cols=22 Identities=14% Similarity=0.209 Sum_probs=16.5
Q ss_pred CCCccCcccHHHHHhHHhHHHh
Q psy12854 27 KPCLLTFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 27 ~p~ll~~~~v~tLfHEfGHalH 48 (265)
.+.++.-+-+.+..||+||.+-
T Consensus 137 ~~~l~~~R~~Kea~HElGH~~G 158 (194)
T PF07998_consen 137 DEELFLERVCKEAVHELGHLFG 158 (194)
T ss_dssp SHHHHHHHHHHHHHHHHHHHTT
T ss_pred cHHHHHHHHHHHHHHHHHHHcC
Confidence 3455556667899999999764
No 87
>PRK02870 heat shock protein HtpX; Provisional
Probab=35.72 E-value=19 Score=33.83 Aligned_cols=20 Identities=5% Similarity=0.102 Sum_probs=14.7
Q ss_pred ccCcccHH-HHHhHHhHHHhH
Q psy12854 30 LLTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 30 ll~~~~v~-tLfHEfGHalH~ 49 (265)
.|+-+|+. -++||+||.-|.
T Consensus 167 ~L~~dEL~aVlAHELgHik~~ 187 (336)
T PRK02870 167 KLDRDELQAVMAHELSHIRHG 187 (336)
T ss_pred hCCHHHHHHHHHHHHHHHHcc
Confidence 34666654 579999999875
No 88
>PF01742 Peptidase_M27: Clostridial neurotoxin zinc protease This family is a subset of the Prosite family; InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=35.03 E-value=28 Score=33.26 Aligned_cols=24 Identities=8% Similarity=-0.029 Sum_probs=18.3
Q ss_pred cCcccHHHHHhHHhHHHhHHhhcC
Q psy12854 31 LTFKQVENLFFHFGSLLQRSLTAT 54 (265)
Q Consensus 31 l~~~~v~tLfHEfGHalH~lls~~ 54 (265)
.-.|=..+|+||+=|+||.|..-.
T Consensus 210 F~~DPAl~LmheLIh~Lh~LYGik 233 (408)
T PF01742_consen 210 FYADPALELMHELIHSLHGLYGIK 233 (408)
T ss_dssp EE--HHHHHHHHHHHHHHHHTTTH
T ss_pred eecCHHHHHHHHHHHHHHhhcccc
Confidence 334557899999999999998753
No 89
>PRK01345 heat shock protein HtpX; Provisional
Probab=34.97 E-value=19 Score=33.32 Aligned_cols=20 Identities=15% Similarity=0.243 Sum_probs=14.1
Q ss_pred cCcccHH-HHHhHHhHHHhHH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQRS 50 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~l 50 (265)
++-+|+. .++||+||..|.=
T Consensus 119 L~~dEL~aVlAHElgHi~~~d 139 (317)
T PRK01345 119 LSPEEVAGVMAHELAHVKNRD 139 (317)
T ss_pred CCHHHHHHHHHHHHHHHHcCC
Confidence 3456644 5899999988753
No 90
>PRK04897 heat shock protein HtpX; Provisional
Probab=34.92 E-value=19 Score=32.96 Aligned_cols=19 Identities=5% Similarity=0.086 Sum_probs=13.7
Q ss_pred cCcccHH-HHHhHHhHHHhH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ 49 (265)
++-+|+. -++||+||.-|.
T Consensus 132 l~~~El~aVlAHElgHi~~~ 151 (298)
T PRK04897 132 MNREELEGVIGHEISHIRNY 151 (298)
T ss_pred CCHHHHHHHHHHHHHHHhcC
Confidence 3556644 589999998864
No 91
>PRK05457 heat shock protein HtpX; Provisional
Probab=31.38 E-value=24 Score=32.14 Aligned_cols=19 Identities=16% Similarity=0.223 Sum_probs=14.0
Q ss_pred cCcccHH-HHHhHHhHHHhH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ 49 (265)
|+.+|+. -++||+||.-|.
T Consensus 129 L~~~El~aVlAHElgHi~~~ 148 (284)
T PRK05457 129 MSRDEVEAVLAHEISHIANG 148 (284)
T ss_pred CCHHHHHHHHHHHHHHHHcC
Confidence 4556654 579999998774
No 92
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=31.29 E-value=28 Score=33.69 Aligned_cols=15 Identities=20% Similarity=0.208 Sum_probs=11.3
Q ss_pred HHHHHhHHhHHHhHH
Q psy12854 36 VENLFFHFGSLLQRS 50 (265)
Q Consensus 36 v~tLfHEfGHalH~l 50 (265)
+..++||+||-+=.-
T Consensus 16 ~li~vHElGHfl~Ar 30 (449)
T PRK10779 16 VLITVHEFGHFWVAR 30 (449)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456799999987553
No 93
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=30.82 E-value=7.8 Score=39.51 Aligned_cols=13 Identities=23% Similarity=0.322 Sum_probs=11.1
Q ss_pred cHHHHHhHHhHHH
Q psy12854 35 QVENLFFHFGSLL 47 (265)
Q Consensus 35 ~v~tLfHEfGHal 47 (265)
-|-+++|||||+|
T Consensus 221 giGVfaHEfGH~L 233 (645)
T PF05547_consen 221 GIGVFAHEFGHDL 233 (645)
T ss_pred ceEEEEeeccccC
Confidence 3678999999987
No 94
>PRK03072 heat shock protein HtpX; Provisional
Probab=30.80 E-value=25 Score=32.10 Aligned_cols=19 Identities=16% Similarity=0.238 Sum_probs=13.9
Q ss_pred cCcccHH-HHHhHHhHHHhH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ 49 (265)
++.+|+. -++||+||.-|.
T Consensus 122 l~~~El~aVlAHElgHi~~~ 141 (288)
T PRK03072 122 LNERELRGVLGHELSHVYNR 141 (288)
T ss_pred CCHHHHHHHHHHHHHHHhcC
Confidence 4556644 589999998874
No 95
>KOG2921|consensus
Probab=29.74 E-value=31 Score=33.14 Aligned_cols=34 Identities=18% Similarity=0.214 Sum_probs=23.3
Q ss_pred CCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCC
Q psy12854 22 EPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATH 55 (265)
Q Consensus 22 ~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~ 55 (265)
-|-.+-+-.|+--=+.-.+||+|||+-...-..+
T Consensus 118 Lpl~~I~yf~t~lvi~~vvHElGHalAA~segV~ 151 (484)
T KOG2921|consen 118 LPLSGIAYFLTSLVITVVVHELGHALAAASEGVQ 151 (484)
T ss_pred cccccchhhhhhHHHHHHHHHhhHHHHHHhcCce
Confidence 3334445556666677789999999987765544
No 96
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=29.72 E-value=24 Score=32.12 Aligned_cols=11 Identities=18% Similarity=0.407 Sum_probs=9.7
Q ss_pred HHHHHhHHhHH
Q psy12854 36 VENLFFHFGSL 46 (265)
Q Consensus 36 v~tLfHEfGHa 46 (265)
..|++||+||.
T Consensus 204 yaTl~HElghw 214 (316)
T COG4227 204 YATLLHELGHW 214 (316)
T ss_pred HHHHHHHhccc
Confidence 68999999984
No 97
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=29.28 E-value=26 Score=30.76 Aligned_cols=22 Identities=18% Similarity=0.081 Sum_probs=17.0
Q ss_pred CCCccCcccHHHHHhHHhHHHh
Q psy12854 27 KPCLLTFKQVENLFFHFGSLLQ 48 (265)
Q Consensus 27 ~p~ll~~~~v~tLfHEfGHalH 48 (265)
.|..+.-+=+.|.-||+|||+=
T Consensus 179 ~pg~~~e~L~~tarhElGhaLg 200 (236)
T COG5549 179 PPGELRENLNPTARHELGHALG 200 (236)
T ss_pred CcccchhhhhHHHHHhhcchhe
Confidence 3555566668899999999984
No 98
>PRK02391 heat shock protein HtpX; Provisional
Probab=27.89 E-value=30 Score=31.74 Aligned_cols=19 Identities=16% Similarity=0.200 Sum_probs=13.5
Q ss_pred cCcccH-HHHHhHHhHHHhH
Q psy12854 31 LTFKQV-ENLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v-~tLfHEfGHalH~ 49 (265)
|+-+|+ .-++||+||.-|.
T Consensus 128 L~~~El~aVlaHElgHi~~~ 147 (296)
T PRK02391 128 LDPDELEAVLAHELSHVKNR 147 (296)
T ss_pred CCHHHHHHHHHHHHHHHHcC
Confidence 344554 4589999998775
No 99
>PF04539 Sigma70_r3: Sigma-70 region 3; InterPro: IPR007624 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but in some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme [].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1L0O_C 1KU2_B 1RP3_C 1SC5_A 1TTY_A 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P ....
Probab=27.88 E-value=50 Score=23.28 Aligned_cols=24 Identities=13% Similarity=0.260 Sum_probs=17.5
Q ss_pred HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854 242 SEVFRRFRGRDPCFKPFLDMFRLN 265 (265)
Q Consensus 242 ~e~l~~flGr~ps~~a~~~~~g~~ 265 (265)
..-|..=+||+|+.+-..+.+|+.
T Consensus 10 ~~~L~~~lgr~Pt~eEiA~~lgis 33 (78)
T PF04539_consen 10 RRELEQELGREPTDEEIAEELGIS 33 (78)
T ss_dssp HHHHHHHHSS--BHHHHHHHHTS-
T ss_pred HHHHHHHhCCCCCHHHHHHHHccc
Confidence 456777889999999999998874
No 100
>PRK01265 heat shock protein HtpX; Provisional
Probab=27.05 E-value=32 Score=32.12 Aligned_cols=19 Identities=21% Similarity=0.277 Sum_probs=14.2
Q ss_pred cCcccHH-HHHhHHhHHHhH
Q psy12854 31 LTFKQVE-NLFFHFGSLLQR 49 (265)
Q Consensus 31 l~~~~v~-tLfHEfGHalH~ 49 (265)
++.+|+. .++||+||.-|.
T Consensus 135 l~~~El~aVlAHElgHik~~ 154 (324)
T PRK01265 135 LNRDEIKAVAGHELGHLKHR 154 (324)
T ss_pred CCHHHHHHHHHHHHHHHHcc
Confidence 4567754 589999998775
No 101
>PHA00527 hypothetical protein
Probab=26.89 E-value=1.5e+02 Score=23.18 Aligned_cols=22 Identities=5% Similarity=-0.047 Sum_probs=18.8
Q ss_pred HHHHHhHHhHHHhHHhhcCCCC
Q psy12854 36 VENLFFHFGSLLQRSLTATHYS 57 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~~~~~ 57 (265)
+.||.||--|.--.++++..-.
T Consensus 74 ~~T~~HECAH~AF~vC~~VGV~ 95 (129)
T PHA00527 74 AATLVHECAHVAFYVCRDVGVT 95 (129)
T ss_pred HHHHHHHHHHHHHHHHHhcCcc
Confidence 5799999999999999986543
No 102
>TIGR00488 putative HD superfamily hydrolase of NAD metabolism. The function of this protein family is unknown. Members of this family of uncharacterized proteins from the Mycoplasmas are longer at the amino end, fused to a region of nicotinamide nucleotide adenylyltransferase, an NAD salvage biosynthesis enzyme. Members are putative metal-dependent phosphohydrolases for NAD metabolism.
Probab=25.35 E-value=3.7e+02 Score=21.76 Aligned_cols=98 Identities=13% Similarity=0.216 Sum_probs=52.5
Q ss_pred cHHHHHhHHhHHHhH--Hh---hcCCCC-CCcccCCccchhHhhchHHHHHHh-c-cHHHHHHHHhhhcCCcccHH--H-
Q psy12854 35 QVENLFFHFGSLLQR--SL---TATHYS-DVSGVNNVEWDSVYIINYFLTHFL-Y-EERVFAELNSHFATGEKLKM--T- 103 (265)
Q Consensus 35 ~v~tLfHEfGHalH~--ll---s~~~~~-~~sg~~~~~~d~~E~pS~~~E~~~-~-~~~~L~~ls~h~~t~~~lp~--~- 103 (265)
-+.-|.|.+|..+-. ++ .+...+ .... .....+-.|+-..+++.|. . +++++..+..|....+..+. .
T Consensus 35 ~~AGLLHDIGk~~~~~~~~~~~~~~~~~~~~~~-~~~~~~H~~vGa~ll~~w~~~~~~~i~~aI~~H~~~~~~~~~l~~i 113 (158)
T TIGR00488 35 EIAGAYHDLAKFLPKEQLKQIAKREKMPAHLLY-PSPKLLHAYVGAYILKREFGVQDEDILDAIRNHTSGPPGMSLLDMI 113 (158)
T ss_pred HHHHHHHHHhccCCHHHHHHHHHHcCCCchhhc-ccccccHHHHHHHHHHHHhCCCcHHHHHHHHHhCCCCCCCCHHHHH
Confidence 356789999984321 11 111100 0100 1134568899999999996 5 66788888777543332221 1
Q ss_pred --HH-HHHHHHhhhhhHHHHHHHHHHHHHHHHh
Q psy12854 104 --DE-QLKALRAHNAGIDVCSELFKANLDLQLH 133 (265)
Q Consensus 104 --~~-~l~~~~~~~~~~~~~~ql~~a~~D~~lh 133 (265)
++ .+...+....+.+-+|.+.+-.+|..+.
T Consensus 114 V~lAD~i~~~~~~~~~~~~~~~~~~~~l~~~~~ 146 (158)
T TIGR00488 114 IYVADKLEPNRGAGIEIDELRKLAKTDLKQAYI 146 (158)
T ss_pred HHhHHHHhhcccccccHHHHHHHHHHHHHHHHH
Confidence 12 2222222344556667776666666554
No 103
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=24.60 E-value=1e+02 Score=24.49 Aligned_cols=13 Identities=15% Similarity=0.130 Sum_probs=11.4
Q ss_pred HHHHhHHhHHHhH
Q psy12854 37 ENLFFHFGSLLQR 49 (265)
Q Consensus 37 ~tLfHEfGHalH~ 49 (265)
.||.||+-|+-|.
T Consensus 81 ~TL~HEL~H~WQ~ 93 (141)
T PHA02456 81 DTLAHELNHAWQF 93 (141)
T ss_pred HHHHHHHHHHHhh
Confidence 5899999999884
No 104
>KOG2719|consensus
Probab=24.34 E-value=47 Score=32.23 Aligned_cols=18 Identities=17% Similarity=0.139 Sum_probs=13.5
Q ss_pred HHHHHhHHhHHHhHHhhc
Q psy12854 36 VENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 36 v~tLfHEfGHalH~lls~ 53 (265)
+..++||+||--|.-..+
T Consensus 281 ~AVl~HELGHW~~~H~~K 298 (428)
T KOG2719|consen 281 VAVLAHELGHWKLNHVLK 298 (428)
T ss_pred HHHHHHHhhHHHHhhHHH
Confidence 557899999988764443
No 105
>PHA00657 crystallin beta/gamma motif-containing protein
Probab=24.30 E-value=50 Score=36.42 Aligned_cols=24 Identities=13% Similarity=0.202 Sum_probs=19.9
Q ss_pred CCccCcccHHHHHhHHhHHHhHHh
Q psy12854 28 PCLLTFKQVENLFFHFGSLLQRSL 51 (265)
Q Consensus 28 p~ll~~~~v~tLfHEfGHalH~ll 51 (265)
-+|+.-.|+.|+.||+||.+--++
T Consensus 784 I~l~KnAD~STFvHE~gH~fLE~~ 807 (2052)
T PHA00657 784 VSLLKNADLSTFLHESGHFFLEVQ 807 (2052)
T ss_pred EEEeccccHHHHHHHHHHHHHHHH
Confidence 467778999999999999966555
No 106
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=23.23 E-value=45 Score=28.43 Aligned_cols=20 Identities=25% Similarity=0.426 Sum_probs=15.2
Q ss_pred CCccCcccHHHHHhHHhHHH
Q psy12854 28 PCLLTFKQVENLFFHFGSLL 47 (265)
Q Consensus 28 p~ll~~~~v~tLfHEfGHal 47 (265)
+-|+.-+-+....||+||.+
T Consensus 117 ~~lf~ERv~KEv~HElGH~~ 136 (181)
T COG1913 117 RELFKERVVKEVLHELGHLL 136 (181)
T ss_pred hHHHHHHHHHHHHHHhhhhc
Confidence 34555666888999999976
No 107
>KOG1565|consensus
Probab=23.02 E-value=29 Score=34.13 Aligned_cols=18 Identities=11% Similarity=-0.047 Sum_probs=12.7
Q ss_pred cHHHHHhHHhHHHhHHhh
Q psy12854 35 QVENLFFHFGSLLQRSLT 52 (265)
Q Consensus 35 ~v~tLfHEfGHalH~lls 52 (265)
=..+.+||+||+|=..-|
T Consensus 211 l~~Va~HEiGH~LGL~HS 228 (469)
T KOG1565|consen 211 LFLVAAHEIGHALGLGHS 228 (469)
T ss_pred hHHHhhhhcccccccCCC
Confidence 345789999999854333
No 108
>PLN02791 Nudix hydrolase homolog
Probab=22.17 E-value=37 Score=35.49 Aligned_cols=20 Identities=0% Similarity=-0.175 Sum_probs=15.0
Q ss_pred ccHHHHHhHHhHHHhHHhhc
Q psy12854 34 KQVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 34 ~~v~tLfHEfGHalH~lls~ 53 (265)
..+.|++||+||.+=-++..
T Consensus 584 f~~~v~lHElgHGsG~~~~~ 603 (770)
T PLN02791 584 FFTHTICHECCHGIGPHTIT 603 (770)
T ss_pred HHHHHHHHHhhcccccccee
Confidence 34899999999987655543
No 109
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=22.04 E-value=51 Score=28.88 Aligned_cols=13 Identities=8% Similarity=-0.230 Sum_probs=10.5
Q ss_pred HHHHHhHHhHHHh
Q psy12854 36 VENLFFHFGSLLQ 48 (265)
Q Consensus 36 v~tLfHEfGHalH 48 (265)
...|+||++||.=
T Consensus 94 gsiLAHE~mHa~L 106 (212)
T PF12315_consen 94 GSILAHELMHAWL 106 (212)
T ss_pred hhHHHHHHHHHHh
Confidence 4568999999865
No 110
>PF13946 DUF4214: Domain of unknown function (DUF4214)
Probab=21.91 E-value=72 Score=22.86 Aligned_cols=21 Identities=14% Similarity=0.300 Sum_probs=16.3
Q ss_pred HHHHHHHhhCCCCCchHHHHh
Q psy12854 241 SSEVFRRFRGRDPCFKPFLDM 261 (265)
Q Consensus 241 ~~e~l~~flGr~ps~~a~~~~ 261 (265)
...+++..|||+|+.+.+--|
T Consensus 25 V~~lY~~~LgR~~D~~Gl~yw 45 (75)
T PF13946_consen 25 VNSLYQNILGREPDPEGLAYW 45 (75)
T ss_pred HHHHHHHHHCCCCCHHHHHHH
Confidence 356888889999988877555
No 111
>KOG2661|consensus
Probab=21.77 E-value=1.1e+02 Score=29.08 Aligned_cols=53 Identities=13% Similarity=-0.035 Sum_probs=28.3
Q ss_pred CcccHH-HHHhHHhHHHhHHhhc--CCCCCC--cccCCccchhHhhch-HHHHHHhccH
Q psy12854 32 TFKQVE-NLFFHFGSLLQRSLTA--THYSDV--SGVNNVEWDSVYIIN-YFLTHFLYEE 84 (265)
Q Consensus 32 ~~~~v~-tLfHEfGHalH~lls~--~~~~~~--sg~~~~~~d~~E~pS-~~~E~~~~~~ 84 (265)
.-++|. .|.||+||++-.--++ +.-..+ -|.--+.++.+-.|+ .+.|.++.-|
T Consensus 271 ~ddglAtvLgHE~aHaVarH~AEki~k~~~~siLgLvlyt~~~a~~~n~~Ll~~flrlP 329 (424)
T KOG2661|consen 271 DDDGLATVLGHEIAHAVARHAAEKIGKVHLLSILGLVLYTMIWAICPNDKLLEYFLRLP 329 (424)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccchHHHHHHHhcCc
Confidence 344544 4789999987443332 111111 121123566666777 6666666665
No 112
>KOG3607|consensus
Probab=21.54 E-value=48 Score=34.35 Aligned_cols=16 Identities=19% Similarity=0.108 Sum_probs=13.8
Q ss_pred CcccHHHHHhHHhHHH
Q psy12854 32 TFKQVENLFFHFGSLL 47 (265)
Q Consensus 32 ~~~~v~tLfHEfGHal 47 (265)
....+.+++||+||.|
T Consensus 320 ~~~~a~v~AhelgH~l 335 (716)
T KOG3607|consen 320 LLAFAVVLAHELGHNL 335 (716)
T ss_pred chhHHHHHHHHHHhhc
Confidence 6677899999999965
No 113
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=21.18 E-value=52 Score=32.60 Aligned_cols=15 Identities=7% Similarity=0.131 Sum_probs=12.8
Q ss_pred ccHHHHHhHHhHHHh
Q psy12854 34 KQVENLFFHFGSLLQ 48 (265)
Q Consensus 34 ~~v~tLfHEfGHalH 48 (265)
.-+.++.||+.|+|=
T Consensus 209 ~~~~~~~HEi~HaLG 223 (521)
T PF01457_consen 209 EFFRTVIHEIAHALG 223 (521)
T ss_dssp HHHHHHHHHHHHHTT
T ss_pred cccceeeeeeeeeee
Confidence 458899999999984
No 114
>PF02128 Peptidase_M36: Fungalysin metallopeptidase (M36); InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=21.08 E-value=16 Score=34.82 Aligned_cols=40 Identities=18% Similarity=0.242 Sum_probs=28.2
Q ss_pred EEEEeccCCCCCCCCCccCc----------------ccHHHHHhHHhHHHhHHhhc
Q psy12854 14 NTLILNLTEPQYGKPCLLTF----------------KQVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 14 ~~l~~nf~~p~~~~p~ll~~----------------~~v~tLfHEfGHalH~lls~ 53 (265)
..-..||.-|+++.|..|+. -|-..++||.||.|+.-|+-
T Consensus 148 g~NNAnf~TPpDG~~prMqMylw~~~~~~~p~rDg~~D~~Ii~HEy~HGiSnRLvg 203 (378)
T PF02128_consen 148 GTNNANFATPPDGQPPRMQMYLWTPSTPASPNRDGDFDNGIIAHEYGHGISNRLVG 203 (378)
T ss_pred CcCCceeecCCCCCCceeeEEEecCCCCcCCCCCcccccCeeEEeecccccccccC
Confidence 35567888777766554443 24556899999999987765
No 115
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=20.59 E-value=56 Score=29.88 Aligned_cols=20 Identities=20% Similarity=0.130 Sum_probs=15.7
Q ss_pred ccHHHHHhHHhHHHhHHhhc
Q psy12854 34 KQVENLFFHFGSLLQRSLTA 53 (265)
Q Consensus 34 ~~v~tLfHEfGHalH~lls~ 53 (265)
.+--.++||.||-+++++.-
T Consensus 166 AqaYViAHEVGHHVQnllGi 185 (295)
T COG2321 166 AQAYVIAHEVGHHVQNLLGI 185 (295)
T ss_pred HHHHHHHhhhhHHHHHHhhh
Confidence 34446899999999998864
No 116
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=20.49 E-value=70 Score=25.77 Aligned_cols=17 Identities=6% Similarity=0.198 Sum_probs=15.1
Q ss_pred HHHHHhHHhHHHhHHhh
Q psy12854 36 VENLFFHFGSLLQRSLT 52 (265)
Q Consensus 36 v~tLfHEfGHalH~lls 52 (265)
..||.||+=|++..+..
T Consensus 61 ~~tL~HEm~H~~~~~~~ 77 (157)
T PF10263_consen 61 IDTLLHEMAHAAAYVFG 77 (157)
T ss_pred HHHHHHHHHHHHhhhcc
Confidence 67999999999998884
Done!