Query 018718
Match_columns 351
No_of_seqs 234 out of 1483
Neff 6.5
Searched_HMMs 46136
Date Fri Mar 29 03:27:37 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/018718.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/018718hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0339 Dcp Zn-dependent oligo 100.0 6.3E-78 1.4E-82 616.1 21.5 330 9-346 56-544 (683)
2 PRK10280 dipeptidyl carboxypep 100.0 1.4E-73 3E-78 600.7 23.0 332 9-351 55-546 (681)
3 KOG2089 Metalloendopeptidase f 100.0 4.7E-72 1E-76 566.5 17.0 338 9-351 79-578 (718)
4 PRK10911 oligopeptidase A; Pro 100.0 1.5E-69 3.2E-74 571.3 23.1 335 9-350 49-545 (680)
5 KOG2090 Metalloendopeptidase f 100.0 5.8E-54 1.3E-58 430.5 21.8 309 33-350 106-560 (704)
6 cd06457 M3A_MIP Peptidase M3 m 100.0 3.2E-52 7E-57 423.0 14.7 209 137-350 23-329 (458)
7 cd06456 M3A_DCP_Oligopeptidase 100.0 1.3E-49 2.8E-54 399.9 14.3 203 139-350 1-289 (422)
8 cd06455 M3A_TOP Peptidase M3 T 100.0 1.3E-46 2.8E-51 383.3 17.8 243 90-348 5-342 (472)
9 PF01432 Peptidase_M3: Peptida 100.0 2.7E-43 5.9E-48 357.0 8.7 203 139-350 30-323 (458)
10 cd06258 Peptidase_M3_like The 99.9 2.7E-26 5.9E-31 226.2 8.4 171 142-339 2-221 (365)
11 TIGR02289 M3_not_pepF oligoend 99.9 2.9E-22 6.3E-27 208.1 10.2 202 90-324 120-380 (549)
12 cd06459 M3B_Oligoendopeptidase 99.9 2.7E-23 6E-28 207.9 2.0 164 129-324 58-265 (427)
13 TIGR00181 pepF oligoendopeptid 99.8 4.1E-21 8.8E-26 201.0 6.1 274 9-324 50-421 (591)
14 TIGR02290 M3_fam_3 oligoendope 99.7 6.1E-19 1.3E-23 184.6 2.0 152 141-324 223-417 (587)
15 COG1164 Oligoendopeptidase F [ 99.0 2.4E-10 5.1E-15 120.2 6.2 79 221-323 343-422 (598)
16 cd06461 M2_ACE Peptidase famil 97.7 5.8E-05 1.3E-09 77.8 5.8 83 226-328 212-298 (477)
17 COG0339 Dcp Zn-dependent oligo 95.7 0.085 1.8E-06 56.2 11.1 76 50-127 58-134 (683)
18 cd06460 M32_Taq Peptidase fami 93.6 0.028 6E-07 56.9 0.9 42 281-324 161-204 (396)
19 COG2856 Predicted Zn peptidase 89.3 0.21 4.5E-06 46.4 1.8 30 256-294 57-87 (213)
20 PF06114 DUF955: Domain of unk 85.6 0.39 8.4E-06 38.6 1.2 32 257-297 28-60 (122)
21 PF00413 Peptidase_M10: Matrix 85.0 0.39 8.5E-06 41.1 1.0 11 281-291 107-117 (154)
22 PF14247 DUF4344: Domain of un 84.9 0.47 1E-05 44.3 1.5 32 281-319 94-125 (220)
23 PRK10280 dipeptidyl carboxypep 80.4 6.6 0.00014 42.6 8.3 48 49-96 56-104 (681)
24 PRK10911 oligopeptidase A; Pro 77.6 10 0.00022 41.1 8.7 46 49-94 50-96 (680)
25 cd04277 ZnMc_serralysin_like Z 77.1 1.1 2.4E-05 40.0 1.0 12 281-292 115-126 (186)
26 PF13398 Peptidase_M50B: Pepti 76.8 1.5 3.2E-05 40.1 1.8 20 281-300 24-43 (200)
27 cd04279 ZnMc_MMP_like_1 Zinc-d 76.1 1.2 2.5E-05 38.7 0.9 11 281-291 106-116 (156)
28 cd04278 ZnMc_MMP Zinc-dependen 76.1 0.95 2.1E-05 39.3 0.3 11 281-291 109-119 (157)
29 cd04268 ZnMc_MMP_like Zinc-dep 76.0 1.2 2.7E-05 38.5 1.0 11 281-291 96-106 (165)
30 PF14891 Peptidase_M91: Effect 75.4 2.5 5.4E-05 37.7 2.8 18 281-298 105-122 (174)
31 PF13058 DUF3920: Protein of u 72.2 2 4.4E-05 35.8 1.3 13 281-293 78-90 (126)
32 smart00235 ZnMc Zinc-dependent 70.9 1.9 4.2E-05 36.4 1.0 11 281-291 88-98 (140)
33 cd04327 ZnMc_MMP_like_3 Zinc-d 70.4 2.1 4.4E-05 38.9 1.1 12 281-292 94-105 (198)
34 PF04228 Zn_peptidase: Putativ 69.1 2.6 5.6E-05 41.1 1.5 17 281-297 172-188 (292)
35 PF13582 Reprolysin_3: Metallo 67.7 2.5 5.4E-05 34.8 1.0 11 281-291 109-119 (124)
36 cd00203 ZnMc Zinc-dependent me 65.1 3 6.4E-05 36.0 1.0 11 281-291 98-108 (167)
37 PF13688 Reprolysin_5: Metallo 60.2 4.1 8.9E-05 36.4 1.0 11 281-291 144-154 (196)
38 PF02128 Peptidase_M36: Fungal 59.1 2.5 5.4E-05 42.5 -0.7 17 281-297 187-203 (378)
39 cd04280 ZnMc_astacin_like Zinc 57.4 4.9 0.00011 36.0 1.0 11 281-291 76-86 (180)
40 PF13583 Reprolysin_4: Metallo 56.9 5 0.00011 36.8 1.0 11 281-291 139-149 (206)
41 PF05572 Peptidase_M43: Pregna 56.4 5.2 0.00011 35.1 1.0 12 281-292 71-82 (154)
42 KOG3658 Tumor necrosis factor- 56.3 11 0.00024 40.5 3.4 64 214-291 338-404 (764)
43 PF13574 Reprolysin_2: Metallo 53.0 6.8 0.00015 34.8 1.2 12 280-291 111-123 (173)
44 PF09471 Peptidase_M64: IgA Pe 52.0 6.5 0.00014 37.7 0.9 17 281-297 218-234 (264)
45 PF01400 Astacin: Astacin (Pep 51.6 7.4 0.00016 35.2 1.2 11 281-291 81-91 (191)
46 PF13485 Peptidase_MA_2: Pepti 51.0 8.7 0.00019 30.8 1.4 17 281-297 27-43 (128)
47 PF12388 Peptidase_M57: Dual-a 50.8 6.7 0.00015 36.4 0.8 12 281-292 135-146 (211)
48 cd04283 ZnMc_hatching_enzyme Z 49.8 7.9 0.00017 35.1 1.1 11 281-291 79-89 (182)
49 PF04298 Zn_peptidase_2: Putat 49.4 7.5 0.00016 36.4 0.9 15 281-295 91-105 (222)
50 cd04270 ZnMc_TACE_like Zinc-de 48.0 8.1 0.00018 36.4 0.9 11 281-291 169-179 (244)
51 cd04267 ZnMc_ADAM_like Zinc-de 45.4 8.8 0.00019 34.2 0.7 11 281-291 135-145 (192)
52 TIGR03296 M6dom_TIGR03296 M6 f 44.8 6.4 0.00014 38.0 -0.4 12 281-292 167-178 (286)
53 cd04271 ZnMc_ADAM_fungal Zinc- 44.2 8.2 0.00018 36.1 0.2 13 281-293 147-162 (228)
54 cd04281 ZnMc_BMP1_TLD Zinc-dep 43.1 12 0.00025 34.5 1.1 11 281-291 89-99 (200)
55 cd06161 S2P-M50_SpoIVFB SpoIVF 42.8 12 0.00026 34.4 1.1 16 281-296 40-55 (208)
56 cd05709 S2P-M50 Site-2 proteas 42.2 13 0.00027 32.9 1.1 18 281-298 10-27 (180)
57 PF02163 Peptidase_M50: Peptid 39.7 16 0.00036 32.4 1.5 24 281-304 9-32 (192)
58 cd04276 ZnMc_MMP_like_2 Zinc-d 39.5 14 0.0003 33.9 1.0 12 281-292 118-129 (197)
59 PF01435 Peptidase_M48: Peptid 37.7 16 0.00034 33.0 1.0 13 281-293 91-103 (226)
60 cd04269 ZnMc_adamalysin_II_lik 36.8 15 0.00033 32.7 0.8 11 281-291 133-143 (194)
61 cd04272 ZnMc_salivary_gland_MP 35.8 17 0.00038 33.3 1.0 11 281-291 147-157 (220)
62 PF13699 DUF4157: Domain of un 35.1 28 0.00061 27.1 2.0 14 281-294 63-76 (79)
63 cd04282 ZnMc_meprin Zinc-depen 34.9 18 0.0004 34.0 1.0 12 281-292 122-133 (230)
64 cd06164 S2P-M50_SpoIVFB_CBS Sp 34.4 19 0.00042 33.6 1.1 15 281-295 55-69 (227)
65 PF02031 Peptidase_M7: Strepto 34.1 22 0.00049 30.5 1.3 12 281-292 79-90 (132)
66 cd06159 S2P-M50_PDZ_Arch Uncha 33.5 20 0.00043 34.4 1.1 16 281-296 120-135 (263)
67 COG2738 Predicted Zn-dependent 32.0 21 0.00046 32.9 0.9 39 281-323 94-132 (226)
68 cd06163 S2P-M50_PDZ_RseP-like 31.2 28 0.00061 31.4 1.6 22 281-302 11-32 (182)
69 PF10460 Peptidase_M30: Peptid 31.1 24 0.00053 35.5 1.3 55 281-339 141-197 (366)
70 cd04275 ZnMc_pappalysin_like Z 29.6 21 0.00045 33.4 0.4 12 281-292 139-150 (225)
71 COG0501 HtpX Zn-dependent prot 28.9 24 0.00051 33.5 0.7 16 277-292 152-170 (302)
72 PF13402 M60-like: Peptidase M 28.9 27 0.00058 33.4 1.1 14 281-294 221-234 (307)
73 cd04273 ZnMc_ADAMTS_like Zinc- 28.8 14 0.00031 33.5 -0.7 11 281-291 142-152 (207)
74 PF05548 Peptidase_M11: Gameto 28.8 24 0.00051 34.8 0.7 11 281-291 152-162 (314)
75 cd06162 S2P-M50_PDZ_SREBP Ster 28.5 27 0.00059 33.8 1.1 16 281-296 137-152 (277)
76 cd06160 S2P-M50_like_2 Unchara 28.2 28 0.00062 31.4 1.1 17 281-297 43-59 (183)
77 PF01421 Reprolysin: Reprolysi 27.1 29 0.00064 31.1 1.0 11 281-291 133-143 (199)
78 PRK03982 heat shock protein Ht 26.7 29 0.00062 33.4 0.9 12 281-292 127-138 (288)
79 PF01742 Peptidase_M27: Clostr 26.6 37 0.0008 34.4 1.6 17 281-297 216-232 (408)
80 COG5549 Predicted Zn-dependent 23.5 33 0.00071 32.0 0.5 18 274-291 180-199 (236)
81 PRK02870 heat shock protein Ht 23.3 37 0.00079 33.8 0.9 13 281-293 175-187 (336)
82 PRK03001 M48 family peptidase; 23.3 36 0.00078 32.7 0.8 13 281-293 126-138 (283)
83 PF10462 Peptidase_M66: Peptid 23.1 35 0.00076 33.5 0.7 11 281-291 195-205 (305)
84 PF05547 Peptidase_M6: Immune 22.9 19 0.00041 39.0 -1.3 11 281-291 223-233 (645)
85 PHA03420 E4 protein; Provision 22.7 44 0.00095 28.4 1.1 13 178-190 12-24 (137)
86 PF01447 Peptidase_M4: Thermol 22.3 45 0.00097 29.2 1.2 11 281-291 137-147 (150)
87 COG4227 Antirestriction protei 22.2 33 0.00072 33.0 0.4 11 281-291 205-215 (316)
88 KOG2921 Intramembrane metallop 21.9 41 0.00089 34.2 0.9 45 256-300 104-152 (484)
89 PRK10779 zinc metallopeptidase 21.8 47 0.001 34.1 1.4 21 281-301 17-37 (449)
90 PRK04897 heat shock protein Ht 21.4 41 0.00089 32.6 0.8 12 281-292 139-150 (298)
91 PRK01345 heat shock protein Ht 21.3 42 0.00091 33.0 0.9 12 281-292 126-137 (317)
92 KOG3714 Meprin A metalloprotea 20.8 44 0.00096 34.0 1.0 13 281-293 161-173 (411)
93 TIGR00054 RIP metalloprotease 20.5 49 0.0011 33.7 1.2 19 281-299 16-34 (420)
No 1
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00 E-value=6.3e-78 Score=616.05 Aligned_cols=330 Identities=34% Similarity=0.534 Sum_probs=300.5
Q ss_pred HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718 9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK------------- 73 (351)
Q Consensus 9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~------------- 73 (351)
..|+|.|+-...+.....+.++.+ .+++|+++|+|..+..- .+.++.....+...+|..+.++.
T Consensus 56 ~~n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a~e~~~p-klse~~~~l~~~~~Lf~~~~al~~~~e~~~ld~e~~ 134 (683)
T COG0339 56 WENTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREAYEEILP-KLSEFSNDLGQNPGLFARYKALWQSPESAGLDPEQK 134 (683)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHHHHHhhH-HHHHHHHHhhcCHHHHHHHHHHhcCcccccCCHHHH
Confidence 468999999999999999999987 99999999999999999 99999999999999999999882
Q ss_pred -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718 74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------ 127 (351)
Q Consensus 74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------ 127 (351)
|+..+++.|+++.+|+ |||+|+++|+||+++++.+|.+++.++++|+|||+..+
T Consensus 135 r~~e~~~rdF~~sGa~L~~~~k~r~~~I~~ela~L~~~Fs~nvL~at~~~~~~~~~~~~LaGlp~~~l~aa~~~A~~kg~ 214 (683)
T COG0339 135 RVVENALRDFVLSGAKLPPEKKARLAEINTELAELGAQFSNNVLDATKAWALIVTDEAELAGLPESLLAAAAAAAEAKGL 214 (683)
T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeecCcHHHhcCCCHHHHHHHHHHHHhcCC
Confidence 6667778899999998 99999999999999999999999999999999999986
Q ss_pred ------------------------------HHHHhhCC-CCCCcHHHHHHHHHHHHHHHhhh------------------
Q 018718 128 ------------------------------AYVTCSFS-GDLDNTSIIDQILKLRLEKAKLL------------------ 158 (351)
Q Consensus 128 ------------------------------a~~~ra~~-~~~~N~~il~eil~LR~e~A~LL------------------ 158 (351)
|+.+|+++ +..+|.+|+.++++||+|+|+||
T Consensus 215 ~~~~itl~~p~~~p~l~~~~~r~lRe~~y~A~~~ra~~~~~~~n~~ii~~~L~LR~E~A~LLGf~~yA~~~L~~kMA~~p 294 (683)
T COG0339 215 EGYLITLDIPSYLPVLTYADNRALREKLYRAYVTRASEGGPNDNRAIIEEILKLRAELAKLLGFSNYAEYSLADKMAKTP 294 (683)
T ss_pred CCeEEeecccchhHHHHhcccHHHHHHHHHHHhhhccccCCcccHHHHHHHHHHHHHHHHHcCcccHHHHhHHHHhhcCH
Confidence 67778876 45679999999999999999999
Q ss_pred --------------------hHHHHHHHHHhc-CCCCCCccccCCHHHHHHHhhhhhcCCCC------------------
Q 018718 159 --------------------DIEDLRSFCKGQ-GALEPREVNHWDINFWSERLCESKYDINE------------------ 199 (351)
Q Consensus 159 --------------------El~~L~~~k~~~-~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~------------------ 199 (351)
|+++|+++++++ + +..+++|||++||++|+|+++|++|+
T Consensus 295 ~~Vl~fL~~l~~ka~~~a~~e~a~L~~~~~~~~~--~~~~l~~WD~~yyaeK~r~~~y~~de~elrpYF~l~~Vl~GlF~ 372 (683)
T COG0339 295 EAVLNFLNDLAEKARPQAEKELAELQAFAAEEEG--GLPELQPWDWAYYAEKQRQEKYAFDEEELRPYFPLNKVLEGLFE 372 (683)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhcc--CCcccchhhHHHHHHHHHhhhcCCCHHHhhhcCChhHHHHHHHH
Confidence 899999999874 4 57789999999999999999999998
Q ss_pred -----------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEE
Q 018718 200 -----------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHM 262 (351)
Q Consensus 200 -----------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~l 262 (351)
+||||||+|+|+|.+|..+|.||+|+|+|+| |+|||||..+++|... .+|+ .|+||+++
T Consensus 373 ~~~rLfGI~~~e~~~~~vwHpDVr~~~v~d~~g~~~g~fY~DlyaR~~-KrgGAWM~~~~~~~~~--~~~~-~q~PV~yl 448 (683)
T COG0339 373 VAKRLFGITFVERKDIPVWHPDVRVFEVFDENGELIGLFYLDLYARDG-KRGGAWMDDFVSQRRL--DDGG-GQKPVIYL 448 (683)
T ss_pred HHHHHcCeEEEECCCCCccCCCceEEEEEcCCCCEEEEEEeecccCCC-CccchHHHHhhhcccc--cCCC-cccceEEE
Confidence 9999999999999999999999999999999 7999999999987653 3454 58999999
Q ss_pred eeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----cc
Q 018718 263 VCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QV 331 (351)
Q Consensus 263 v~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~ 331 (351)
||||++|.+|+|+||+|+ +||||||||+||+||++++|+.+|||+ |||||||+||||||+| ++|.. +|
T Consensus 449 vCNF~~p~~gkPaLls~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~-v~wDfVElPSQ~mE~w~~~p~vL~~~a~Hy~T 527 (683)
T COG0339 449 VCNFTKPVGGKPALLSHDEVTTLFHEFGHGLHHLLTRVKYPGVSGTN-VPWDFVELPSQFMENWCWEPEVLAKYARHYQT 527 (683)
T ss_pred eccCCCCCCCCCceeeHHHHHHHHHHhhhHHHHHhhcCCccccCCCC-CCcchhhccHHHHHHhhcCHHHHHHHHHhhcc
Confidence 999999999999999999 999999999999999999999999997 9999999999999654 56644 89
Q ss_pred CCCCcccc--chhhccc
Q 018718 332 IPTIPEDG--YFCSFRH 346 (351)
Q Consensus 332 ~~~~~~d~--~~~~~~~ 346 (351)
|+|||.++ .+.+.|+
T Consensus 528 Ge~lP~~ll~k~laakn 544 (683)
T COG0339 528 GEPLPKELLDKMLAAKN 544 (683)
T ss_pred CCcCHHHHHHHHHHHHh
Confidence 99999999 3444443
No 2
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00 E-value=1.4e-73 Score=600.75 Aligned_cols=332 Identities=22% Similarity=0.317 Sum_probs=293.8
Q ss_pred HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718 9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK------------- 73 (351)
Q Consensus 9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~------------- 73 (351)
.-|+|+|+...++.....+.++.+ .+++|+++|+|+.++.. .+.++..+..+..++|..+.++.
T Consensus 55 ~~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~-~l~~~~~~l~~~~~Ly~~l~~~~~~~~~~~l~~e~~ 133 (681)
T PRK10280 55 FNNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSA-ELAELANDIYLNGELFARVDAVWQQRESLGLDSESI 133 (681)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHH-HHHHHHHHhhCCHHHHHHHHHHHcCccccCCCHHHH
Confidence 368999999999999988888887 99999999999999999 99999999999999999999882
Q ss_pred -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718 74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------ 127 (351)
Q Consensus 74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------ 127 (351)
|+..+++.|+++++|+ +|++|+++|++|++++++.|.+++.+++||+|||++++
T Consensus 134 r~l~~~l~dF~~sG~~L~~~~r~r~~~l~~~l~~L~~~F~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~g~ 213 (681)
T PRK10280 134 RLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQFNQRLLAANKSGGLVVNDIHQLAGLSEQEIALAAEAAREKGL 213 (681)
T ss_pred HHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhcCeeecCCHHHHcCCCHHHHHHHHHHHHhcCC
Confidence 4455556788888887 99999999999999999999888889999999999877
Q ss_pred -------------------------------HHHHhhCC-CCCCcHHHHHHHHHHHHHHHhhh-----------------
Q 018718 128 -------------------------------AYVTCSFS-GDLDNTSIIDQILKLRLEKAKLL----------------- 158 (351)
Q Consensus 128 -------------------------------a~~~ra~~-~~~~N~~il~eil~LR~e~A~LL----------------- 158 (351)
|+.+|++. +..+|.+++++|+++|+|+|+||
T Consensus 214 ~g~~~itl~~~~~~p~l~~~~dr~~Re~~~~A~~~r~~~~~~~dn~~il~~ll~lR~e~A~lLGf~~yA~~~l~~kMa~s 293 (681)
T PRK10280 214 DNRWLIPLLNTTQQPALAELRDRQTRENLFAAGWTRAEKGDANDTRAIIQRLVEIRAQQAKLLGFPHYAAWKIADQMAKT 293 (681)
T ss_pred CCcEEEeCcCCcHhHHHhcCCCHHHHHHHHHHHHhhhccCCchhHHHHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHCCC
Confidence 33333432 34689999999999999999999
Q ss_pred ---------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC------------------
Q 018718 159 ---------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE------------------ 199 (351)
Q Consensus 159 ---------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~------------------ 199 (351)
|+++|+++|++.+ +...|+|||++||++|+|+++|++|+
T Consensus 294 pe~V~~FL~~L~~~~~~~a~~E~~~L~~~~~~~~--g~~~l~pWD~~yy~ek~r~~~~~~d~~~l~~YFpl~~Vl~~Glf 371 (681)
T PRK10280 294 PEAALNFMREIVPAARQRASDELASIQAVIDKQQ--GGFSAQAWDWAFYAEQVRREKYALDEAQLKPYFELNTVLNEGVF 371 (681)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCChhHHHHHHHHHHHHhcCCCHHHcCCcCcHHHHHHHhHH
Confidence 8889999987654 34579999999999999999999987
Q ss_pred ------------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEE
Q 018718 200 ------------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVH 261 (351)
Q Consensus 200 ------------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~ 261 (351)
+|||||++|+|+|.+|.+||+||+|+|+|+| |++||||+.++.++. .+ +|+|+++
T Consensus 372 ~l~~~LfGi~f~~~~~~~vWh~dV~~~~V~d~~g~~lG~fY~Dl~~R~g-K~~gawm~~~~~~~~---~~---~~~Pv~~ 444 (681)
T PRK10280 372 WTANQLFGIKFVERFDIPVYHPDVRVWEIFDHNGVGLALFYGDFFARDS-KSGGAWMGNFVEQST---LN---ETRPVIY 444 (681)
T ss_pred HHHHHHcCeEEEECCCCCCCCCCeeEEEEEcCCCCEEEEEEecCCCCCC-CCCCCCCcccccccc---cC---CCCCeEE
Confidence 8999999999999888999999999999999 688999988876542 12 3679999
Q ss_pred EeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----c
Q 018718 262 MVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----Q 330 (351)
Q Consensus 262 lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~ 330 (351)
|||||++|++++|+||+|+ +|||||||||||+|||+++|+++|||+ |+|||||+||||||+| +||+. .
T Consensus 445 lvcNf~~p~~~~p~LL~~~eV~TlFHEfGHalH~lls~~~y~~~sGt~-v~~DfVE~PSq~mE~w~~~~~vL~~~a~Hy~ 523 (681)
T PRK10280 445 NVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTN-TPRDFVEFPSQINEHWASHPQVFARYARHYQ 523 (681)
T ss_pred EECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhCCCccccCCCC-CCcchhcCcHHHHHHHhcCHHHHHHHhhccC
Confidence 9999999999999999999 999999999999999999999999986 7999999999999554 67765 7
Q ss_pred cCCCCcccc--chhhcccccccC
Q 018718 331 VIPTIPEDG--YFCSFRHKFASL 351 (351)
Q Consensus 331 ~~~~~~~d~--~~~~~~~~~~~~ 351 (351)
||+|||+|+ .++..|+.++|+
T Consensus 524 TgepiP~~l~~~l~~ar~~~~g~ 546 (681)
T PRK10280 524 SGEAMPDELQEKMRNASLFNKGY 546 (681)
T ss_pred CCCCCCHHHHHHHHHhhCcchHH
Confidence 999999999 789999988764
No 3
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=4.7e-72 Score=566.46 Aligned_cols=338 Identities=37% Similarity=0.588 Sum_probs=300.4
Q ss_pred HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718 9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK------------- 73 (351)
Q Consensus 9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~------------- 73 (351)
..|+++|++.+|+++...++.|.| +|++++++|+||.++.+ -++++.+.++|+.++|..+.++.
T Consensus 79 ~en~vepla~ie~el~~~~~~L~f~~~vs~~~~~R~as~~~~~-~~~~~~~r~~~r~di~~~~~~i~e~~~~dslspe~~ 157 (718)
T KOG2089|consen 79 YENVVEPLAKIEVELTVVIGMLVFPQHVSPDKELRKASTEADK-KLDEFDLRLSMRKDIYNRFQAIYEKPAKDSLSPEAQ 157 (718)
T ss_pred hHHHHhHHHHHHHHHHHHHhhhhhhhhcCCchhhhhhhhcchH-HHHHHHHHhhhhhhHHHHHHHHHHhcccccCCHHHH
Confidence 579999999999999999999999 88899999999999999 99999999999999998888773
Q ss_pred -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718 74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------ 127 (351)
Q Consensus 74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------ 127 (351)
|++.+++.|+..++|+ +|++++.+|++||+++++.|++++||.+||+|+|+++|
T Consensus 158 RylE~~ike~k~nGL~L~~~kr~~ik~ikk~l~~l~~~f~~nvne~~~~~t~litd~~el~glPps~L~~la~~~~~k~s 237 (718)
T KOG2089|consen 158 RYLEKLIKEGKLNGLHLDDDKREEIKEIKKELSELSIKFSKNVNEDTKKFTFLITDKKELEGLPPSFLQSLAKDEDDKGS 237 (718)
T ss_pred HHHHHHHHHHHhcCCCCChhHHHHHHHHHHHHHHHHHHHHhhhhhcccccceeeecHHHhccCCHHHHHHHhhcccCCCC
Confidence 5555666788888887 99999999999999999999999999999999999987
Q ss_pred ------------------------------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh-------------------
Q 018718 128 ------------------------------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL------------------- 158 (351)
Q Consensus 128 ------------------------------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL------------------- 158 (351)
||..||..+..+|..+|++|++||.++|+||
T Consensus 238 gpwkiTl~~p~~~pvmk~c~~r~tR~~v~~A~~~ra~~~~~eNs~ii~~l~~lR~~lAklLGY~t~Ad~~L~~kMa~ss~ 317 (718)
T KOG2089|consen 238 GPWKITLKYPHYFPVMKHCKIRQTRKTVWRAYNSRASEGDLENSAIIEQLLKLRLELAKLLGYSTYADYSLAMKMAKSSE 317 (718)
T ss_pred CceEEEecccchhHHHHhCCcHHHHHHHHHHHhccCCcccccccHHHHHHHHHHHHHHHHhCchhHHHHHHHHHhhcCcH
Confidence 5566666556789999999999999999999
Q ss_pred -----------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC----------------
Q 018718 159 -----------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE---------------- 199 (351)
Q Consensus 159 -----------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~---------------- 199 (351)
+++.++.++++.|.+.++.++.||+.||.+++.+..|+||+
T Consensus 318 tv~~fl~dL~~kL~pl~~~e~~v~~elk~~e~k~~g~~~~~~~~~wD~~yy~~~~~e~~f~vd~~~LreyFPl~~v~~Gl 397 (718)
T KOG2089|consen 318 TVVEFLDDLSQKLRPLGIDERSVLLELKKGEAKDRGAPFDGKLTAWDLRYYMKRVEESKFDVDQEDLREYFPLPVVLSGL 397 (718)
T ss_pred HHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHhcCCCccccchhhHHHHHHHHHHHHhcCCCHHHHHhhCCcHHHHHHH
Confidence 34445556666777788899999999999999999999997
Q ss_pred -------------------ccCCceeEEEEEcC-CCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccce
Q 018718 200 -------------------VWNDDVRFYRVKDS-SGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPV 259 (351)
Q Consensus 200 -------------------vWh~dV~~~~V~d~-~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~ 259 (351)
|||+|||+|.|.|. +|+++|+||+|+|+|+||+++.+.+.. ++.+...+|+ +++||
T Consensus 398 ~~i~q~LFglkf~e~~da~vWh~dVr~y~v~D~~Sg~~vG~fY~D~y~RegK~gh~~~f~l---~~~~~~~~ss-~~~PV 473 (718)
T KOG2089|consen 398 FGIYQTLFGLKFEEATDAEVWHADVRVYTVKDSASGNPVGYFYLDPYPREGKYGHAAVFGL---QPGCLQKDSS-RRIPV 473 (718)
T ss_pred HHHHHHHhCceeeecCCchhcccceeEEeccCCCCCceeeEEEeccCCCccccchhhhhcc---chhhhccCCc-cccch
Confidence 99999999999997 599999999999999997776554333 3335556676 79999
Q ss_pred EEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc----
Q 018718 260 VHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT---- 329 (351)
Q Consensus 260 ~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~---- 329 (351)
++|||||++|.+++|+||.|+ +|+||||||+||+||++++|+++||+++|+|||||+||||||+| ++|+.
T Consensus 474 aalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~d~L~~lS~H 553 (718)
T KOG2089|consen 474 AALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKH 553 (718)
T ss_pred HHHHHhcCCcccCCCCccchHHHHHHHHHHhHHHHHHHhcCccccccCcccCCcchhhchHHHHHHhccCchHhhhhccc
Confidence 999999999999999999999 99999999999999999999999999999999999999999554 56644
Q ss_pred -ccCCCCcccc--chhhcccccccC
Q 018718 330 -QVIPTIPEDG--YFCSFRHKFASL 351 (351)
Q Consensus 330 -~~~~~~~~d~--~~~~~~~~~~~~ 351 (351)
.||+||||+. .+...|+.++||
T Consensus 554 y~tge~l~eEl~~kl~~~r~~~~gl 578 (718)
T KOG2089|consen 554 YKTGEPLPEELLKKLILTRTVNAGL 578 (718)
T ss_pred ccCCCccHHHHHHHHHHHHHHHHHH
Confidence 7999999999 788999988875
No 4
>PRK10911 oligopeptidase A; Provisional
Probab=100.00 E-value=1.5e-69 Score=571.28 Aligned_cols=335 Identities=30% Similarity=0.489 Sum_probs=294.9
Q ss_pred HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718 9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK------------- 73 (351)
Q Consensus 9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~------------- 73 (351)
.-|+|.|+...++++.....++.+ +|++|+++|+|+.++.. .+.++.....+..++|..+.++.
T Consensus 49 ~~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~-~l~~~~~~~~~~~~Ly~~~~~~~~~~~~~~l~~e~~ 127 (680)
T PRK10911 49 WENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQTLP-LLSEYSTWVGQHEGLYQAYRDLRDGDHYATLNTAQK 127 (680)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHH-HHHHHHHHHhCCHHHHHHHHHHhccccccCCCHHHH
Confidence 368999999999999988888886 99999999999999999 99999999999999999988772
Q ss_pred -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718 74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------ 127 (351)
Q Consensus 74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------ 127 (351)
|++.+++.|+++++|+ +|++|+++|++|++++++.|..++++++||+|||++++
T Consensus 128 r~l~~~~~~F~~sG~~L~~~~r~~~~~i~~~l~~l~~~F~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~~~ 207 (680)
T PRK10911 128 KAVDNALRDFELSGIGLPKEKQQRYGEIAARLSELGNQYSNNVLDATMGWTKLITDEAELAGMPESALAAAKAQAEAKEQ 207 (680)
T ss_pred HHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccceeecCCHHHHCCCCHHHHHHHHHHHHhcCC
Confidence 4444556777788777 99999999999999999999999999999999999876
Q ss_pred ------------------------------HHHHhhCC-----CCCCcHHHHHHHHHHHHHHHhhh--------------
Q 018718 128 ------------------------------AYVTCSFS-----GDLDNTSIIDQILKLRLEKAKLL-------------- 158 (351)
Q Consensus 128 ------------------------------a~~~ra~~-----~~~~N~~il~eil~LR~e~A~LL-------------- 158 (351)
|+.+|++. +..+|.+++.+|+++|+|+|+||
T Consensus 208 ~g~~~tl~~p~~~p~L~~~~drelRk~~y~A~~~r~~~~~~~~~~~dN~~il~eil~lR~e~AkLLGy~syAd~~L~~km 287 (680)
T PRK10911 208 EGYLLTLDIPSYLPVMTYCDNQALREEMYRAYSTRASDQGPNAGKWDNSEVMEEILALRHELAQLLGFENYADKSLATKM 287 (680)
T ss_pred CCeEEEeecCcHHHHHhhCCCHHHHHHHHHHHHhhhhhccCcccccchHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHh
Confidence 33334432 23589999999999999999999
Q ss_pred ------------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC---------------
Q 018718 159 ------------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE--------------- 199 (351)
Q Consensus 159 ------------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~--------------- 199 (351)
|+++|+++|++.. +...++|||+.||++++++++|++|+
T Consensus 288 a~spe~V~~fL~~l~~~~~p~a~~El~~L~~~~k~~~--g~~~L~pWD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~g 365 (680)
T PRK10911 288 AENPQQVLDFLTDLAKRARPQGEKELAQLRAFAKAEF--GVDELQPWDIAYYSEKQKQHLYSISDEQLRPYFPENKAVNG 365 (680)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCCCCChhhHHHHhhHHHHhhcCCCHHHhcccCcHHHHHHH
Confidence 7778888887653 45679999999999999999999987
Q ss_pred --------------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccce
Q 018718 200 --------------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPV 259 (351)
Q Consensus 200 --------------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~ 259 (351)
+|||||++|+|+|++|++||+||+|+|+|+| |++||||+.+.++.. ..+|+ +|.|+
T Consensus 366 l~~~~~~Lfgi~~~e~~~~~vwh~dV~~~~v~d~~~~~iG~~y~D~~~R~g-K~~ga~~~~~~~~~~--~~~g~-~~~Pv 441 (680)
T PRK10911 366 LFEVVKRIYGITAKERKDVDVWHPDVRFFELYDENNELRGSFYLDLYAREN-KRGGAWMDDCVGQMR--KADGS-LQKPV 441 (680)
T ss_pred HHHHHHHHcCeeEEecCCCCCCCCcceEEEEEeCCCCeEEEEEeeccCCCC-CCCCccccccccccc--cCCCc-eeCCe
Confidence 9999999999999888999999999999999 688889988765433 34665 68999
Q ss_pred EEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc----
Q 018718 260 VHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT---- 329 (351)
Q Consensus 260 ~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~---- 329 (351)
++++|||++|++++|+||+|+ +||||||||++|++|++++|++++||++++|||||+|||+||+| +||+.
T Consensus 442 ~~l~~Nf~~p~~~~p~LL~~~~v~tlfHEfGHalH~~ls~~~~~~~sGt~~~~~D~vE~pS~~~E~~~~~~~vL~~~a~H 521 (680)
T PRK10911 442 AYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGH 521 (680)
T ss_pred EEEECCCCCCCCCCCcccCHHHHHHHHHHHhHHHHHHHhCCCcCcCCCcCCCCchHhhccHHHHHHHhcCHHHHHHHHHH
Confidence 999999999999999999999 99999999999999999999999999899999999999999554 67765
Q ss_pred -ccCCCCcccc--chhhccccccc
Q 018718 330 -QVIPTIPEDG--YFCSFRHKFAS 350 (351)
Q Consensus 330 -~~~~~~~~d~--~~~~~~~~~~~ 350 (351)
.|++|||+++ .++..|+.++|
T Consensus 522 ~~tgeplp~~l~~~l~~~~~~~~~ 545 (680)
T PRK10911 522 YETGEPLPKELLDKMLAAKNYQAA 545 (680)
T ss_pred hcCCCCCCHHHHHHHHHhhhhhhH
Confidence 7999999999 67777777765
No 5
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=5.8e-54 Score=430.48 Aligned_cols=309 Identities=23% Similarity=0.375 Sum_probs=253.5
Q ss_pred EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHh--------------------------hccCCCHHHHHHHH
Q 018718 33 SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHL--------------------------KSVKNNPDLRSAYE 86 (351)
Q Consensus 33 ~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l--------------------------~~~~~~~~~R~~~~ 86 (351)
-.||+++.+.|+.+|..++. .+-...+--..+|..+..+ .|++.+++.|+.+-
T Consensus 106 ~aHPd~~fv~aAe~a~~~~~-e~ve~LNTn~~LY~~Lk~~l~~~~~l~~~d~e~~v~~lll~DFE~sGIhL~~~kr~kfv 184 (704)
T KOG2090|consen 106 QAHPDPEFVEAAEEACRSMF-ELVESLNTNVALYQKLKKVLQDSSRLDDLDPETYVARLLLDDFEKSGIHLDPEKREKFV 184 (704)
T ss_pred HhCCCHHHHHHHHHHHHHHH-HHHHHhccCHHHHHHHHHHhcCcccccccCHHHHHHHHHHHHHHhhcccCCHHHHHHHH
Confidence 67999999999999998443 4444445555555555443 37788888999988
Q ss_pred HHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH-------------------------------HHHHhhC
Q 018718 87 EVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL-------------------------------AYVTCSF 134 (351)
Q Consensus 87 ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l-------------------------------a~~~ra~ 134 (351)
+++ +|..|+.+|.+|...-.+. .+.-..+-|-+.-+-.+ .|. ...
T Consensus 185 ~Ls~eI~~lg~~F~~n~~~~P~~--~l~~s~~~l~~~~~~~~~~k~knF~~~t~~~~i~~ll~~~~d~~vRk~vY~-~~~ 261 (704)
T KOG2090|consen 185 QLSSEIFDLGREFQNNTDRFPDN--KLPNSLERLPFSKKNFIESKRKNFDQATDPYIIYGLLSSSEDESVRKMVYN-TFH 261 (704)
T ss_pred HHhHHHHHHHHHHHhccccCCcc--ccchhhhhccCcchhhHHHHhcCCCcCCCCeeeehhhhcCCCHHHHHHHHH-Hhc
Confidence 887 9999999999998743211 12222222222111111 111 122
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCC
Q 018718 135 SGDLDNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPR 176 (351)
Q Consensus 135 ~~~~~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~ 176 (351)
.....+..+|++++..|+|+|+|. |++.|..+|++++..+..
T Consensus 262 ~~~~~q~~~L~~Li~sRheLa~l~G~~SfA~~al~~~~a~~pk~V~~Fl~~Ls~k~~~~~~kel~~i~~mk~k~~~~~~~ 341 (704)
T KOG2090|consen 262 SPSDIQVKLLEHLISSRHELAKLVGKSSFAHRALEGKLAKNPKTVRSFLEELSEKLSEKTDKELAVIRDMKKKENNNNNA 341 (704)
T ss_pred CCcHHHHHHHHHHHHHHHHHHHHhCcchHHHHHhhccccCChHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhcCCCCC
Confidence 245789999999999999999999 788889998887654567
Q ss_pred ccccCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcCCCCCcc
Q 018718 177 EVNHWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDSSGSPIA 219 (351)
Q Consensus 177 ~l~pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~~g~~lG 219 (351)
++.|||..||..++|++.++++. +|||||+...|.+++.+.+|
T Consensus 342 e~~~WD~~YYT~~~r~~~~~~~~~~~~~fFslg~~ieGLs~L~~~LyGirl~~~~l~pGE~WhpdV~KL~vv~E~eg~lG 421 (704)
T KOG2090|consen 342 EIEPWDRPYYTSMYRQSNNSLNPSIYSPFFSLGSCIEGLSTLFQRLYGIRLIPEPLAPGEVWHPDVRKLNVVHEQEGLLG 421 (704)
T ss_pred ccccccchhhhhHhhcccCCCCcceeeccccHHHHHHHHHHHHHHHhCeeeeccCCCCccccCchhhhheeecCCCCcee
Confidence 89999999999999999988876 99999999999999889999
Q ss_pred ccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhc
Q 018718 220 YFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 220 ~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~ 297 (351)
|+|+|+|.|+||..|.|++++.++|.. +||+ +|.||++|+|||+.+....|++|++. +|||||||||||+||++
T Consensus 422 ~IY~Dlf~R~gK~~g~aHFTIr~sr~l---~Dg~-yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSmLGr 497 (704)
T KOG2090|consen 422 YIYCDLFERPGKTVGDAHFTIRGSRQL---SDGT-YQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGR 497 (704)
T ss_pred EEEEEeeccCCCCCCCceEEeeccccC---CCCC-eeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHHhcc
Confidence 999999999998888899999988653 7898 69999999999999999999999999 99999999999999999
Q ss_pred CCccccccCCCccceeeeechhhch----hhhhhhc-----ccCCCCcccc--chhhccccccc
Q 018718 298 QDEGLVAGSRGIEWDAVELSTNNSW----KIGVTRT-----QVIPTIPEDG--YFCSFRHKFAS 350 (351)
Q Consensus 298 ~~y~~~sGt~~v~~DfvE~PSq~~e----~~~~l~~-----~~~~~~~~d~--~~~~~~~~~~~ 350 (351)
|+||+++|||| |.||+|+||.+|| +|+|++. .+++|||+|| .+|.+++.||+
T Consensus 498 T~YQhvtGTRc-~tDfaEiPSiLMEyFa~D~rVl~~~aRhy~t~e~l~~~mv~~l~~s~n~~Aa 560 (704)
T KOG2090|consen 498 THYQHVTGTRC-PTDFAEIPSILMEYFANDYRVLRFFARHYSTGEPLPEDMVNRLCESRNSFAA 560 (704)
T ss_pred chhccccCccc-chhHhhhhHHHHHHHhcchHHHHHHHHHcCCCCCCCHHHHHHHHHHhccchh
Confidence 99999999986 6999999999984 6788865 7999999999 79999999986
No 6
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00 E-value=3.2e-52 Score=422.97 Aligned_cols=209 Identities=28% Similarity=0.464 Sum_probs=179.4
Q ss_pred CCCcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCcc
Q 018718 137 DLDNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREV 178 (351)
Q Consensus 137 ~~~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l 178 (351)
..+|.+++.+|+++|+++|+|| |+++|++++++.+......|
T Consensus 23 ~~~n~~il~~l~~lR~e~A~llGy~sya~~~l~~~ma~spe~v~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~~~~~~~L 102 (458)
T cd06457 23 SAEQLELLEELLSSRAELAQLLGFESYAHRALRGKMAKNPENVMEFLTELSEKLRPRAEEELAVLRDMKRKVLGKPLSSL 102 (458)
T ss_pred hhhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCC
Confidence 4689999999999999999999 77788888876531124579
Q ss_pred ccCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcCCCCCcccc
Q 018718 179 NHWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDSSGSPIAYF 221 (351)
Q Consensus 179 ~pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~~g~~lG~~ 221 (351)
.|||+.||.+++|+++|++|+ +|||||++|+|+|++|++||+|
T Consensus 103 ~pwD~~yy~~~~~~~~~~~d~~~l~~YFpl~~vl~gl~~~~~~lfgi~~~~~~~~~~~~Wh~dV~~~~v~d~~~~~lG~~ 182 (458)
T cd06457 103 APWDRDYYTGQYRQSRFDSEPSNLSPYFSLGTVMEGLSRLFSRLYGIRLVPVPLAPGEVWHPDVRKLDVVHEDEGLLGVI 182 (458)
T ss_pred CHHHHHHHHHHHHHhhcCCChHHhcccCcHHHHHHHHHHHHHHHhCeEEEecCCCCCCCcCccceEEEEEeCCCCEEEEE
Confidence 999999999999999999886 8999999999999888899999
Q ss_pred ccCccCCCCCCCCCceeeecccccccccCC----------CCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhH
Q 018718 222 YFDPYSCPSEKKGSARICEVFSQSRVLTLD----------GATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGH 289 (351)
Q Consensus 222 YlDl~~R~gKk~~ga~~~~~~~~~~~~~~d----------G~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGH 289 (351)
|||+|+|+| |++||||..++++.. ..+ |+ +|+|+++|+|||++|++++|+||+|+ .||||||||
T Consensus 183 YlDl~~R~~-K~~~a~~~~~~~~~~--~~~~~~~~~~~~~g~-~q~Pv~~lvcnf~~p~~~~p~lL~~~~v~TLfHEfGH 258 (458)
T cd06457 183 YCDLFSRPG-KPPGAAHFTIRCSRR--LDDDDVAERGGRGGT-YQLPVVALMCNFPPPSPSGPTLLSPHEVETLFHEMGH 258 (458)
T ss_pred EeecCCCCC-CCCCcceeccccccc--cCcccccccccCCCc-eeCCeEEEECCCCCCCCCCCCCcCHHHHHHHHHHHhH
Confidence 999999999 577888877665322 123 55 68999999999999999999999999 999999999
Q ss_pred HHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718 290 ALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS 350 (351)
Q Consensus 290 alH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~ 350 (351)
+||++|++++|+.++||+ +||||||+||||||.| .+|+. .+++|||+++ .+...|+.++|
T Consensus 259 alH~~ls~~~~~~~sgt~-~~~d~vE~pS~~~E~~~~~~~~L~~~a~h~~t~e~ip~~l~~~l~~~~~~~~~ 329 (458)
T cd06457 259 AMHSMLGRTEYQHVSGTR-CATDFVEVPSILMEYFASDPRVLKLFARHYSTGEPLPEEMLARLLASKNSFAA 329 (458)
T ss_pred HHHHHHcCCCccccCCCC-CCcchhhcCHHHHHHHHhhHHHHHHHhcccCCCCcCcHHHHHHHHHhhhhhHH
Confidence 999999999999999986 7999999999999654 56543 6899999999 56666666543
No 7
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00 E-value=1.3e-49 Score=399.89 Aligned_cols=203 Identities=40% Similarity=0.710 Sum_probs=179.2
Q ss_pred CcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCcccc
Q 018718 139 DNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREVNH 180 (351)
Q Consensus 139 ~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l~p 180 (351)
+|.+++++|+++|+|+|++| |++.|+++|++.+ +...|+|
T Consensus 1 ~N~~~l~~l~~lR~e~A~llG~~s~a~~~l~~~ma~sp~~v~~Fl~~l~~~~~p~~~~e~~~L~~~~~~~~--~~~~l~~ 78 (422)
T cd06456 1 DNRPLIEEILALRAEKAKLLGFENYAEYSLADKMAKSPEAVLEFLEDLAPKAKPQAKKELAELQAFAKEEG--GEDELEP 78 (422)
T ss_pred ChHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCCH
Confidence 69999999999999999999 8888999998876 4457999
Q ss_pred CCHHHHHHHhhhhhcCCCC-----------------------------------ccCCceeEEEEEcCCCCCccccccCc
Q 018718 181 WDINFWSERLCESKYDINE-----------------------------------VWNDDVRFYRVKDSSGSPIAYFYFDP 225 (351)
Q Consensus 181 WD~~Yy~~~~~~~~~~vd~-----------------------------------vWh~dV~~~~V~d~~g~~lG~~YlDl 225 (351)
||+.||.+++|+++|++|+ +|||||++|+|+|.+|++||+||+|+
T Consensus 79 wD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~lfgi~~~~~~~~~~Wh~dV~~~~v~d~~~~~lG~~ylDl 158 (422)
T cd06456 79 WDWAYYSEKLRKEKYDLDEEELRPYFPLEKVLDGLFELAERLYGITFKERTDLPVWHPDVRVYEVFDKDGSHIGLFYLDL 158 (422)
T ss_pred HHHHHHHHHHHHHhcCCCHHHHhccCCHHHHHHHHHHHHHHHcCeeEEECCCCCCCCCCceEEEEEeCCCCeEEEEEEec
Confidence 9999999999999988875 99999999999998889999999999
Q ss_pred cCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCcccc
Q 018718 226 YSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLV 303 (351)
Q Consensus 226 ~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~ 303 (351)
|+|+| |++||||+..+++.. +|. +|.|+++++|||++|++++|+||+++ .|||||||||+|++|++++|+.+
T Consensus 159 ~~R~~-K~~ga~~~~~~~~~~----~~~-~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l 232 (422)
T cd06456 159 YAREG-KRGGAWMNNLRSQSK----NGL-GQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSL 232 (422)
T ss_pred cCCCC-CCCCceeeccccccc----CCC-CCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCcccc
Confidence 99999 577889988765321 454 68999999999999999999999999 99999999999999999999999
Q ss_pred ccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718 304 AGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS 350 (351)
Q Consensus 304 sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~ 350 (351)
+|++ +++||||+|||+||.+ .+|+. .+++|||+++ .+...|+.++|
T Consensus 233 ~~~~-~~~d~~E~pS~~~E~~~~d~~vL~~~s~h~~t~~~lp~~l~~~~~~~~~~~~~ 289 (422)
T cd06456 233 GGTN-VEWDFVELPSQFMENWAWEPEVLKLFAKHYETGEPLPDELIDKLLAARNFNSG 289 (422)
T ss_pred CCCc-CchhHhhccHHHHHHHhcCHHHHHHHHhhcCCCCCCCHHHHHHHHHHhhHhHH
Confidence 9985 7899999999999654 56655 6999999999 56676666654
No 8
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=100.00 E-value=1.3e-46 Score=383.27 Aligned_cols=243 Identities=31% Similarity=0.534 Sum_probs=194.4
Q ss_pred HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh-----
Q 018718 90 ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL----- 158 (351)
Q Consensus 90 ~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL----- 158 (351)
+|.+|+++|++|+.++...+ ..-|.-.|+..+ ++.++ +..+|.+++.+|+++|+++|+++
T Consensus 5 ~~~~~~~~f~~~~~~~~~~~------~~~l~~~~d~~~Rk~~~~a~~~~---~~~~n~~~l~~ll~~R~~~A~l~Gy~s~ 75 (472)
T cd06455 5 ELSELCQEFSKNLNEKYPDY------FPVMKYAKNAETRKAMWVAFQNR---GGPENVPLLEELVALRHELARLLGYKSH 75 (472)
T ss_pred HHHHHHHHHHHhhhhcCccH------HHHHhcCCCHHHHHHHHHHHHhh---cchhHHHHHHHHHHHHHHHHHHcCCCCH
Confidence 67899999999999843221 223444433322 33333 24799999999999999999999
Q ss_pred ---------------------------------hHHHHHHHHHhcCCCCCC--ccccCCHHHHHHHhhhhhcCCCC----
Q 018718 159 ---------------------------------DIEDLRSFCKGQGALEPR--EVNHWDINFWSERLCESKYDINE---- 199 (351)
Q Consensus 159 ---------------------------------El~~L~~~k~~~~~~~~~--~l~pWD~~Yy~~~~~~~~~~vd~---- 199 (351)
|++.|.++|++.. +.+ +++|||+.||.++++++.+++|+
T Consensus 76 a~~~l~~~m~~s~e~v~~fl~~l~~~~~p~~~~e~~~l~~~k~~~~--~~~~~~l~~wD~~y~~~~~~~~~~~~~~~~~~ 153 (472)
T cd06455 76 ADYVLEDRMAKSPETVRDFLEDLSKKLKPLAEKELAELLELKKKEV--PEAGDRIYPWDLAYYMERVEEEKYDVDQEKIR 153 (472)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCCCHhhHHHHHHHHHHHhcCCCHHHHh
Confidence 5667888887653 222 69999999999999887666553
Q ss_pred -------------------------------ccCCceeEEEEEcCC-CCCccccccCccCCCCCCCCCceeeeccccccc
Q 018718 200 -------------------------------VWNDDVRFYRVKDSS-GSPIAYFYFDPYSCPSEKKGSARICEVFSQSRV 247 (351)
Q Consensus 200 -------------------------------vWh~dV~~~~V~d~~-g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~ 247 (351)
+|||||++|+|+|++ |+++|++|+|+|+|+| |++||||...++..
T Consensus 154 ~yf~~~~~~~~i~~~~~~lfg~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~g~~ylD~~~R~g-K~~Ga~~~~~~~~~-- 230 (472)
T cd06455 154 EYFPLEVVIEGMLDIYQRLFGLRFEEVPDASVWHEDVRLYSVWDADTGEFLGYFYLDLHPREG-KYGHAANFGLQPGF-- 230 (472)
T ss_pred ccCcHHHHHHHHHHHHHHHhCeEEEeCCCCCCCCCcceEEEEEECCCCCEEEEEEeecCCCCC-CCCCccccccccce--
Confidence 899999999999975 7899999999999999 57788987765432
Q ss_pred ccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh-
Q 018718 248 LTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI- 324 (351)
Q Consensus 248 ~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~- 324 (351)
...+|. +|.|+++++|||++|.+++|+||+++ .||||||||++|++|++++|+.++|++ +++||||+|||+||.+
T Consensus 231 ~~~~g~-~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~~~~~~sg~~-~~~d~aE~pS~~~E~~~ 308 (472)
T cd06455 231 LLPDGS-RQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRTKYARFSGTR-VERDFVEAPSQMLENWC 308 (472)
T ss_pred ecCCCC-EeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCCCccccCCCc-CChhhhhcchHHHHHHh
Confidence 235665 68899999999999999999999999 999999999999999999999999985 7899999999999654
Q ss_pred ---hhhhc-----ccCCCCcccc--chhhccccc
Q 018718 325 ---GVTRT-----QVIPTIPEDG--YFCSFRHKF 348 (351)
Q Consensus 325 ---~~l~~-----~~~~~~~~d~--~~~~~~~~~ 348 (351)
++++. .+++|||+++ .+...|+.+
T Consensus 309 ~~~~~l~~l~~h~~t~e~i~~~li~~~~~~~~~~ 342 (472)
T cd06455 309 WEPEVLKRLSKHYKTGEKIPDELIERLIASRHFN 342 (472)
T ss_pred cCHHHHHHHhhccCCCCCCCHHHHHHHHHhhhhh
Confidence 45644 5899999998 445554433
No 9
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00 E-value=2.7e-43 Score=356.95 Aligned_cols=203 Identities=35% Similarity=0.588 Sum_probs=163.8
Q ss_pred CcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCC-ccc
Q 018718 139 DNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPR-EVN 179 (351)
Q Consensus 139 ~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~-~l~ 179 (351)
+|..++.+++++|+++|+++ +++.|++++++.. +.+ +|+
T Consensus 30 ~n~~~l~~l~~~R~~~A~llGy~s~~~~~l~~~m~~~~~~v~~fL~~~~~~~~p~~~~~~~~l~~~~~~~~--g~~~~l~ 107 (458)
T PF01432_consen 30 DNAAILNELLKLRNELAKLLGYPSYAEYSLQDKMAKNPENVLDFLDELVKKLKPLLERELELLKKLKKKRL--GLEKKLR 107 (458)
T ss_dssp HHHHHHHHHHHHHHHHHHHTT-SSHHHHHHTTSTTTSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT--T-SSSBB
T ss_pred hHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccc
Confidence 79999999999999999999 6777777776653 344 899
Q ss_pred cCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcC--CCCCccc
Q 018718 180 HWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDS--SGSPIAY 220 (351)
Q Consensus 180 pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~--~g~~lG~ 220 (351)
|||++||.+++++..+++|+ +|||||++|+|+|. ++.++|+
T Consensus 108 ~wD~~y~~~~~~~~~~~~~~~~~~~~f~~~~~~~g~~~l~~~lfgi~~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~ig~ 187 (458)
T PF01432_consen 108 PWDVAYYMEQYRQERYDLDEEELSPYFPLEEVLEGLFELAERLFGIRFEEVPDADGEVWHPDVRKFEVWDEIFSGMFIGY 187 (458)
T ss_dssp GGGHHHHHHHHHHHHTSSSHHHHGGG-BHHHHHHHHHHHHHHHHTEEEEECTCGGHHHSSTT-EEEEEEETHTTHCECEE
T ss_pred cchhHHHhhHHHHHHhccchhhcCCcCcHHHHHHHHHHHHHHHhcEEEEecccccccceecceeEEEEeehhhcccchhc
Confidence 99999999999998888775 69999999999997 6889999
Q ss_pred cccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcC
Q 018718 221 FYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQ 298 (351)
Q Consensus 221 ~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~ 298 (351)
+|+|+|+|+| |++||||+..+.. + .+|+ +|.|+++++|||++|.+++|+||+++ .|||||||||+|++|+++
T Consensus 188 ~ylDl~~R~g-K~~ga~~~~~~~~--~--~~~~-~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tLfHE~GHa~H~~ls~~ 261 (458)
T PF01432_consen 188 IYLDLYPRPG-KRSGAFCFTLRPS--R--SDGE-RQLPVPYIFCNFTGPSAGKPSLLSHDDVETLFHEFGHAMHSLLSRT 261 (458)
T ss_dssp EEEEES--TT-S-SS-EEEEEEC---B--TTST-CECEEEEEEEEE-S-BTTC--B-SHHHHHHHHHHHHHHHHHHHCCC
T ss_pred ccccchhcCC-CCCCceeCCccCc--c--cccc-CCCCceEEEecCCCCCCCCCCccChhhHHHHHHHHhHHHHHHHhcc
Confidence 9999999999 5777888776542 1 4676 79999999999999999999999999 999999999999999999
Q ss_pred CccccccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718 299 DEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS 350 (351)
Q Consensus 299 ~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~ 350 (351)
+++.++|+ .+|+||||+||||||.+ .++.. .++++||+++ .+..+|+.+++
T Consensus 262 ~~~~~sg~-~~~~d~aE~~S~~~E~~~~~~~~l~~~~~~~~~~~~i~~~~l~~~~~~~~~~~~ 323 (458)
T PF01432_consen 262 KYQHLSGT-RVPMDFAEFPSQFMENWLWDPLVLKAFSRHYETGEPIPEELLEDLIASRNFFAA 323 (458)
T ss_dssp SSGGGSTT-SS-CHHCHHHHHHHHHHGGCHHHHHHH-BSTTTHHHHHHHHHHHHHHTTTTTHH
T ss_pred ccccccCC-chhHHHHhcchHHHHHhhhchhhhhhhccChhhhhhHHHHHHHhhhhhhhhhHH
Confidence 99999999 57899999999999664 45543 5889999998 45556665543
No 10
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=99.93 E-value=2.7e-26 Score=226.17 Aligned_cols=171 Identities=20% Similarity=0.270 Sum_probs=122.2
Q ss_pred HHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCccccCCH
Q 018718 142 SIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREVNHWDI 183 (351)
Q Consensus 142 ~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l~pWD~ 183 (351)
++|++|+.+|+|+|+++ +++.|+++++++. +...+.||..
T Consensus 2 ~~l~~lv~lr~e~A~~lGy~~~~d~~l~~~~~~~~e~v~~f~~~l~~~~~p~~~~~~~~l~~~~~~~~--~~~~~~~~~~ 79 (365)
T cd06258 2 ALLEELVSLRNQLARLLGYENFADYKLALQEAKSPETVEGFFEELKRKLRPLLAKLREEISAAKQKEE--EIYGELPARY 79 (365)
T ss_pred hHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCcccccccc
Confidence 68999999999999999 4555555554432 2345778811
Q ss_pred HHHHHHhhhhhcCCCCccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEe
Q 018718 184 NFWSERLCESKYDINEVWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMV 263 (351)
Q Consensus 184 ~Yy~~~~~~~~~~vd~vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv 263 (351)
. .+.....+.|.++.+|.- ....+|+|+++|+| |++|+||+...+.. + ++.|.++++
T Consensus 80 ~-~~~~~~~~~f~~~~~~~~-------------~~~~~~lD~~~R~g-K~~~a~~~~~~~~~-------~-~~~~~~~i~ 136 (365)
T cd06258 80 D-VDSALLKEFFDAERPWEG-------------ALPFFYLDLYDRKG-KYPHGFCTGLDPGF-------N-RQDKDVRIL 136 (365)
T ss_pred c-CCHhhccCcCChHHHHHH-------------HHHhHhccCCCCCC-CCCCCeeccccCCC-------C-CCCCeEEEE
Confidence 1 011111112232222221 11222999999999 56677886654311 1 478999999
Q ss_pred eccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----ccC
Q 018718 264 CNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVI 332 (351)
Q Consensus 264 ~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~ 332 (351)
|||++|++++|+|++++ .||||||||++|+++++++|+.++|+ ++++||+|+||++||.+ .+|+. .++
T Consensus 137 ~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~-~~~~~~~E~~S~~~E~~~~~~~~L~~~~~~~~~~ 215 (365)
T cd06258 137 ANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERT-PTSTDFAEAQSMFLESFATDPEWLERYARHYQGG 215 (365)
T ss_pred ccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCC-CCCccHHhccHHHHHHHHCCHHHHHHHhhhcCCC
Confidence 99999999999999999 99999999999999999999988997 47899999999999754 45543 344
Q ss_pred CCCcccc
Q 018718 333 PTIPEDG 339 (351)
Q Consensus 333 ~~~~~d~ 339 (351)
|+|++.
T Consensus 216 -~~~~~~ 221 (365)
T cd06258 216 -VVPDEL 221 (365)
T ss_pred -CCcHHH
Confidence 788777
No 11
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=99.87 E-value=2.9e-22 Score=208.09 Aligned_cols=202 Identities=16% Similarity=0.152 Sum_probs=126.0
Q ss_pred HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHH----------HH-----HHHHhhCCCCCCcHHHHHHHHHHHHHH
Q 018718 90 ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPAT----------AL-----AYVTCSFSGDLDNTSIIDQILKLRLEK 154 (351)
Q Consensus 90 ~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~----------~l-----a~~~ra~~~~~~N~~il~eil~LR~e~ 154 (351)
+-.++++.|.+.+...+ |.++.+++. +++- +. ++.++......+|..++.+++++|++.
T Consensus 120 ~~~~~~~~y~~l~~~~~-----~~~~Ge~~~-l~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~~~il~~lv~~R~~~ 193 (549)
T TIGR02289 120 KENILSTKYREIISNIS-----IIFEGEEKT-LSQLKPYLQDPNRSTRKKAWYARSEFFAVVEEELDRIYDELVQIRTQI 193 (549)
T ss_pred HHHHHHHHHHHHHhccE-----EEECCEecc-HHHhhHHhhCCCHHHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHH
Confidence 44567788888777554 455555432 2221 11 233333334567889999999999999
Q ss_pred Hhhh------------------------------------hHHHHHHHHHhcCCCCCCccccCCHHHHH--HHhhhhhcC
Q 018718 155 AKLL------------------------------------DIEDLRSFCKGQGALEPREVNHWDINFWS--ERLCESKYD 196 (351)
Q Consensus 155 A~LL------------------------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~--~~~~~~~~~ 196 (351)
|+++ -...+.+++++.. |.+.+.+||..|.. ... ...++
T Consensus 194 Akl~Gy~s~~e~~~~~~~r~~~~~e~v~~~l~~v~~~~~p~~~~~~~~k~k~l--G~~~l~~wD~~~~~~~~~~-~~~~~ 270 (549)
T TIGR02289 194 ALNLGFSNYRDYMYKLKNRTDYSAEDCYKYRESILKYVVPLWTKLRKIKKKRL--GIKTLRPWDESAVFLDGNV-KPFGN 270 (549)
T ss_pred HHHCCCCCHHHHHHHHhcccCCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHhC--CCCcCChhhccCCCCCCCc-CCCCC
Confidence 9999 0112233344332 45679999987532 111 12234
Q ss_pred CCCccCCceeEEEEEcCC-----CCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCC
Q 018718 197 INEVWNDDVRFYRVKDSS-----GSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVG 271 (351)
Q Consensus 197 vd~vWh~dV~~~~V~d~~-----g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~ 271 (351)
.+++..--.+.|...+.. ..+++..|+|+++|+| |++||||+.++. ...|. ++|||++.
T Consensus 271 ~e~~~~~~~~~~~~l~~~~~e~~~~~~~~~~iD~~~r~g-K~~Gayc~~~~~-----------~~~P~--I~~Nf~~t-- 334 (549)
T TIGR02289 271 VDFLLEKAEKMYKELSLEFYEFFNFMREKNLLDLESRKG-KAAGGYCTYLPK-----------YKAPF--IFSNFNGT-- 334 (549)
T ss_pred HHHHHHHHHHHhcCCCcHHHHHHHHHHhcCccccCCCCC-CCCCcccCCCCC-----------CCCcE--EEEeCCCC--
Confidence 333222112333333321 2257788999999999 688889877532 23674 46999871
Q ss_pred CCCCCCCCc-hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718 272 DKPSLMTFS-ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI 324 (351)
Q Consensus 272 ~~P~LL~~~-~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~ 324 (351)
..| .||+||||||+|+++++.. .....+..|+|++|+|||+||.+
T Consensus 335 ------~~dv~TL~HElGHa~H~~~s~~~--~~~~~~~~~~~~aE~aS~~~E~l 380 (549)
T TIGR02289 335 ------SGDIDVLTHEAGHAFHVYESRKD--LLPEYRWPTYEAAELASMSMELL 380 (549)
T ss_pred ------hhHHHHHHHHhhHHHHHHHhcCC--cccccccCcchhhhhhHHHHHHH
Confidence 344 9999999999999999852 22333345799999999999764
No 12
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=99.87 E-value=2.7e-23 Score=207.91 Aligned_cols=164 Identities=17% Similarity=0.136 Sum_probs=114.8
Q ss_pred HHHhhCCCCCCcHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHHhcCCCCCC
Q 018718 129 YVTCSFSGDLDNTSIIDQILKLRLEKAKLL--------------------------------DIEDLRSFCKGQGALEPR 176 (351)
Q Consensus 129 ~~~ra~~~~~~N~~il~eil~LR~e~A~LL--------------------------------El~~L~~~k~~~~~~~~~ 176 (351)
+.++......+|.+++.+++++|.+.|+++ -+..+.+.+++.. +.+
T Consensus 58 ~~~~~~~~~~~~~~~l~~lv~~r~~~A~~~Gy~~~~~~~l~~~~~~~~~~~~l~~v~~~~~p~~~~~~~~~~~~l--g~~ 135 (427)
T cd06459 58 LYKAYEKYENTLAAILNTLVKLRLTLAKLRGYDSYLEAALFNNNIPEDVYDFLIAVVKENVPLLHRYLKLKKKLL--GLD 135 (427)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHhhccCCCCHHHHHHHHHHHHHhcHHHHHHHHHHHHHh--CCC
Confidence 333333234678999999999999999999 1222223333322 356
Q ss_pred ccccCCHHHHHHHhhhhhcCCCC----------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccc
Q 018718 177 EVNHWDINFWSERLCESKYDINE----------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSR 246 (351)
Q Consensus 177 ~l~pWD~~Yy~~~~~~~~~~vd~----------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~ 246 (351)
.+.|||+.|+..+.++..|++++ .|++++. ++. +..++..|+|+++|+| |++||||+...+
T Consensus 136 ~l~~wD~~~~~~~~~~~~~~~~~~~~~v~~~~~~l~~~~~--~~~---~~~~~~~~iD~~~r~g-K~~gaf~~~~~~--- 206 (427)
T cd06459 136 KLRPYDLYAPLVSGNPPKYTYEEAKELVLEALSPLGPEYA--EFA---KRAFEERWIDVEPRKG-KRSGAYCTGLPP--- 206 (427)
T ss_pred cCcHhhcCCCCCCCCCCcCcHHHHHHHHHHHHHccCHHHH--HHH---HHHhhCCCeeccCCCC-CCCCeecCCCCC---
Confidence 79999999987777666777776 3443321 111 2355677999999999 677888866421
Q ss_pred cccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718 247 VLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI 324 (351)
Q Consensus 247 ~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~ 324 (351)
...| +++|||.+ +++ .||||||||++|.++++..++ +.+. ++|+||+|+|||+||.+
T Consensus 207 --------~~~p--~i~~n~~~---------~~~~v~tl~HE~GHa~h~~~~~~~~~-~~~~-~~~~~~~E~~S~~~E~~ 265 (427)
T cd06459 207 --------GKHP--FILMNFNG---------TLDDVFTLAHELGHAFHSYLSRDNQP-YLYS-DYPIFLAEIASTFNELL 265 (427)
T ss_pred --------CCCC--eEEecCCC---------ChhhHHHHHHHhhHHHHHHHHccCCC-cccC-CCCchhhHHHHHHHHHH
Confidence 1235 56899976 345 999999999999999987554 3333 47899999999999765
No 13
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=99.83 E-value=4.1e-21 Score=200.95 Aligned_cols=274 Identities=14% Similarity=0.137 Sum_probs=155.9
Q ss_pred HHHHhcccccccccccccccceEE--EeC-CCHHHHHHHHHHHHHHHhhhhhcccc-hhhHHHHHHHh-hc-cCCCHH--
Q 018718 9 VVFLLLPPSAVQSRFKSRNTFLCF--SIC-PSPASAATSMAATGAVVDGLSLEDNP-LLQDFEIIGHL-KS-VKNNPD-- 80 (351)
Q Consensus 9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~-~~~~~r~as~~a~~~~~~~~~~~~~~-l~~v~~~~~~l-~~-~~~~~~-- 80 (351)
+.+.+.++-+.+............ .+. .|++.++...++.. ++.++...... -..++.+-... .. ....++
T Consensus 50 l~~~l~~~e~~~~~~~~l~~Ya~l~~s~d~~d~~~~~~~~~~~~-l~s~~~~~~sf~~~eL~~l~~~~~~~~l~~~~~L~ 128 (591)
T TIGR00181 50 FLEALALEEKILILLNRLYNYASMKLSTDVTDPEANAISQKLSN-LYTKVASATSFFEPEILEIEEKIIKEWLKDPEELA 128 (591)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCHHHHHHHHhcChhhH
Confidence 344555555555555444444444 444 55788998888888 88777666552 22233322111 11 112221
Q ss_pred -HHHHHHHH-------------------H-HHHHHHHHHHHHHHHhhcccccccc-ChhhhcCCCHHH----H-------
Q 018718 81 -LRSAYEEV-------------------R-ELGKLSIDFSENVLDATKRFETFVT-DKKEIQGLPATA----L------- 127 (351)
Q Consensus 81 -~R~~~~ei-------------------~-~ls~L~~~F~~Nl~~~~~~~~~v~~-~~~eL~GlP~~~----l------- 127 (351)
.+..+.++ + .++..+..|.+..+.+. .|..+.. +++++ .+.... +
T Consensus 129 ~y~~~l~~~~r~k~h~Ls~e~E~lLa~~~~~~~a~~~~f~~l~~ad~-~f~~~~~~~G~~~-~l~~~~~~~~l~~~~dr~ 206 (591)
T TIGR00181 129 DYKRALEEIFRDKPHILSEEVEKLLSALSEVFGSPSDIYSTLTNADM-DFGSIEDYKGKKY-PITNSTYENFLQKNKDRE 206 (591)
T ss_pred HHHHHHHHHHHhCCCCCCHHHHHHHHhCchhhccHHHHHHHHHhccC-CCCceecCCCcEE-eccHHHHHHHHhcCCCHH
Confidence 22222222 1 34566677877655443 2222211 12221 122211 1
Q ss_pred -------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHH
Q 018718 128 -------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL--------------------------------DIEDLRSFCK 168 (351)
Q Consensus 128 -------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL--------------------------------El~~L~~~k~ 168 (351)
++.++.......|..++.+++++|.+.|+++ .+....++|+
T Consensus 207 ~Rk~a~~a~~~~~~~~~~~~a~~l~~~v~~r~~~A~lrGy~s~~~~~l~~~~i~~~v~~~l~~~v~~~~~~~~~y~~~k~ 286 (591)
T TIGR00181 207 IRKKAYESFYKAYRKHKNTFAALYYGNVQKNVFLAKLRNYESYIDASLFSDEVPREVYDNLYDTIKKNAPVLQRYYKLRK 286 (591)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHhccCCCCHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 2333332234578999999999999999998 1223333444
Q ss_pred hcCCCCCCccccCCHHHHHHHhhh--hhcCCCC--------------ccCCceeEEEEEcCCCCCccccccCccCCCCCC
Q 018718 169 GQGALEPREVNHWDINFWSERLCE--SKYDINE--------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEK 232 (351)
Q Consensus 169 ~~~~~~~~~l~pWD~~Yy~~~~~~--~~~~vd~--------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk 232 (351)
+.. |.+++++||+.. ...+. ..|++++ .|.+-|+. ..+.=|+|+++|+| |
T Consensus 287 k~l--g~~~l~~~D~~~--~~~~~~~~~~~~~~a~~~v~~~~~~~g~~~~~~~~~---------~~~~~wiD~~~r~g-K 352 (591)
T TIGR00181 287 KVL--KLDKMEPYDLYL--PLVKEKNPKFSIEEAKELILKSLEPLGEEYIKILKR---------AFNERWVDYAENKG-K 352 (591)
T ss_pred HHh--CCCcccHHHccC--CCCCCcCCcccHHHHHHHHHHHHhccCHHHHHHHHH---------HhhCCCeeecCCCC-C
Confidence 433 456799999872 22222 2233333 12222211 01112999999999 6
Q ss_pred CCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCcc
Q 018718 233 KGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIE 310 (351)
Q Consensus 233 ~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~ 310 (351)
++||||+... ...|. ++|||++ +++ .||||||||++|+++++...+...+ .+|
T Consensus 353 ~~Ga~~~~~~------------~~~p~--il~N~~~---------~~~dv~TLaHElGHa~H~~~~~~~~~~~~~--~~~ 407 (591)
T TIGR00181 353 RSGAYSIGGY------------KVKPY--ILMNWDG---------TLNSVFTLAHELGHSMHSYFSSKHQPYPNS--DYS 407 (591)
T ss_pred CCCcccCCCC------------CCCCe--EEEecCC---------CcchHHHHHHHhhhHHHHHHHccCCCCccC--CCC
Confidence 7788875541 12454 4789987 555 9999999999999999765554554 478
Q ss_pred ceeeeechhhchhh
Q 018718 311 WDAVELSTNNSWKI 324 (351)
Q Consensus 311 ~DfvE~PSq~~e~~ 324 (351)
+|+||+||++||.+
T Consensus 408 ~~~aE~aS~~~E~l 421 (591)
T TIGR00181 408 IFYAEIASTFNELL 421 (591)
T ss_pred chhhhHhHHHHHHH
Confidence 99999999999764
No 14
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=99.73 E-value=6.1e-19 Score=184.58 Aligned_cols=152 Identities=16% Similarity=0.197 Sum_probs=97.7
Q ss_pred HHHHHHHHHHHHHHHhhh-----------------------------hHHHHHH---HHHhcCCCCCCccccCCHHHHHH
Q 018718 141 TSIIDQILKLRLEKAKLL-----------------------------DIEDLRS---FCKGQGALEPREVNHWDINFWSE 188 (351)
Q Consensus 141 ~~il~eil~LR~e~A~LL-----------------------------El~~L~~---~k~~~~~~~~~~l~pWD~~Yy~~ 188 (351)
..++..+++.|...|++. .+..+++ .+++.. |.+.+++||..+...
T Consensus 223 a~~l~~~v~~~~~~a~~rgy~s~~e~~l~~~~i~~~~~~~l~~~v~~~~p~~~~y~~~k~~~l--gl~~l~~~D~~~p~~ 300 (587)
T TIGR02290 223 AAILNALAGDRLTEYRLRGYDHPLEPRLLYNRIDQETLDAMLEAIKENYPLFRRYYKLKAKLL--GKEKLDFYDLYAPLG 300 (587)
T ss_pred HHHHHHHHHHHHHHHHhhCCCCHHHHhhccCCCCHHHHHHHHHHHHHhhHHHHHHHHHHHHHh--CCCcccHHHcCCCCC
Confidence 346789999999999997 1112222 222222 457799999865433
Q ss_pred HhhhhhcCCCC----------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccc
Q 018718 189 RLCESKYDINE----------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLP 258 (351)
Q Consensus 189 ~~~~~~~~vd~----------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P 258 (351)
...+..|.+++ .|.|+..-+.. .....-++|+++|+| |++||||+...+ + ..|
T Consensus 301 ~~~~~~~~~e~a~~~v~~~~~~l~~e~~~~~~-----~~~~~~~iD~~~r~g-K~~Ga~~~~~~~--------~---~~p 363 (587)
T TIGR02290 301 DSSAKTYTFDEAKELVLEAFGKFSPEMADFAE-----KAFEEGWIDAEPRPG-KRGGAFCTGFPP--------S---KEP 363 (587)
T ss_pred CCCCCeecHHHHHHHHHHHHHhcCHHHHHHHH-----HHHHcCCcccCCCCC-CCCCcccCCCCC--------C---CCC
Confidence 22234455543 33333321110 011222899999999 678888866421 1 236
Q ss_pred eEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718 259 VVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI 324 (351)
Q Consensus 259 ~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~ 324 (351)
+++|||++. ..+ .||||||||++|++++++.. ..++ ++++|+||+||||||.+
T Consensus 364 --~i~~N~~~~--------~~~v~TL~HE~GHa~H~~ls~~~~--~~~~-~~~~~~aE~~S~~~E~l 417 (587)
T TIGR02290 364 --RVLMNYDGS--------RRDVSTLAHELGHAYHSELAKDQP--LLNA-RYPMTLAETASIFAEML 417 (587)
T ss_pred --EEEEecCCC--------chhHHHHHHHhhHHHHHHHHccCC--cccC-CCCchhhhHHHHHHHHH
Confidence 477999872 234 99999999999999998643 4443 47899999999999754
No 15
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=99.03 E-value=2.4e-10 Score=120.21 Aligned_cols=79 Identities=16% Similarity=0.244 Sum_probs=58.3
Q ss_pred cccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhcCC
Q 018718 221 FYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTKQD 299 (351)
Q Consensus 221 ~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~~~ 299 (351)
=++|+++|+| |++||++++.. +| ..|.+. +|+.+- -.| .||+||+||++|+.+++..
T Consensus 343 ~WiD~~~~~g-KrsGaYs~~~~--------~~---~~p~Il--mN~~gt--------~~dV~TLaHElGHs~Hs~~s~~~ 400 (598)
T COG1164 343 RWIDVYPRKG-KRSGAYSIGFY--------KG---DHPFIL--MNYDGT--------LRDVFTLAHELGHSVHSYFSRKH 400 (598)
T ss_pred CCeeccCCCC-CCCCcccCCCC--------CC---CCCeEE--EeCCCc--------hhHHHHHHHHccHHHHHHHHhcc
Confidence 3799999999 68888876542 11 357654 677651 345 8999999999999999854
Q ss_pred ccccccCCCccceeeeechhhchh
Q 018718 300 EGLVAGSRGIEWDAVELSTNNSWK 323 (351)
Q Consensus 300 y~~~sGt~~v~~DfvE~PSq~~e~ 323 (351)
-+-+.+. .++.++|+||+++|.
T Consensus 401 qp~~~~~--~~i~~AEiAS~fnE~ 422 (598)
T COG1164 401 QPYLYAD--YSIFLAEIASTFNEM 422 (598)
T ss_pred CCccccC--CchHHHHHHHHHHHH
Confidence 4424442 469999999999865
No 16
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=97.67 E-value=5.8e-05 Score=77.76 Aligned_cols=83 Identities=11% Similarity=-0.025 Sum_probs=48.9
Q ss_pred cCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhh-hcCCcccc
Q 018718 226 YSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRML-TKQDEGLV 303 (351)
Q Consensus 226 ~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~ll-s~~~y~~~ 303 (351)
++|+|| ++| .|+...- .+ .. ..-|.+. +|+.. -..| .|+.||+||+.-.+. ...++--.
T Consensus 212 ~~rp~k-~~~-~C~~saw--d~---~~--~~d~rI~--~c~~~--------t~~D~~t~~HE~GH~~yy~~y~~~p~~~r 272 (477)
T cd06461 212 LEKPTD-REV-VCHASAW--DF---YN--GKDFRIK--MCTKV--------NMEDFVTVHHEMGHIQYYLQYKDQPVLFR 272 (477)
T ss_pred ccCCCC-CCC-ccCcccc--cC---CC--CCCccee--eCCCC--------CHHHHHHHHHHHHHHHHHHHhccCCHHHh
Confidence 999995 544 5555211 00 01 2356555 44433 1456 999999999994434 33333323
Q ss_pred ccCCCccc--eeeeechhhchhhhhhh
Q 018718 304 AGSRGIEW--DAVELSTNNSWKIGVTR 328 (351)
Q Consensus 304 sGt~~v~~--DfvE~PSq~~e~~~~l~ 328 (351)
.|. +... +.+|+.|+.|-.|+.|.
T Consensus 273 ~~a-np~fheav~e~~smS~~tpe~L~ 298 (477)
T cd06461 273 EGA-NPGFHEAVGDAIALSVSTPKHLH 298 (477)
T ss_pred CCC-CCChHHHHHHHHHHhcCCHHHHh
Confidence 443 2446 88999999995555554
No 17
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=95.73 E-value=0.085 Score=56.20 Aligned_cols=76 Identities=29% Similarity=0.449 Sum_probs=58.6
Q ss_pred HHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH
Q 018718 50 AVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL 127 (351)
Q Consensus 50 ~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l 127 (351)
|++.+++..+.+|.+||.+|.+++++++++++|+.++++. +++++.+.-++|..=-. .+..++.+++ .+|+.+...
T Consensus 58 n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a~e~~~pklse~~~~l~~~~~Lf~-~~~al~~~~e-~~~ld~e~~ 134 (683)
T COG0339 58 NTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREAYEEILPKLSEFSNDLGQNPGLFA-RYKALWQSPE-SAGLDPEQK 134 (683)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHHHHHhhHHHHHHHHHhhcCHHHHH-HHHHHhcCcc-cccCCHHHH
Confidence 5788899999999999999999999999999999999998 77777777666644222 2333555554 677776644
No 18
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=93.55 E-value=0.028 Score=56.86 Aligned_cols=42 Identities=24% Similarity=0.192 Sum_probs=29.9
Q ss_pred hhhHhhhhHHHhhh-hhcCCc-cccccCCCccceeeeechhhchhh
Q 018718 281 ETVFHEFGHALQRM-LTKQDE-GLVAGSRGIEWDAVELSTNNSWKI 324 (351)
Q Consensus 281 ~TLFHEfGHalH~l-ls~~~y-~~~sGt~~v~~DfvE~PSq~~e~~ 324 (351)
.|+.||+|||+|.. +...-. +.+.+ ++++=+.|..|.|+|+.
T Consensus 161 ~t~iHE~GHalye~~l~~~~~~~pl~~--~~sm~ihESqS~f~E~~ 204 (396)
T cd06460 161 FSTIHETGHALYEQGLPPELRGTPLGG--GASMGIHESQSRLWENQ 204 (396)
T ss_pred HHHHHHhhHHHHHhcCCHhHccCCccc--CCCchhHHHHHHHHHHH
Confidence 68999999999998 652111 22222 34688999999999863
No 19
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=89.30 E-value=0.21 Score=46.43 Aligned_cols=30 Identities=33% Similarity=0.430 Sum_probs=23.1
Q ss_pred ccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhh
Q 018718 256 RLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRM 294 (351)
Q Consensus 256 ~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~l 294 (351)
+.|+++++.|-+. ... .||+||+||++=|-
T Consensus 57 ~~~~I~iN~n~~~---------~r~rFtlAHELGH~llH~ 87 (213)
T COG2856 57 EKPVIYINANNSL---------ERKRFTLAHELGHALLHT 87 (213)
T ss_pred cCceEEEeCCCCH---------HHHHHHHHHHHhHHHhcc
Confidence 4599999888743 345 89999999997543
No 20
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=85.60 E-value=0.39 Score=38.58 Aligned_cols=32 Identities=28% Similarity=0.410 Sum_probs=21.9
Q ss_pred cceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhc
Q 018718 257 LPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 257 ~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~ 297 (351)
.|++++..|-++ ... .|++||+||.+.+--..
T Consensus 28 ~~~I~in~~~~~---------~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 28 NPIIFINSNLSP---------ERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp TTEEEEESSS-H---------HHHHHHHHHHHHHHHHHH-HH
T ss_pred CCEEEECCCCCH---------HHHHHHHHHHHHHHHhhhccc
Confidence 577777776321 233 89999999999876553
No 21
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=84.98 E-value=0.39 Score=41.08 Aligned_cols=11 Identities=55% Similarity=0.936 Sum_probs=10.3
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||+|
T Consensus 107 ~v~~HEiGHaL 117 (154)
T PF00413_consen 107 SVAIHEIGHAL 117 (154)
T ss_dssp HHHHHHHHHHT
T ss_pred hhhhhcccccc
Confidence 79999999997
No 22
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=84.94 E-value=0.47 Score=44.33 Aligned_cols=32 Identities=41% Similarity=0.453 Sum_probs=22.2
Q ss_pred hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechh
Q 018718 281 ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTN 319 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq 319 (351)
.||+||+||++-+.+.= + +.| -+=|.|-.-+.
T Consensus 94 ~~l~HE~GHAlI~~~~l---P-v~G---rEEDaaD~lAa 125 (220)
T PF14247_consen 94 FTLYHELGHALIDDLDL---P-VLG---REEDAADQLAA 125 (220)
T ss_pred HHHHHHHHHHHHHHhcC---C-ccc---chhhHHHHHHH
Confidence 79999999999998732 2 445 24676664443
No 23
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=80.44 E-value=6.6 Score=42.62 Aligned_cols=48 Identities=17% Similarity=0.194 Sum_probs=41.0
Q ss_pred HHHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHHHH
Q 018718 49 GAVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKLSI 96 (351)
Q Consensus 49 ~~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L~~ 96 (351)
+|++.+++..+..+.+++.++.++++++.++++|+.+.++. ++++...
T Consensus 56 ~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~~l~~~~~ 104 (681)
T PRK10280 56 NNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSAELAELAN 104 (681)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHH
Confidence 35888999999999999999999999999999999999776 4444433
No 24
>PRK10911 oligopeptidase A; Provisional
Probab=77.55 E-value=10 Score=41.13 Aligned_cols=46 Identities=37% Similarity=0.512 Sum_probs=39.9
Q ss_pred HHHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHH
Q 018718 49 GAVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKL 94 (351)
Q Consensus 49 ~~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L 94 (351)
+|++.+++..+..+.++++++.+++++++++++|+...++. ++++.
T Consensus 50 ~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~~l~~~ 96 (680)
T PRK10911 50 ENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQTLPLLSEY 96 (680)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence 35888999999999999999999999999999999999776 44433
No 25
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=77.11 E-value=1.1 Score=40.02 Aligned_cols=12 Identities=58% Similarity=1.024 Sum_probs=10.9
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.|+.||+||+|=
T Consensus 115 ~t~~HEiGHaLG 126 (186)
T cd04277 115 QTIIHEIGHALG 126 (186)
T ss_pred HHHHHHHHHHhc
Confidence 899999999974
No 26
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=76.80 E-value=1.5 Score=40.11 Aligned_cols=20 Identities=45% Similarity=0.594 Sum_probs=16.7
Q ss_pred hhhHhhhhHHHhhhhhcCCc
Q 018718 281 ETVFHEFGHALQRMLTKQDE 300 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y 300 (351)
.|++||+||++=.+++-.+.
T Consensus 24 ~t~~HE~gHal~a~l~G~~v 43 (200)
T PF13398_consen 24 VTFVHELGHALAALLTGGRV 43 (200)
T ss_pred HHHHHHHHHHHHHHHhCCCc
Confidence 78999999999999875433
No 27
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=76.15 E-value=1.2 Score=38.69 Aligned_cols=11 Identities=55% Similarity=0.972 Sum_probs=10.2
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||+|
T Consensus 106 ~~~~HEiGHaL 116 (156)
T cd04279 106 AIALHELGHAL 116 (156)
T ss_pred HHHHHHhhhhh
Confidence 78999999997
No 28
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=76.12 E-value=0.95 Score=39.34 Aligned_cols=11 Identities=55% Similarity=0.836 Sum_probs=10.3
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||+|
T Consensus 109 ~~~~HEiGHaL 119 (157)
T cd04278 109 SVAAHEIGHAL 119 (157)
T ss_pred HHHHHHhcccc
Confidence 79999999996
No 29
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=76.00 E-value=1.2 Score=38.48 Aligned_cols=11 Identities=64% Similarity=0.915 Sum_probs=10.3
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||+|
T Consensus 96 ~~~~HEiGHaL 106 (165)
T cd04268 96 NTAEHELGHAL 106 (165)
T ss_pred HHHHHHHHHHh
Confidence 79999999997
No 30
>PF14891 Peptidase_M91: Effector protein
Probab=75.39 E-value=2.5 Score=37.71 Aligned_cols=18 Identities=28% Similarity=0.351 Sum_probs=16.0
Q ss_pred hhhHhhhhHHHhhhhhcC
Q 018718 281 ETVFHEFGHALQRMLTKQ 298 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~ 298 (351)
+.|+||+.||.|.+-+..
T Consensus 105 v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 105 VVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHCCCC
Confidence 789999999999988764
No 31
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=72.19 E-value=2 Score=35.77 Aligned_cols=13 Identities=54% Similarity=0.963 Sum_probs=12.1
Q ss_pred hhhHhhhhHHHhh
Q 018718 281 ETVFHEFGHALQR 293 (351)
Q Consensus 281 ~TLFHEfGHalH~ 293 (351)
.||+|||-|++++
T Consensus 78 ktllhefrh~mQh 90 (126)
T PF13058_consen 78 KTLLHEFRHAMQH 90 (126)
T ss_pred HHHHHHHHHHHHh
Confidence 8999999999994
No 32
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=70.92 E-value=1.9 Score=36.44 Aligned_cols=11 Identities=55% Similarity=0.836 Sum_probs=9.9
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-|+.||+||+|
T Consensus 88 ~~~~HEigHaL 98 (140)
T smart00235 88 GVAAHELGHAL 98 (140)
T ss_pred ccHHHHHHHHh
Confidence 47999999997
No 33
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=70.36 E-value=2.1 Score=38.94 Aligned_cols=12 Identities=67% Similarity=1.121 Sum_probs=10.4
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.|+.|||||+|=
T Consensus 94 ~~i~HElgHaLG 105 (198)
T cd04327 94 RVVLHEFGHALG 105 (198)
T ss_pred HHHHHHHHHHhc
Confidence 689999999974
No 34
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=69.06 E-value=2.6 Score=41.06 Aligned_cols=17 Identities=35% Similarity=0.622 Sum_probs=14.9
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
-+|.||+||.+++++.-
T Consensus 172 yVlAHEyGHHVQ~l~Gi 188 (292)
T PF04228_consen 172 YVLAHEYGHHVQNLLGI 188 (292)
T ss_pred HHHHHHHHHHHHHHhhh
Confidence 57999999999999863
No 35
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=67.67 E-value=2.5 Score=34.80 Aligned_cols=11 Identities=55% Similarity=0.906 Sum_probs=9.9
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.|
T Consensus 109 ~~~~HEiGH~l 119 (124)
T PF13582_consen 109 DTFAHEIGHNL 119 (124)
T ss_dssp THHHHHHHHHT
T ss_pred eEeeehhhHhc
Confidence 78999999986
No 36
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=65.15 E-value=3 Score=36.00 Aligned_cols=11 Identities=64% Similarity=1.078 Sum_probs=10.4
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||+|
T Consensus 98 ~~~~HElGH~L 108 (167)
T cd00203 98 QTIAHELGHAL 108 (167)
T ss_pred hhHHHHHHHHh
Confidence 88999999998
No 37
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=60.18 E-value=4.1 Score=36.39 Aligned_cols=11 Identities=55% Similarity=0.742 Sum_probs=10.0
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.|
T Consensus 144 ~~~AHEiGH~l 154 (196)
T PF13688_consen 144 ITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHT
T ss_pred hhhHHhHHHhc
Confidence 89999999975
No 38
>PF02128 Peptidase_M36: Fungalysin metallopeptidase (M36); InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=59.14 E-value=2.5 Score=42.53 Aligned_cols=17 Identities=29% Similarity=0.622 Sum_probs=14.9
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
..+.||.||+|+.-|+-
T Consensus 187 ~Ii~HEy~HGiSnRLvg 203 (378)
T PF02128_consen 187 GIIAHEYGHGISNRLVG 203 (378)
T ss_pred CeeEEeecccccccccC
Confidence 78999999999988754
No 39
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=57.36 E-value=4.9 Score=36.00 Aligned_cols=11 Identities=55% Similarity=0.770 Sum_probs=9.8
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-|+.|||||||
T Consensus 76 g~v~HE~~Hal 86 (180)
T cd04280 76 GTIVHELMHAL 86 (180)
T ss_pred chhHHHHHHHh
Confidence 67899999997
No 40
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=56.90 E-value=5 Score=36.81 Aligned_cols=11 Identities=55% Similarity=0.875 Sum_probs=9.9
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.|
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 67999999986
No 41
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=56.35 E-value=5.2 Score=35.11 Aligned_cols=12 Identities=50% Similarity=0.695 Sum_probs=10.6
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.||-||+||-|-
T Consensus 71 ~TltHEvGH~LG 82 (154)
T PF05572_consen 71 KTLTHEVGHWLG 82 (154)
T ss_dssp HHHHHHHHHHTT
T ss_pred cchhhhhhhhhc
Confidence 899999999764
No 42
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=56.25 E-value=11 Score=40.55 Aligned_cols=64 Identities=17% Similarity=0.240 Sum_probs=33.0
Q ss_pred CCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEE--eeccCCCCCCCCCCCCCc-hhhHhhhhHH
Q 018718 214 SGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHM--VCNQTPPVGDKPSLMTFS-ETVFHEFGHA 290 (351)
Q Consensus 214 ~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~l--v~Nf~~p~~~~P~LL~~~-~TLFHEfGHa 290 (351)
+|+.||.-|.-- ||.| ..|| +|..... ..+|..+ +=-+-+ +-|+. ++..-.-. .||.||+||.
T Consensus 338 e~GtLGLAwVgs-p~~~-saGG-----IC~k~~~-y~~G~~~-sLNtGi~T~~NYg-----~~Vp~kvs~lt~AHEiGHN 403 (764)
T KOG3658|consen 338 EGGTLGLAWVGS-PRSN-SAGG-----ICEKAYT-YNEGKKR-SLNTGISTSVNYG-----KRVPTKVSDLTLAHEIGHN 403 (764)
T ss_pred ccceeeeEEecc-CccC-CCCc-----cccccee-ccCcceE-EeecceeeeeecC-----CccCcchhheeehhhhccc
Confidence 578889888865 6665 3444 3332211 1224211 111111 13343 33333444 8999999997
Q ss_pred H
Q 018718 291 L 291 (351)
Q Consensus 291 l 291 (351)
+
T Consensus 404 f 404 (764)
T KOG3658|consen 404 F 404 (764)
T ss_pred c
Confidence 5
No 43
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=53.02 E-value=6.8 Score=34.82 Aligned_cols=12 Identities=50% Similarity=0.872 Sum_probs=10.6
Q ss_pred c-hhhHhhhhHHH
Q 018718 280 S-ETVFHEFGHAL 291 (351)
Q Consensus 280 ~-~TLFHEfGHal 291 (351)
+ .|+.||+||.+
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 6 78999999976
No 44
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=51.98 E-value=6.5 Score=37.71 Aligned_cols=17 Identities=35% Similarity=0.751 Sum_probs=11.5
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
.++-|||||++-.|--.
T Consensus 218 ~v~vHE~GHsf~~LaDE 234 (264)
T PF09471_consen 218 QVVVHEFGHSFGGLADE 234 (264)
T ss_dssp HHHHHHHHHHTT-----
T ss_pred ceeeeeccccccccccc
Confidence 89999999999988754
No 45
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=51.60 E-value=7.4 Score=35.21 Aligned_cols=11 Identities=64% Similarity=1.132 Sum_probs=10.1
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-|+.|||||||
T Consensus 81 ~~i~HEl~HaL 91 (191)
T PF01400_consen 81 GTILHELGHAL 91 (191)
T ss_dssp HHHHHHHHHHH
T ss_pred cchHHHHHHHH
Confidence 68999999997
No 46
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=50.98 E-value=8.7 Score=30.78 Aligned_cols=17 Identities=18% Similarity=0.159 Sum_probs=14.8
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
.+|.||++|.+|.-...
T Consensus 27 ~~l~HE~~H~~~~~~~~ 43 (128)
T PF13485_consen 27 RVLAHELAHQWFGNYFG 43 (128)
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 89999999999987754
No 47
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=50.76 E-value=6.7 Score=36.43 Aligned_cols=12 Identities=33% Similarity=0.744 Sum_probs=10.8
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.++-||+||+|=
T Consensus 135 hvi~HEiGH~IG 146 (211)
T PF12388_consen 135 HVITHEIGHCIG 146 (211)
T ss_pred HHHHHHhhhhcc
Confidence 689999999985
No 48
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=49.84 E-value=7.9 Score=35.06 Aligned_cols=11 Identities=45% Similarity=0.567 Sum_probs=9.7
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-|+.|||||||
T Consensus 79 G~i~HEl~HaL 89 (182)
T cd04283 79 GIIQHELLHAL 89 (182)
T ss_pred chHHHHHHHHh
Confidence 47899999997
No 49
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=49.35 E-value=7.5 Score=36.38 Aligned_cols=15 Identities=40% Similarity=0.558 Sum_probs=11.9
Q ss_pred hhhHhhhhHHHhhhh
Q 018718 281 ETVFHEFGHALQRML 295 (351)
Q Consensus 281 ~TLFHEfGHalH~ll 295 (351)
-+=.||.|||+|+--
T Consensus 91 aVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 91 AVAAHEVGHAIQHAE 105 (222)
T ss_pred HHHHHHHhHHHhccc
Confidence 345799999999763
No 50
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=48.05 E-value=8.1 Score=36.38 Aligned_cols=11 Identities=36% Similarity=0.631 Sum_probs=10.1
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.+
T Consensus 169 ~t~AHElGHnl 179 (244)
T cd04270 169 LVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHhc
Confidence 79999999986
No 51
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=45.43 E-value=8.8 Score=34.20 Aligned_cols=11 Identities=55% Similarity=0.815 Sum_probs=10.3
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.|
T Consensus 135 ~~~aHElGH~l 145 (192)
T cd04267 135 LTMAHELGHNL 145 (192)
T ss_pred hhhhhhHHhhc
Confidence 89999999997
No 52
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=44.81 E-value=6.4 Score=37.98 Aligned_cols=12 Identities=42% Similarity=0.747 Sum_probs=10.0
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
-|+.|||||+|-
T Consensus 167 gv~~HE~gH~lG 178 (286)
T TIGR03296 167 GVIAHELGHDLG 178 (286)
T ss_pred eeeehhhhcccC
Confidence 678999999763
No 53
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=44.18 E-value=8.2 Score=36.06 Aligned_cols=13 Identities=31% Similarity=0.672 Sum_probs=11.6
Q ss_pred hhhHhhhhHHH---hh
Q 018718 281 ETVFHEFGHAL---QR 293 (351)
Q Consensus 281 ~TLFHEfGHal---H~ 293 (351)
.|+.||+||.| |+
T Consensus 147 ~t~AHElGHnLGm~HD 162 (228)
T cd04271 147 QVFAHEIGHTFGAVHD 162 (228)
T ss_pred eehhhhhhhhcCCCCC
Confidence 78999999998 65
No 54
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=43.06 E-value=12 Score=34.50 Aligned_cols=11 Identities=45% Similarity=0.884 Sum_probs=9.7
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-|+.||+||+|
T Consensus 89 Gti~HEl~HaL 99 (200)
T cd04281 89 GIVVHELGHVI 99 (200)
T ss_pred chHHHHHHHHh
Confidence 57899999997
No 55
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=42.76 E-value=12 Score=34.35 Aligned_cols=16 Identities=38% Similarity=0.557 Sum_probs=12.4
Q ss_pred hhhHhhhhHHHhhhhh
Q 018718 281 ETVFHEFGHALQRMLT 296 (351)
Q Consensus 281 ~TLFHEfGHalH~lls 296 (351)
..++||+||++=...-
T Consensus 40 ~v~iHElgH~~~A~~~ 55 (208)
T cd06161 40 SVLLHELGHALVARRY 55 (208)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 5689999999865543
No 56
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=42.22 E-value=13 Score=32.92 Aligned_cols=18 Identities=33% Similarity=0.390 Sum_probs=13.2
Q ss_pred hhhHhhhhHHHhhhhhcC
Q 018718 281 ETVFHEFGHALQRMLTKQ 298 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~ 298 (351)
..+.||+||++=...-..
T Consensus 10 ~i~iHE~gH~~~A~~~G~ 27 (180)
T cd05709 10 SVTVHELGHALVARRLGV 27 (180)
T ss_pred HHHHHHHHHHHHHHHcCC
Confidence 457899999986655443
No 57
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=39.69 E-value=16 Score=32.38 Aligned_cols=24 Identities=33% Similarity=0.416 Sum_probs=15.5
Q ss_pred hhhHhhhhHHHhhhhhcCCccccc
Q 018718 281 ETVFHEFGHALQRMLTKQDEGLVA 304 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~~~s 304 (351)
..+.||+||++=..+...+...+.
T Consensus 9 ~i~~HE~gH~~~a~~~G~~~~~~~ 32 (192)
T PF02163_consen 9 SIVLHELGHALAARLYGDKVPRFE 32 (192)
T ss_dssp HHHHHHHHHHHHHHTTT--B--EE
T ss_pred cccccccccccccccccccccccc
Confidence 357899999998877665555553
No 58
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.53 E-value=14 Score=33.89 Aligned_cols=12 Identities=42% Similarity=0.595 Sum_probs=10.6
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.++.||+||+|=
T Consensus 118 ~~~~he~gh~lG 129 (197)
T cd04276 118 YLLAHEVGHTLG 129 (197)
T ss_pred HHHHHHHHHHhc
Confidence 789999999973
No 59
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=37.74 E-value=16 Score=32.96 Aligned_cols=13 Identities=38% Similarity=0.685 Sum_probs=11.4
Q ss_pred hhhHhhhhHHHhh
Q 018718 281 ETVFHEFGHALQR 293 (351)
Q Consensus 281 ~TLFHEfGHalH~ 293 (351)
-.|.||+||..|.
T Consensus 91 aVlaHElgH~~~~ 103 (226)
T PF01435_consen 91 AVLAHELGHIKHR 103 (226)
T ss_dssp HHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHcC
Confidence 6899999999874
No 60
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=36.76 E-value=15 Score=32.73 Aligned_cols=11 Identities=55% Similarity=0.842 Sum_probs=10.4
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.++.||+||.+
T Consensus 133 ~~~AHElGH~l 143 (194)
T cd04269 133 VTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHhhc
Confidence 89999999997
No 61
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=35.82 E-value=17 Score=33.28 Aligned_cols=11 Identities=45% Similarity=0.609 Sum_probs=10.4
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+.||+||.|
T Consensus 147 ~~~AHElGH~l 157 (220)
T cd04272 147 YTMTHELAHLL 157 (220)
T ss_pred HHHHHHHHHHh
Confidence 89999999987
No 62
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=35.10 E-value=28 Score=27.07 Aligned_cols=14 Identities=29% Similarity=0.494 Sum_probs=11.8
Q ss_pred hhhHhhhhHHHhhh
Q 018718 281 ETVFHEFGHALQRM 294 (351)
Q Consensus 281 ~TLFHEfGHalH~l 294 (351)
..|.||+.|.++.-
T Consensus 63 ~llaHEl~Hv~Qq~ 76 (79)
T PF13699_consen 63 ALLAHELAHVVQQR 76 (79)
T ss_pred hhHhHHHHHHHhhc
Confidence 56899999999864
No 63
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=34.91 E-value=18 Score=33.99 Aligned_cols=12 Identities=67% Similarity=0.711 Sum_probs=10.0
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
-|..||+||+|=
T Consensus 122 Gti~HEl~HalG 133 (230)
T cd04282 122 ATVEHEFLHALG 133 (230)
T ss_pred chHHHHHHHHhC
Confidence 578999999973
No 64
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=34.36 E-value=19 Score=33.57 Aligned_cols=15 Identities=33% Similarity=0.607 Sum_probs=11.8
Q ss_pred hhhHhhhhHHHhhhh
Q 018718 281 ETVFHEFGHALQRML 295 (351)
Q Consensus 281 ~TLFHEfGHalH~ll 295 (351)
..++||+||++=...
T Consensus 55 ~v~iHElgH~~~A~~ 69 (227)
T cd06164 55 SVLLHELGHSLVARR 69 (227)
T ss_pred HHHHHHHHHHHHHHH
Confidence 567999999986544
No 65
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=34.05 E-value=22 Score=30.51 Aligned_cols=12 Identities=42% Similarity=0.667 Sum_probs=10.0
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
....||+||.|=
T Consensus 79 RIaaHE~GHiLG 90 (132)
T PF02031_consen 79 RIAAHELGHILG 90 (132)
T ss_dssp HHHHHHHHHHHT
T ss_pred eeeeehhccccC
Confidence 678899999864
No 66
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=33.52 E-value=20 Score=34.35 Aligned_cols=16 Identities=25% Similarity=0.455 Sum_probs=12.6
Q ss_pred hhhHhhhhHHHhhhhh
Q 018718 281 ETVFHEFGHALQRMLT 296 (351)
Q Consensus 281 ~TLFHEfGHalH~lls 296 (351)
..+.||+||++=...-
T Consensus 120 sv~iHElgHa~~Ar~~ 135 (263)
T cd06159 120 GVVVHELSHGILARVE 135 (263)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 5789999999876543
No 67
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=32.04 E-value=21 Score=32.87 Aligned_cols=39 Identities=23% Similarity=0.229 Sum_probs=21.8
Q ss_pred hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchh
Q 018718 281 ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWK 323 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~ 323 (351)
-.-.||-||++++- .+|+.+-= |+.=.-.+-+-|++-|-
T Consensus 94 aVAAHEVGHAiQd~---~~Y~~L~~-R~~lvPv~~~gSn~a~~ 132 (226)
T COG2738 94 AVAAHEVGHAIQDQ---EDYAFLVL-RHALVPVANFGSNLAPL 132 (226)
T ss_pred HHHHHHhhHHHhhh---cccHHHHH-hhcccceeccccchhHH
Confidence 34689999999974 34543211 11113445566776543
No 68
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=31.17 E-value=28 Score=31.43 Aligned_cols=22 Identities=27% Similarity=0.205 Sum_probs=16.1
Q ss_pred hhhHhhhhHHHhhhhhcCCccc
Q 018718 281 ETVFHEFGHALQRMLTKQDEGL 302 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~~ 302 (351)
..+.||+||.+=......+-..
T Consensus 11 ~v~iHElGH~~~Ar~~Gv~v~~ 32 (182)
T cd06163 11 LIFVHELGHFLVAKLFGVKVEE 32 (182)
T ss_pred HHHHHHHHHHHHHHHcCCeeeE
Confidence 4678999999987776554443
No 69
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=31.10 E-value=24 Score=35.46 Aligned_cols=55 Identities=15% Similarity=0.102 Sum_probs=31.3
Q ss_pred hhhHhhhhHHHhhhhhcCCccccccC-CCccceeeeechhhchhhhhhhc-ccCCCCcccc
Q 018718 281 ETVFHEFGHALQRMLTKQDEGLVAGS-RGIEWDAVELSTNNSWKIGVTRT-QVIPTIPEDG 339 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~~~sGt-~~v~~DfvE~PSq~~e~~~~l~~-~~~~~~~~d~ 339 (351)
.||.|||-|.+|.--.... .|. ...+.=.=|.-|+++|++-..+. ....+++..+
T Consensus 141 sTlAHEfQHmInfy~~~v~----~g~~~~~dtWLnE~lS~~aEdl~s~~~~~~~n~i~d~R 197 (366)
T PF10460_consen 141 STLAHEFQHMINFYQRGVL----HGKQYAMDTWLNEMLSMSAEDLYSSKIDPGYNNIRDSR 197 (366)
T ss_pred HHHHHHHHHHHHHHHHHHh----cCCCcccccHHHHHHHHHHHHHHhcCCCcccCcccccc
Confidence 8999999999996432211 121 11223345888999988643333 2334444433
No 70
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=29.57 E-value=21 Score=33.41 Aligned_cols=12 Identities=50% Similarity=0.689 Sum_probs=10.7
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
.|+-||+||.|-
T Consensus 139 ~t~~HEvGH~lG 150 (225)
T cd04275 139 DTATHEVGHWLG 150 (225)
T ss_pred ceeEEeccceee
Confidence 899999999874
No 71
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=28.90 E-value=24 Score=33.48 Aligned_cols=16 Identities=25% Similarity=0.434 Sum_probs=13.3
Q ss_pred CCCc---hhhHhhhhHHHh
Q 018718 277 MTFS---ETVFHEFGHALQ 292 (351)
Q Consensus 277 L~~~---~TLFHEfGHalH 292 (351)
|+.+ -.|.||+||..|
T Consensus 152 l~~dEl~aVlaHElgHi~~ 170 (302)
T COG0501 152 LNDDELEAVLAHELGHIKN 170 (302)
T ss_pred CCHHHHHHHHHHHHHHHhc
Confidence 4555 689999999998
No 72
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=28.88 E-value=27 Score=33.43 Aligned_cols=14 Identities=36% Similarity=0.456 Sum_probs=10.3
Q ss_pred hhhHhhhhHHHhhh
Q 018718 281 ETVFHEFGHALQRM 294 (351)
Q Consensus 281 ~TLFHEfGHalH~l 294 (351)
--+.||+||.++.-
T Consensus 221 WG~~HE~GH~~Q~~ 234 (307)
T PF13402_consen 221 WGPWHELGHNHQQG 234 (307)
T ss_dssp HHHHHHHHHHH-BG
T ss_pred eeehhhhhhhcCcc
Confidence 35799999998843
No 73
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=28.82 E-value=14 Score=33.50 Aligned_cols=11 Identities=55% Similarity=0.839 Sum_probs=10.0
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.++.||+||.|
T Consensus 142 ~~~aHElGH~L 152 (207)
T cd04273 142 FTIAHELGHVL 152 (207)
T ss_pred Eeeeeechhhc
Confidence 78999999987
No 74
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=28.78 E-value=24 Score=34.75 Aligned_cols=11 Identities=55% Similarity=1.063 Sum_probs=9.6
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|+|||+||-+
T Consensus 152 ~~~~HElgHN~ 162 (314)
T PF05548_consen 152 ATIMHELGHNL 162 (314)
T ss_pred HHHHHHhhhhc
Confidence 68999999965
No 75
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=28.50 E-value=27 Score=33.77 Aligned_cols=16 Identities=31% Similarity=0.567 Sum_probs=12.3
Q ss_pred hhhHhhhhHHHhhhhh
Q 018718 281 ETVFHEFGHALQRMLT 296 (351)
Q Consensus 281 ~TLFHEfGHalH~lls 296 (351)
..+.||+||++=...-
T Consensus 137 svvvHElgHal~A~~~ 152 (277)
T cd06162 137 SGVVHEMGHGVAAVRE 152 (277)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 5678999999865543
No 76
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=28.23 E-value=28 Score=31.42 Aligned_cols=17 Identities=29% Similarity=0.261 Sum_probs=12.7
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
..+.||+||++=...-.
T Consensus 43 ~l~iHElgH~~~A~~~G 59 (183)
T cd06160 43 ILGIHEMGHYLAARRHG 59 (183)
T ss_pred HHHHHHHHHHHHHHHCC
Confidence 56799999998655543
No 77
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=27.09 E-value=29 Score=31.07 Aligned_cols=11 Identities=45% Similarity=0.794 Sum_probs=9.9
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.+++||+||.|
T Consensus 133 ~~~AHelGH~l 143 (199)
T PF01421_consen 133 VIIAHELGHNL 143 (199)
T ss_dssp HHHHHHHHHHT
T ss_pred HHHHHHHHHhc
Confidence 89999999975
No 78
>PRK03982 heat shock protein HtpX; Provisional
Probab=26.67 E-value=29 Score=33.43 Aligned_cols=12 Identities=25% Similarity=0.398 Sum_probs=10.9
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
-.|.||+||.-|
T Consensus 127 AVlAHElgHi~~ 138 (288)
T PRK03982 127 GVIAHELTHIKN 138 (288)
T ss_pred HHHHHHHHHHHc
Confidence 689999999987
No 79
>PF01742 Peptidase_M27: Clostridial neurotoxin zinc protease This family is a subset of the Prosite family; InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=26.58 E-value=37 Score=34.44 Aligned_cols=17 Identities=24% Similarity=0.364 Sum_probs=15.6
Q ss_pred hhhHhhhhHHHhhhhhc
Q 018718 281 ETVFHEFGHALQRMLTK 297 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~ 297 (351)
.+|.||+=|+||.|.+-
T Consensus 216 l~LmheLIh~Lh~LYGi 232 (408)
T PF01742_consen 216 LELMHELIHSLHGLYGI 232 (408)
T ss_dssp HHHHHHHHHHHHHHTTT
T ss_pred HHHHHHHHHHHHhhccc
Confidence 79999999999999874
No 80
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=23.48 E-value=33 Score=32.01 Aligned_cols=18 Identities=44% Similarity=0.635 Sum_probs=13.2
Q ss_pred CCCCCCc--hhhHhhhhHHH
Q 018718 274 PSLMTFS--ETVFHEFGHAL 291 (351)
Q Consensus 274 P~LL~~~--~TLFHEfGHal 291 (351)
|..+... .|.-||+||+|
T Consensus 180 pg~~~e~L~~tarhElGhaL 199 (236)
T COG5549 180 PGELRENLNPTARHELGHAL 199 (236)
T ss_pred cccchhhhhHHHHHhhcchh
Confidence 3444444 78999999998
No 81
>PRK02870 heat shock protein HtpX; Provisional
Probab=23.32 E-value=37 Score=33.78 Aligned_cols=13 Identities=23% Similarity=0.373 Sum_probs=11.6
Q ss_pred hhhHhhhhHHHhh
Q 018718 281 ETVFHEFGHALQR 293 (351)
Q Consensus 281 ~TLFHEfGHalH~ 293 (351)
-.+.||+||.-|.
T Consensus 175 aVlAHELgHik~~ 187 (336)
T PRK02870 175 AVMAHELSHIRHG 187 (336)
T ss_pred HHHHHHHHHHHcc
Confidence 6899999999885
No 82
>PRK03001 M48 family peptidase; Provisional
Probab=23.27 E-value=36 Score=32.66 Aligned_cols=13 Identities=31% Similarity=0.526 Sum_probs=11.5
Q ss_pred hhhHhhhhHHHhh
Q 018718 281 ETVFHEFGHALQR 293 (351)
Q Consensus 281 ~TLFHEfGHalH~ 293 (351)
-.|.||+||.-|.
T Consensus 126 aVlAHElgHi~~~ 138 (283)
T PRK03001 126 GVMAHELAHVKHR 138 (283)
T ss_pred HHHHHHHHHHhCC
Confidence 6899999999884
No 83
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=23.14 E-value=35 Score=33.50 Aligned_cols=11 Identities=36% Similarity=0.727 Sum_probs=10.2
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.++.||+||++
T Consensus 195 ~~f~HE~GH~~ 205 (305)
T PF10462_consen 195 NEFSHELGHNF 205 (305)
T ss_dssp HHHHHHHHHTT
T ss_pred ceeehhhhhhc
Confidence 89999999986
No 84
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=22.88 E-value=19 Score=38.95 Aligned_cols=11 Identities=55% Similarity=0.905 Sum_probs=9.5
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
=++.|||||+|
T Consensus 223 GVfaHEfGH~L 233 (645)
T PF05547_consen 223 GVFAHEFGHDL 233 (645)
T ss_pred EEEEeeccccC
Confidence 57889999987
No 85
>PHA03420 E4 protein; Provisional
Probab=22.73 E-value=44 Score=28.36 Aligned_cols=13 Identities=23% Similarity=0.685 Sum_probs=10.5
Q ss_pred cccCCHHHHHHHh
Q 018718 178 VNHWDINFWSERL 190 (351)
Q Consensus 178 l~pWD~~Yy~~~~ 190 (351)
-.|||..||.+.+
T Consensus 12 ~hPWdTPYYrrll 24 (137)
T PHA03420 12 HHPWDTPYYRRLL 24 (137)
T ss_pred CCCcccHHHHHHH
Confidence 3599999997764
No 86
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=22.29 E-value=45 Score=29.20 Aligned_cols=11 Identities=36% Similarity=0.685 Sum_probs=9.8
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
-.+.|||+|++
T Consensus 137 DVvaHEltHGV 147 (150)
T PF01447_consen 137 DVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHH
T ss_pred ceeeecccccc
Confidence 67899999997
No 87
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=22.21 E-value=33 Score=33.04 Aligned_cols=11 Identities=45% Similarity=0.776 Sum_probs=10.0
Q ss_pred hhhHhhhhHHH
Q 018718 281 ETVFHEFGHAL 291 (351)
Q Consensus 281 ~TLFHEfGHal 291 (351)
.|++||+||..
T Consensus 205 aTl~HElghwt 215 (316)
T COG4227 205 ATLLHELGHWT 215 (316)
T ss_pred HHHHHHhcccc
Confidence 89999999975
No 88
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=21.88 E-value=41 Score=34.25 Aligned_cols=45 Identities=24% Similarity=0.258 Sum_probs=25.4
Q ss_pred ccceEEEe--eccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCc
Q 018718 256 RLPVVHMV--CNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDE 300 (351)
Q Consensus 256 ~~P~~~lv--~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y 300 (351)
..|+..-+ =.+.-|-++-|-.++-- ...+||+|||+-.....++.
T Consensus 104 hnpv~le~iiPg~nLpl~~I~yf~t~lvi~~vvHElGHalAA~segV~v 152 (484)
T KOG2921|consen 104 HNPVFLESIIPGTNLPLSGIAYFLTSLVITVVVHELGHALAAASEGVQV 152 (484)
T ss_pred CCceEEEeecCccccccccchhhhhhHHHHHHHHHhhHHHHHHhcCcee
Confidence 45655322 22222434444334333 56799999999877765533
No 89
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=21.76 E-value=47 Score=34.08 Aligned_cols=21 Identities=24% Similarity=0.173 Sum_probs=14.7
Q ss_pred hhhHhhhhHHHhhhhhcCCcc
Q 018718 281 ETVFHEFGHALQRMLTKQDEG 301 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~y~ 301 (351)
..++||+||.+=.-+...+-.
T Consensus 17 li~vHElGHfl~Ar~~gv~V~ 37 (449)
T PRK10779 17 LITVHEFGHFWVARRCGVRVE 37 (449)
T ss_pred HHHHHHHHHHHHHHHcCCeee
Confidence 467999999987665544333
No 90
>PRK04897 heat shock protein HtpX; Provisional
Probab=21.41 E-value=41 Score=32.61 Aligned_cols=12 Identities=25% Similarity=0.329 Sum_probs=10.8
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
-.+.||+||.-|
T Consensus 139 aVlAHElgHi~~ 150 (298)
T PRK04897 139 GVIGHEISHIRN 150 (298)
T ss_pred HHHHHHHHHHhc
Confidence 689999999986
No 91
>PRK01345 heat shock protein HtpX; Provisional
Probab=21.32 E-value=42 Score=32.96 Aligned_cols=12 Identities=25% Similarity=0.426 Sum_probs=11.0
Q ss_pred hhhHhhhhHHHh
Q 018718 281 ETVFHEFGHALQ 292 (351)
Q Consensus 281 ~TLFHEfGHalH 292 (351)
-.|.||+||.-|
T Consensus 126 aVlAHElgHi~~ 137 (317)
T PRK01345 126 GVMAHELAHVKN 137 (317)
T ss_pred HHHHHHHHHHHc
Confidence 689999999988
No 92
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=20.82 E-value=44 Score=34.02 Aligned_cols=13 Identities=46% Similarity=0.544 Sum_probs=10.5
Q ss_pred hhhHhhhhHHHhh
Q 018718 281 ETVFHEFGHALQR 293 (351)
Q Consensus 281 ~TLFHEfGHalH~ 293 (351)
-|..||+||||=.
T Consensus 161 G~i~HEl~HaLGf 173 (411)
T KOG3714|consen 161 GTIVHELMHALGF 173 (411)
T ss_pred chhHHHHHHHhhh
Confidence 4688999999853
No 93
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=20.50 E-value=49 Score=33.71 Aligned_cols=19 Identities=32% Similarity=0.373 Sum_probs=14.3
Q ss_pred hhhHhhhhHHHhhhhhcCC
Q 018718 281 ETVFHEFGHALQRMLTKQD 299 (351)
Q Consensus 281 ~TLFHEfGHalH~lls~~~ 299 (351)
..++||+||.+=.-....+
T Consensus 16 ~v~~HE~gH~~~a~~~g~~ 34 (420)
T TIGR00054 16 LIFVHELGHFLAARLCGIK 34 (420)
T ss_pred HHHHHhHHHHHHHHHcCCE
Confidence 5789999999876655433
Done!