Query 043086
Match_columns 257
No_of_seqs 210 out of 1117
Neff 6.4
Searched_HMMs 46136
Date Fri Mar 29 12:43:33 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043086.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043086hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 9.9E-44 2.2E-48 342.1 16.0 174 57-257 29-229 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 99.8 1.1E-21 2.4E-26 163.6 6.9 99 155-257 1-125 (157)
3 PF00413 Peptidase_M10: Matrix 99.8 2.8E-19 6.1E-24 147.2 5.0 97 155-257 1-123 (154)
4 PF01471 PG_binding_1: Putativ 99.5 6.4E-14 1.4E-18 97.8 6.0 56 55-119 2-57 (57)
5 cd04279 ZnMc_MMP_like_1 Zinc-d 99.4 8.2E-13 1.8E-17 109.9 5.9 68 181-257 49-122 (156)
6 smart00235 ZnMc Zinc-dependent 99.3 6.3E-12 1.4E-16 102.5 5.8 81 154-256 3-103 (140)
7 cd04268 ZnMc_MMP_like Zinc-dep 99.0 5.7E-11 1.2E-15 98.8 1.4 71 181-257 42-112 (165)
8 cd04277 ZnMc_serralysin_like Z 98.9 1.4E-09 3E-14 93.0 5.1 63 181-256 61-130 (186)
9 TIGR02869 spore_SleB spore cor 98.9 5.9E-09 1.3E-13 91.0 7.1 72 48-129 6-77 (201)
10 cd04327 ZnMc_MMP_like_3 Zinc-d 98.5 1.2E-07 2.6E-12 82.2 5.9 89 155-256 2-109 (198)
11 cd00203 ZnMc Zinc-dependent me 98.3 6.6E-07 1.4E-11 74.2 4.7 59 181-256 50-113 (167)
12 cd04276 ZnMc_MMP_like_2 Zinc-d 98.3 6.8E-07 1.5E-11 77.9 3.5 91 160-256 7-133 (197)
13 PRK10594 murein L,D-transpepti 98.2 2.3E-06 5.1E-11 85.3 6.6 83 40-122 226-343 (608)
14 COG3409 Putative peptidoglycan 97.9 1.7E-05 3.6E-10 66.7 4.8 65 49-122 39-105 (185)
15 PF12388 Peptidase_M57: Dual-a 97.7 7.3E-05 1.6E-09 65.8 5.4 44 207-256 102-150 (211)
16 COG2989 Uncharacterized protei 97.5 0.00019 4.1E-09 70.4 5.9 68 47-121 230-298 (561)
17 PF05572 Peptidase_M43: Pregna 97.1 0.00025 5.4E-09 59.5 2.1 21 236-256 66-86 (154)
18 PF13582 Reprolysin_3: Metallo 97.0 0.00024 5.2E-09 56.2 1.2 18 239-256 107-124 (124)
19 COG3023 ampD N-acetyl-anhydrom 96.9 0.0016 3.6E-08 58.5 5.0 56 53-119 195-251 (257)
20 PF13583 Reprolysin_4: Metallo 96.7 0.00073 1.6E-08 59.1 1.4 19 238-256 136-154 (206)
21 PF08823 PG_binding_2: Putativ 96.6 0.0057 1.2E-07 45.3 5.6 57 53-118 13-73 (74)
22 COG5549 Predicted Zn-dependent 96.6 0.0013 2.7E-08 57.8 2.4 23 235-257 183-206 (236)
23 cd04273 ZnMc_ADAMTS_like Zinc- 96.6 0.0012 2.5E-08 57.5 2.0 66 181-256 92-157 (207)
24 cd04275 ZnMc_pappalysin_like Z 96.5 0.00051 1.1E-08 61.1 -0.8 62 181-256 84-154 (225)
25 PF13688 Reprolysin_5: Metallo 96.3 0.0018 3.9E-08 55.3 1.5 21 236-256 139-159 (196)
26 COG3409 Putative peptidoglycan 96.2 0.011 2.3E-07 49.5 5.7 62 51-120 123-184 (185)
27 cd04280 ZnMc_astacin_like Zinc 96.2 0.0019 4E-08 55.3 1.0 17 240-256 75-91 (180)
28 PF02031 Peptidase_M7: Strepto 96.2 0.002 4.4E-08 52.4 1.1 41 207-254 52-92 (132)
29 COG1913 Predicted Zn-dependent 96.2 0.0021 4.5E-08 54.9 1.1 19 238-256 123-141 (181)
30 PRK13267 archaemetzincin-like 96.1 0.003 6.5E-08 54.3 1.9 19 238-256 124-142 (179)
31 PF07998 Peptidase_M54: Peptid 96.0 0.0034 7.3E-08 54.7 1.6 19 238-256 144-162 (194)
32 PF13574 Reprolysin_2: Metallo 96.0 0.003 6.5E-08 53.7 1.2 18 239-256 111-128 (173)
33 PF01400 Astacin: Astacin (Pep 95.9 0.003 6.5E-08 54.5 1.2 17 239-255 79-95 (191)
34 cd04267 ZnMc_ADAM_like Zinc-de 95.9 0.0034 7.3E-08 53.6 1.2 20 237-256 131-150 (192)
35 cd04283 ZnMc_hatching_enzyme Z 95.8 0.0036 7.8E-08 54.0 1.2 17 240-256 78-94 (182)
36 cd04269 ZnMc_adamalysin_II_lik 95.8 0.005 1.1E-07 52.7 1.9 19 237-255 129-147 (194)
37 cd04281 ZnMc_BMP1_TLD Zinc-dep 95.4 0.0067 1.4E-07 53.1 1.2 17 240-256 88-104 (200)
38 cd04271 ZnMc_ADAM_fungal Zinc- 95.0 0.0083 1.8E-07 53.4 0.7 15 241-255 147-161 (228)
39 PF01421 Reprolysin: Reprolysi 95.0 0.01 2.2E-07 51.0 1.2 20 236-255 128-147 (199)
40 cd04270 ZnMc_TACE_like Zinc-de 94.7 0.017 3.6E-07 51.8 1.9 18 239-256 167-184 (244)
41 cd04272 ZnMc_salivary_gland_MP 94.6 0.017 3.7E-07 50.6 1.6 18 238-255 144-161 (220)
42 cd04282 ZnMc_meprin Zinc-depen 94.5 0.016 3.5E-07 51.8 1.2 17 240-256 121-137 (230)
43 PF11350 DUF3152: Protein of u 93.5 0.084 1.8E-06 46.2 3.7 73 179-256 78-156 (203)
44 PF05548 Peptidase_M11: Gameto 93.3 0.042 9.2E-07 51.2 1.7 20 238-257 149-168 (314)
45 PF10462 Peptidase_M66: Peptid 92.8 0.061 1.3E-06 50.0 1.9 20 238-257 192-211 (305)
46 KOG3714 Meprin A metalloprotea 92.3 0.048 1E-06 52.7 0.5 17 240-256 160-176 (411)
47 PF09374 PG_binding_3: Predict 91.2 0.15 3.2E-06 37.4 2.0 26 94-119 1-30 (72)
48 TIGR03296 M6dom_TIGR03296 M6 f 90.4 0.075 1.6E-06 48.7 -0.2 16 240-255 166-181 (286)
49 PF04298 Zn_peptidase_2: Putat 90.2 0.21 4.5E-06 44.5 2.4 16 236-251 86-101 (222)
50 COG3824 Predicted Zn-dependent 88.8 0.14 3E-06 41.4 0.2 19 236-254 106-124 (136)
51 PF12044 Metallopep: Putative 87.0 0.34 7.4E-06 47.0 1.7 17 240-256 317-333 (423)
52 PTZ00257 Glycoprotein GP63 (le 86.0 0.92 2E-05 46.0 4.3 36 215-254 236-271 (622)
53 PTZ00337 surface protease GP63 85.1 0.56 1.2E-05 47.2 2.3 35 215-255 211-245 (567)
54 PF09471 Peptidase_M64: IgA Pe 85.1 0.49 1.1E-05 43.2 1.7 16 238-253 215-230 (264)
55 PF06262 DUF1025: Possibl zinc 84.1 0.62 1.3E-05 36.2 1.6 17 238-254 72-88 (97)
56 KOG3607 Meltrins, fertilins an 83.2 0.62 1.3E-05 48.1 1.7 20 237-256 321-340 (716)
57 PF05547 Peptidase_M6: Immune 81.5 0.28 6E-06 50.1 -1.6 15 240-254 222-236 (645)
58 PF01457 Peptidase_M8: Leishma 80.7 0.91 2E-05 45.1 1.7 36 215-254 190-225 (521)
59 KOG4525 Jacalin-like lectin do 78.2 1.3 2.9E-05 43.1 1.9 18 240-257 302-319 (614)
60 COG4783 Putative Zn-dependent 75.6 1.4 3.1E-05 43.2 1.4 17 236-252 127-143 (484)
61 PF07172 GRP: Glycine rich pro 73.5 2.3 4.9E-05 32.9 1.8 9 9-17 4-12 (95)
62 PF06114 DUF955: Domain of unk 72.6 1.7 3.6E-05 32.9 0.8 27 217-251 28-54 (122)
63 PF14247 DUF4344: Domain of un 72.5 1.9 4.1E-05 38.3 1.3 14 238-251 91-104 (220)
64 PF04228 Zn_peptidase: Putativ 71.3 3.3 7.2E-05 38.3 2.7 39 211-251 139-182 (292)
65 PF01435 Peptidase_M48: Peptid 68.4 2.8 6.1E-05 35.9 1.5 15 237-251 87-101 (226)
66 COG2738 Predicted Zn-dependent 67.1 2.6 5.7E-05 37.0 1.0 16 236-251 89-104 (226)
67 COG2856 Predicted Zn peptidase 65.9 2.7 5.8E-05 37.2 0.8 26 218-251 59-84 (213)
68 COG0501 HtpX Zn-dependent prot 64.1 3.7 8.1E-05 36.9 1.4 15 237-251 155-169 (302)
69 PRK03982 heat shock protein Ht 63.7 3.8 8.3E-05 37.4 1.4 14 238-251 124-137 (288)
70 PF12725 DUF3810: Protein of u 62.8 5.6 0.00012 37.2 2.4 33 214-254 179-211 (318)
71 PRK03001 M48 family peptidase; 61.1 4.5 9.8E-05 36.9 1.5 14 238-251 123-136 (283)
72 PRK01345 heat shock protein Ht 60.2 4.8 0.0001 37.6 1.4 14 238-251 123-136 (317)
73 PRK03072 heat shock protein Ht 59.8 4.9 0.00011 36.9 1.5 15 237-251 125-139 (288)
74 cd06258 Peptidase_M3_like The 59.6 11 0.00024 35.2 3.8 16 236-251 151-166 (365)
75 PF11337 DUF3139: Protein of u 56.6 30 0.00065 25.7 5.0 22 53-74 32-53 (85)
76 PRK04897 heat shock protein Ht 56.0 6.2 0.00014 36.4 1.5 15 237-251 135-149 (298)
77 PF01863 DUF45: Protein of unk 55.8 8.1 0.00018 32.8 2.0 38 215-257 145-182 (205)
78 cd06164 S2P-M50_SpoIVFB_CBS Sp 54.9 6.4 0.00014 34.9 1.3 14 238-251 52-65 (227)
79 PF13485 Peptidase_MA_2: Pepti 54.7 6.8 0.00015 29.7 1.3 16 236-251 22-37 (128)
80 PRK05457 heat shock protein Ht 53.9 7.1 0.00015 35.8 1.4 14 237-250 132-145 (284)
81 PRK02870 heat shock protein Ht 53.7 7.1 0.00015 36.9 1.4 14 237-250 171-184 (336)
82 PRK02391 heat shock protein Ht 53.3 7.3 0.00016 36.0 1.4 14 237-250 131-144 (296)
83 PRK01265 heat shock protein Ht 53.1 7.4 0.00016 36.6 1.4 14 237-250 138-151 (324)
84 PF01432 Peptidase_M3: Peptida 52.3 6.9 0.00015 37.8 1.2 16 236-251 239-254 (458)
85 cd06459 M3B_Oligoendopeptidase 50.7 7.4 0.00016 36.8 1.1 16 236-251 219-234 (427)
86 KOG3658 Tumor necrosis factor- 50.6 5.6 0.00012 40.8 0.2 16 241-256 394-409 (764)
87 TIGR00181 pepF oligoendopeptid 50.0 11 0.00024 37.8 2.2 17 235-251 374-390 (591)
88 cd06162 S2P-M50_PDZ_SREBP Ster 49.3 8.8 0.00019 35.3 1.3 13 239-251 135-147 (277)
89 cd06455 M3A_TOP Peptidase M3 T 49.0 8.3 0.00018 37.7 1.2 15 237-251 261-275 (472)
90 TIGR02289 M3_not_pepF oligoend 48.2 17 0.00036 36.3 3.2 17 235-251 333-349 (549)
91 KOG3538 Disintegrin metallopro 48.1 4.2 9.2E-05 42.8 -1.1 22 235-256 313-334 (845)
92 PLN02791 Nudix hydrolase homol 47.8 9.8 0.00021 39.9 1.5 19 237-255 583-601 (770)
93 cd06159 S2P-M50_PDZ_Arch Uncha 47.3 9.9 0.00021 34.7 1.3 13 239-251 118-130 (263)
94 PF13699 DUF4157: Domain of un 47.3 11 0.00024 27.9 1.3 30 217-251 44-73 (79)
95 cd06457 M3A_MIP Peptidase M3 m 46.7 9.5 0.00021 37.2 1.1 16 236-251 245-260 (458)
96 TIGR02290 M3_fam_3 oligoendope 44.8 10 0.00022 38.0 1.1 15 237-251 373-387 (587)
97 PF01447 Peptidase_M4: Thermol 42.7 11 0.00024 31.4 0.8 12 240-251 136-147 (150)
98 cd06456 M3A_DCP_Oligopeptidase 42.6 13 0.00027 36.1 1.3 16 236-251 205-220 (422)
99 KOG2921 Intramembrane metallop 41.8 12 0.00027 36.2 1.1 13 239-251 131-143 (484)
100 COG1164 Oligoendopeptidase F [ 39.7 24 0.00053 35.8 2.9 16 236-251 377-392 (598)
101 PF03571 Peptidase_M49: Peptid 38.1 16 0.00035 36.8 1.3 16 237-252 291-307 (549)
102 COG4227 Antirestriction protei 36.3 25 0.00054 32.4 2.1 35 215-254 184-218 (316)
103 COG3926 zliS Lysozyme family p 35.4 27 0.00059 31.4 2.1 29 92-120 93-125 (252)
104 CHL00132 psaF photosystem I su 35.1 88 0.0019 27.1 5.1 39 56-104 44-82 (185)
105 KOG2719 Metalloprotease [Gener 34.3 21 0.00045 34.9 1.3 14 237-250 278-291 (428)
106 PF11045 YbjM: Putative inner 34.1 38 0.00082 27.6 2.6 25 1-25 1-25 (125)
107 PF04956 TrbC: TrbC/VIRB2 fami 33.6 73 0.0016 23.9 4.0 25 1-25 1-25 (99)
108 PRK10911 oligopeptidase A; Pro 32.3 22 0.00047 36.8 1.1 15 237-251 461-475 (680)
109 cd06460 M32_Taq Peptidase fami 31.6 23 0.00049 34.3 1.1 14 238-251 158-171 (396)
110 KOG2661 Peptidase family M48 [ 29.9 27 0.00058 33.3 1.2 14 238-251 274-287 (424)
111 PF14891 Peptidase_M91: Effect 29.3 37 0.00081 28.6 1.9 15 237-251 101-115 (174)
112 cd02641 R3H_Smubp-2_like R3H d 28.9 28 0.0006 24.4 0.9 20 238-257 28-47 (60)
113 PLN00081 photosystem I reactio 28.6 17 0.00037 29.5 -0.2 39 216-254 81-127 (141)
114 PF11119 DUF2633: Protein of u 26.8 90 0.002 22.1 3.1 7 1-7 1-7 (59)
115 TIGR02411 leuko_A4_hydro leuko 26.0 66 0.0014 32.7 3.3 14 237-250 277-290 (601)
116 PRK06287 cobalt transport prot 24.8 88 0.0019 24.6 3.2 28 1-28 1-28 (107)
117 PF07172 GRP: Glycine rich pro 24.2 91 0.002 24.0 3.1 17 8-24 6-22 (95)
118 cd02639 R3H_RRM R3H domain of 24.0 37 0.0008 24.0 0.8 20 238-257 28-47 (60)
119 PRK14125 cell division suppres 22.8 1.2E+02 0.0025 23.7 3.5 25 1-25 1-25 (103)
120 PF06167 Peptidase_M90: Glucos 21.2 40 0.00087 30.5 0.6 17 237-253 152-168 (253)
121 PRK10280 dipeptidyl carboxypep 21.1 46 0.001 34.4 1.2 14 238-251 463-476 (681)
122 COG2321 Predicted metalloprote 21.0 44 0.00095 30.9 0.8 40 211-251 136-179 (295)
123 KOG1047 Bifunctional leukotrie 21.0 50 0.0011 33.5 1.3 16 235-250 284-299 (613)
124 KOG3988 Protein-tyrosine sulfo 20.9 2E+02 0.0042 27.1 5.0 36 40-75 297-332 (378)
125 cd02640 R3H_NRF R3H domain of 20.4 52 0.0011 23.2 0.9 21 237-257 27-47 (60)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=9.9e-44 Score=342.10 Aligned_cols=174 Identities=44% Similarity=0.792 Sum_probs=151.7
Q ss_pred cHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCCccccCCCCCCcccc
Q 043086 57 GINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKPRCGVPDVINGTTRM 136 (257)
Q Consensus 57 ~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~PRCGvpD~~~~~~~~ 136 (257)
++..++.||++|||+...+... . ...+..+++||+.||++++|++||++|.+|++.|.+|||||||.
T Consensus 29 ~~~~~~~yl~~~~y~~~~~~~~----~--~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~------- 95 (469)
T KOG1565|consen 29 DKVALQDYLECYGYLPPTDLTA----T--RASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG------- 95 (469)
T ss_pred chhHHHHHhhhcccCCCccccc----c--ccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------
Confidence 6788999999999998874211 1 11578899999999999999999999999999999999999991
Q ss_pred CCCCCccccccccCCCCCCCCCCccceeEeecCCCc-ce----------------eecC------C-CCCccEEEEeecC
Q 043086 137 QGGPAHYHTHYVFYPGRPKWPATKQIISCAFLPGTR-TD----------------VQEP------D-YDAADVKISFQRG 192 (257)
Q Consensus 137 ~~~~~~~~~~~~~~~~~~kW~~~k~~LTy~i~~~~~-~~----------------~v~~------~-~~~aDI~I~f~~~ 192 (257)
+|+++++++||+ |++|||+|.+++. +. .|.+ . ...|||+|+|..+
T Consensus 96 ---------~~~~~~~~~kW~--k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~ 164 (469)
T KOG1565|consen 96 ---------RYRYFPGKPKWN--KEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPG 164 (469)
T ss_pred ---------ccccCcccCccc--ccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeecc
Confidence 577788999999 9999999999853 21 1222 2 3799999999999
Q ss_pred CCCCCCCCCCCCCCcceeecCCC---CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086 193 DHGDGYPFDGPGPYNLLAHSFPP---TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 193 ~hgd~~~fdG~g~~~~lahaf~P---~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs 257 (257)
.|||++||||+| |+|||||+| ..|++|||++|.|+++ ...++||+.||+|||||+|||+||+
T Consensus 165 ~h~d~~PFDG~~--g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~ 229 (469)
T KOG1565|consen 165 DHGDGFPFDGPG--GVLAHAFFPGPGIGGDLHFDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSS 229 (469)
T ss_pred CCCCCCcccCCC--CceecccCCCCCCCCccccCcccceecc-CCccchhHHHhhhhcccccccCCCC
Confidence 999999999999 999999999 6789999999999996 3469999999999999999999996
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.85 E-value=1.1e-21 Score=163.65 Aligned_cols=99 Identities=48% Similarity=0.924 Sum_probs=82.9
Q ss_pred CCCCCccceeEeecCCCc---ce--------ee------cC-------CCCCccEEEEeecCCCCCCCCCCCCCCCccee
Q 043086 155 KWPATKQIISCAFLPGTR---TD--------VQ------EP-------DYDAADVKISFQRGDHGDGYPFDGPGPYNLLA 210 (257)
Q Consensus 155 kW~~~k~~LTy~i~~~~~---~~--------~v------~~-------~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~la 210 (257)
||+ |+.|||+|.+... .. ++ .. ...+|||+|.|....|+|+++|||++ |++|
T Consensus 1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~--g~l~ 76 (157)
T cd04278 1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPG--GTLA 76 (157)
T ss_pred CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCc--cccc
Confidence 799 9999999987542 10 11 11 22379999999999999999999999 9999
Q ss_pred ecCCC--CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086 211 HSFPP--TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 211 haf~P--~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs 257 (257)
||++| ..|+|+||.++.|+......+.+++.|++|||||+|||.|+.
T Consensus 77 ~a~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~ 125 (157)
T cd04278 77 HAFFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS 125 (157)
T ss_pred cccCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence 99999 789999999999998643467899999999999999999984
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.77 E-value=2.8e-19 Score=147.23 Aligned_cols=97 Identities=43% Similarity=0.852 Sum_probs=80.6
Q ss_pred CCCCCccceeEeecCCCc-ce----------ee------cC------CCCCccEEEEeecCCCCCCCCCCCCCCCcceee
Q 043086 155 KWPATKQIISCAFLPGTR-TD----------VQ------EP------DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAH 211 (257)
Q Consensus 155 kW~~~k~~LTy~i~~~~~-~~----------~v------~~------~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lah 211 (257)
||+ +++|||+|.+.++ .+ |+ .+ ...++||+|.|....+++...|++.+ +++++
T Consensus 1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~adi~i~~~~~~~~~~~~~~~~~--~~~~~ 76 (154)
T PF00413_consen 1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGNADIRISFGSNNHGDGYSFDGSG--GTLAH 76 (154)
T ss_dssp SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSSCSEEEEEESSSSSSSS-CSSSS--SESEE
T ss_pred CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCCcceeeeeeccccCcccccccce--eeeec
Confidence 799 8999999997652 11 11 11 33569999999999999999999998 99999
Q ss_pred cCCCC---CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086 212 SFPPT---DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 212 af~P~---~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs 257 (257)
++.|. .|+|+|+.++.|+... .+.++..|++|||||+|||.|++
T Consensus 77 ~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~v~~HEiGHaLGL~H~~ 123 (154)
T PF00413_consen 77 AYFPNNIVSGDIHFNDDESWTIDD--SGNDLQSVAIHEIGHALGLDHSN 123 (154)
T ss_dssp EEESSSTTTTEEEEETTSHEESSS--SSEEHHHHHHHHHHHHTTBESSS
T ss_pred cccccccccccccccccccchhhh--hhhhhhhhhhhccccccCcCcCC
Confidence 99996 7899999999998752 56799999999999999999984
No 4
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.48 E-value=6.4e-14 Score=97.75 Aligned_cols=56 Identities=32% Similarity=0.489 Sum_probs=49.8
Q ss_pred hhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhcc
Q 043086 55 VKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMM 119 (257)
Q Consensus 55 ~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M 119 (257)
+++|..+|.+|.++||++... ++.|++.+++||+.||+.+||++||++|.+|+++|
T Consensus 2 ~~~v~~lq~~L~~~gy~~~~~---------~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 2 GPDVKALQQYLNRLGYYPGPV---------DGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp SHHHHHHHHHHHHTTTT-SST---------TSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred cHHHHHHHHHHHHcCCCCCCC---------CCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence 678999999999999995542 68899999999999999999999999999999886
No 5
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.36 E-value=8.2e-13 Score=109.94 Aligned_cols=68 Identities=37% Similarity=0.463 Sum_probs=50.9
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCcc------CCCCCCchhHHHHHHhhhhhcCCC
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTV------GAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~------~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
.+|||+|.|.... +|+|.| +++|||++|..|. |+....|.. .....+.+++.|++|||||+|||.
T Consensus 49 ~~adi~I~f~~~~-----~~~~~g--~~~a~a~~p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~ 119 (156)
T cd04279 49 NDADIVIFFDRPP-----PVGGAG--GGLARAGFPLISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLW 119 (156)
T ss_pred CCCcEEEEecCCC-----CCCCCC--CceEEecccccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCC
Confidence 3799999998543 799999 9999999997543 333333322 111125789999999999999999
Q ss_pred CCC
Q 043086 255 HRP 257 (257)
Q Consensus 255 HSs 257 (257)
|++
T Consensus 120 H~~ 122 (156)
T cd04279 120 HHS 122 (156)
T ss_pred CCC
Confidence 984
No 6
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.27 E-value=6.3e-12 Score=102.51 Aligned_cols=81 Identities=35% Similarity=0.531 Sum_probs=58.6
Q ss_pred CCCCCCccceeEeec-CC-Cc---ce----e------ecC-----CCCCccEEEEeecCCCCCCCCCCCCCCCcceeecC
Q 043086 154 PKWPATKQIISCAFL-PG-TR---TD----V------QEP-----DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAHSF 213 (257)
Q Consensus 154 ~kW~~~k~~LTy~i~-~~-~~---~~----~------v~~-----~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf 213 (257)
.+|+ +..++|.|. +. +. .. + +.. ...++||.|.|...++ .| ..+||++
T Consensus 3 ~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~~--------~g--~~~a~~g 70 (140)
T smart00235 3 KKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDG--------SG--CTLSHAG 70 (140)
T ss_pred CcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECCC--------CC--cceeeee
Confidence 5899 889999996 32 21 11 1 111 2238999999998765 12 2368998
Q ss_pred CCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086 214 PPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 214 ~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS 256 (257)
.| .|+++++. +.|+.+ ..|++|||||+|||.|+
T Consensus 71 ~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~ 103 (140)
T smart00235 71 RP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHE 103 (140)
T ss_pred cC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcC
Confidence 88 88999987 777653 23999999999999997
No 7
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.04 E-value=5.7e-11 Score=98.85 Aligned_cols=71 Identities=25% Similarity=0.248 Sum_probs=47.0
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs 257 (257)
..+||+|.+....+.. +|.. +.+.+...|..|+|+++....|.......+.++..|++|||||+|||.|+.
T Consensus 42 ~~~di~i~~~~~~~~~----~~~~--~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~ 112 (165)
T cd04268 42 DPADIRYSVIRWIPYN----DGTW--SYGPSQVDPLTGEILLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNF 112 (165)
T ss_pred CccCceEEEEEeecCC----CCcc--ccCCccCCCCCccEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccC
Confidence 4799999998643211 1222 333343456779999988766542101134578999999999999999973
No 8
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=98.91 E-value=1.4e-09 Score=93.02 Aligned_cols=63 Identities=21% Similarity=0.360 Sum_probs=48.6
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCC-------CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCC
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPT-------DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGL 253 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~-------~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL 253 (257)
.++||+|.+..... + ++.+.|++|. .|+|++|.+..|... ..+.....|++|||||+|||
T Consensus 61 ~~adI~i~~~~~~~---------~--~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~--~~g~~~~~t~~HEiGHaLGL 127 (186)
T cd04277 61 SGADIRFGNSSDPD---------G--NTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSD--SPGSYGYQTIIHEIGHALGL 127 (186)
T ss_pred CcceEEEEeccCCC---------C--CccEEEECCCCCccccccceeEEecCcccccC--CCChhhHHHHHHHHHHHhcC
Confidence 47999998864322 3 5678888884 368999998887643 23557899999999999999
Q ss_pred CCC
Q 043086 254 AHR 256 (257)
Q Consensus 254 ~HS 256 (257)
.|+
T Consensus 128 ~H~ 130 (186)
T cd04277 128 EHP 130 (186)
T ss_pred CCC
Confidence 997
No 9
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=98.85 E-value=5.9e-09 Score=91.01 Aligned_cols=72 Identities=21% Similarity=0.250 Sum_probs=63.4
Q ss_pred CCCCCCchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCCccccC
Q 043086 48 GTQKGDTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKPRCGVP 127 (257)
Q Consensus 48 ~~~~g~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~PRCGvp 127 (257)
.++.|+.+++|..+|+.|...||+.... ++.|+..|.+||+.||+.+||++||++|..|+..|..| +.++
T Consensus 6 ~l~~G~~g~~V~~LQ~~L~~lG~~~g~i---------dG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~ 75 (201)
T TIGR02869 6 TYQRGSTGSDVIEIQRRLKAWGYYNGKV---------DGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQ 75 (201)
T ss_pred cCCCCCCcHHHHHHHHHHHHcCCCCCCC---------CCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-cccC
Confidence 4788999999999999999999987653 57899999999999999999999999999999999554 5555
Q ss_pred CC
Q 043086 128 DV 129 (257)
Q Consensus 128 D~ 129 (257)
.+
T Consensus 76 ~~ 77 (201)
T TIGR02869 76 SS 77 (201)
T ss_pred cc
Confidence 53
No 10
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.55 E-value=1.2e-07 Score=82.24 Aligned_cols=89 Identities=29% Similarity=0.476 Sum_probs=51.9
Q ss_pred CCCCCccceeEeecCCCcc---e----ee-----------cC-CCCCccEEEEeecCCCCCCCCCCCCCCCcceeecCCC
Q 043086 155 KWPATKQIISCAFLPGTRT---D----VQ-----------EP-DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPP 215 (257)
Q Consensus 155 kW~~~k~~LTy~i~~~~~~---~----~v-----------~~-~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P 215 (257)
+|+. .+.|+|.+...... . ++ .. ...++||+|.|..+. || +.+.|.. +...++
T Consensus 2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~~~-Gc-~S~vG~~--~~~~~~--- 73 (198)
T cd04327 2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGD-GY-WSYVGTD--ALLIGA--- 73 (198)
T ss_pred CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEecCC-CC-CCCcCCc--ccccCC---
Confidence 5874 56788888765321 0 11 11 335799999998743 43 4577764 321111
Q ss_pred CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086 216 TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 216 ~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS 256 (257)
....+.++....++ .+.++..|++|||||+|||.|.
T Consensus 74 ~~~t~~l~~~~~~~-----~~~~~~~~i~HElgHaLG~~HE 109 (198)
T cd04327 74 DAPTMNLGWFTDDT-----PDPEFSRVVLHEFGHALGFIHE 109 (198)
T ss_pred CCceeeeeeecCCC-----chhhHHHHHHHHHHHHhcCccc
Confidence 12234453211111 2345778999999999999885
No 11
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=98.32 E-value=6.6e-07 Score=74.25 Aligned_cols=59 Identities=25% Similarity=0.305 Sum_probs=40.7
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCC-----CCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCC
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTD-----GRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAH 255 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~-----g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~H 255 (257)
..+||.|.+...++ .+ +.+++|+.|.. |.++++....| +..+..|++|||||+|||.|
T Consensus 50 ~~~di~~~~~~~~~--------~~--~~~g~a~~~~~c~~~~~~~~~~~~~~~-------~~~~~~~~~HElGH~LGl~H 112 (167)
T cd00203 50 DKADIAILVTRQDF--------DG--GTGGWAYLGRVCDSLRGVGVLQDNQSG-------TKEGAQTIAHELGHALGFYH 112 (167)
T ss_pred CcCcEEEEeccccC--------CC--CceEEEecCCccCCCCCcEEEecCCcc-------cccchhhHHHHHHHHhCCCc
Confidence 36899998876433 22 45567777643 45555544332 34688999999999999999
Q ss_pred C
Q 043086 256 R 256 (257)
Q Consensus 256 S 256 (257)
.
T Consensus 113 ~ 113 (167)
T cd00203 113 D 113 (167)
T ss_pred c
Confidence 6
No 12
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.25 E-value=6.8e-07 Score=77.90 Aligned_cols=91 Identities=19% Similarity=0.122 Sum_probs=52.0
Q ss_pred ccceeEeecCCCcce-----------------------e--ecC-C--CCCccEEEEeecCCCCCCCCCCCCCCCcceee
Q 043086 160 KQIISCAFLPGTRTD-----------------------V--QEP-D--YDAADVKISFQRGDHGDGYPFDGPGPYNLLAH 211 (257)
Q Consensus 160 k~~LTy~i~~~~~~~-----------------------~--v~~-~--~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lah 211 (257)
|+.|||+|.++++.. + |+. + ...+||++..-+--|.+ ++.. +...+
T Consensus 7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~--~~gps 80 (197)
T cd04276 7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGW--AYGPS 80 (197)
T ss_pred CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcc--eeccc
Confidence 889999999876421 1 111 1 12479995544444433 2332 33345
Q ss_pred cCCCCCC-----CCcccCCCCCccCC---CCCCchhHHHHHHhhhhhcCCCCC
Q 043086 212 SFPPTDG-----RFHYDGDENWTVGA---VPGAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 212 af~P~~g-----~ihfd~~e~wt~~~---~~~~~~l~~va~HEiGH~LGL~HS 256 (257)
...|..| +|+++......... ...+..+..+++||+||+|||.|.
T Consensus 81 ~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn 133 (197)
T cd04276 81 VVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHN 133 (197)
T ss_pred ccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccc
Confidence 5567555 45666543221110 012245788999999999999995
No 13
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=98.21 E-value=2.3e-06 Score=85.29 Aligned_cols=83 Identities=20% Similarity=0.211 Sum_probs=63.1
Q ss_pred hhhhhhccCCCCCCchhcHHHHHHHHHhcCCCCCCCcc----c-------------------------------cccccC
Q 043086 40 LEFFKLFQGTQKGDTVKGINALKKYLHTLGYLSNHNHN----H-------------------------------AAAAAD 84 (257)
Q Consensus 40 ~~~~~~l~~~~~g~~~~~v~~~q~yL~~fGYl~~~~~~----~-------------------------------~~~~~~ 84 (257)
|..+..-..++.|+..++|..+.+.|...|++...... . ......
T Consensus 226 Wp~i~~~~~LrpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (608)
T PRK10594 226 WPQLTGKATLRPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPAV 305 (608)
T ss_pred CcccCCCCCCCCCCCCchHHHHHHHHHHcCCccccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 44444344688999999999999999999998532100 0 000112
Q ss_pred CCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCC
Q 043086 85 GDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKP 122 (257)
Q Consensus 85 ~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~P 122 (257)
+..|+.++.+|||+||+.+||.++|+++..|++.|+.|
T Consensus 306 ~~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 306 RAAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred ccccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 45699999999999999999999999999999998654
No 14
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=97.88 E-value=1.7e-05 Score=66.67 Aligned_cols=65 Identities=18% Similarity=0.272 Sum_probs=57.3
Q ss_pred CCCCCchhcHHHHHHHHHhcCCCC-CCCccccccccCCCcccHHHHHHHHHHHHhcCC-CCCCCCChhhhhccCCC
Q 043086 49 TQKGDTVKGINALKKYLHTLGYLS-NHNHNHAAAAADGDYFDENLESAVKTYQRNFNL-NPTGHMDLKTVSMMGKP 122 (257)
Q Consensus 49 ~~~g~~~~~v~~~q~yL~~fGYl~-~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL-~~TG~lD~~T~~~M~~P 122 (257)
...+..++.|..+|..|...||.. ... +++|++.+..||+.||+.++| +++|+++++|+..+...
T Consensus 39 ~~~~~~~~~v~~lq~~L~~~g~~~~~~~---------dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~ 105 (185)
T COG3409 39 LTLGAEGPSVRILQAALNALGYYPDGVI---------DGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQ 105 (185)
T ss_pred cccCCCCchHHHHHHHHHhcCCCCCCCc---------cCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHH
Confidence 445567899999999999999998 543 688999999999999999999 89999999999988653
No 15
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=97.66 E-value=7.3e-05 Score=65.76 Aligned_cols=44 Identities=20% Similarity=0.309 Sum_probs=29.4
Q ss_pred cceeecCCC-CCC----CCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086 207 NLLAHSFPP-TDG----RFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 207 ~~lahaf~P-~~g----~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS 256 (257)
|..+-|.|| .+| .|.+.. +.. ..-.-..+|.+|||||++||.|+
T Consensus 102 G~ggsAGFP~s~G~P~~~I~I~~-----~~~-~~~~~~~hvi~HEiGH~IGfRHT 150 (211)
T PF12388_consen 102 GAGGSAGFPTSNGNPYKFIQIYG-----LSN-YSVNVIEHVITHEIGHCIGFRHT 150 (211)
T ss_pred CcceeccCCCCCCCCCceEEEEe-----cCC-CchhHHHHHHHHHhhhhcccccc
Confidence 556788889 545 355511 111 12234778999999999999996
No 16
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.47 E-value=0.00019 Score=70.42 Aligned_cols=68 Identities=24% Similarity=0.287 Sum_probs=57.1
Q ss_pred cCCCCCCchhcHHHHHHHHHhcCC-CCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCC
Q 043086 47 QGTQKGDTVKGINALKKYLHTLGY-LSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGK 121 (257)
Q Consensus 47 ~~~~~g~~~~~v~~~q~yL~~fGY-l~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~ 121 (257)
..+++|+...+|..+.+.|..-|- +.... .....|+.++.+|||+||+.+||+.+|++++.|++.|+-
T Consensus 230 ~~LrpG~~~~~v~aL~~~L~~~~~d~~~a~-------~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~ 298 (561)
T COG2989 230 ALLRPGVTSPDVPALRARLARSGMDLPSAA-------GSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV 298 (561)
T ss_pred cccCCCCCchhHHHHHHHHHhcCccchhhc-------cCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence 457899999999999999999884 22221 123479999999999999999999999999999999964
No 17
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.12 E-value=0.00025 Score=59.51 Aligned_cols=21 Identities=33% Similarity=0.495 Sum_probs=15.1
Q ss_pred CchhHHHHHHhhhhhcCCCCC
Q 043086 236 AVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 236 ~~~l~~va~HEiGH~LGL~HS 256 (257)
..+...+++||+||-|||.|.
T Consensus 66 ~~~~g~TltHEvGH~LGL~Ht 86 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLGLYHT 86 (154)
T ss_dssp TS-SSHHHHHHHHHHTT---T
T ss_pred ccccccchhhhhhhhhccccc
Confidence 457789999999999999996
No 18
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.03 E-value=0.00024 Score=56.22 Aligned_cols=18 Identities=39% Similarity=0.495 Sum_probs=12.6
Q ss_pred hHHHHHHhhhhhcCCCCC
Q 043086 239 MQTVALHELRHVLGLAHR 256 (257)
Q Consensus 239 l~~va~HEiGH~LGL~HS 256 (257)
...+.+|||||.|||.|+
T Consensus 107 ~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 107 GVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp STTHHHHHHHHHTT----
T ss_pred cceEeeehhhHhcCCCCC
Confidence 348999999999999995
No 19
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=96.85 E-value=0.0016 Score=58.55 Aligned_cols=56 Identities=23% Similarity=0.414 Sum_probs=48.1
Q ss_pred CchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCC-CCCCCCChhhhhcc
Q 043086 53 DTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNL-NPTGHMDLKTVSMM 119 (257)
Q Consensus 53 ~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL-~~TG~lD~~T~~~M 119 (257)
.+.++|..+|+.|.+|||.... +.|++.++.+|+.||.-|+= ..+|+.|.+|.+.+
T Consensus 195 ~~~~~v~~lq~~L~~YGY~v~~-----------~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l 251 (257)
T COG3023 195 LKGEDVAALQEMLARYGYGVEI-----------GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAIL 251 (257)
T ss_pred hccCCHHHHHHHHHHhCcCCCc-----------chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHH
Confidence 3448999999999999997653 56899999999999998874 58999999998765
No 20
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=96.67 E-value=0.00073 Score=59.10 Aligned_cols=19 Identities=37% Similarity=0.499 Sum_probs=16.4
Q ss_pred hhHHHHHHhhhhhcCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HS 256 (257)
+-..+.+|||||+|||.|.
T Consensus 136 ~~~~~~aHEiGH~lGl~H~ 154 (206)
T PF13583_consen 136 NGYQTFAHEIGHNLGLRHD 154 (206)
T ss_pred ccchHHHHHHHHHhcCCCC
Confidence 3457899999999999996
No 21
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=96.61 E-value=0.0057 Score=45.28 Aligned_cols=57 Identities=19% Similarity=0.313 Sum_probs=46.8
Q ss_pred CchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCC----CCCCCCChhhhhc
Q 043086 53 DTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNL----NPTGHMDLKTVSM 118 (257)
Q Consensus 53 ~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL----~~TG~lD~~T~~~ 118 (257)
..++-+.++|.-|.+.||+.++. ++.+++++.+|++.|+-.-++ ...|++|...++-
T Consensus 13 l~~~~~~evq~~L~~lGyy~g~~---------~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~ 73 (74)
T PF08823_consen 13 LDGDVAREVQEALKRLGYYKGEA---------DGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAY 73 (74)
T ss_pred CcHHHHHHHHHHHHHcCCccCCC---------CCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhh
Confidence 44667789999999999998874 577899999999999998886 4566688877654
No 22
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=96.61 E-value=0.0013 Score=57.77 Aligned_cols=23 Identities=35% Similarity=0.475 Sum_probs=20.3
Q ss_pred CCchhHHHHHHhhhhhcCCC-CCC
Q 043086 235 GAVDMQTVALHELRHVLGLA-HRP 257 (257)
Q Consensus 235 ~~~~l~~va~HEiGH~LGL~-HSs 257 (257)
...+|..+|-||+||+||+. ||.
T Consensus 183 ~~e~L~~tarhElGhaLgi~ghsd 206 (236)
T COG5549 183 LRENLNPTARHELGHALGIWGHSD 206 (236)
T ss_pred chhhhhHHHHHhhcchheeccccc
Confidence 45689999999999999998 884
No 23
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.57 E-value=0.0012 Score=57.48 Aligned_cols=66 Identities=21% Similarity=0.282 Sum_probs=39.8
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS 256 (257)
...|+.+-|...+.... ++++ ++++.|+.- .|. .......+.. ..+.....|++|||||.||+.|-
T Consensus 92 ~~~D~a~llt~~d~~~~---~~~~--~~~G~A~~g---giC-~~~~s~~i~~-~~~~~~a~~~aHElGH~LG~~HD 157 (207)
T cd04273 92 EHHDHAILLTRQDICRS---NGNC--DTLGLAPVG---GMC-SPSRSCSINE-DTGLSSAFTIAHELGHVLGMPHD 157 (207)
T ss_pred cccceEEEEeeeccccc---CCCC--CceEEeccc---cCC-CCCcceEEEc-CCCceeEEeeeeechhhcCCCCC
Confidence 35788887776544322 4667 778888752 222 1111111211 12455667999999999999995
No 24
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.49 E-value=0.00051 Score=61.12 Aligned_cols=62 Identities=23% Similarity=0.259 Sum_probs=38.1
Q ss_pred CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCC--------CcccCCCCCccC-CCCCCchhHHHHHHhhhhhc
Q 043086 181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGR--------FHYDGDENWTVG-AVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~--------ihfd~~e~wt~~-~~~~~~~l~~va~HEiGH~L 251 (257)
+...|.|-+.. |. .+ ++++.|++|.... |.++.. +++ .....+++..|++||+||.|
T Consensus 84 ~~~~LNIy~~~--------~~-~~--~~lG~a~fP~~~~~~~~~~dGvvi~~~---~~~~~~~~~~n~g~t~~HEvGH~l 149 (225)
T cd04275 84 GYKYLNIYVAN--------FL-GG--GLLGYATFPDSLVSLAFITDGVVINPS---SLPGGSAAPYNLGDTATHEVGHWL 149 (225)
T ss_pred CcCeEEEEEEC--------CC-CC--CcCEEEECCCcccCCccccceEEEecc---ccCCCCcccccccceeEEecccee
Confidence 35677787653 22 23 6788899984421 111111 111 11124688899999999999
Q ss_pred CCCCC
Q 043086 252 GLAHR 256 (257)
Q Consensus 252 GL~HS 256 (257)
||-|.
T Consensus 150 GL~Ht 154 (225)
T cd04275 150 GLYHT 154 (225)
T ss_pred eeeee
Confidence 99996
No 25
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=96.29 E-value=0.0018 Score=55.32 Aligned_cols=21 Identities=29% Similarity=0.390 Sum_probs=14.9
Q ss_pred CchhHHHHHHhhhhhcCCCCC
Q 043086 236 AVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 236 ~~~l~~va~HEiGH~LGL~HS 256 (257)
......|++|||||.||+.|.
T Consensus 139 ~~~~~~~~AHEiGH~lGa~HD 159 (196)
T PF13688_consen 139 TYNGAITFAHEIGHNLGAPHD 159 (196)
T ss_dssp HHHHHHHHHHHHHHHTT----
T ss_pred CCceehhhHHhHHHhcCCCCC
Confidence 456778999999999999994
No 26
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=96.19 E-value=0.011 Score=49.50 Aligned_cols=62 Identities=27% Similarity=0.359 Sum_probs=52.5
Q ss_pred CCCchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccC
Q 043086 51 KGDTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMG 120 (257)
Q Consensus 51 ~g~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~ 120 (257)
.+..+.++..++.-+...+|..... .+++|+..++.||+.||+.++|+++|++...|+..+.
T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~ 184 (185)
T COG3409 123 LGLGGGDVATLQQPLPLLGYRSGIR--------VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR 184 (185)
T ss_pred ccccccchHHHHHHHHhcccccCCC--------CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence 3556677778888888888887652 3689999999999999999999999999999998764
No 27
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=96.18 E-value=0.0019 Score=55.34 Aligned_cols=17 Identities=41% Similarity=0.649 Sum_probs=15.4
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
..+++|||||+|||.|-
T Consensus 75 ~g~v~HE~~HalG~~HE 91 (180)
T cd04280 75 LGTIVHELMHALGFYHE 91 (180)
T ss_pred CchhHHHHHHHhcCcch
Confidence 57999999999999984
No 28
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=96.18 E-value=0.002 Score=52.36 Aligned_cols=41 Identities=27% Similarity=0.299 Sum_probs=24.3
Q ss_pred cceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086 207 NLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 207 ~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
+-.|.......|.|.+|.. + ..+++-..|++|||||+|||.
T Consensus 52 ~sya~~~g~G~G~I~l~~~--~-----~qgy~~~RIaaHE~GHiLGLP 92 (132)
T PF02031_consen 52 GSYASTDGLGSGYIFLDYQ--Q-----NQGYNSTRIAAHELGHILGLP 92 (132)
T ss_dssp --EEEE-SSS-EEEEEEHH--H-----HHHS-HHHHHHHHHHHHHT--
T ss_pred CcccccCCCCcEEEEechH--H-----hhCCccceeeeehhccccCCC
Confidence 3333333335678888842 2 146788889999999999995
No 29
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=96.15 E-value=0.0021 Score=54.94 Aligned_cols=19 Identities=47% Similarity=0.737 Sum_probs=16.0
Q ss_pred hhHHHHHHhhhhhcCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HS 256 (257)
..+.=++||+||+|||.|-
T Consensus 123 Rv~KEv~HElGH~~GL~HC 141 (181)
T COG1913 123 RVVKEVLHELGHLLGLSHC 141 (181)
T ss_pred HHHHHHHHHhhhhcCcccC
Confidence 4555689999999999995
No 30
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=96.09 E-value=0.003 Score=54.33 Aligned_cols=19 Identities=42% Similarity=0.646 Sum_probs=16.1
Q ss_pred hhHHHHHHhhhhhcCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HS 256 (257)
.+..+++||+||.|||.|-
T Consensus 124 R~~k~~~HElGH~lGL~HC 142 (179)
T PRK13267 124 RVRKEVTHELGHTLGLEHC 142 (179)
T ss_pred HHHHHHHHHHHHHcCCccC
Confidence 4555699999999999994
No 31
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=95.98 E-value=0.0034 Score=54.72 Aligned_cols=19 Identities=42% Similarity=0.663 Sum_probs=13.6
Q ss_pred hhHHHHHHhhhhhcCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HS 256 (257)
.+..+++||+||++||.|-
T Consensus 144 R~~Kea~HElGH~~GL~HC 162 (194)
T PF07998_consen 144 RVCKEAVHELGHLFGLDHC 162 (194)
T ss_dssp HHHHHHHHHHHHHTT----
T ss_pred HHHHHHHHHHHHHcCCcCC
Confidence 5667999999999999993
No 32
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=95.96 E-value=0.003 Score=53.68 Aligned_cols=18 Identities=39% Similarity=0.438 Sum_probs=13.5
Q ss_pred hHHHHHHhhhhhcCCCCC
Q 043086 239 MQTVALHELRHVLGLAHR 256 (257)
Q Consensus 239 l~~va~HEiGH~LGL~HS 256 (257)
...|++|||||.||+.|.
T Consensus 111 ~~~~~aHElGH~lGa~Hd 128 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHD 128 (173)
T ss_dssp HHHHHHHHHHHHHT---S
T ss_pred eeeeehhhhHhhcCCCCC
Confidence 667899999999999996
No 33
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=95.94 E-value=0.003 Score=54.54 Aligned_cols=17 Identities=47% Similarity=0.645 Sum_probs=13.4
Q ss_pred hHHHHHHhhhhhcCCCC
Q 043086 239 MQTVALHELRHVLGLAH 255 (257)
Q Consensus 239 l~~va~HEiGH~LGL~H 255 (257)
...|++|||||||||.|
T Consensus 79 ~~~~i~HEl~HaLG~~H 95 (191)
T PF01400_consen 79 SVGTILHELGHALGFWH 95 (191)
T ss_dssp SHHHHHHHHHHHHTB--
T ss_pred CccchHHHHHHHHhhhh
Confidence 35699999999999998
No 34
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.86 E-value=0.0034 Score=53.59 Aligned_cols=20 Identities=35% Similarity=0.423 Sum_probs=17.4
Q ss_pred chhHHHHHHhhhhhcCCCCC
Q 043086 237 VDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL~HS 256 (257)
.....+++||+||.|||.|-
T Consensus 131 ~~~~~~~aHElGH~lG~~HD 150 (192)
T cd04267 131 LLTALTMAHELGHNLGAEHD 150 (192)
T ss_pred eeehhhhhhhHHhhcCCcCC
Confidence 45667999999999999995
No 35
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=95.85 E-value=0.0036 Score=54.02 Aligned_cols=17 Identities=41% Similarity=0.694 Sum_probs=15.1
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
..+++|||||||||-|-
T Consensus 78 ~G~i~HEl~HaLG~~HE 94 (182)
T cd04283 78 KGIIQHELLHALGFYHE 94 (182)
T ss_pred cchHHHHHHHHhCCccc
Confidence 46999999999999983
No 36
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.78 E-value=0.005 Score=52.67 Aligned_cols=19 Identities=37% Similarity=0.581 Sum_probs=16.7
Q ss_pred chhHHHHHHhhhhhcCCCC
Q 043086 237 VDMQTVALHELRHVLGLAH 255 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL~H 255 (257)
..+..+++|||||.||+.|
T Consensus 129 ~~~a~~~AHElGH~lG~~H 147 (194)
T cd04269 129 LLFAVTMAHELGHNLGMEH 147 (194)
T ss_pred HHHHHHHHHHHHhhcCCCc
Confidence 3556799999999999999
No 37
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=95.36 E-value=0.0067 Score=53.08 Aligned_cols=17 Identities=41% Similarity=0.735 Sum_probs=14.9
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
..+++|||||+||+-|-
T Consensus 88 ~Gti~HEl~HaLGf~HE 104 (200)
T cd04281 88 FGIVVHELGHVIGFWHE 104 (200)
T ss_pred CchHHHHHHHHhcCcch
Confidence 35999999999999883
No 38
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=95.01 E-value=0.0083 Score=53.37 Aligned_cols=15 Identities=40% Similarity=0.685 Sum_probs=14.4
Q ss_pred HHHHHhhhhhcCCCC
Q 043086 241 TVALHELRHVLGLAH 255 (257)
Q Consensus 241 ~va~HEiGH~LGL~H 255 (257)
.|.+||+||.||+.|
T Consensus 147 ~t~AHElGHnLGm~H 161 (228)
T cd04271 147 QVFAHEIGHTFGAVH 161 (228)
T ss_pred eehhhhhhhhcCCCC
Confidence 699999999999999
No 39
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=94.98 E-value=0.01 Score=51.00 Aligned_cols=20 Identities=35% Similarity=0.690 Sum_probs=14.6
Q ss_pred CchhHHHHHHhhhhhcCCCC
Q 043086 236 AVDMQTVALHELRHVLGLAH 255 (257)
Q Consensus 236 ~~~l~~va~HEiGH~LGL~H 255 (257)
...+-.+++|||||.||+.|
T Consensus 128 ~~~~a~~~AHelGH~lGm~H 147 (199)
T PF01421_consen 128 GLSFAVIIAHELGHNLGMPH 147 (199)
T ss_dssp HHHHHHHHHHHHHHHTT---
T ss_pred hHHHHHHHHHHHHHhcCCCC
Confidence 45667799999999999998
No 40
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=94.73 E-value=0.017 Score=51.84 Aligned_cols=18 Identities=39% Similarity=0.554 Sum_probs=15.7
Q ss_pred hHHHHHHhhhhhcCCCCC
Q 043086 239 MQTVALHELRHVLGLAHR 256 (257)
Q Consensus 239 l~~va~HEiGH~LGL~HS 256 (257)
...|.+||+||.||+.|=
T Consensus 167 ~a~t~AHElGHnlGm~HD 184 (244)
T cd04270 167 SDLVTAHELGHNFGSPHD 184 (244)
T ss_pred HHHHHHHHHHHhcCCCCC
Confidence 447999999999999993
No 41
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=94.59 E-value=0.017 Score=50.63 Aligned_cols=18 Identities=39% Similarity=0.460 Sum_probs=16.4
Q ss_pred hhHHHHHHhhhhhcCCCC
Q 043086 238 DMQTVALHELRHVLGLAH 255 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~H 255 (257)
....+++||+||.||+.|
T Consensus 144 ~~~~~~AHElGH~lG~~H 161 (220)
T cd04272 144 YGVYTMTHELAHLLGAPH 161 (220)
T ss_pred ccHHHHHHHHHHHhCCCC
Confidence 457899999999999999
No 42
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=94.48 E-value=0.016 Score=51.79 Aligned_cols=17 Identities=35% Similarity=0.622 Sum_probs=14.9
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
..+++|||||+||+-|-
T Consensus 121 ~Gti~HEl~HalGf~HE 137 (230)
T cd04282 121 KATVEHEFLHALGFYHE 137 (230)
T ss_pred CchHHHHHHHHhCCccc
Confidence 46899999999999883
No 43
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=93.50 E-value=0.084 Score=46.22 Aligned_cols=73 Identities=30% Similarity=0.356 Sum_probs=45.0
Q ss_pred CCCCccEEEEeecCCCCCCC--CCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCC----CCchhHHHHHHhhhhhcC
Q 043086 179 DYDAADVKISFQRGDHGDGY--PFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVP----GAVDMQTVALHELRHVLG 252 (257)
Q Consensus 179 ~~~~aDI~I~f~~~~hgd~~--~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~----~~~~l~~va~HEiGH~LG 252 (257)
..+.+|++|...+..--|.. +.+-. +.+.. +.|..++|.+|.+ .|..+... -+.+...|+=||.||.||
T Consensus 78 ~~~~~Df~I~Lasp~T~~~lC~g~~~~---~e~SC-~~~~g~~VvIN~~-RWv~Ga~~f~gdl~~YRqYvINHEVGH~LG 152 (203)
T PF11350_consen 78 DSGAPDFRISLASPGTTDRLCAGLDTS---GETSC-RNPAGGRVVINLA-RWVRGAPAFGGDLASYRQYVINHEVGHALG 152 (203)
T ss_pred CCCCCCEEEEECCcchhhhhccCcCcC---ceeEe-ecCCCCeEEEehH-HhhccCcccCCcHHHHHHHhhhhhhhhhcc
Confidence 34458999999876654321 23322 23322 2344567777753 67765422 233456699999999999
Q ss_pred CCCC
Q 043086 253 LAHR 256 (257)
Q Consensus 253 L~HS 256 (257)
-.|-
T Consensus 153 h~H~ 156 (203)
T PF11350_consen 153 HGHE 156 (203)
T ss_pred cCCC
Confidence 9885
No 44
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=93.29 E-value=0.042 Score=51.24 Aligned_cols=20 Identities=40% Similarity=0.484 Sum_probs=17.1
Q ss_pred hhHHHHHHhhhhhcCCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HSs 257 (257)
.-..|.+||+||-|||.||.
T Consensus 149 ~~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 149 QDWATIMHELGHNLGLWHAG 168 (314)
T ss_pred ccHHHHHHHhhhhccccccC
Confidence 34569999999999999983
No 45
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=92.76 E-value=0.061 Score=50.01 Aligned_cols=20 Identities=40% Similarity=0.580 Sum_probs=15.1
Q ss_pred hhHHHHHHhhhhhcCCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HSs 257 (257)
....+.+||+||++||.|.+
T Consensus 192 s~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 192 SYGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp -SHHHHHHHHHHTTT--SST
T ss_pred CccceeehhhhhhcCCCCCC
Confidence 56789999999999999953
No 46
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=92.26 E-value=0.048 Score=52.66 Aligned_cols=17 Identities=41% Similarity=0.602 Sum_probs=15.4
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
+-|++||++||||+-|-
T Consensus 160 ~G~i~HEl~HaLGf~He 176 (411)
T KOG3714|consen 160 FGTIVHELMHALGFWHE 176 (411)
T ss_pred CchhHHHHHHHhhhhhc
Confidence 77999999999999873
No 47
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=91.16 E-value=0.15 Score=37.38 Aligned_cols=26 Identities=27% Similarity=0.397 Sum_probs=20.6
Q ss_pred HHHHHHHHhcC----CCCCCCCChhhhhcc
Q 043086 94 SAVKTYQRNFN----LNPTGHMDLKTVSMM 119 (257)
Q Consensus 94 ~AIk~FQ~~~g----L~~TG~lD~~T~~~M 119 (257)
.|||.+|+..| +.++|++++.|+.++
T Consensus 1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al 30 (72)
T PF09374_consen 1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL 30 (72)
T ss_dssp HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence 48999999999 999999999999998
No 48
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=90.40 E-value=0.075 Score=48.73 Aligned_cols=16 Identities=50% Similarity=0.621 Sum_probs=14.5
Q ss_pred HHHHHHhhhhhcCCCC
Q 043086 240 QTVALHELRHVLGLAH 255 (257)
Q Consensus 240 ~~va~HEiGH~LGL~H 255 (257)
..|.+||+||.|||.+
T Consensus 166 igv~~HE~gH~lGLPD 181 (286)
T TIGR03296 166 VGVIAHELGHDLGLPD 181 (286)
T ss_pred eeeeehhhhcccCCCc
Confidence 6799999999999975
No 49
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=90.23 E-value=0.21 Score=44.46 Aligned_cols=16 Identities=31% Similarity=0.547 Sum_probs=12.5
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
...=..||+||.||++
T Consensus 86 SiaAvaVAAHEvGHAi 101 (222)
T PF04298_consen 86 SIAAVAVAAHEVGHAI 101 (222)
T ss_pred CHHHHHHHHHHHhHHH
Confidence 3445569999999996
No 50
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=88.76 E-value=0.14 Score=41.41 Aligned_cols=19 Identities=32% Similarity=0.595 Sum_probs=16.0
Q ss_pred CchhHHHHHHhhhhhcCCC
Q 043086 236 AVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 236 ~~~l~~va~HEiGH~LGL~ 254 (257)
+--...|++|||||..||+
T Consensus 106 gd~vthvliHEIgHhFGLs 124 (136)
T COG3824 106 GDQVTHVLIHEIGHHFGLS 124 (136)
T ss_pred hhHhhhhhhhhhhhhcCCC
Confidence 4456779999999999996
No 51
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=86.98 E-value=0.34 Score=47.03 Aligned_cols=17 Identities=35% Similarity=0.645 Sum_probs=14.8
Q ss_pred HHHHHHhhhhhcCCCCC
Q 043086 240 QTVALHELRHVLGLAHR 256 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HS 256 (257)
.-..+|||||++|+.|.
T Consensus 317 iGA~lHEiGH~fg~pH~ 333 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQ 333 (423)
T ss_pred hHHHHHHHHHhcCCCCC
Confidence 34789999999999996
No 52
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.05 E-value=0.92 Score=46.03 Aligned_cols=36 Identities=19% Similarity=0.274 Sum_probs=22.8
Q ss_pred CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086 215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
|..|.|.|+.... .. ........|++|||.|+||..
T Consensus 236 Pi~G~iNinp~~i---~s-~~~~~~~rv~~HEi~HALGFS 271 (622)
T PTZ00257 236 PAVGVMNIPAANI---VS-RYDQGTTRTVTHEVAHALGFS 271 (622)
T ss_pred ceEEEEeeCHHHC---CC-ccchHHHHHHHHHHHHHhcCC
Confidence 5556666664321 00 011257899999999999985
No 53
>PTZ00337 surface protease GP63; Provisional
Probab=85.12 E-value=0.56 Score=47.20 Aligned_cols=35 Identities=29% Similarity=0.389 Sum_probs=23.8
Q ss_pred CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCC
Q 043086 215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAH 255 (257)
Q Consensus 215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~H 255 (257)
|..|.|.|+.... . .....+.|++|||.|+||+..
T Consensus 211 Pi~G~in~np~~i---~---~~~~~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 211 PFAAAVNFDPRQI---A---VTNGDVRVAAHELGHALGFVR 245 (567)
T ss_pred ceEEEEEECHHHc---c---chhHHHHHHHHHHHHHHccCH
Confidence 4456677775321 1 123578899999999999853
No 54
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=85.11 E-value=0.49 Score=43.18 Aligned_cols=16 Identities=31% Similarity=0.598 Sum_probs=13.7
Q ss_pred hhHHHHHHhhhhhcCC
Q 043086 238 DMQTVALHELRHVLGL 253 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL 253 (257)
.+..|++||+||++|-
T Consensus 215 ~~~~v~vHE~GHsf~~ 230 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGG 230 (264)
T ss_dssp THHHHHHHHHHHHTT-
T ss_pred cccceeeeeccccccc
Confidence 6889999999998773
No 55
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=84.10 E-value=0.62 Score=36.23 Aligned_cols=17 Identities=24% Similarity=0.567 Sum_probs=13.0
Q ss_pred hhHHHHHHhhhhhcCCC
Q 043086 238 DMQTVALHELRHVLGLA 254 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~ 254 (257)
-...|++|||||.+|++
T Consensus 72 ~I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 72 LIRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHHcCCC
Confidence 35569999999999985
No 56
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=83.24 E-value=0.62 Score=48.14 Aligned_cols=20 Identities=40% Similarity=0.644 Sum_probs=17.3
Q ss_pred chhHHHHHHhhhhhcCCCCC
Q 043086 237 VDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL~HS 256 (257)
..+..|.+||+||.||+.|-
T Consensus 321 ~~~a~v~AhelgH~lGm~hD 340 (716)
T KOG3607|consen 321 LAFAVVLAHELGHNLGMIHD 340 (716)
T ss_pred hhHHHHHHHHHHhhcCcccc
Confidence 35677999999999999983
No 57
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=81.46 E-value=0.28 Score=50.05 Aligned_cols=15 Identities=47% Similarity=0.567 Sum_probs=13.5
Q ss_pred HHHHHHhhhhhcCCC
Q 043086 240 QTVALHELRHVLGLA 254 (257)
Q Consensus 240 ~~va~HEiGH~LGL~ 254 (257)
.-|.+||+||.|||.
T Consensus 222 iGVfaHEfGH~LGLP 236 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLP 236 (645)
T ss_pred eEEEEeeccccCCCC
Confidence 569999999999995
No 58
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=80.65 E-value=0.91 Score=45.14 Aligned_cols=36 Identities=19% Similarity=0.391 Sum_probs=20.2
Q ss_pred CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086 215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
|..|.|.|+... +... ....+..|++|||.|+||+.
T Consensus 190 P~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLGFs 225 (521)
T PF01457_consen 190 PIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALGFS 225 (521)
T ss_dssp EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT-S
T ss_pred CeeEEEEEchhH---ccch-hhhcccceeeeeeeeeeeec
Confidence 344667776531 1110 23467889999999999985
No 59
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=78.20 E-value=1.3 Score=43.15 Aligned_cols=18 Identities=39% Similarity=0.591 Sum_probs=15.3
Q ss_pred HHHHHHhhhhhcCCCCCC
Q 043086 240 QTVALHELRHVLGLAHRP 257 (257)
Q Consensus 240 ~~va~HEiGH~LGL~HSs 257 (257)
.-..+|||||.||+.|-+
T Consensus 302 lGA~~HElGH~lgcpHq~ 319 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQS 319 (614)
T ss_pred HHHHHHHhhhccCCCCCC
Confidence 447899999999999964
No 60
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=75.57 E-value=1.4 Score=43.21 Aligned_cols=17 Identities=29% Similarity=0.456 Sum_probs=14.8
Q ss_pred CchhHHHHHHhhhhhcC
Q 043086 236 AVDMQTVALHELRHVLG 252 (257)
Q Consensus 236 ~~~l~~va~HEiGH~LG 252 (257)
...|-.|++|||||+.|
T Consensus 127 esElagViAHEigHv~q 143 (484)
T COG4783 127 ESELAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHHHHhh
Confidence 34899999999999876
No 61
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=73.51 E-value=2.3 Score=32.94 Aligned_cols=9 Identities=44% Similarity=0.574 Sum_probs=3.4
Q ss_pred HHHHHHHHH
Q 043086 9 HAFISFFLL 17 (257)
Q Consensus 9 ~~~~~~~~~ 17 (257)
|+||++.|+
T Consensus 4 K~~llL~l~ 12 (95)
T PF07172_consen 4 KAFLLLGLL 12 (95)
T ss_pred hHHHHHHHH
Confidence 334333333
No 62
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=72.56 E-value=1.7 Score=32.94 Aligned_cols=27 Identities=19% Similarity=0.261 Sum_probs=18.2
Q ss_pred CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086 217 DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 217 ~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L 251 (257)
...|..+.+ ........+++|||||.+
T Consensus 28 ~~~I~in~~--------~~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 28 NPIIFINSN--------LSPERQRFTLAHELGHIL 54 (122)
T ss_dssp TTEEEEESS--------S-HHHHHHHHHHHHHHHH
T ss_pred CCEEEECCC--------CCHHHHHHHHHHHHHHHH
Confidence 445666653 133466779999999975
No 63
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=72.48 E-value=1.9 Score=38.34 Aligned_cols=14 Identities=36% Similarity=0.513 Sum_probs=12.8
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
++..|+.||+||+|
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 68889999999986
No 64
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=71.33 E-value=3.3 Score=38.33 Aligned_cols=39 Identities=26% Similarity=0.318 Sum_probs=24.1
Q ss_pred ecCCCCCCCCcccCCCCCc-----cCCCCCCchhHHHHHHhhhhhc
Q 043086 211 HSFPPTDGRFHYDGDENWT-----VGAVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 211 haf~P~~g~ihfd~~e~wt-----~~~~~~~~~l~~va~HEiGH~L 251 (257)
.-|-|.+..|++|-.. |. ++. ....-...|++||+||++
T Consensus 139 pFYCp~D~tIYlD~~f-~~~L~~~~ga-~G~~a~ayVlAHEyGHHV 182 (292)
T PF04228_consen 139 PFYCPADQTIYLDLSF-FDELQQRFGA-SGDFAQAYVLAHEYGHHV 182 (292)
T ss_pred CEeCCCCCEEEechHH-HHHHHHHhCC-ccHHHHHHHHHHHHHHHH
Confidence 3456778889988641 21 111 123445669999999973
No 65
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=68.42 E-value=2.8 Score=35.86 Aligned_cols=15 Identities=33% Similarity=0.525 Sum_probs=13.4
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
..+..|++||+||..
T Consensus 87 ~el~aVlaHElgH~~ 101 (226)
T PF01435_consen 87 DELAAVLAHELGHIK 101 (226)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 479999999999986
No 66
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=67.13 E-value=2.6 Score=36.98 Aligned_cols=16 Identities=31% Similarity=0.538 Sum_probs=13.2
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
.+.-..||+||.||++
T Consensus 89 Sia~~aVAAHEVGHAi 104 (226)
T COG2738 89 SIAAIAVAAHEVGHAI 104 (226)
T ss_pred cHHHHHHHHHHhhHHH
Confidence 3566779999999997
No 67
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=65.92 E-value=2.7 Score=37.23 Aligned_cols=26 Identities=19% Similarity=0.201 Sum_probs=17.0
Q ss_pred CCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086 218 GRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 218 g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L 251 (257)
..|+++.+- ....-.-|++||+||+|
T Consensus 59 ~~I~iN~n~--------~~~r~rFtlAHELGH~l 84 (213)
T COG2856 59 PVIYINANN--------SLERKRFTLAHELGHAL 84 (213)
T ss_pred ceEEEeCCC--------CHHHHHHHHHHHHhHHH
Confidence 356776541 22344559999999996
No 68
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=64.09 E-value=3.7 Score=36.92 Aligned_cols=15 Identities=33% Similarity=0.567 Sum_probs=13.2
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
..+..|++||+||..
T Consensus 155 dEl~aVlaHElgHi~ 169 (302)
T COG0501 155 DELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHh
Confidence 379999999999974
No 69
>PRK03982 heat shock protein HtpX; Provisional
Probab=63.65 E-value=3.8 Score=37.44 Aligned_cols=14 Identities=36% Similarity=0.665 Sum_probs=12.6
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
.+..|++||+||.-
T Consensus 124 El~AVlAHElgHi~ 137 (288)
T PRK03982 124 ELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHH
Confidence 69999999999974
No 70
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=62.76 E-value=5.6 Score=37.18 Aligned_cols=33 Identities=30% Similarity=0.415 Sum_probs=24.0
Q ss_pred CCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086 214 PPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 214 ~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
.|=.|+.+.|.+- -..++..|++||++|-+|..
T Consensus 179 ~PFT~EA~vN~~~--------p~~~~P~T~~HElAHq~G~a 211 (318)
T PF12725_consen 179 NPFTGEANVNTDL--------PPYSLPFTICHELAHQLGFA 211 (318)
T ss_pred cCCcceeecCCCC--------CcccccHHHHHHHHHHhCCC
Confidence 4544566666542 23578999999999999975
No 71
>PRK03001 M48 family peptidase; Provisional
Probab=61.14 E-value=4.5 Score=36.86 Aligned_cols=14 Identities=43% Similarity=0.653 Sum_probs=12.6
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
.+..|++||+||.-
T Consensus 123 El~aVlAHElgHi~ 136 (283)
T PRK03001 123 EIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHh
Confidence 69999999999974
No 72
>PRK01345 heat shock protein HtpX; Provisional
Probab=60.19 E-value=4.8 Score=37.56 Aligned_cols=14 Identities=43% Similarity=0.591 Sum_probs=12.9
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
.+..|++|||||.-
T Consensus 123 EL~aVlAHElgHi~ 136 (317)
T PRK01345 123 EVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHH
Confidence 79999999999975
No 73
>PRK03072 heat shock protein HtpX; Provisional
Probab=59.79 E-value=4.9 Score=36.87 Aligned_cols=15 Identities=40% Similarity=0.545 Sum_probs=13.1
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
..+..|++||+||.-
T Consensus 125 ~El~aVlAHElgHi~ 139 (288)
T PRK03072 125 RELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHHh
Confidence 379999999999974
No 74
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=59.56 E-value=11 Score=35.17 Aligned_cols=16 Identities=25% Similarity=0.378 Sum_probs=13.6
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+-..+++||+||++
T Consensus 151 ~~~~v~tl~HE~GHa~ 166 (365)
T cd06258 151 GHDDINTLFHEFGHAV 166 (365)
T ss_pred CHHHHHHHHHHHhHHH
Confidence 5667789999999997
No 75
>PF11337 DUF3139: Protein of unknown function (DUF3139); InterPro: IPR021486 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=56.57 E-value=30 Score=25.74 Aligned_cols=22 Identities=27% Similarity=0.427 Sum_probs=15.9
Q ss_pred CchhcHHHHHHHHHhcCCCCCC
Q 043086 53 DTVKGINALKKYLHTLGYLSNH 74 (257)
Q Consensus 53 ~~~~~v~~~q~yL~~fGYl~~~ 74 (257)
++....+.+..||++-||-+..
T Consensus 32 ~k~~~~k~i~~yL~e~gy~e~~ 53 (85)
T PF11337_consen 32 QKHKAEKAIDWYLQEQGYKESD 53 (85)
T ss_pred hHHHHHHHHHHHHHHcCCcHHH
Confidence 3445567788999999987443
No 76
>PRK04897 heat shock protein HtpX; Provisional
Probab=56.02 E-value=6.2 Score=36.36 Aligned_cols=15 Identities=27% Similarity=0.479 Sum_probs=13.0
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
..+..|++||+||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 379999999999964
No 77
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=55.75 E-value=8.1 Score=32.82 Aligned_cols=38 Identities=21% Similarity=0.378 Sum_probs=26.7
Q ss_pred CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086 215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs 257 (257)
+..|.|.|+ |.+-.. -..-+..|++||+-|..-..||.
T Consensus 145 ~~~~~I~ln----~~L~~~-P~~~idYVvvHEL~Hl~~~nHs~ 182 (205)
T PF01863_consen 145 SSKGNITLN----WRLVMA-PPEVIDYVVVHELCHLRHPNHSK 182 (205)
T ss_pred CCCCcEEee----cccccC-CccHHHHHHHHHHHHhccCCCCH
Confidence 445677777 433111 12368889999999999999984
No 78
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=54.91 E-value=6.4 Score=34.93 Aligned_cols=14 Identities=50% Similarity=0.636 Sum_probs=11.7
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
=+..|.+||+||++
T Consensus 52 l~~~v~iHElgH~~ 65 (227)
T cd06164 52 LFASVLLHELGHSL 65 (227)
T ss_pred HHHHHHHHHHHHHH
Confidence 46679999999985
No 79
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=54.73 E-value=6.8 Score=29.73 Aligned_cols=16 Identities=31% Similarity=0.277 Sum_probs=13.2
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
...+..|++||+.|.+
T Consensus 22 ~~~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 22 EDWLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3467899999999974
No 80
>PRK05457 heat shock protein HtpX; Provisional
Probab=53.91 E-value=7.1 Score=35.83 Aligned_cols=14 Identities=29% Similarity=0.619 Sum_probs=12.5
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++||+||.
T Consensus 132 ~El~aVlAHElgHi 145 (284)
T PRK05457 132 DEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHH
Confidence 37999999999996
No 81
>PRK02870 heat shock protein HtpX; Provisional
Probab=53.70 E-value=7.1 Score=36.85 Aligned_cols=14 Identities=43% Similarity=0.712 Sum_probs=12.7
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++||+||.
T Consensus 171 dEL~aVlAHELgHi 184 (336)
T PRK02870 171 DELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHH
Confidence 37999999999997
No 82
>PRK02391 heat shock protein HtpX; Provisional
Probab=53.35 E-value=7.3 Score=35.98 Aligned_cols=14 Identities=43% Similarity=0.684 Sum_probs=12.5
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++||+||.
T Consensus 131 ~El~aVlaHElgHi 144 (296)
T PRK02391 131 DELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHH
Confidence 37999999999996
No 83
>PRK01265 heat shock protein HtpX; Provisional
Probab=53.09 E-value=7.4 Score=36.57 Aligned_cols=14 Identities=43% Similarity=0.624 Sum_probs=12.5
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++||+||.
T Consensus 138 ~El~aVlAHElgHi 151 (324)
T PRK01265 138 DEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 37999999999995
No 84
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=52.33 E-value=6.9 Score=37.79 Aligned_cols=16 Identities=25% Similarity=0.407 Sum_probs=12.7
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+=..+++||+||++
T Consensus 239 ~~~~v~tLfHE~GHa~ 254 (458)
T PF01432_consen 239 SHDDVETLFHEFGHAM 254 (458)
T ss_dssp SHHHHHHHHHHHHHHH
T ss_pred ChhhHHHHHHHHhHHH
Confidence 3455679999999986
No 85
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=50.68 E-value=7.4 Score=36.80 Aligned_cols=16 Identities=31% Similarity=0.391 Sum_probs=13.0
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+-..+++||+||++
T Consensus 219 ~~~~v~tl~HE~GHa~ 234 (427)
T cd06459 219 TLDDVFTLAHELGHAF 234 (427)
T ss_pred ChhhHHHHHHHhhHHH
Confidence 3566789999999985
No 86
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=50.57 E-value=5.6 Score=40.76 Aligned_cols=16 Identities=31% Similarity=0.501 Sum_probs=14.3
Q ss_pred HHHHHhhhhhcCCCCC
Q 043086 241 TVALHELRHVLGLAHR 256 (257)
Q Consensus 241 ~va~HEiGH~LGL~HS 256 (257)
.|++|||||.-|-.|-
T Consensus 394 lt~AHEiGHNfGSpHD 409 (764)
T KOG3658|consen 394 LTLAHEIGHNFGSPHD 409 (764)
T ss_pred eeehhhhccccCCCCC
Confidence 4899999999999984
No 87
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=49.99 E-value=11 Score=37.76 Aligned_cols=17 Identities=29% Similarity=0.474 Sum_probs=14.0
Q ss_pred CCchhHHHHHHhhhhhc
Q 043086 235 GAVDMQTVALHELRHVL 251 (257)
Q Consensus 235 ~~~~l~~va~HEiGH~L 251 (257)
+..+-..|++||+||++
T Consensus 374 ~~~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 374 GTLNSVFTLAHELGHSM 390 (591)
T ss_pred CCcchHHHHHHHhhhHH
Confidence 34577889999999985
No 88
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=49.29 E-value=8.8 Score=35.34 Aligned_cols=13 Identities=23% Similarity=0.434 Sum_probs=11.1
Q ss_pred hHHHHHHhhhhhc
Q 043086 239 MQTVALHELRHVL 251 (257)
Q Consensus 239 l~~va~HEiGH~L 251 (257)
+..|++||+||++
T Consensus 135 ~isvvvHElgHal 147 (277)
T cd06162 135 LISGVVHEMGHGV 147 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 5579999999985
No 89
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=48.96 E-value=8.3 Score=37.73 Aligned_cols=15 Identities=33% Similarity=0.481 Sum_probs=12.5
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
.+=..+++||+||+|
T Consensus 261 ~~~V~TLfHEfGHal 275 (472)
T cd06455 261 HDEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456678999999997
No 90
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=48.19 E-value=17 Score=36.31 Aligned_cols=17 Identities=29% Similarity=0.350 Sum_probs=14.2
Q ss_pred CCchhHHHHHHhhhhhc
Q 043086 235 GAVDMQTVALHELRHVL 251 (257)
Q Consensus 235 ~~~~l~~va~HEiGH~L 251 (257)
+..+-..|++||+||++
T Consensus 333 ~t~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 333 GTSGDIDVLTHEAGHAF 349 (549)
T ss_pred CChhHHHHHHHHhhHHH
Confidence 44677889999999986
No 91
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=48.08 E-value=4.2 Score=42.84 Aligned_cols=22 Identities=27% Similarity=0.428 Sum_probs=18.4
Q ss_pred CCchhHHHHHHhhhhhcCCCCC
Q 043086 235 GAVDMQTVALHELRHVLGLAHR 256 (257)
Q Consensus 235 ~~~~l~~va~HEiGH~LGL~HS 256 (257)
.+.....+++||+||.+|+.|-
T Consensus 313 ~gl~~aftiahE~GH~~gm~hd 334 (845)
T KOG3538|consen 313 TGLPSAFTIAHELGHNFGMEHD 334 (845)
T ss_pred CCCccceeeeeccccccCcccC
Confidence 4555778999999999999993
No 92
>PLN02791 Nudix hydrolase homolog
Probab=47.81 E-value=9.8 Score=39.87 Aligned_cols=19 Identities=21% Similarity=0.144 Sum_probs=15.8
Q ss_pred chhHHHHHHhhhhhcCCCC
Q 043086 237 VDMQTVALHELRHVLGLAH 255 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL~H 255 (257)
-.+.+|++||+||-+|=.+
T Consensus 583 af~~~v~lHElgHGsG~~~ 601 (770)
T PLN02791 583 SFFTHTICHECCHGIGPHT 601 (770)
T ss_pred cHHHHHHHHHhhccccccc
Confidence 3678999999999999643
No 93
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=47.30 E-value=9.9 Score=34.65 Aligned_cols=13 Identities=38% Similarity=0.595 Sum_probs=10.7
Q ss_pred hHHHHHHhhhhhc
Q 043086 239 MQTVALHELRHVL 251 (257)
Q Consensus 239 l~~va~HEiGH~L 251 (257)
+..|.+||+||++
T Consensus 118 ~isv~iHElgHa~ 130 (263)
T cd06159 118 VVGVVVHELSHGI 130 (263)
T ss_pred HHHHHHHHHHHHH
Confidence 4568999999985
No 94
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=47.27 E-value=11 Score=27.93 Aligned_cols=30 Identities=20% Similarity=0.339 Sum_probs=20.2
Q ss_pred CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086 217 DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 217 ~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L 251 (257)
+.+|+|...+.= .+...-..+++||+.|++
T Consensus 44 G~~I~f~~g~~~-----~~s~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 44 GNDIYFAPGKYN-----PDSPEGRALLAHELAHVV 73 (79)
T ss_pred CCEEEEcCCCcC-----CCCCCcchhHhHHHHHHH
Confidence 568999654321 123456789999999985
No 95
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=46.70 E-value=9.5 Score=37.24 Aligned_cols=16 Identities=19% Similarity=0.360 Sum_probs=12.8
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+=..++.||+||+|
T Consensus 245 ~~~~v~TLfHEfGHal 260 (458)
T cd06457 245 SPHEVETLFHEMGHAM 260 (458)
T ss_pred CHHHHHHHHHHHhHHH
Confidence 3456679999999986
No 96
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=44.80 E-value=10 Score=38.05 Aligned_cols=15 Identities=27% Similarity=0.246 Sum_probs=12.7
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
.+=..+++||+||++
T Consensus 373 ~~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 373 RRDVSTLAHELGHAY 387 (587)
T ss_pred chhHHHHHHHhhHHH
Confidence 466778999999997
No 97
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=42.69 E-value=11 Score=31.36 Aligned_cols=12 Identities=42% Similarity=0.517 Sum_probs=10.0
Q ss_pred HHHHHHhhhhhc
Q 043086 240 QTVALHELRHVL 251 (257)
Q Consensus 240 ~~va~HEiGH~L 251 (257)
.-|++|||+|..
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 459999999964
No 98
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=42.55 E-value=13 Score=36.10 Aligned_cols=16 Identities=38% Similarity=0.497 Sum_probs=13.2
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+-..++.||+||+|
T Consensus 205 ~~~~v~tLfHEfGHal 220 (422)
T cd06456 205 THDEVTTLFHEFGHAL 220 (422)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 4566778999999997
No 99
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=41.81 E-value=12 Score=36.17 Aligned_cols=13 Identities=54% Similarity=0.774 Sum_probs=10.7
Q ss_pred hHHHHHHhhhhhc
Q 043086 239 MQTVALHELRHVL 251 (257)
Q Consensus 239 l~~va~HEiGH~L 251 (257)
+..+.+||+||+|
T Consensus 131 vi~~vvHElGHal 143 (484)
T KOG2921|consen 131 VITVVVHELGHAL 143 (484)
T ss_pred HHHHHHHHhhHHH
Confidence 4457899999997
No 100
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=39.74 E-value=24 Score=35.83 Aligned_cols=16 Identities=25% Similarity=0.260 Sum_probs=13.3
Q ss_pred CchhHHHHHHhhhhhc
Q 043086 236 AVDMQTVALHELRHVL 251 (257)
Q Consensus 236 ~~~l~~va~HEiGH~L 251 (257)
..+-..|++||+||+.
T Consensus 377 t~~dV~TLaHElGHs~ 392 (598)
T COG1164 377 TLRDVFTLAHELGHSV 392 (598)
T ss_pred chhHHHHHHHHccHHH
Confidence 4577889999999984
No 101
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=38.10 E-value=16 Score=36.79 Aligned_cols=16 Identities=44% Similarity=0.488 Sum_probs=12.7
Q ss_pred chhHHHHHHhh-hhhcC
Q 043086 237 VDMQTVALHEL-RHVLG 252 (257)
Q Consensus 237 ~~l~~va~HEi-GH~LG 252 (257)
....+|++||+ ||-.|
T Consensus 291 af~~~v~lHEllGHGsG 307 (549)
T PF03571_consen 291 AFFVQVGLHELLGHGSG 307 (549)
T ss_dssp HHHHHHHHHHHCCCCS-
T ss_pred hHhhhhhHHhhccCcCc
Confidence 35678999997 99888
No 102
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=36.35 E-value=25 Score=32.36 Aligned_cols=35 Identities=26% Similarity=0.376 Sum_probs=23.8
Q ss_pred CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086 215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA 254 (257)
Q Consensus 215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~ 254 (257)
|..-.|++-.-+.+- +..+...+++||+||--|=+
T Consensus 184 Pa~D~Iq~p~~~aF~-----da~~yyaTl~HElghwtgh~ 218 (316)
T COG4227 184 PASDSIQLPPFEAFR-----DAINYYATLLHELGHWTGHE 218 (316)
T ss_pred CccCeeccCChhhcc-----chHhHHHHHHHHhccccCch
Confidence 433346666554442 45689999999999987743
No 103
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=35.39 E-value=27 Score=31.36 Aligned_cols=29 Identities=17% Similarity=0.239 Sum_probs=26.3
Q ss_pred HHHHHHHHHHhcC----CCCCCCCChhhhhccC
Q 043086 92 LESAVKTYQRNFN----LNPTGHMDLKTVSMMG 120 (257)
Q Consensus 92 ~~~AIk~FQ~~~g----L~~TG~lD~~T~~~M~ 120 (257)
-..|++-+||..+ .+++|++...|+..+.
T Consensus 93 p~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~ 125 (252)
T COG3926 93 PGRAAKWLQKALGPAYTVRVDGVIGAQTLAALK 125 (252)
T ss_pred cchHHHHHHHHhccCCcccccCcccHHHHHHHH
Confidence 3579999999999 9999999999998875
No 104
>CHL00132 psaF photosystem I subunit III; Validated
Probab=35.05 E-value=88 Score=27.07 Aligned_cols=39 Identities=8% Similarity=0.107 Sum_probs=24.2
Q ss_pred hcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcC
Q 043086 56 KGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFN 104 (257)
Q Consensus 56 ~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~g 104 (257)
..|+.+++.|++|-=-.++ .....+....+.++|.++..
T Consensus 44 ~~~k~Le~rlk~Y~~~s~p----------~lal~~qi~~tk~RFe~Y~~ 82 (185)
T CHL00132 44 NSVKKLENRLAKYEANSPP----------ALALQQQIDKTKARFDKYGR 82 (185)
T ss_pred HHHHHHHhhhhccCCCCCh----------HHHHHHHHHHHHHHHHHhcc
Confidence 3566788888877432221 11245677788888888753
No 105
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=34.30 E-value=21 Score=34.87 Aligned_cols=14 Identities=36% Similarity=0.470 Sum_probs=12.4
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++||+||-
T Consensus 278 eel~AVl~HELGHW 291 (428)
T KOG2719|consen 278 EELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHHhhHH
Confidence 47999999999994
No 106
>PF11045 YbjM: Putative inner membrane protein of Enterobacteriaceae; InterPro: IPR020368 This entry contains membrane proteins with no known function.; GO: 0016021 integral to membrane
Probab=34.11 E-value=38 Score=27.59 Aligned_cols=25 Identities=24% Similarity=0.284 Sum_probs=19.4
Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086 1 MKHKRNALHAFISFFLLQFFLLHAL 25 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~l~~~~ 25 (257)
||+||+....+.||+|++...++-.
T Consensus 1 M~~~r~W~g~i~cfvLf~vVfl~~~ 25 (125)
T PF11045_consen 1 MANKRRWAGVICCFVLFIVVFLFLK 25 (125)
T ss_pred CCCCccchHHHHHHHHHHHHHHHhh
Confidence 8999999999988877766554443
No 107
>PF04956 TrbC: TrbC/VIRB2 family; InterPro: IPR007039 Conjugal transfer protein, TrbC has been identified as a subunit of the pilus precursor in bacteria. The protein undergoes three processing steps before gaining its mature cyclic structure[]. This family also contains several VirB2 type IV secretion proteins. The virB2 gene encodes a putative type IV secretion system and is known to be a pathogenicity factor in Bartonella species [].
Probab=33.63 E-value=73 Score=23.87 Aligned_cols=25 Identities=24% Similarity=0.230 Sum_probs=11.8
Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086 1 MKHKRNALHAFISFFLLQFFLLHAL 25 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~l~~~~ 25 (257)
||+..+.....+..+++.+++++..
T Consensus 1 mk~~~~~~~~~~~~~~~~~~~~~~~ 25 (99)
T PF04956_consen 1 MKKVQSKKRRKFLLLLLALALLLLA 25 (99)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHhC
Confidence 7755544444444444444444443
No 108
>PRK10911 oligopeptidase A; Provisional
Probab=32.26 E-value=22 Score=36.76 Aligned_cols=15 Identities=33% Similarity=0.372 Sum_probs=12.4
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
.+=..++.|||||+|
T Consensus 461 ~~~v~tlfHEfGHal 475 (680)
T PRK10911 461 HDEVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHHHhHHH
Confidence 356679999999997
No 109
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=31.62 E-value=23 Score=34.26 Aligned_cols=14 Identities=29% Similarity=0.377 Sum_probs=10.9
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
+-...++||+||+|
T Consensus 158 ~~l~t~iHE~GHal 171 (396)
T cd06460 158 SALFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHHHhhHHH
Confidence 34457899999986
No 110
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=29.93 E-value=27 Score=33.29 Aligned_cols=14 Identities=36% Similarity=0.541 Sum_probs=12.5
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
.+..|+.|||+|++
T Consensus 274 glAtvLgHE~aHaV 287 (424)
T KOG2661|consen 274 GLATVLGHEIAHAV 287 (424)
T ss_pred HHHHHHHHHHHHHH
Confidence 58889999999986
No 111
>PF14891 Peptidase_M91: Effector protein
Probab=29.32 E-value=37 Score=28.60 Aligned_cols=15 Identities=33% Similarity=0.264 Sum_probs=12.6
Q ss_pred chhHHHHHHhhhhhc
Q 043086 237 VDMQTVALHELRHVL 251 (257)
Q Consensus 237 ~~l~~va~HEiGH~L 251 (257)
.+-+.|+.||+.|+.
T Consensus 101 ~~p~v~L~HEL~HA~ 115 (174)
T PF14891_consen 101 RPPFVVLYHELIHAY 115 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 467789999999984
No 112
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=28.88 E-value=28 Score=24.42 Aligned_cols=20 Identities=35% Similarity=0.577 Sum_probs=16.8
Q ss_pred hhHHHHHHhhhhhcCCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HSs 257 (257)
...--.+||+.+.+||.|-|
T Consensus 28 ~~eR~~vH~lA~~~gL~s~S 47 (60)
T cd02641 28 SHDRLLVHELAEELGLRHES 47 (60)
T ss_pred HHHHHHHHHHHHHcCCceEe
Confidence 56668899999999999854
No 113
>PLN00081 photosystem I reaction center subunit V (PsaG); Provisional
Probab=28.62 E-value=17 Score=29.54 Aligned_cols=39 Identities=21% Similarity=0.362 Sum_probs=28.2
Q ss_pred CCCCCcccCC-----CCCcc--CCCCCCchhHHHHHHh-hhhhcCCC
Q 043086 216 TDGRFHYDGD-----ENWTV--GAVPGAVDMQTVALHE-LRHVLGLA 254 (257)
Q Consensus 216 ~~g~ihfd~~-----e~wt~--~~~~~~~~l~~va~HE-iGH~LGL~ 254 (257)
.+|..||++. |.|.+ ..++.|+++..|++-- |||+.|..
T Consensus 81 QNG~ThF~agd~ra~e~~~~l~tnDPaGF~ivDvlawGsiGH~vg~~ 127 (141)
T PLN00081 81 QNGKTHFDAGDERAKEFAALLKSNDPVGFNLVDVLAWGSLGHIVAYY 127 (141)
T ss_pred cCCcchhhhHHHHHHHHHHhhccCCCCccchhhhhhhhhhhhhhhhe
Confidence 5799999873 34443 2245789999988875 99998853
No 114
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=26.77 E-value=90 Score=22.12 Aligned_cols=7 Identities=43% Similarity=0.876 Sum_probs=4.2
Q ss_pred Ccchhhh
Q 043086 1 MKHKRNA 7 (257)
Q Consensus 1 ~~~~~~~ 7 (257)
||+|.|.
T Consensus 1 ~r~k~~~ 7 (59)
T PF11119_consen 1 MRRKKNS 7 (59)
T ss_pred CCCcccc
Confidence 5666664
No 115
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=26.02 E-value=66 Score=32.71 Aligned_cols=14 Identities=36% Similarity=0.340 Sum_probs=11.3
Q ss_pred chhHHHHHHhhhhh
Q 043086 237 VDMQTVALHELRHV 250 (257)
Q Consensus 237 ~~l~~va~HEiGH~ 250 (257)
..+..|++|||.|.
T Consensus 277 ~s~~~viaHElAHq 290 (601)
T TIGR02411 277 RSNVDVIAHELAHS 290 (601)
T ss_pred hhhhhhHHHHHHhh
Confidence 34678999999994
No 116
>PRK06287 cobalt transport protein CbiN; Validated
Probab=24.83 E-value=88 Score=24.58 Aligned_cols=28 Identities=32% Similarity=0.197 Sum_probs=14.9
Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHhhhh
Q 043086 1 MKHKRNALHAFISFFLLQFFLLHALASA 28 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 28 (257)
|.++|..+..++++++++..+++..|++
T Consensus 1 ~~~~~~~~~~~~~~all~a~~~s~~AS~ 28 (107)
T PRK06287 1 MMDNKKFLIAGLIVALLIAILAPFLASG 28 (107)
T ss_pred CCcchhhHHHHHHHHHHHHHHHHhhhcC
Confidence 3455555555555555555555544443
No 117
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=24.19 E-value=91 Score=24.00 Aligned_cols=17 Identities=0% Similarity=-0.021 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q 043086 8 LHAFISFFLLQFFLLHA 24 (257)
Q Consensus 8 ~~~~~~~~~~~~~l~~~ 24 (257)
+.+|.++|.+++|+...
T Consensus 6 ~llL~l~LA~lLlisSe 22 (95)
T PF07172_consen 6 FLLLGLLLAALLLISSE 22 (95)
T ss_pred HHHHHHHHHHHHHHHhh
Confidence 44555555555555443
No 118
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=24.01 E-value=37 Score=23.98 Aligned_cols=20 Identities=30% Similarity=0.371 Sum_probs=16.3
Q ss_pred hhHHHHHHhhhhhcCCCCCC
Q 043086 238 DMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 238 ~l~~va~HEiGH~LGL~HSs 257 (257)
.-.--.+|.+-|-|||.|.|
T Consensus 28 ~~eRriih~la~~lGL~~~s 47 (60)
T cd02639 28 PAERRIVHLLASRLGLNHVS 47 (60)
T ss_pred HHHHHHHHHHHHHcCCceEE
Confidence 45557899999999999864
No 119
>PRK14125 cell division suppressor protein YneA; Provisional
Probab=22.85 E-value=1.2e+02 Score=23.68 Aligned_cols=25 Identities=32% Similarity=0.434 Sum_probs=14.0
Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086 1 MKHKRNALHAFISFFLLQFFLLHAL 25 (257)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~l~~~~ 25 (257)
||-+-|-++..+++++|++++.+..
T Consensus 1 ~~~~~~~~~~~ii~~~l~~~~~~~~ 25 (103)
T PRK14125 1 LKLKESKIHVSIFFVLTALVLLIFV 25 (103)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566666665555555554443
No 120
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=21.17 E-value=40 Score=30.55 Aligned_cols=17 Identities=35% Similarity=0.573 Sum_probs=12.3
Q ss_pred chhHHHHHHhhhhhcCC
Q 043086 237 VDMQTVALHELRHVLGL 253 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL 253 (257)
.|-..|++|||.|.|=+
T Consensus 152 ~dg~NVviHEfAH~LD~ 168 (253)
T PF06167_consen 152 NDGHNVVIHEFAHKLDM 168 (253)
T ss_dssp SSS--HHHHHHHHHHHC
T ss_pred CCCcchHHHHHHHHHHh
Confidence 46678999999999843
No 121
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=21.14 E-value=46 Score=34.43 Aligned_cols=14 Identities=36% Similarity=0.501 Sum_probs=11.3
Q ss_pred hhHHHHHHhhhhhc
Q 043086 238 DMQTVALHELRHVL 251 (257)
Q Consensus 238 ~l~~va~HEiGH~L 251 (257)
+=..++-|||||+|
T Consensus 463 ~eV~TlFHEfGHal 476 (681)
T PRK10280 463 DDVITLFHEFGHTL 476 (681)
T ss_pred HHHHHHHHHHHHHH
Confidence 44558999999997
No 122
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=21.01 E-value=44 Score=30.93 Aligned_cols=40 Identities=25% Similarity=0.246 Sum_probs=23.0
Q ss_pred ecCCCCCCCCcccCCCCC----ccCCCCCCchhHHHHHHhhhhhc
Q 043086 211 HSFPPTDGRFHYDGDENW----TVGAVPGAVDMQTVALHELRHVL 251 (257)
Q Consensus 211 haf~P~~g~ihfd~~e~w----t~~~~~~~~~l~~va~HEiGH~L 251 (257)
+-|-|.+..++||....= -++. .+.+--..|++||.||.+
T Consensus 136 PFYCP~D~kvYlDlsFf~~m~~~fga-~GdfAqaYViAHEVGHHV 179 (295)
T COG2321 136 PFYCPADTKVYLDLSFFDEMKTKFGA-SGDFAQAYVIAHEVGHHV 179 (295)
T ss_pred CeecCCCceEEEehhHHHHHHHHhcC-CccHHHHHHHHhhhhHHH
Confidence 335677777777753100 0111 133455669999999974
No 123
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=20.99 E-value=50 Score=33.48 Aligned_cols=16 Identities=31% Similarity=0.507 Sum_probs=14.3
Q ss_pred CCchhHHHHHHhhhhh
Q 043086 235 GAVDMQTVALHELRHV 250 (257)
Q Consensus 235 ~~~~l~~va~HEiGH~ 250 (257)
++..|..|++|||.|+
T Consensus 284 GDrsl~~vIaHEIAHS 299 (613)
T KOG1047|consen 284 GDRSLVDVIAHEIAHS 299 (613)
T ss_pred CCcchhhHHHHHhhhh
Confidence 5678999999999997
No 124
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=20.90 E-value=2e+02 Score=27.15 Aligned_cols=36 Identities=28% Similarity=0.405 Sum_probs=28.4
Q ss_pred hhhhhhccCCCCCCchhcHHHHHHHHHhcCCCCCCC
Q 043086 40 LEFFKLFQGTQKGDTVKGINALKKYLHTLGYLSNHN 75 (257)
Q Consensus 40 ~~~~~~l~~~~~g~~~~~v~~~q~yL~~fGYl~~~~ 75 (257)
..++.+-.|+-.++...+++.+--+|..+||.+..+
T Consensus 297 l~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~an 332 (378)
T KOG3988|consen 297 LEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYAN 332 (378)
T ss_pred HHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCC
Confidence 344555557778888889999999999999998764
No 125
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.41 E-value=52 Score=23.17 Aligned_cols=21 Identities=19% Similarity=0.401 Sum_probs=17.4
Q ss_pred chhHHHHHHhhhhhcCCCCCC
Q 043086 237 VDMQTVALHELRHVLGLAHRP 257 (257)
Q Consensus 237 ~~l~~va~HEiGH~LGL~HSs 257 (257)
....--++|++-+.+||.|-|
T Consensus 27 t~~eR~~vH~~a~~~gL~s~S 47 (60)
T cd02640 27 SKEERALIHQIAQKYGLKSRS 47 (60)
T ss_pred CHHHHHHHHHHHHHcCCceee
Confidence 356778999999999998754
Done!