Query 020771
Match_columns 321
No_of_seqs 234 out of 1572
Neff 4.6
Searched_HMMs 46136
Date Fri Mar 29 05:01:52 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/020771.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/020771hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2661 Peptidase family M48 [ 100.0 5.4E-56 1.2E-60 425.5 7.5 244 1-318 44-293 (424)
2 COG4783 Putative Zn-dependent 99.8 6.6E-20 1.4E-24 183.7 10.8 116 121-317 32-147 (484)
3 COG4784 Putative Zn-dependent 99.8 7.6E-19 1.6E-23 170.8 9.0 121 117-318 20-142 (479)
4 COG0501 HtpX Zn-dependent prot 99.5 4.7E-14 1E-18 131.2 5.5 57 261-318 115-175 (302)
5 PF01435 Peptidase_M48: Peptid 99.5 8.9E-14 1.9E-18 123.5 6.8 78 235-319 28-108 (226)
6 PRK03982 heat shock protein Ht 99.3 2.7E-12 5.9E-17 121.8 7.8 72 237-317 67-142 (288)
7 PRK02870 heat shock protein Ht 99.3 3.9E-12 8.4E-17 124.2 8.0 74 236-317 113-190 (336)
8 PRK01345 heat shock protein Ht 99.3 4E-12 8.6E-17 122.9 7.6 73 237-318 66-142 (317)
9 PRK03001 M48 family peptidase; 99.2 1.3E-11 2.9E-16 116.8 7.2 53 264-317 85-141 (283)
10 PRK03072 heat shock protein Ht 99.2 1.3E-11 2.7E-16 117.8 6.7 73 236-317 68-144 (288)
11 PRK04897 heat shock protein Ht 99.2 2.1E-11 4.5E-16 116.7 7.5 73 236-317 78-154 (298)
12 PRK05457 heat shock protein Ht 99.2 4.2E-11 9.2E-16 114.1 7.6 61 254-317 87-151 (284)
13 PRK02391 heat shock protein Ht 99.2 4.6E-11 1E-15 114.5 7.1 73 236-317 74-150 (296)
14 PRK01265 heat shock protein Ht 99.1 7.9E-11 1.7E-15 114.6 7.7 72 237-317 82-157 (324)
15 KOG2719 Metalloprotease [Gener 97.1 0.00041 9E-09 70.2 4.4 54 265-318 236-298 (428)
16 PF13203 DUF2201_N: Putative m 96.5 0.0043 9.3E-08 58.8 5.2 51 266-317 27-77 (292)
17 PF05569 Peptidase_M56: BlaR1 96.1 0.007 1.5E-07 57.4 4.4 51 266-317 160-213 (299)
18 PF04228 Zn_peptidase: Putativ 95.2 0.039 8.5E-07 53.6 5.9 43 273-315 136-185 (292)
19 PF06114 DUF955: Domain of unk 94.5 0.079 1.7E-06 41.7 5.0 48 265-317 7-59 (122)
20 PF12388 Peptidase_M57: Dual-a 92.7 0.09 1.9E-06 49.1 2.9 34 280-316 116-150 (211)
21 PF13699 DUF4157: Domain of un 92.4 0.086 1.9E-06 41.7 2.0 39 273-314 37-75 (79)
22 PF10263 SprT-like: SprT-like 91.2 0.56 1.2E-05 40.1 6.0 31 283-314 44-74 (157)
23 smart00731 SprT SprT homologue 90.3 0.34 7.4E-06 41.9 3.8 32 281-313 41-72 (146)
24 PRK04351 hypothetical protein; 89.9 0.45 9.8E-06 42.0 4.3 33 279-312 41-73 (149)
25 PF00413 Peptidase_M10: Matrix 86.8 0.44 9.6E-06 40.1 2.1 22 295-316 100-122 (154)
26 COG1451 Predicted metal-depend 86.2 0.55 1.2E-05 43.9 2.6 48 270-319 147-195 (223)
27 PF10026 DUF2268: Predicted Zn 85.5 1.4 3.1E-05 39.8 4.9 33 282-316 49-81 (195)
28 COG4219 MecR1 Antirepressor re 84.8 1.3 2.8E-05 44.1 4.5 50 266-316 154-206 (337)
29 cd04279 ZnMc_MMP_like_1 Zinc-d 84.6 0.63 1.4E-05 40.0 2.1 20 297-316 101-121 (156)
30 PF13485 Peptidase_MA_2: Pepti 84.4 1.1 2.5E-05 35.4 3.3 22 295-316 20-41 (128)
31 PF01863 DUF45: Protein of unk 84.4 0.64 1.4E-05 41.3 2.1 54 265-320 131-185 (205)
32 COG2321 Predicted metalloprote 83.2 1.1 2.3E-05 43.8 3.2 42 273-314 133-181 (295)
33 cd04276 ZnMc_MMP_like_2 Zinc-d 82.6 1 2.2E-05 41.5 2.7 36 282-317 92-134 (197)
34 cd04268 ZnMc_MMP_like Zinc-dep 82.6 0.99 2.2E-05 38.5 2.5 21 297-317 91-112 (165)
35 PF04450 BSP: Peptidase of pla 82.5 3.2 7E-05 38.4 5.9 45 268-313 57-109 (205)
36 COG3590 PepO Predicted metallo 81.4 0.43 9.4E-06 50.6 -0.2 46 273-318 450-505 (654)
37 PF14247 DUF4344: Domain of un 80.3 11 0.00024 35.5 8.7 50 263-312 38-104 (220)
38 PRK04860 hypothetical protein; 79.1 2.1 4.5E-05 38.4 3.3 32 282-314 46-77 (160)
39 PF10463 Peptidase_U49: Peptid 78.1 1.4 3E-05 41.1 2.0 21 298-318 99-119 (206)
40 COG2856 Predicted Zn peptidase 76.6 2.3 5.1E-05 39.7 3.1 30 282-316 59-88 (213)
41 cd04278 ZnMc_MMP Zinc-dependen 74.5 1.7 3.7E-05 37.4 1.5 21 297-317 104-125 (157)
42 cd04277 ZnMc_serralysin_like Z 74.4 2.3 4.9E-05 37.6 2.3 36 282-317 94-131 (186)
43 cd00203 ZnMc Zinc-dependent me 73.8 1.5 3.3E-05 37.2 1.0 20 297-316 93-113 (167)
44 cd04269 ZnMc_adamalysin_II_lik 73.0 2.6 5.6E-05 37.4 2.3 19 297-315 128-147 (194)
45 cd04270 ZnMc_TACE_like Zinc-de 73.0 2.3 5.1E-05 39.8 2.1 32 284-315 146-183 (244)
46 PF13582 Reprolysin_3: Metallo 72.7 1.9 4.1E-05 35.1 1.3 14 302-315 109-123 (124)
47 PF05572 Peptidase_M43: Pregna 72.1 2.1 4.6E-05 37.7 1.5 23 296-318 65-88 (154)
48 PF13688 Reprolysin_5: Metallo 72.0 2.2 4.8E-05 37.6 1.6 20 296-315 138-158 (196)
49 KOG3624 M13 family peptidase [ 71.3 3.2 7E-05 44.2 2.9 47 273-319 481-537 (687)
50 PF01421 Reprolysin: Reprolysi 70.7 3.4 7.3E-05 36.8 2.5 21 295-315 126-147 (199)
51 PF13583 Reprolysin_4: Metallo 69.5 2.5 5.3E-05 38.7 1.4 16 302-317 139-155 (206)
52 cd04267 ZnMc_ADAM_like Zinc-de 68.2 2.1 4.6E-05 37.8 0.7 20 297-316 130-150 (192)
53 PF01447 Peptidase_M4: Thermol 67.5 3.6 7.7E-05 36.3 1.9 41 271-315 107-150 (150)
54 smart00235 ZnMc Zinc-dependent 66.7 3.2 7E-05 34.7 1.5 35 276-316 67-103 (140)
55 PF14891 Peptidase_M91: Effect 66.4 3.3 7.2E-05 36.8 1.5 17 296-312 99-115 (174)
56 PF13574 Reprolysin_2: Metallo 65.8 3.3 7.2E-05 36.7 1.4 17 300-316 111-128 (173)
57 cd04272 ZnMc_salivary_gland_MP 63.5 4 8.7E-05 37.2 1.5 14 299-312 144-157 (220)
58 COG3091 SprT Zn-dependent meta 61.5 8.3 0.00018 34.7 3.1 29 283-312 45-73 (156)
59 PF01433 Peptidase_M1: Peptida 60.0 12 0.00025 36.2 4.1 51 264-314 254-309 (390)
60 PRK09672 phage exclusion prote 59.8 6 0.00013 39.1 2.1 20 298-317 163-182 (305)
61 PF08325 WLM: WLM domain; Int 59.3 4 8.8E-05 37.3 0.8 25 296-320 78-103 (186)
62 TIGR02411 leuko_A4_hydro leuko 58.7 8.1 0.00018 41.0 3.0 32 276-311 259-290 (601)
63 PF02031 Peptidase_M7: Strepto 54.0 6.8 0.00015 34.4 1.3 47 262-312 40-89 (132)
64 KOG1047 Bifunctional leukotrie 53.8 6.3 0.00014 42.0 1.2 27 283-312 274-300 (613)
65 cd04273 ZnMc_ADAMTS_like Zinc- 53.7 2.1 4.5E-05 38.7 -2.1 16 300-315 140-156 (207)
66 PF04298 Zn_peptidase_2: Putat 52.7 7.6 0.00017 36.8 1.5 16 301-316 90-105 (222)
67 COG2738 Predicted Zn-dependent 51.5 7.9 0.00017 36.4 1.3 16 301-316 93-108 (226)
68 cd04271 ZnMc_ADAM_fungal Zinc- 51.0 6 0.00013 36.9 0.5 14 302-315 147-161 (228)
69 PF01457 Peptidase_M8: Leishma 50.8 17 0.00036 38.0 3.7 54 258-312 156-222 (521)
70 PF14521 Aspzincin_M35: Lysine 50.5 20 0.00043 31.2 3.6 40 272-311 64-107 (148)
71 PF13402 M60-like: Peptidase M 47.6 23 0.00049 33.5 3.8 51 264-314 178-233 (307)
72 cd04327 ZnMc_MMP_like_3 Zinc-d 47.6 13 0.00028 33.5 2.1 15 298-312 90-104 (198)
73 cd06459 M3B_Oligoendopeptidase 46.3 12 0.00026 36.7 1.8 39 271-313 194-235 (427)
74 TIGR00181 pepF oligoendopeptid 40.9 9.5 0.00021 39.8 0.2 45 266-314 346-392 (591)
75 TIGR02412 pepN_strep_liv amino 40.5 29 0.00063 38.3 3.8 15 81-95 120-134 (831)
76 PTZ00337 surface protease GP63 40.1 56 0.0012 35.0 5.6 52 258-312 178-241 (567)
77 TIGR02414 pepN_proteo aminopep 38.8 16 0.00035 40.7 1.5 24 72-95 99-122 (863)
78 COG0308 PepN Aminopeptidase N 38.7 23 0.0005 39.3 2.7 51 262-312 262-319 (859)
79 PF06861 BALF1: BALF1 protein; 38.3 21 0.00045 32.8 1.9 25 292-316 124-148 (182)
80 PF12315 DUF3633: Protein of u 38.1 19 0.0004 34.0 1.6 26 284-314 82-107 (212)
81 PF09715 Plasmod_dom_1: Plasmo 37.1 15 0.00033 28.8 0.7 31 2-32 1-31 (67)
82 PRK13267 archaemetzincin-like 36.6 39 0.00085 30.7 3.4 17 299-315 124-141 (179)
83 PRK14015 pepN aminopeptidase N 34.3 26 0.00056 39.2 2.2 24 72-95 112-135 (875)
84 KOG1565 Gelatinase A and relat 33.8 18 0.0004 37.6 0.9 18 299-316 210-228 (469)
85 TIGR02289 M3_not_pepF oligoend 33.2 21 0.00046 37.1 1.3 41 270-314 308-351 (549)
86 COG3824 Predicted Zn-dependent 33.0 31 0.00068 30.2 2.0 23 132-154 11-33 (136)
87 PF10023 DUF2265: Predicted am 32.9 25 0.00055 35.2 1.7 17 296-312 161-177 (337)
88 cd04275 ZnMc_pappalysin_like Z 31.7 8 0.00017 36.3 -1.9 20 299-318 136-156 (225)
89 TIGR01519 plasmod_dom_1 Plasmo 31.6 22 0.00047 28.1 0.8 31 2-32 1-31 (70)
90 KOG3658 Tumor necrosis factor- 30.8 13 0.00029 40.5 -0.7 32 281-312 368-404 (764)
91 PF09768 Peptidase_M76: Peptid 30.6 39 0.00085 30.7 2.4 29 281-312 55-83 (173)
92 PF12725 DUF3810: Protein of u 29.9 27 0.00059 34.3 1.4 15 298-312 194-208 (318)
93 PTZ00257 Glycoprotein GP63 (le 29.9 99 0.0021 33.6 5.5 55 258-313 202-269 (622)
94 PF02571 CbiJ: Precorrin-6x re 29.6 50 0.0011 31.4 3.0 37 142-178 54-91 (249)
95 KOG3607 Meltrins, fertilins an 27.4 35 0.00075 37.5 1.7 21 296-316 319-340 (716)
96 KOG1046 Puromycin-sensitive am 26.2 94 0.002 34.8 4.8 47 266-312 283-336 (882)
97 cd06164 S2P-M50_SpoIVFB_CBS Sp 26.2 40 0.00088 31.4 1.7 14 299-312 52-65 (227)
98 PF07225 NDUF_B4: NADH-ubiquin 26.0 38 0.00082 29.4 1.4 47 62-116 46-99 (125)
99 PF01432 Peptidase_M3: Peptida 25.5 38 0.00081 34.1 1.5 17 296-313 239-255 (458)
100 COG4324 Predicted aminopeptida 25.2 50 0.0011 32.8 2.2 45 261-311 162-208 (376)
101 cd06160 S2P-M50_like_2 Unchara 24.0 48 0.001 30.1 1.7 16 297-312 38-53 (183)
102 COG3864 Uncharacterized protei 24.0 62 0.0013 32.7 2.6 35 281-316 51-85 (396)
103 TIGR02290 M3_fam_3 oligoendope 23.9 39 0.00084 35.5 1.3 44 266-313 342-388 (587)
104 PF01400 Astacin: Astacin (Pep 23.8 44 0.00095 30.1 1.4 12 301-312 80-91 (191)
105 PF08014 DUF1704: Domain of un 23.0 1.1E+02 0.0023 30.8 4.1 47 263-313 132-179 (349)
106 COG1164 Oligoendopeptidase F [ 23.0 38 0.00083 36.2 1.0 48 263-314 343-394 (598)
107 cd06455 M3A_TOP Peptidase M3 T 22.6 45 0.00097 34.2 1.4 17 297-314 261-277 (472)
108 cd04281 ZnMc_BMP1_TLD Zinc-dep 22.5 40 0.00086 31.1 0.9 12 301-312 88-99 (200)
109 cd04283 ZnMc_hatching_enzyme Z 21.9 42 0.00091 30.6 0.9 12 301-312 78-89 (182)
110 PF02102 Peptidase_M35: Deuter 21.5 1.3E+02 0.0028 30.5 4.4 42 272-313 262-310 (359)
111 TIGR03296 M6dom_TIGR03296 M6 f 21.1 24 0.00051 33.9 -0.9 12 301-312 166-177 (286)
112 COG4900 Predicted metallopepti 20.9 85 0.0018 27.3 2.5 26 286-312 67-92 (133)
113 cd06159 S2P-M50_PDZ_Arch Uncha 20.3 62 0.0013 31.2 1.7 33 274-312 96-130 (263)
No 1
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=5.4e-56 Score=425.48 Aligned_cols=244 Identities=27% Similarity=0.417 Sum_probs=224.1
Q ss_pred ChhhhhhHHHHHHHhh-cccCCCCCCCcccccc-ccccCC----CCCcccccCCcccceeeccccccccCCCCCCCCcce
Q 020771 1 MAYYRRAKAALDAFRN-LSSKAVPKSPVQESCS-RVYSNG----SANSAKFSSGFYSYSCISQRLRNSYCNPNFNTAKRY 74 (321)
Q Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 74 (321)
|.|||+.|+.+|.+++ ++++|.|++....+.+ .+.++. .+...+.+.|++++.+.-+.++..-+|||.+.+.+|
T Consensus 44 ~~~~~~~~~~~~~l~~~~~~~i~~~t~~~t~~n~~~~~~~~~~~~t~v~S~~v~l~~~~~s~Rn~Ng~~y~rFqsnp~~y 123 (424)
T KOG2661|consen 44 HFYSTFNNKRTGGLSSTKSKEIWRITSKCTVWNDAFSRQLLIKEVTAVPSLSVLLPLSPASIRNINGHTYPRFQSNPVPY 123 (424)
T ss_pred eeeeeecchhcccccccCCccceecccccceehhhhhhccchhhhhcccccccccccchhhhhccCccccchhhcCCccH
Confidence 7899999999999999 9999999999877776 333322 445578899999999999999999999999999999
Q ss_pred eEeecceeeeeccCCCCccccCCchhHHHHHHHHHHHHHHHhhccCccCCccchhccCCCHHHHHHHHHHHHHHHHHHhc
Q 020771 75 YYVDRYHVQHFRPRGPRKWLQNPRTVFIVVVIGSGAFITLYLGNLETVPYTKRTHFVLLSKAVERQLGESQFQQMKAAFK 154 (321)
Q Consensus 75 y~~~~~~~~hf~~rg~r~w~~~~r~~~~~v~vg~g~~~~~y~~~lEtvP~TgR~rfillS~~~E~~Lgeq~y~qil~e~~ 154 (321)
|+.+++.|+||+|++.-+|++|+|++++++++|+.++++|||+|+|++|+|||+||+++|+++|..||+.+|+++++|++
T Consensus 124 s~~~r~~~~sfkpn~~dk~f~n~rk~f~~lf~G~~~~~vfyfthlE~~Pit~Rsrfillskp~ekllge~~ye~im~eyq 203 (424)
T KOG2661|consen 124 SLMIRKWWQSFKPNKKDKVFENIRKNFWKLFLGLLLFVVFYFTHLEVSPITGRSRFILLSKPQEKLLGELEYEAIMEEYQ 203 (424)
T ss_pred HHHHHHHHHhcCCCCccchhhhHHHHHHHHHhhccceEEEEEeeeeeeccCCcceeeeccCHHHHHhhHHHHHHHHHHhc
Confidence 99999999999999999999999999999999876677899999999999999999999999999999999999999999
Q ss_pred CCCCCCCChhHHHHHHHHHHHHHHHHhcccccccccccCCCCccccccccchhhhhHHhhhhcccccccccCcccccccc
Q 020771 155 GKILPAIHPDSVRVRLIAKDIIEALQRGLKHETVWSDMGYASTETDFVNEGRAARDTLRALSENSERGKTEGKWHQEDEI 234 (321)
Q Consensus 155 ~kiLP~~hp~s~rVr~Ia~rIi~al~~~l~~~~~~~~~gy~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~l 234 (321)
++|||+.||++.+|++|+.+||++++.
T Consensus 204 ~~mLp~~hp~sl~V~~vlk~iIea~~~----------------------------------------------------- 230 (424)
T KOG2661|consen 204 NDMLPEKHPRSLAVKEVLKHIIEANKD----------------------------------------------------- 230 (424)
T ss_pred CccCCcCCchhhHHHHHHHHHHHHhcc-----------------------------------------------------
Confidence 999999999999999999999998520
Q ss_pred cchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 235 LDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 235 l~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
...+++..|+++||+||.+||||+|||+|||+||+|..|++||+||+|||||+||.++|
T Consensus 231 ---------------------~~slsgIkWeihVVndPipNAFvLPgGKvfVFtgiLn~ck~ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 231 ---------------------VPSLSGIKWEIHVVNDPIPNAFVLPGGKVFVFTGILNSCKDDDGLATVLGHEIAHAVAR 289 (424)
T ss_pred ---------------------CCcccCceeEEEEecCCCCceeeccCCeEEEEechhhcccChHHHHHHHHHHHHHHHHH
Confidence 01256789999999999999999999999999999999999999999999999999999
Q ss_pred cCCC
Q 020771 315 CGLP 318 (321)
Q Consensus 315 H~l~ 318 (321)
|+.+
T Consensus 290 H~AE 293 (424)
T KOG2661|consen 290 HAAE 293 (424)
T ss_pred HHHH
Confidence 9853
No 2
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.81 E-value=6.6e-20 Score=183.66 Aligned_cols=116 Identities=27% Similarity=0.313 Sum_probs=94.1
Q ss_pred ccCCccchhccCCCHHHHHHHHHHHHHHHHHHhcCCCCCCCChhHHHHHHHHHHHHHHHHhcccccccccccCCCCcccc
Q 020771 121 TVPYTKRTHFVLLSKAVERQLGESQFQQMKAAFKGKILPAIHPDSVRVRLIAKDIIEALQRGLKHETVWSDMGYASTETD 200 (321)
Q Consensus 121 tvP~TgR~rfillS~~~E~~Lgeq~y~qil~e~~~kiLP~~hp~s~rVr~Ia~rIi~al~~~l~~~~~~~~~gy~~~~~~ 200 (321)
..|.+|....-.+|+++|..+|...+.|+..+ .
T Consensus 32 ~lp~ig~~~~s~ls~~qev~~g~~~~~Qlr~~--~--------------------------------------------- 64 (484)
T COG4783 32 LLPDIGVSAGSTLSPAQEVALGDPANAQLRGS--V--------------------------------------------- 64 (484)
T ss_pred hcchhhhhhhhcCCHHHHHHhhHHHHHHhccC--C---------------------------------------------
Confidence 46888999999999999999999877766543 1
Q ss_pred ccccchhhhhHHhhhhcccccccccCcccccccccchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC
Q 020771 201 FVNEGRAARDTLRALSENSERGKTEGKWHQEDEILDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP 280 (321)
Q Consensus 201 ~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ll~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP 280 (321)
+++.||++++|++. +..++...+++++.+++|++|+|+.+||||+|
T Consensus 65 --------------------------------~~i~D~el~~yv~~--~g~rL~~~a~~~~~~f~f~lV~d~~iNAFA~~ 110 (484)
T COG4783 65 --------------------------------PLIRDPELEEYVNS--LGQRLAAAADLVKTPFTFFLVNDDSINAFATP 110 (484)
T ss_pred --------------------------------CCcCCHHHHHHHHH--HHHHHHHhcCCCCCCeEEEEecCCccchhhcC
Confidence 23444445555542 12244444556788999999999999999999
Q ss_pred CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
||.|+|+||||-..+||+|||+||||||||+..+|..
T Consensus 111 Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~a 147 (484)
T COG4783 111 GGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLA 147 (484)
T ss_pred CceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHH
Confidence 9999999999999999999999999999999999964
No 3
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=99.77 E-value=7.6e-19 Score=170.81 Aligned_cols=121 Identities=24% Similarity=0.350 Sum_probs=100.2
Q ss_pred hccCccCCccchhccCCCHHHHHH-HHHHHHHHHHHHhcCCCCCCCChhH-HHHHHHHHHHHHHHHhcccccccccccCC
Q 020771 117 GNLETVPYTKRTHFVLLSKAVERQ-LGESQFQQMKAAFKGKILPAIHPDS-VRVRLIAKDIIEALQRGLKHETVWSDMGY 194 (321)
Q Consensus 117 ~~lEtvP~TgR~rfillS~~~E~~-Lgeq~y~qil~e~~~kiLP~~hp~s-~rVr~Ia~rIi~al~~~l~~~~~~~~~gy 194 (321)
..|...|-|+-.-+..+++.++.. ||.++-..|+++|++.. +||-. +.|.+|+.++.
T Consensus 20 ~~~~~~Ps~~p~~v~~v~~n~~laqmgar~hprIla~yggeY---~D~Kler~Vari~g~lt------------------ 78 (479)
T COG4784 20 YESGVQPSDNPQTVDNVQKNDPLAQMGARQHPRILATYGGEY---RDPKLERMVARIVGALT------------------ 78 (479)
T ss_pred cccccCCCCCcchHHHhhccChHhHhhhhhChhHHHhcCCcc---CCHHHHHHHHHHHhHhh------------------
Confidence 367778999999999999988766 99999999999998765 24443 23444443332
Q ss_pred CCccccccccchhhhhHHhhhhcccccccccCcccccccccchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCC
Q 020771 195 ASTETDFVNEGRAARDTLRALSENSERGKTEGKWHQEDEILDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVI 274 (321)
Q Consensus 195 ~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ll~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~ 274 (321)
..++-+...+.|.|+|+|.+
T Consensus 79 ------------------------------------------------------------~~S~~p~q~YriTilnSP~I 98 (479)
T COG4784 79 ------------------------------------------------------------AVSENPQQTYRITILNSPNI 98 (479)
T ss_pred ------------------------------------------------------------hhccCCCceEEEEEecCCCc
Confidence 22344677899999999999
Q ss_pred ceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCCC
Q 020771 275 NAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGLP 318 (321)
Q Consensus 275 NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l~ 318 (321)
|||++|||.|+|+.|||..+.++.|+|+||+||||||..+|+++
T Consensus 99 NAFALPGGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHgi~ 142 (479)
T COG4784 99 NAFALPGGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHGIQ 142 (479)
T ss_pred cccccCCceEEEehhHHHHcCCHHHHHHHHHhhhhheecchhHH
Confidence 99999999999999999999999999999999999999999975
No 4
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.46 E-value=4.7e-14 Score=131.21 Aligned_cols=57 Identities=35% Similarity=0.529 Sum_probs=53.1
Q ss_pred CCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCCC
Q 020771 261 GLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGLP 318 (321)
Q Consensus 261 ~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l~ 318 (321)
...|+++|++++.+||||+|+ |.|+|++|||+.+ |+|||++|||||+||+.++|++.
T Consensus 115 ~~~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gLl~~l-~~dEl~aVlaHElgHi~~rd~~~ 175 (302)
T COG0501 115 PHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDLL-NDDELEAVLAHELGHIKNRHTLV 175 (302)
T ss_pred CCCCeeEEecCCCccceecCCCCCCeeEEecHHHHhhC-CHHHHHHHHHHHHHHHhcccHHH
Confidence 457999999999999999998 7999999999977 99999999999999999999863
No 5
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.46 E-value=8.9e-14 Score=123.55 Aligned_cols=78 Identities=29% Similarity=0.394 Sum_probs=59.2
Q ss_pred cchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCCCe---EEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 235 LDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPGGK---IVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 235 l~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPgG~---I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
.+++++++.++. +++..+....+++++|++++.+||||+|+|. |+|++||+..+ |+|||++|||||+||+
T Consensus 28 ~~~~~L~~~v~~------l~~~~~~~~~~~~v~v~~~~~~NA~~~g~~~~~~I~v~~~ll~~~-~~~el~aVlaHElgH~ 100 (226)
T PF01435_consen 28 LEDPELRRIVEE------LARRAGLGIPPPRVYVIDSPSPNAFATGGGPRKRIVVTSGLLESL-SEDELAAVLAHELGHI 100 (226)
T ss_dssp -HHHHHHHHHHH------HHHHHHCTSS--EEEEE--SSEEEEEETTTC--EEEEEHHHHHHS-SHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHH------HHHHhcCCCCCCeEEEEcCCCCcEEEEccCCCcEEEEeChhhhcc-cHHHHHHHHHHHHHHH
Confidence 345666666653 2233333445689999999999999999999 99999999665 9999999999999999
Q ss_pred hcccCCCC
Q 020771 312 QRICGLPF 319 (321)
Q Consensus 312 ~~rH~l~~ 319 (321)
..+|....
T Consensus 101 ~~~h~~~~ 108 (226)
T PF01435_consen 101 KHRHILKS 108 (226)
T ss_dssp HTTHCCCC
T ss_pred HcCCcchH
Confidence 99998764
No 6
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.33 E-value=2.7e-12 Score=121.77 Aligned_cols=72 Identities=18% Similarity=0.244 Sum_probs=58.1
Q ss_pred hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
+|++++.++. ++...+++ ..+++|++++.+|||+++ +|.|+|++||++.+ |+|||++|||||+||+.
T Consensus 67 ~p~L~~~v~~------la~~~g~~--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~l-~~~El~AVlAHElgHi~ 137 (288)
T PRK03982 67 APELYRIVER------LAERANIP--KPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLL-NEDELEGVIAHELTHIK 137 (288)
T ss_pred hHHHHHHHHH------HHHHcCCC--CCeEEEEeCCCcceEEeccCCCCeEEEeehHHHhhC-CHHHHHHHHHHHHHHHH
Confidence 5666666553 23344443 348999999999999994 46789999999999 89999999999999999
Q ss_pred cccCC
Q 020771 313 RICGL 317 (321)
Q Consensus 313 ~rH~l 317 (321)
++|.+
T Consensus 138 ~~h~~ 142 (288)
T PRK03982 138 NRDTL 142 (288)
T ss_pred cCCHH
Confidence 99974
No 7
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.31 E-value=3.9e-12 Score=124.19 Aligned_cols=74 Identities=19% Similarity=0.305 Sum_probs=59.2
Q ss_pred chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEe----CCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCL----PGGKIVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfal----PgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
+++++.+.++. ++...+++ ..++++|++++.+||||+ |++.|+|++|||+.+ |+|||++|||||+||+
T Consensus 113 ~~~~L~~~ve~------La~~ag~p-~~p~V~vi~~~~~NAFA~G~~~~~~~Ivvt~GLL~~L-~~dEL~aVlAHELgHi 184 (336)
T PRK02870 113 QERQLYNVVEE------LLVAAGLR-FMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKL-DRDELQAVMAHELSHI 184 (336)
T ss_pred hhHHHHHHHHH------HHHHcCCC-CCCeEEEEcCCCCceEEecCCCCCcEEEEehHHhhhC-CHHHHHHHHHHHHHHH
Confidence 45566665543 23333432 347999999999999999 478999999999999 8999999999999999
Q ss_pred hcccCC
Q 020771 312 QRICGL 317 (321)
Q Consensus 312 ~~rH~l 317 (321)
.++|..
T Consensus 185 k~~di~ 190 (336)
T PRK02870 185 RHGDIR 190 (336)
T ss_pred HcccHH
Confidence 999863
No 8
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.30 E-value=4e-12 Score=122.89 Aligned_cols=73 Identities=21% Similarity=0.283 Sum_probs=58.5
Q ss_pred hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
+|++.+.++. ++...+++ ..+++|++++.+|||+++. +.|+|++|||+.+ |+|||++||||||||+.
T Consensus 66 ~p~L~~~v~~------La~~agi~--~p~v~vid~~~~NAFa~G~~~~~~~V~vt~gLL~~L-~~dEL~aVlAHElgHi~ 136 (317)
T PRK01345 66 APELYRMVRD------LARRAGLP--MPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRL-SPEEVAGVMAHELAHVK 136 (317)
T ss_pred CHHHHHHHHH------HHHHcCCC--CCcEEEEcCCCcceEEecCCCCCeEEEechHHHhhC-CHHHHHHHHHHHHHHHH
Confidence 4556665543 34445554 3589999999999999953 4799999999998 79999999999999999
Q ss_pred cccCCC
Q 020771 313 RICGLP 318 (321)
Q Consensus 313 ~rH~l~ 318 (321)
++|.+.
T Consensus 137 ~~d~~~ 142 (317)
T PRK01345 137 NRDTLT 142 (317)
T ss_pred cCCHHH
Confidence 999753
No 9
>PRK03001 M48 family peptidase; Provisional
Probab=99.24 E-value=1.3e-11 Score=116.83 Aligned_cols=53 Identities=26% Similarity=0.335 Sum_probs=48.7
Q ss_pred eEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 264 WEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 264 w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
.+++|++++.+|||+++. +.|+|++|||+.+ |+|||++|||||+||+.++|..
T Consensus 85 p~v~v~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHi~~~h~~ 141 (283)
T PRK03001 85 PKVYLINEDQPNAFATGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAHVKHRDIL 141 (283)
T ss_pred CeEEEecCCCcceEEecCCCCCeEEEecHHHHhhC-CHHHHHHHHHHHHHHHhCCChH
Confidence 489999999999999974 3599999999999 8999999999999999999975
No 10
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.23 E-value=1.3e-11 Score=117.77 Aligned_cols=73 Identities=15% Similarity=0.177 Sum_probs=59.4
Q ss_pred chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
+||++.+.++. ++...+++ ..+++|++++.+|||+++. +.|+|++||++.+ |+|||++|||||+||+
T Consensus 68 ~~p~L~~~v~~------la~~~g~p--~p~vyv~~~~~~NAFa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi 138 (288)
T PRK03072 68 QAPAMYRIVRE------LSTAARQP--MPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQIL-NERELRGVLGHELSHV 138 (288)
T ss_pred hhHHHHHHHHH------HHHHcCCC--CCCEEEecCCCCceEEecCCCCCcEEEecHHHHHhC-CHHHHHHHHHHHHHHH
Confidence 56777776653 34445554 4589999999999999952 2499999999999 8999999999999999
Q ss_pred hcccCC
Q 020771 312 QRICGL 317 (321)
Q Consensus 312 ~~rH~l 317 (321)
.++|.+
T Consensus 139 ~~~d~~ 144 (288)
T PRK03072 139 YNRDIL 144 (288)
T ss_pred hcCCHH
Confidence 999975
No 11
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.22 E-value=2.1e-11 Score=116.66 Aligned_cols=73 Identities=21% Similarity=0.294 Sum_probs=59.2
Q ss_pred chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
++|++.+.++. ++...+++ ..++++++++.+|||+++ ++.|+|++|||+.+ ++|||++|||||+||+
T Consensus 78 ~~p~L~~~v~~------la~~~gip--~p~v~v~~~~~~NAfa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi 148 (298)
T PRK04897 78 EAPELWHIVED------MAMVAQIP--MPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIM-NREELEGVIGHEISHI 148 (298)
T ss_pred hhHHHHHHHHH------HHHHcCCC--CCcEEEecCCCCceEEeccCCCCcEEEeehHHHhhC-CHHHHHHHHHHHHHHH
Confidence 45666666653 34445554 358999999999999997 35799999999999 8999999999999999
Q ss_pred hcccCC
Q 020771 312 QRICGL 317 (321)
Q Consensus 312 ~~rH~l 317 (321)
.++|..
T Consensus 149 ~~~d~~ 154 (298)
T PRK04897 149 RNYDIR 154 (298)
T ss_pred hcCCHH
Confidence 999964
No 12
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.18 E-value=4.2e-11 Score=114.13 Aligned_cols=61 Identities=25% Similarity=0.347 Sum_probs=51.9
Q ss_pred ccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 254 SATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 254 ~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
+...+++ ..+++|++++.+|||+++. +.|+|++|||+.+ |+|||++|||||+||+.++|..
T Consensus 87 a~~~g~p--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlAHElgHi~~~d~~ 151 (284)
T PRK05457 87 ARQAGIG--MPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNM-SRDEVEAVLAHEISHIANGDMV 151 (284)
T ss_pred HHhCCCC--CCCEEEEeCCCceEEEecCCCCCeEEEeehHHhhhC-CHHHHHHHHHHHHHHHHcCCHH
Confidence 3444544 4579999999999999973 3589999999999 8999999999999999999964
No 13
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.17 E-value=4.6e-11 Score=114.54 Aligned_cols=73 Identities=21% Similarity=0.232 Sum_probs=58.1
Q ss_pred chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
++|++.+.++. ++...+.+ ..+++|++++.+|||+++. +.|+|++||++.+ |+|||.+|||||+||+
T Consensus 74 ~~p~L~~~v~~------la~~~~~~--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlaHElgHi 144 (296)
T PRK02391 74 EYPELHAMVER------LCALADLP--KPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRL-DPDELEAVLAHELSHV 144 (296)
T ss_pred hCHHHHHHHHH------HHHHcCCC--CCcEEEEeCCCCceEEecCCCCCcEEEecHHHHhhC-CHHHHHHHHHHHHHHH
Confidence 35666666543 33444544 2489999999999999963 3589999999999 7999999999999999
Q ss_pred hcccCC
Q 020771 312 QRICGL 317 (321)
Q Consensus 312 ~~rH~l 317 (321)
.++|.+
T Consensus 145 ~~~di~ 150 (296)
T PRK02391 145 KNRDVA 150 (296)
T ss_pred HcCCHH
Confidence 999974
No 14
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.14 E-value=7.9e-11 Score=114.61 Aligned_cols=72 Identities=22% Similarity=0.280 Sum_probs=58.1
Q ss_pred hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
++++.+-++. ++...+++ ..+++|++++.+||||.+ ++.|+|++|||+.+ |+|||.+|||||+||+.
T Consensus 82 ~~~L~~~v~~------la~~~g~~--~p~vyv~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHik 152 (324)
T PRK01265 82 YGWLYSIVAE------VAKYNGIR--VPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKIL-NRDEIKAVAGHELGHLK 152 (324)
T ss_pred cHHHHHHHHH------HHHHcCCC--CCeEEEecCCCCCeEEeccCCCCCEEEEehHHHhhC-CHHHHHHHHHHHHHHHH
Confidence 4556655543 33444554 458999999999999973 56899999999999 89999999999999999
Q ss_pred cccCC
Q 020771 313 RICGL 317 (321)
Q Consensus 313 ~rH~l 317 (321)
++|..
T Consensus 153 ~~d~~ 157 (324)
T PRK01265 153 HRDVE 157 (324)
T ss_pred cccHH
Confidence 99974
No 15
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=97.15 E-value=0.00041 Score=70.24 Aligned_cols=54 Identities=20% Similarity=0.237 Sum_probs=43.1
Q ss_pred EEEEEeC----CCCceEEeCC---CeEEEcHhHHh-hc-CCHHHHHHHHHHHHHhhhcccCCC
Q 020771 265 EVLVVNE----PVINAFCLPG---GKIVVFTGLLE-HF-RTDAEIATIIGHEPRGLQRICGLP 318 (321)
Q Consensus 265 ~v~Vv~~----~~~NAfalPg---G~I~V~tGLL~-~~-~~edeLAaVLaHEiaHv~~rH~l~ 318 (321)
.++|++. ...||+-.+= -.|++|.-|+. .. -|+||++||+|||+||-.++|.++
T Consensus 236 k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~~eel~AVl~HELGHW~~~H~~K 298 (428)
T KOG2719|consen 236 KYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLNNEELVAVLAHELGHWKLNHVLK 298 (428)
T ss_pred EEEEEecCCCCCCCCeeeeeccccceEEEehhhhhhhhccccHHHHHHHHHHhhHHHHhhHHH
Confidence 5677772 3589998864 36999999993 33 389999999999999999999753
No 16
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=96.46 E-value=0.0043 Score=58.78 Aligned_cols=51 Identities=16% Similarity=0.082 Sum_probs=45.5
Q ss_pred EEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 266 VLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 266 v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
+.++.++.+..+++-|..|+++-..+..+ +.+|+.+||.||+-|++.+|..
T Consensus 27 l~~~~~~~~~t~~tDg~~l~~nP~~~~~l-~~~~~~~~l~HevlH~~~~H~~ 77 (292)
T PF13203_consen 27 LPPVEDDGIPTAATDGRRLYYNPEFLESL-SPEERVGLLLHEVLHCLLRHPW 77 (292)
T ss_pred CeeeeCCCCceeeEcCcEEEECcHHHhcC-CHHHHHHHHHHHHHHHHccchh
Confidence 44555888899999999999999999888 7999999999999999999974
No 17
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=96.07 E-value=0.007 Score=57.37 Aligned_cols=51 Identities=24% Similarity=0.240 Sum_probs=40.7
Q ss_pred EEEEeCCC-CceEEeCC--CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 266 VLVVNEPV-INAFCLPG--GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 266 v~Vv~~~~-~NAfalPg--G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
+.++.++. ..+|+++- .+|++..++++.+ +++|+..||.||++|+.++|.+
T Consensus 160 ~~i~~s~~i~sP~~~G~~~p~I~lP~~~~~~~-~~~el~~il~HEl~Hikr~D~~ 213 (299)
T PF05569_consen 160 IRIRVSSGISSPFVFGFLRPVIVLPESLLEDL-SEEELRAILLHELAHIKRRDLL 213 (299)
T ss_pred eEEEEcCCCCCCeeecCcceEEEecCcccccc-CHHHHHHHHHHHHHHHHCCChH
Confidence 44444443 46777742 5899999999888 8999999999999999999874
No 18
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=95.19 E-value=0.039 Score=53.65 Aligned_cols=43 Identities=16% Similarity=0.207 Sum_probs=32.3
Q ss_pred CCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhccc
Q 020771 273 VINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQRIC 315 (321)
Q Consensus 273 ~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~rH 315 (321)
..|||-+|. .+|++-+.+++.++ .+-..+.|||||.||.+...
T Consensus 136 ~~gpFYCp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l 185 (292)
T PF04228_consen 136 ATGPFYCPADQTIYLDLSFFDELQQRFGASGDFAQAYVLAHEYGHHVQNL 185 (292)
T ss_pred CCCCEeCCCCCEEEechHHHHHHHHHhCCccHHHHHHHHHHHHHHHHHHH
Confidence 357787775 68999998876653 24456789999999987654
No 19
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=94.48 E-value=0.079 Score=41.74 Aligned_cols=48 Identities=15% Similarity=0.187 Sum_probs=30.6
Q ss_pred EEEEEeCCC--CceEEeC---CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771 265 EVLVVNEPV--INAFCLP---GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 265 ~v~Vv~~~~--~NAfalP---gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l 317 (321)
+|...+... ...++.+ .+.|+|.+. .++.+..++|+||+||+..+|..
T Consensus 7 ~v~~~~~~~~~~~~~~~~~~~~~~I~in~~-----~~~~~~~f~laHELgH~~~~~~~ 59 (122)
T PF06114_consen 7 DVVYTDLGPKDLGGFSIPKENNPIIFINSN-----LSPERQRFTLAHELGHILLHHGD 59 (122)
T ss_dssp EEEEE--TT--EEEETT----TTEEEEESS-----S-HHHHHHHHHHHHHHHHHHH-H
T ss_pred EEEEEcCCCCCcCEEEEecCcCCEEEECCC-----CCHHHHHHHHHHHHHHHHhhhcc
Confidence 444444433 3444444 567887766 36888999999999999987753
No 20
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=92.71 E-value=0.09 Score=49.14 Aligned_cols=34 Identities=24% Similarity=0.098 Sum_probs=26.0
Q ss_pred CCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccC
Q 020771 280 PGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 280 PgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~ 316 (321)
|++.|.| .|+-. .+-+.++.||.|||||.+ .||+
T Consensus 116 P~~~I~I-~~~~~--~~~~~~~hvi~HEiGH~IGfRHT 150 (211)
T PF12388_consen 116 PYKFIQI-YGLSN--YSVNVIEHVITHEIGHCIGFRHT 150 (211)
T ss_pred CCceEEE-EecCC--CchhHHHHHHHHHhhhhcccccc
Confidence 4567888 46422 267889999999999987 8885
No 21
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=92.37 E-value=0.086 Score=41.75 Aligned_cols=39 Identities=21% Similarity=0.264 Sum_probs=27.9
Q ss_pred CCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 273 VINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 273 ~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
...||++ |+.|++-.|-... +..+=..+|+||++|+...
T Consensus 37 ~A~A~T~-G~~I~f~~g~~~~--~s~~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 37 GARAFTV-GNDIYFAPGKYNP--DSPEGRALLAHELAHVVQQ 75 (79)
T ss_pred CCeEEEE-CCEEEEcCCCcCC--CCCCcchhHhHHHHHHHhh
Confidence 4678888 7899997775432 2333457899999999753
No 22
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=91.22 E-value=0.56 Score=40.13 Aligned_cols=31 Identities=19% Similarity=0.233 Sum_probs=27.8
Q ss_pred eEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
.|.++.-+++. .++++|..+|.|||+|+...
T Consensus 44 ~I~ls~~~~~~-~~~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 44 EIRLSPKLLDR-NPEEELIDTLLHEMAHAAAY 74 (157)
T ss_pred EEEECHHHHHh-hHHHHHHHHHHHHHHHHHhh
Confidence 69999999998 47999999999999999763
No 23
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=90.29 E-value=0.34 Score=41.85 Aligned_cols=32 Identities=16% Similarity=0.062 Sum_probs=28.5
Q ss_pred CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771 281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~ 313 (321)
.+.|.++.-|++.. .++.|..||.|||+|+..
T Consensus 41 ~~~I~ln~~l~~~~-~~~~l~~~l~HEm~H~~~ 72 (146)
T smart00731 41 SAEIRLNPKLLTEN-GRDRLRETLLHELCHAAL 72 (146)
T ss_pred CCEEEeCHHHHhhc-cHHHHHhhHHHHHHHHHH
Confidence 68899999988866 689999999999999986
No 24
>PRK04351 hypothetical protein; Provisional
Probab=89.87 E-value=0.45 Score=42.05 Aligned_cols=33 Identities=27% Similarity=0.283 Sum_probs=28.8
Q ss_pred eCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 279 LPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 279 lPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
+....|-++.-+++.. +++.|..||.||++|+.
T Consensus 41 l~~~~I~lnp~ll~~~-~~~~l~~vv~HElcH~~ 73 (149)
T PRK04351 41 LKDHHIEFNPKMLEEY-GLEELIGIIKHELCHYH 73 (149)
T ss_pred cCCCeEEeCHHHHhhc-cHHHHHhhHHHHHHHHH
Confidence 4557899999999876 79999999999999984
No 25
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=86.76 E-value=0.44 Score=40.10 Aligned_cols=22 Identities=9% Similarity=0.010 Sum_probs=18.0
Q ss_pred CCHHHHHHHHHHHHHhhh-cccC
Q 020771 295 RTDAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 295 ~~edeLAaVLaHEiaHv~-~rH~ 316 (321)
.+...+..|+.|||||++ +.|.
T Consensus 100 ~~~~~~~~v~~HEiGHaLGL~H~ 122 (154)
T PF00413_consen 100 DSGNDLQSVAIHEIGHALGLDHS 122 (154)
T ss_dssp SSSEEHHHHHHHHHHHHTTBESS
T ss_pred hhhhhhhhhhhhccccccCcCcC
Confidence 355678999999999997 7775
No 26
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=86.21 E-value=0.55 Score=43.92 Aligned_cols=48 Identities=21% Similarity=0.165 Sum_probs=35.8
Q ss_pred eCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccCCCC
Q 020771 270 NEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICGLPF 319 (321)
Q Consensus 270 ~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~l~~ 319 (321)
+-...=+-|.+.|.|.+.+-|.. + .++.+..|+.||++|+. ..|+-.|
T Consensus 147 ~~k~~WGScs~~~~i~~~~~l~~-~-p~~~i~YVvvHELaHLke~nHs~~F 195 (223)
T COG1451 147 NMKRRWGSCSKAGEIRFNWRLVM-A-PEEVIDYVVVHELAHLKEKNHSKRF 195 (223)
T ss_pred eccceeeeecCCCcEEeehhhhc-C-CHHHHHHHHHHHHHHHhhhhccHHH
Confidence 34455677888887777776655 3 69999999999999997 5555444
No 27
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=85.48 E-value=1.4 Score=39.83 Aligned_cols=33 Identities=27% Similarity=0.273 Sum_probs=25.9
Q ss_pred CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771 282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~ 316 (321)
|+|+++- +..-.+.++|.++||||.-|+.+-..
T Consensus 49 ~~i~l~~--~~~~~~~~~l~~~iaHE~hH~~r~~~ 81 (195)
T PF10026_consen 49 GYIFLFL--LPNDYSLEELPALIAHEYHHNCRYEQ 81 (195)
T ss_pred CEEEEEe--cCCcccHHHHHHHHHHHHHHHHHHhc
Confidence 5788775 45555899999999999999976443
No 28
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=84.83 E-value=1.3 Score=44.14 Aligned_cols=50 Identities=30% Similarity=0.282 Sum_probs=38.8
Q ss_pred EEEEeCC---CCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771 266 VLVVNEP---VINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 266 v~Vv~~~---~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~ 316 (321)
+.+..++ .+-.|.++--.|++-+.+.+.+ +++|+.-|++||.+|+.++..
T Consensus 154 i~ir~s~~i~~P~v~gl~kp~IvlP~d~~~r~-~~ee~~yIilHEl~Hlk~gD~ 206 (337)
T COG4219 154 ILIRKSKAIDGPMVFGLVKPCIVLPADFVERL-TDEELKYIILHELSHLKRGDA 206 (337)
T ss_pred eeEeecccCCCceeeccCcceEEccHHHHhhc-CHHhhhhhHhHHHhhhhcccH
Confidence 4444443 3445555667899999999999 899999999999999987654
No 29
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=84.63 E-value=0.63 Score=40.01 Aligned_cols=20 Identities=15% Similarity=-0.031 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHhhh-cccC
Q 020771 297 DAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH~ 316 (321)
...+..|+.|||||++ +.|.
T Consensus 101 ~~~~~~~~~HEiGHaLGL~H~ 121 (156)
T cd04279 101 AENLQAIALHELGHALGLWHH 121 (156)
T ss_pred chHHHHHHHHHhhhhhcCCCC
Confidence 5679999999999997 6665
No 30
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=84.39 E-value=1.1 Score=35.37 Aligned_cols=22 Identities=9% Similarity=-0.086 Sum_probs=18.3
Q ss_pred CCHHHHHHHHHHHHHhhhcccC
Q 020771 295 RTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 295 ~~edeLAaVLaHEiaHv~~rH~ 316 (321)
.+++.+..||+||++|+.....
T Consensus 20 ~~~~~~~~~l~HE~~H~~~~~~ 41 (128)
T PF13485_consen 20 SDEDWLDRVLAHELAHQWFGNY 41 (128)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH
Confidence 5788899999999999975543
No 31
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=84.37 E-value=0.64 Score=41.26 Aligned_cols=54 Identities=17% Similarity=0.161 Sum_probs=40.5
Q ss_pred EEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccCCCCC
Q 020771 265 EVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICGLPFY 320 (321)
Q Consensus 265 ~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~l~~~ 320 (321)
++.|-+...-=.-|...|.|-++.-|+.. .++-+..|+.||++|+. .+|+-.||
T Consensus 131 ~i~ir~~ksrWGsc~~~~~I~ln~~L~~~--P~~~idYVvvHEL~Hl~~~nHs~~Fw 185 (205)
T PF01863_consen 131 KIKIRDMKSRWGSCSSKGNITLNWRLVMA--PPEVIDYVVVHELCHLRHPNHSKRFW 185 (205)
T ss_pred eEEEeehhhccccCCCCCcEEeecccccC--CccHHHHHHHHHHHHhccCCCCHHHH
Confidence 45555555555557557999999988773 68899999999999995 66765554
No 32
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=83.24 E-value=1.1 Score=43.81 Aligned_cols=42 Identities=19% Similarity=0.257 Sum_probs=32.9
Q ss_pred CCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhcc
Q 020771 273 VINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 273 ~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~r 314 (321)
..--|-+|+ .++++...+.+.|+ .|=.-|.|+|||++|.+.+
T Consensus 133 asGPFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHVQn 181 (295)
T COG2321 133 ASGPFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHVQN 181 (295)
T ss_pred CCCCeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHHHH
Confidence 344567775 68999999999885 4666789999999998753
No 33
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=82.60 E-value=1 Score=41.45 Aligned_cols=36 Identities=14% Similarity=0.309 Sum_probs=27.7
Q ss_pred CeEEEcHhHHhhcCC------HHHHHHHHHHHHHhhh-cccCC
Q 020771 282 GKIVVFTGLLEHFRT------DAEIATIIGHEPRGLQ-RICGL 317 (321)
Q Consensus 282 G~I~V~tGLL~~~~~------edeLAaVLaHEiaHv~-~rH~l 317 (321)
+.|.++++.+..... .+-|..+++||+||.+ +.|..
T Consensus 92 a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~ 134 (197)
T cd04276 92 ADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNF 134 (197)
T ss_pred EEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccc
Confidence 678899998866532 3568899999999987 77753
No 34
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=82.59 E-value=0.99 Score=38.48 Aligned_cols=21 Identities=14% Similarity=0.034 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHhhh-cccCC
Q 020771 297 DAEIATIIGHEPRGLQ-RICGL 317 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH~l 317 (321)
.+.+-+|+.|||||++ +.|..
T Consensus 91 ~~~~~~~~~HEiGHaLGL~H~~ 112 (165)
T cd04268 91 GARLRNTAEHELGHALGLRHNF 112 (165)
T ss_pred HHHHHHHHHHHHHHHhcccccC
Confidence 4678999999999998 66654
No 35
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=82.46 E-value=3.2 Score=38.43 Aligned_cols=45 Identities=16% Similarity=0.184 Sum_probs=33.4
Q ss_pred EEeCCCCceEEeCC----CeEEEcHhHHhhcCC----HHHHHHHHHHHHHhhhc
Q 020771 268 VVNEPVINAFCLPG----GKIVVFTGLLEHFRT----DAEIATIIGHEPRGLQR 313 (321)
Q Consensus 268 Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~----edeLAaVLaHEiaHv~~ 313 (321)
++++-.-=|++. | ..|.+++..|....+ .+|+..||-|||.|+.+
T Consensus 57 ~~~~~~gVA~t~-gd~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q 109 (205)
T PF04450_consen 57 ILDDMDGVAYTS-GDDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQ 109 (205)
T ss_pred EEECCCeeEEEe-cCCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhh
Confidence 445433334444 3 689999999998853 46999999999999965
No 36
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=81.43 E-value=0.43 Score=50.56 Aligned_cols=46 Identities=28% Similarity=0.358 Sum_probs=32.3
Q ss_pred CCceEEeCCCe-EEEcHhHHh--hcC-------CHHHHHHHHHHHHHhhhcccCCC
Q 020771 273 VINAFCLPGGK-IVVFTGLLE--HFR-------TDAEIATIIGHEPRGLQRICGLP 318 (321)
Q Consensus 273 ~~NAfalPgG~-I~V~tGLL~--~~~-------~edeLAaVLaHEiaHv~~rH~l~ 318 (321)
.+|||.-|+-+ ||.-.+||. .++ |=-.+-+||||||+|.--.-+.+
T Consensus 450 ~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGak 505 (654)
T COG3590 450 TVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGAK 505 (654)
T ss_pred HhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCccc
Confidence 58999999854 666666664 221 33468899999999986554444
No 37
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=80.31 E-value=11 Score=35.51 Aligned_cols=50 Identities=14% Similarity=0.100 Sum_probs=36.0
Q ss_pred CeEEEEEeCCCCceEEeCC-CeEEEcHhHHhhcC--------C--------HHHHHHHHHHHHHhhh
Q 020771 263 NWEVLVVNEPVINAFCLPG-GKIVVFTGLLEHFR--------T--------DAEIATIIGHEPRGLQ 312 (321)
Q Consensus 263 ~w~v~Vv~~~~~NAfalPg-G~I~V~tGLL~~~~--------~--------edeLAaVLaHEiaHv~ 312 (321)
+..+.+-.-.+.|||--|. ..|.+...+++.+. + .+-+..+|-||+||+.
T Consensus 38 ~l~i~~~~CGe~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 38 DLTIRFAECGEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHAL 104 (220)
T ss_pred CeEEEEeecCCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHHH
Confidence 3445444567899999995 67998888776442 1 2346678999999985
No 38
>PRK04860 hypothetical protein; Provisional
Probab=79.14 E-value=2.1 Score=38.37 Aligned_cols=32 Identities=25% Similarity=0.150 Sum_probs=26.0
Q ss_pred CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
+.|-++--+|... ++++|..||.||++|+...
T Consensus 46 ~~I~~Np~ll~~~-~~~~l~~~v~HEl~H~~~~ 77 (160)
T PRK04860 46 NEIRLNPVLLLEN-QQAFIDEVVPHELAHLLVY 77 (160)
T ss_pred CCeeeCHHHHhhC-cHHHHHhHHHHHHHHHHHH
Confidence 5677777777766 7999999999999998643
No 39
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=78.06 E-value=1.4 Score=41.13 Aligned_cols=21 Identities=24% Similarity=-0.016 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHhhhcccCCC
Q 020771 298 AEIATIIGHEPRGLQRICGLP 318 (321)
Q Consensus 298 deLAaVLaHEiaHv~~rH~l~ 318 (321)
-.++.+|.||++|+..+|...
T Consensus 99 ~A~~fil~HE~~Hv~~~h~~~ 119 (206)
T PF10463_consen 99 CAIAFILLHELAHVVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHHHHcCccc
Confidence 357899999999999999754
No 40
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=76.61 E-value=2.3 Score=39.71 Aligned_cols=30 Identities=10% Similarity=-0.065 Sum_probs=22.9
Q ss_pred CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771 282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~ 316 (321)
+.|++++- .+++.--++||||+||++++-.
T Consensus 59 ~~I~iN~n-----~~~~r~rFtlAHELGH~llH~~ 88 (213)
T COG2856 59 PVIYINAN-----NSLERKRFTLAHELGHALLHTD 88 (213)
T ss_pred ceEEEeCC-----CCHHHHHHHHHHHHhHHHhccc
Confidence 46776544 3788888999999999986543
No 41
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=74.53 E-value=1.7 Score=37.45 Aligned_cols=21 Identities=10% Similarity=0.007 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHhhh-cccCC
Q 020771 297 DAEIATIIGHEPRGLQ-RICGL 317 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH~l 317 (321)
...+..|+.|||||++ +.|.-
T Consensus 104 ~~~~~~~~~HEiGHaLGL~H~~ 125 (157)
T cd04278 104 GTDLFSVAAHEIGHALGLGHSS 125 (157)
T ss_pred cchHHHHHHHHhccccccCCCC
Confidence 3458899999999997 67753
No 42
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=74.39 E-value=2.3 Score=37.64 Aligned_cols=36 Identities=14% Similarity=-0.164 Sum_probs=23.2
Q ss_pred CeEEEcHhHHhhc-CCHHHHHHHHHHHHHhhh-cccCC
Q 020771 282 GKIVVFTGLLEHF-RTDAEIATIIGHEPRGLQ-RICGL 317 (321)
Q Consensus 282 G~I~V~tGLL~~~-~~edeLAaVLaHEiaHv~-~rH~l 317 (321)
|.|++........ ......-.++.|||||++ +.|.-
T Consensus 94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~~ 131 (186)
T cd04277 94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHPG 131 (186)
T ss_pred ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCCC
Confidence 5666665433321 123556789999999998 67753
No 43
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=73.80 E-value=1.5 Score=37.23 Aligned_cols=20 Identities=25% Similarity=0.036 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHhhh-cccC
Q 020771 297 DAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH~ 316 (321)
...++.+++|||||++ +.|.
T Consensus 93 ~~~~~~~~~HElGH~LGl~H~ 113 (167)
T cd00203 93 TKEGAQTIAHELGHALGFYHD 113 (167)
T ss_pred cccchhhHHHHHHHHhCCCcc
Confidence 3578999999999997 5554
No 44
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=72.97 E-value=2.6 Score=37.38 Aligned_cols=19 Identities=16% Similarity=-0.130 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHhhh-ccc
Q 020771 297 DAEIATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH 315 (321)
....|.++||||||.. ..|
T Consensus 128 ~~~~a~~~AHElGH~lG~~H 147 (194)
T cd04269 128 LLLFAVTMAHELGHNLGMEH 147 (194)
T ss_pred hHHHHHHHHHHHHhhcCCCc
Confidence 5678999999999987 444
No 45
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=72.96 E-value=2.3 Score=39.85 Aligned_cols=32 Identities=19% Similarity=0.059 Sum_probs=21.1
Q ss_pred EEEcHhHHhhcC-----CHHHHHHHHHHHHHhhh-ccc
Q 020771 284 IVVFTGLLEHFR-----TDAEIATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 284 I~V~tGLL~~~~-----~edeLAaVLaHEiaHv~-~rH 315 (321)
++..+||..... .....|.+|||||||.. ..|
T Consensus 146 ~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnlGm~H 183 (244)
T cd04270 146 KYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNFGSPH 183 (244)
T ss_pred eeeecceEeeeccCCccchhHHHHHHHHHHHHhcCCCC
Confidence 455667664221 24567899999999986 344
No 46
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=72.66 E-value=1.9 Score=35.13 Aligned_cols=14 Identities=14% Similarity=-0.126 Sum_probs=10.3
Q ss_pred HHHHHHHHhhh-ccc
Q 020771 302 TIIGHEPRGLQ-RIC 315 (321)
Q Consensus 302 aVLaHEiaHv~-~rH 315 (321)
.+++|||||.. +.|
T Consensus 109 ~~~~HEiGH~lGl~H 123 (124)
T PF13582_consen 109 DTFAHEIGHNLGLNH 123 (124)
T ss_dssp THHHHHHHHHTT---
T ss_pred eEeeehhhHhcCCCC
Confidence 89999999986 555
No 47
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=72.12 E-value=2.1 Score=37.69 Aligned_cols=23 Identities=9% Similarity=0.031 Sum_probs=14.8
Q ss_pred CHHHHHHHHHHHHHhhh-cccCCC
Q 020771 296 TDAEIATIIGHEPRGLQ-RICGLP 318 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~-~rH~l~ 318 (321)
+.....-+|.||+||.+ +.|+.+
T Consensus 65 ~~~~~g~TltHEvGH~LGL~HtF~ 88 (154)
T PF05572_consen 65 SQYNFGKTLTHEVGHWLGLYHTFG 88 (154)
T ss_dssp TTS-SSHHHHHHHHHHTT---TT-
T ss_pred Cccccccchhhhhhhhhccccccc
Confidence 45566899999999987 888753
No 48
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=71.98 E-value=2.2 Score=37.62 Aligned_cols=20 Identities=15% Similarity=-0.122 Sum_probs=14.0
Q ss_pred CHHHHHHHHHHHHHhhh-ccc
Q 020771 296 TDAEIATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~-~rH 315 (321)
....-+-|+||||||.. +.|
T Consensus 138 ~~~~~~~~~AHEiGH~lGa~H 158 (196)
T PF13688_consen 138 PTYNGAITFAHEIGHNLGAPH 158 (196)
T ss_dssp -HHHHHHHHHHHHHHHTT---
T ss_pred CCCceehhhHHhHHHhcCCCC
Confidence 45777889999999986 444
No 49
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=71.27 E-value=3.2 Score=44.17 Aligned_cols=47 Identities=32% Similarity=0.390 Sum_probs=36.7
Q ss_pred CCceEEeCC-CeEEEcHhHHhh----c-----CCHHHHHHHHHHHHHhhhcccCCCC
Q 020771 273 VINAFCLPG-GKIVVFTGLLEH----F-----RTDAEIATIIGHEPRGLQRICGLPF 319 (321)
Q Consensus 273 ~~NAfalPg-G~I~V~tGLL~~----~-----~~edeLAaVLaHEiaHv~~rH~l~~ 319 (321)
.+|||-.|. ..|+|.-|+|.. . -+=.-+.+||||||+|.--.++.++
T Consensus 481 ~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~ 537 (687)
T KOG3624|consen 481 QVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQY 537 (687)
T ss_pred eeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhccccccccc
Confidence 589988775 578999999885 1 1567789999999999887776654
No 50
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=70.70 E-value=3.4 Score=36.85 Aligned_cols=21 Identities=29% Similarity=0.191 Sum_probs=15.7
Q ss_pred CCHHHHHHHHHHHHHhhh-ccc
Q 020771 295 RTDAEIATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 295 ~~edeLAaVLaHEiaHv~-~rH 315 (321)
.+...+|.+|||||||.. ..|
T Consensus 126 ~~~~~~a~~~AHelGH~lGm~H 147 (199)
T PF01421_consen 126 RSGLSFAVIIAHELGHNLGMPH 147 (199)
T ss_dssp SSHHHHHHHHHHHHHHHTT---
T ss_pred chhHHHHHHHHHHHHHhcCCCC
Confidence 467889999999999986 444
No 51
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=69.51 E-value=2.5 Score=38.70 Aligned_cols=16 Identities=13% Similarity=-0.058 Sum_probs=13.1
Q ss_pred HHHHHHHHhhh-cccCC
Q 020771 302 TIIGHEPRGLQ-RICGL 317 (321)
Q Consensus 302 aVLaHEiaHv~-~rH~l 317 (321)
-+++|||||.. ++|..
T Consensus 139 ~~~aHEiGH~lGl~H~~ 155 (206)
T PF13583_consen 139 QTFAHEIGHNLGLRHDF 155 (206)
T ss_pred hHHHHHHHHHhcCCCCc
Confidence 45999999997 77764
No 52
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=68.17 E-value=2.1 Score=37.77 Aligned_cols=20 Identities=15% Similarity=-0.085 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHhhh-cccC
Q 020771 297 DAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 297 edeLAaVLaHEiaHv~-~rH~ 316 (321)
.-..+.++|||+||.. ..|.
T Consensus 130 ~~~~~~~~aHElGH~lG~~HD 150 (192)
T cd04267 130 TLLTALTMAHELGHNLGAEHD 150 (192)
T ss_pred ceeehhhhhhhHHhhcCCcCC
Confidence 3567889999999987 5665
No 53
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=67.50 E-value=3.6 Score=36.33 Aligned_cols=41 Identities=24% Similarity=0.234 Sum_probs=23.7
Q ss_pred CCCCceEEeCCCeEEEcHhHHhhcCCHHHHHH---HHHHHHHhhhccc
Q 020771 271 EPVINAFCLPGGKIVVFTGLLEHFRTDAEIAT---IIGHEPRGLQRIC 315 (321)
Q Consensus 271 ~~~~NAfalPgG~I~V~tGLL~~~~~edeLAa---VLaHEiaHv~~rH 315 (321)
..-.|||.- |..|+.-.|--..+. .+++ |+||||+|.+-.|
T Consensus 107 ~~~~NAfW~-g~~m~yGdG~~~~f~---~~~~~lDVvaHEltHGVte~ 150 (150)
T PF01447_consen 107 KNYNNAFWN-GSQMVYGDGDGQIFK---PFASSLDVVAHELTHGVTEY 150 (150)
T ss_dssp SSTT-EEE--SSSEEEE---SSSBS----GGG-HHHHHHHHHHHHHHT
T ss_pred CCccCcccc-CCEEEEECCCCcccc---cCccccceeeeccccccccC
Confidence 346899996 567777766442332 3333 9999999987554
No 54
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=66.69 E-value=3.2 Score=34.73 Aligned_cols=35 Identities=17% Similarity=0.118 Sum_probs=20.8
Q ss_pred eEE-eCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccC
Q 020771 276 AFC-LPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 276 Afa-lPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~ 316 (321)
|++ .|+|.+-+..+ .-+. -..|+.|||||++ +.|.
T Consensus 67 a~~g~~~g~~~~~~~--~~~~----~~~~~~HEigHaLGl~H~ 103 (140)
T smart00235 67 SHAGRPGGDQHFSLG--NGCI----NTGVAAHELGHALGLYHE 103 (140)
T ss_pred eeeecCCCceEEEcc--CCcC----CcccHHHHHHHHhcCCcC
Confidence 444 35776666431 1121 2359999999997 5554
No 55
>PF14891 Peptidase_M91: Effector protein
Probab=66.42 E-value=3.3 Score=36.83 Aligned_cols=17 Identities=12% Similarity=-0.050 Sum_probs=13.6
Q ss_pred CHHHHHHHHHHHHHhhh
Q 020771 296 TDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~ 312 (321)
.+..=+.||+|||+|+.
T Consensus 99 ~~~~p~v~L~HEL~HA~ 115 (174)
T PF14891_consen 99 SPRPPFVVLYHELIHAY 115 (174)
T ss_pred hhHHHHHHHHHHHHHHH
Confidence 34556789999999985
No 56
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=65.81 E-value=3.3 Score=36.74 Aligned_cols=17 Identities=12% Similarity=-0.136 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhhh-cccC
Q 020771 300 IATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 300 LAaVLaHEiaHv~-~rH~ 316 (321)
-.-++||||||.. ..|.
T Consensus 111 ~~~~~aHElGH~lGa~Hd 128 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHD 128 (173)
T ss_dssp HHHHHHHHHHHHHT---S
T ss_pred eeeeehhhhHhhcCCCCC
Confidence 3456999999987 5664
No 57
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=63.54 E-value=4 Score=37.22 Aligned_cols=14 Identities=21% Similarity=-0.009 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhh
Q 020771 299 EIATIIGHEPRGLQ 312 (321)
Q Consensus 299 eLAaVLaHEiaHv~ 312 (321)
..+.++|||+||..
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 45899999999987
No 58
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=61.52 E-value=8.3 Score=34.70 Aligned_cols=29 Identities=28% Similarity=0.304 Sum_probs=23.9
Q ss_pred eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
.|=++-=||... .+|.|..|+.||++|..
T Consensus 45 ~I~lNP~ll~en-~~~f~~~vV~HELaHl~ 73 (156)
T COG3091 45 EIRLNPKLLEEN-GEDFIEQVVPHELAHLH 73 (156)
T ss_pred ccccCHHHHHHc-cHHHHHHHHHHHHHHHH
Confidence 566666677766 79999999999999985
No 59
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=59.99 E-value=12 Score=36.19 Aligned_cols=51 Identities=20% Similarity=0.055 Sum_probs=28.0
Q ss_pred eEEEEEeCCCCceEEeCCCeEEEcHhHHhh--cC---CHHHHHHHHHHHHHhhhcc
Q 020771 264 WEVLVVNEPVINAFCLPGGKIVVFTGLLEH--FR---TDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 264 w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~--~~---~edeLAaVLaHEiaHv~~r 314 (321)
..+.++.+-..+++.-+|-.++--..++-. .. ...+++.|||||+||.--+
T Consensus 254 ~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfG 309 (390)
T PF01433_consen 254 LDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFG 309 (390)
T ss_dssp EEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBT
T ss_pred eeEEEEeccccccccccccccccccccccCcccccchhhhhhHHHHHHHHHHHHhc
Confidence 445555433356776666444444444422 11 2457899999999997544
No 60
>PRK09672 phage exclusion protein Lit; Provisional
Probab=59.78 E-value=6 Score=39.08 Aligned_cols=20 Identities=20% Similarity=0.086 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHhhhcccCC
Q 020771 298 AEIATIIGHEPRGLQRICGL 317 (321)
Q Consensus 298 deLAaVLaHEiaHv~~rH~l 317 (321)
..+|.|+-||++|+...|..
T Consensus 163 ~A~a~i~~HEiaHv~~~h~~ 182 (305)
T PRK09672 163 CALAWILLHEIAHVEFQHSS 182 (305)
T ss_pred HHHHHHHHHHHHHHHhcccc
Confidence 35899999999999999986
No 61
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=59.30 E-value=4 Score=37.26 Aligned_cols=25 Identities=16% Similarity=-0.012 Sum_probs=20.0
Q ss_pred CHHHHHHHHHHHHHhhh-cccCCCCC
Q 020771 296 TDAEIATIIGHEPRGLQ-RICGLPFY 320 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~-~rH~l~~~ 320 (321)
+-+.|..||-||++|.. ..|...||
T Consensus 78 ~~~~i~~t~lHELaH~~~~~H~~~F~ 103 (186)
T PF08325_consen 78 PYETILGTMLHELAHNVHGPHDDKFW 103 (186)
T ss_pred eHHHHHHHHHHHHHhcccCCccHHHH
Confidence 56799999999999987 45666555
No 62
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=58.72 E-value=8.1 Score=41.01 Aligned_cols=32 Identities=19% Similarity=0.174 Sum_probs=21.8
Q ss_pred eEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771 276 AFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGL 311 (321)
Q Consensus 276 AfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv 311 (321)
|+--|| .++....+|. .+.+++.|++|||||-
T Consensus 259 gMEN~~-ltf~~~~ll~---~d~s~~~viaHElAHq 290 (601)
T TIGR02411 259 GMENPN-LTFATPTLIA---GDRSNVDVIAHELAHS 290 (601)
T ss_pred cccccc-ceeecccccc---CChhhhhhHHHHHHhh
Confidence 444454 6666666653 3456789999999996
No 63
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=53.96 E-value=6.8 Score=34.36 Aligned_cols=47 Identities=17% Similarity=0.028 Sum_probs=23.3
Q ss_pred CCeEEEEEeCCC-CceEEeCCC--eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 262 LNWEVLVVNEPV-INAFCLPGG--KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 262 ~~w~v~Vv~~~~-~NAfalPgG--~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
.+..++...++. .=|-..+.| .|++-+.--..- +.+. |.+||+||++
T Consensus 40 a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy---~~~R-IaaHE~GHiL 89 (132)
T PF02031_consen 40 ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGY---NSTR-IAAHELGHIL 89 (132)
T ss_dssp -SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS----HHH-HHHHHHHHHH
T ss_pred CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCC---ccce-eeeehhcccc
Confidence 455555544433 335555554 566554432222 2222 8999999986
No 64
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=53.79 E-value=6.3 Score=41.99 Aligned_cols=27 Identities=26% Similarity=0.242 Sum_probs=21.2
Q ss_pred eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
..||+--||. .|.-|+.||||||||.-
T Consensus 274 ltF~TpTlla---GDrsl~~vIaHEIAHSW 300 (613)
T KOG1047|consen 274 LTFVTPTLLA---GDRSLVDVIAHEIAHSW 300 (613)
T ss_pred eeeecchhhc---CCcchhhHHHHHhhhhh
Confidence 5666666665 67888999999999963
No 65
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=53.72 E-value=2.1 Score=38.74 Aligned_cols=16 Identities=25% Similarity=0.040 Sum_probs=13.0
Q ss_pred HHHHHHHHHHhhh-ccc
Q 020771 300 IATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 300 LAaVLaHEiaHv~-~rH 315 (321)
.|.++||||||.. ..|
T Consensus 140 ~a~~~aHElGH~LG~~H 156 (207)
T cd04273 140 SAFTIAHELGHVLGMPH 156 (207)
T ss_pred eEEeeeeechhhcCCCC
Confidence 5789999999987 444
No 66
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=52.75 E-value=7.6 Score=36.75 Aligned_cols=16 Identities=13% Similarity=0.102 Sum_probs=13.2
Q ss_pred HHHHHHHHHhhhcccC
Q 020771 301 ATIIGHEPRGLQRICG 316 (321)
Q Consensus 301 AaVLaHEiaHv~~rH~ 316 (321)
.+|=|||+||+.+++.
T Consensus 90 vaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 90 VAVAAHEVGHAIQHAE 105 (222)
T ss_pred HHHHHHHHhHHHhccc
Confidence 4689999999987763
No 67
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=51.51 E-value=7.9 Score=36.42 Aligned_cols=16 Identities=13% Similarity=-0.010 Sum_probs=13.0
Q ss_pred HHHHHHHHHhhhcccC
Q 020771 301 ATIIGHEPRGLQRICG 316 (321)
Q Consensus 301 AaVLaHEiaHv~~rH~ 316 (321)
.+|-|||+||+.+++.
T Consensus 93 ~aVAAHEVGHAiQd~~ 108 (226)
T COG2738 93 IAVAAHEVGHAIQDQE 108 (226)
T ss_pred HHHHHHHhhHHHhhhc
Confidence 4789999999987653
No 68
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=50.96 E-value=6 Score=36.90 Aligned_cols=14 Identities=14% Similarity=0.157 Sum_probs=11.4
Q ss_pred HHHHHHHHhhh-ccc
Q 020771 302 TIIGHEPRGLQ-RIC 315 (321)
Q Consensus 302 aVLaHEiaHv~-~rH 315 (321)
.+|||||||.. ..|
T Consensus 147 ~t~AHElGHnLGm~H 161 (228)
T cd04271 147 QVFAHEIGHTFGAVH 161 (228)
T ss_pred eehhhhhhhhcCCCC
Confidence 69999999987 344
No 69
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=50.78 E-value=17 Score=37.97 Aligned_cols=54 Identities=19% Similarity=0.227 Sum_probs=29.6
Q ss_pred CCCCCCeEEEEEeCCC---CceEEeC----------CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 258 HLDGLNWEVLVVNEPV---INAFCLP----------GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 258 ~l~~~~w~v~Vv~~~~---~NAfalP----------gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
++.+.|.-++|...|. .-|+|.| -|.|.+....|... ..+.+.-|+.|||+|++
T Consensus 156 gv~~~D~vlyV~~~~~~~~~~A~A~~C~~~~~~RP~~G~in~~p~~i~~~-~~~~~~~~~~HEi~HaL 222 (521)
T PF01457_consen 156 GVPNADFVLYVTARPSSSSTLAWAAPCQQDSDGRPIAGVININPSYIPSF-YFQEFFRTVIHEIAHAL 222 (521)
T ss_dssp -EES-SEEEEEE----STT---EEEEEEE-TTS-EEEEEEE--GGG---S---HHHHHHHHHHHHHHT
T ss_pred CcCCccEEEEEEEeeccCCceEEEeehhhccCCCCeeEEEEEchhHccch-hhhcccceeeeeeeeee
Confidence 3456778888876554 3477764 26888887777655 57778889999999996
No 70
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=50.45 E-value=20 Score=31.23 Aligned_cols=40 Identities=13% Similarity=0.113 Sum_probs=28.1
Q ss_pred CCCceEEeCCC---eEEEcHhHHhhc-CCHHHHHHHHHHHHHhh
Q 020771 272 PVINAFCLPGG---KIVVFTGLLEHF-RTDAEIATIIGHEPRGL 311 (321)
Q Consensus 272 ~~~NAfalPgG---~I~V~tGLL~~~-~~edeLAaVLaHEiaHv 311 (321)
+..-|+|.|+. .|+|.......= ...+--+.+|-||++|.
T Consensus 64 ~~~~a~~~~~~~~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf 107 (148)
T PF14521_consen 64 NNTYAYVYPDSPTYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF 107 (148)
T ss_dssp SSSSEEE-TTST-TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred cceEEEEECCCCceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence 35678899985 599988888732 34567799999999993
No 71
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=47.64 E-value=23 Score=33.46 Aligned_cols=51 Identities=12% Similarity=-0.046 Sum_probs=29.1
Q ss_pred eEEEEEeCCCCc-eEEe-CCCeEEEcHhHHhhcCCHHHH---HHHHHHHHHhhhcc
Q 020771 264 WEVLVVNEPVIN-AFCL-PGGKIVVFTGLLEHFRTDAEI---ATIIGHEPRGLQRI 314 (321)
Q Consensus 264 w~v~Vv~~~~~N-Afal-PgG~I~V~tGLL~~~~~edeL---AaVLaHEiaHv~~r 314 (321)
....++.+...+ +++. .|+.|....+..+.+-+.+.+ .=-+.||+||....
T Consensus 178 ~~~r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~~~~~WG~~HE~GH~~Q~ 233 (307)
T PF13402_consen 178 PNNRFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPLRKGGWGPWHELGHNHQQ 233 (307)
T ss_dssp --EEEEEETT----SEEEETTEEEEETT--HHHH-HHHHHHH-HHHHHHHHHHH-B
T ss_pred ccceEEEeccccccceeecCCcEEeeCcHHhcccCHhHcCCCCeeehhhhhhhcCc
Confidence 344777777655 5665 577777777766655466655 33589999998654
No 72
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=47.58 E-value=13 Score=33.51 Aligned_cols=15 Identities=20% Similarity=0.184 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHhhh
Q 020771 298 AEIATIIGHEPRGLQ 312 (321)
Q Consensus 298 deLAaVLaHEiaHv~ 312 (321)
....+++-|||||++
T Consensus 90 ~~~~~~i~HElgHaL 104 (198)
T cd04327 90 PEFSRVVLHEFGHAL 104 (198)
T ss_pred hhHHHHHHHHHHHHh
Confidence 345578899999997
No 73
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=46.26 E-value=12 Score=36.65 Aligned_cols=39 Identities=15% Similarity=0.181 Sum_probs=23.7
Q ss_pred CCCCceEEeC---CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771 271 EPVINAFCLP---GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 271 ~~~~NAfalP---gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~ 313 (321)
.+..-|||.+ ++..+|++.. -++-++ ...|+||+||+.+
T Consensus 194 gK~~gaf~~~~~~~~~p~i~~n~---~~~~~~-v~tl~HE~GHa~h 235 (427)
T cd06459 194 GKRSGAYCTGLPPGKHPFILMNF---NGTLDD-VFTLAHELGHAFH 235 (427)
T ss_pred CCCCCeecCCCCCCCCCeEEecC---CCChhh-HHHHHHHhhHHHH
Confidence 3345688875 3455654332 234444 4568999999764
No 74
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=40.90 E-value=9.5 Score=39.83 Aligned_cols=45 Identities=16% Similarity=0.036 Sum_probs=24.9
Q ss_pred EEEEeCCCCceEEeCCC--eEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 266 VLVVNEPVINAFCLPGG--KIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 266 v~Vv~~~~~NAfalPgG--~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
++--+....-|||.+.+ .-+| |+..-.+-++ ..+|+||+||..+.
T Consensus 346 ~~~r~gK~~Ga~~~~~~~~~p~i---l~N~~~~~~d-v~TLaHElGHa~H~ 392 (591)
T TIGR00181 346 YAENKGKRSGAYSIGGYKVKPYI---LMNWDGTLNS-VFTLAHELGHSMHS 392 (591)
T ss_pred ecCCCCCCCCcccCCCCCCCCeE---EEecCCCcch-HHHHHHHhhhHHHH
Confidence 33334455679998542 2222 1222223333 35799999999744
No 75
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=40.53 E-value=29 Score=38.27 Aligned_cols=15 Identities=20% Similarity=0.437 Sum_probs=13.4
Q ss_pred eeeeeccCCCCcccc
Q 020771 81 HVQHFRPRGPRKWLQ 95 (321)
Q Consensus 81 ~~~hf~~rg~r~w~~ 95 (321)
-+-||.|.+-|+||.
T Consensus 120 ~~Tq~ep~~Ar~~fP 134 (831)
T TIGR02412 120 LYTQFEPADARRVFA 134 (831)
T ss_pred EEECCCCcCceeeEe
Confidence 378999999999997
No 76
>PTZ00337 surface protease GP63; Provisional
Probab=40.09 E-value=56 Score=34.96 Aligned_cols=52 Identities=12% Similarity=0.088 Sum_probs=35.6
Q ss_pred CCCCCCeEEEEEeCCC--CceEEeCC----------CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 258 HLDGLNWEVLVVNEPV--INAFCLPG----------GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 258 ~l~~~~w~v~Vv~~~~--~NAfalPg----------G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
++.+.|..++|.-.|. .=|+|.|- |.|.+....|. +.++..-|+.|||+|++
T Consensus 178 Gv~naD~vLyV~a~p~~~~~A~A~~C~~~~~~RPi~G~in~np~~i~---~~~~~~~v~~HEi~HAL 241 (567)
T PTZ00337 178 GVAGADMILYANIFPTSGPAAWAIPCFLLDDGRPFAAAVNFDPRQIA---VTNGDVRVAAHELGHAL 241 (567)
T ss_pred CccccceEEEEEeccCCCcceEEEEcccccCCCceEEEEEECHHHcc---chhHHHHHHHHHHHHHH
Confidence 4456677777765443 33777642 67777765543 46777889999999997
No 77
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=38.83 E-value=16 Score=40.68 Aligned_cols=24 Identities=13% Similarity=0.133 Sum_probs=16.4
Q ss_pred cceeEeecceeeeeccCCCCcccc
Q 020771 72 KRYYYVDRYHVQHFRPRGPRKWLQ 95 (321)
Q Consensus 72 ~r~y~~~~~~~~hf~~rg~r~w~~ 95 (321)
.-+|.....-+-+|.|.|-|+|+.
T Consensus 99 ~GlY~s~~~~~TQ~Ep~gaR~ifp 122 (863)
T TIGR02414 99 EGLYKSGGNFCTQCEAEGFRRITY 122 (863)
T ss_pred eEEEEeCCeEEEEecCCCCCcCCC
Confidence 345555555678999999996554
No 78
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=38.70 E-value=23 Score=39.29 Aligned_cols=51 Identities=16% Similarity=0.148 Sum_probs=32.9
Q ss_pred CCeEEEEEeCCCCceEEeCC-CeEEEcHhH-Hhhc---C--CHHHHHHHHHHHHHhhh
Q 020771 262 LNWEVLVVNEPVINAFCLPG-GKIVVFTGL-LEHF---R--TDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 262 ~~w~v~Vv~~~~~NAfalPg-G~I~V~tGL-L~~~---~--~edeLAaVLaHEiaHv~ 312 (321)
.+|...+|.-|..|+-+|=| |.|.+.+.. |..- . +-...+.|++||++|--
T Consensus 262 y~l~~~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqW 319 (859)
T COG0308 262 YALPIDIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQW 319 (859)
T ss_pred CCCcccEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhc
Confidence 34444677778888888865 566666663 2210 0 12567779999999963
No 79
>PF06861 BALF1: BALF1 protein; InterPro: IPR010677 Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members [].
Probab=38.29 E-value=21 Score=32.84 Aligned_cols=25 Identities=16% Similarity=0.171 Sum_probs=21.7
Q ss_pred hhcCCHHHHHHHHHHHHHhhhcccC
Q 020771 292 EHFRTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 292 ~~~~~edeLAaVLaHEiaHv~~rH~ 316 (321)
..++|+.|-++|++|++||...+|-
T Consensus 124 ~~l~~d~e~~s~v~~~lA~Fy~~~r 148 (182)
T PF06861_consen 124 RNLLNDHENASLVSHALAHFYLRYR 148 (182)
T ss_pred HHhcCchHHHHHHHHHHHHHHHHHH
Confidence 4567999999999999999988873
No 80
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=38.05 E-value=19 Score=34.03 Aligned_cols=26 Identities=27% Similarity=0.290 Sum_probs=17.3
Q ss_pred EEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 284 IVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 284 I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
|.|-.||=..+ ..+|||||+.|+-.+
T Consensus 82 Ilvl~GLPrll-----~gsiLAHE~mHa~Lr 107 (212)
T PF12315_consen 82 ILVLYGLPRLL-----TGSILAHELMHAWLR 107 (212)
T ss_pred EEEECCCCHHH-----HhhHHHHHHHHHHhc
Confidence 44555543332 478999999998654
No 81
>PF09715 Plasmod_dom_1: Plasmodium protein of unknown function (Plasmod_dom_1); InterPro: IPR006410 These sequences represent an uncharacterised family consisting of a small number of hypothetical proteins of the malaria parasite Plasmodium falciparum (isolate 3D7).
Probab=37.10 E-value=15 Score=28.79 Aligned_cols=31 Identities=23% Similarity=0.490 Sum_probs=23.4
Q ss_pred hhhhhhHHHHHHHhhcccCCCCCCCcccccc
Q 020771 2 AYYRRAKAALDAFRNLSSKAVPKSPVQESCS 32 (321)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 32 (321)
++++|+|+.+|+|-++-+.-+=.++++...|
T Consensus 1 sFFKk~klvm~~fDnifidklId~~i~nk~s 31 (67)
T PF09715_consen 1 SFFKKAKLVMDIFDNIFIDKLIDNNIQNKSS 31 (67)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHccccccCCC
Confidence 4799999999999996554444667766655
No 82
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=36.55 E-value=39 Score=30.69 Aligned_cols=17 Identities=12% Similarity=-0.035 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHhhh-ccc
Q 020771 299 EIATIIGHEPRGLQ-RIC 315 (321)
Q Consensus 299 eLAaVLaHEiaHv~-~rH 315 (321)
-++-++.||+||.. ..|
T Consensus 124 R~~k~~~HElGH~lGL~H 141 (179)
T PRK13267 124 RVRKEVTHELGHTLGLEH 141 (179)
T ss_pred HHHHHHHHHHHHHcCCcc
Confidence 44555899999995 455
No 83
>PRK14015 pepN aminopeptidase N; Provisional
Probab=34.31 E-value=26 Score=39.19 Aligned_cols=24 Identities=13% Similarity=0.153 Sum_probs=18.7
Q ss_pred cceeEeecceeeeeccCCCCcccc
Q 020771 72 KRYYYVDRYHVQHFRPRGPRKWLQ 95 (321)
Q Consensus 72 ~r~y~~~~~~~~hf~~rg~r~w~~ 95 (321)
.-+|..+.+-+-+|.|.|-|+|+.
T Consensus 112 ~Gly~s~~~~~TQ~Ep~gAR~~fP 135 (875)
T PRK14015 112 EGLYRSGGMFCTQCEAEGFRRITY 135 (875)
T ss_pred eeeEEECCEEEEeccccCcCCccc
Confidence 346666666678999999999886
No 84
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=33.75 E-value=18 Score=37.56 Aligned_cols=18 Identities=11% Similarity=-0.018 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHhhh-cccC
Q 020771 299 EIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 299 eLAaVLaHEiaHv~-~rH~ 316 (321)
.|..|.+|||||++ +.|.
T Consensus 210 ~l~~Va~HEiGH~LGL~HS 228 (469)
T KOG1565|consen 210 DLFLVAAHEIGHALGLGHS 228 (469)
T ss_pred hhHHHhhhhcccccccCCC
Confidence 58899999999987 6665
No 85
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=33.21 E-value=21 Score=37.14 Aligned_cols=41 Identities=15% Similarity=0.276 Sum_probs=24.7
Q ss_pred eCCCCceEEeC--C-CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 270 NEPVINAFCLP--G-GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 270 ~~~~~NAfalP--g-G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
+.....|||.+ . +.-+|+ +..-.+-+.+ .+|+||+||+.+.
T Consensus 308 ~gK~~Gayc~~~~~~~~P~I~---~Nf~~t~~dv-~TL~HElGHa~H~ 351 (549)
T TIGR02289 308 KGKAAGGYCTYLPKYKAPFIF---SNFNGTSGDI-DVLTHEAGHAFHV 351 (549)
T ss_pred CCCCCCcccCCCCCCCCcEEE---EeCCCChhHH-HHHHHHhhHHHHH
Confidence 34567899985 2 334442 3333344444 5699999999744
No 86
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=32.97 E-value=31 Score=30.24 Aligned_cols=23 Identities=13% Similarity=0.199 Sum_probs=18.5
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhc
Q 020771 132 LLSKAVERQLGESQFQQMKAAFK 154 (321)
Q Consensus 132 llS~~~E~~Lgeq~y~qil~e~~ 154 (321)
-.|.+...+|+.++|.++=++|.
T Consensus 11 aps~~~fe~La~~A~d~lP~efr 33 (136)
T COG3824 11 APSLERFEELASDALDHLPQEFR 33 (136)
T ss_pred CCCHHHHHHHHHHHHHhCcHHHH
Confidence 57888888999999987777764
No 87
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=32.93 E-value=25 Score=35.24 Aligned_cols=17 Identities=35% Similarity=0.378 Sum_probs=15.8
Q ss_pred CHHHHHHHHHHHHHhhh
Q 020771 296 TDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~ 312 (321)
+|.+||.+|=||++|.+
T Consensus 161 ~~~~LA~LIfHELaHq~ 177 (337)
T PF10023_consen 161 PDGELARLIFHELAHQT 177 (337)
T ss_pred CchHHHHHHHHHHhhce
Confidence 79999999999999974
No 88
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=31.70 E-value=8 Score=36.25 Aligned_cols=20 Identities=10% Similarity=0.024 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHhhh-cccCCC
Q 020771 299 EIATIIGHEPRGLQ-RICGLP 318 (321)
Q Consensus 299 eLAaVLaHEiaHv~-~rH~l~ 318 (321)
-+..+|.||+||.+ +.|+.+
T Consensus 136 n~g~t~~HEvGH~lGL~HtF~ 156 (225)
T cd04275 136 NLGDTATHEVGHWLGLYHTFQ 156 (225)
T ss_pred cccceeEEeccceeeeeeeec
Confidence 46678999999987 888754
No 89
>TIGR01519 plasmod_dom_1 Plasmodium falciparum uncharacterized domain. This model represents an uncharacterized domain present in roughly eight hypothetical proteins of the malaria parasite Plasmodium falciparum.
Probab=31.59 E-value=22 Score=28.12 Aligned_cols=31 Identities=19% Similarity=0.379 Sum_probs=23.9
Q ss_pred hhhhhhHHHHHHHhhcccCCCCCCCcccccc
Q 020771 2 AYYRRAKAALDAFRNLSSKAVPKSPVQESCS 32 (321)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 32 (321)
++++|+|+.+|+|-++-+.-+=.++++...|
T Consensus 1 sFFKkaklvm~~fDnifidklid~~v~nk~S 31 (70)
T TIGR01519 1 SFFKRALLLMNIFDNIFIDKLIDNNVHNKGS 31 (70)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHccccccCCC
Confidence 4799999999999996555555677776655
No 90
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=30.78 E-value=13 Score=40.45 Aligned_cols=32 Identities=19% Similarity=0.106 Sum_probs=25.0
Q ss_pred CCeEEEcHhHHhhcC-----CHHHHHHHHHHHHHhhh
Q 020771 281 GGKIVVFTGLLEHFR-----TDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 281 gG~I~V~tGLL~~~~-----~edeLAaVLaHEiaHv~ 312 (321)
|-+..++|||+.... =..+---|+||||||.-
T Consensus 368 G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNf 404 (764)
T KOG3658|consen 368 GKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNF 404 (764)
T ss_pred cceEEeecceeeeeecCCccCcchhheeehhhhcccc
Confidence 557999999998764 24556678999999974
No 91
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=30.64 E-value=39 Score=30.72 Aligned_cols=29 Identities=17% Similarity=0.227 Sum_probs=22.6
Q ss_pred CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
...|+|-.. .+.+..+|.-+|+||+-|+-
T Consensus 55 ~~~I~lC~N---~~~~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 55 KKGIVLCQN---RIRSQGHLEDTLTHELIHAY 83 (173)
T ss_pred CCCEEEeeC---CCCCHHHHHHHHHHHHHHHH
Confidence 556666543 45799999999999999984
No 92
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=29.90 E-value=27 Score=34.29 Aligned_cols=15 Identities=20% Similarity=0.056 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHhhh
Q 020771 298 AEIATIIGHEPRGLQ 312 (321)
Q Consensus 298 deLAaVLaHEiaHv~ 312 (321)
-.+-++++||+||..
T Consensus 194 ~~~P~T~~HElAHq~ 208 (318)
T PF12725_consen 194 YSLPFTICHELAHQL 208 (318)
T ss_pred ccccHHHHHHHHHHh
Confidence 346689999999974
No 93
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=29.88 E-value=99 Score=33.59 Aligned_cols=55 Identities=15% Similarity=0.184 Sum_probs=35.8
Q ss_pred CCCCCCeEEEEEeCCC---CceEEeC----------CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771 258 HLDGLNWEVLVVNEPV---INAFCLP----------GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 258 ~l~~~~w~v~Vv~~~~---~NAfalP----------gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~ 313 (321)
+..+.|.-++|.-.|. .=|+|.+ -|.|.+.---|.. ..+.+..-|+.|||+|++-
T Consensus 202 Gv~naD~vLYVta~P~~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i~s-~~~~~~~rv~~HEi~HALG 269 (622)
T PTZ00257 202 GVSNADFVLYVASVPSEPGVLAWATTCQVFSDDHPAVGVMNIPAANIVS-RYDQGTTRTVTHEVAHALG 269 (622)
T ss_pred CccccCEEEEEEEecCCCCeeEEEEEeccccCCCceEEEEeeCHHHCCC-ccchHHHHHHHHHHHHHhc
Confidence 3456778888876553 2377764 1567766544432 2345678999999999973
No 94
>PF02571 CbiJ: Precorrin-6x reductase CbiJ/CobK; InterPro: IPR003723 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase []. In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase []. There are at least two distinct cobalamin biosynthetic pathways in bacteria []: Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway []; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis []; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii subsp. shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) []. There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobK and CbiJ precorrin-6x reductase (1.3.1.54 from EC). In the aerobic pathway, CobK catalyses the reduction of the macrocycle of precorrin-6X to produce precorrin-6Y; while in the anaerobic pathway CbiJ catalyses the reduction of the macrocycle of cobalt-precorrin-6X into cobalt-precorrin-6Y [, ].; GO: 0016994 precorrin-6A reductase activity, 0009236 cobalamin biosynthetic process, 0055114 oxidation-reduction process
Probab=29.58 E-value=50 Score=31.36 Aligned_cols=37 Identities=14% Similarity=0.127 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHhcC-CCCCCCChhHHHHHHHHHHHHHH
Q 020771 142 GESQFQQMKAAFKG-KILPAIHPDSVRVRLIAKDIIEA 178 (321)
Q Consensus 142 geq~y~qil~e~~~-kiLP~~hp~s~rVr~Ia~rIi~a 178 (321)
++.+.++++++.+- .++.++||....|..-+.+..+.
T Consensus 54 ~~~~l~~~l~~~~i~~vIDATHPfA~~is~na~~a~~~ 91 (249)
T PF02571_consen 54 DEEGLAEFLRENGIDAVIDATHPFAAEISQNAIEACRE 91 (249)
T ss_pred CHHHHHHHHHhCCCcEEEECCCchHHHHHHHHHHHHhh
Confidence 44556667776663 48899999999988766666554
No 95
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=27.44 E-value=35 Score=37.46 Aligned_cols=21 Identities=19% Similarity=0.088 Sum_probs=16.6
Q ss_pred CHHHHHHHHHHHHHhhh-cccC
Q 020771 296 TDAEIATIIGHEPRGLQ-RICG 316 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~-~rH~ 316 (321)
+....|.||||||||.+ ..|.
T Consensus 319 ~~~~~a~v~AhelgH~lGm~hD 340 (716)
T KOG3607|consen 319 ILLAFAVVLAHELGHNLGMIHD 340 (716)
T ss_pred cchhHHHHHHHHHHhhcCcccc
Confidence 56778999999999986 4443
No 96
>KOG1046 consensus Puromycin-sensitive aminopeptidase and related aminopeptidases [Amino acid transport and metabolism; Posttranslational modification, protein turnover, chaperones]
Probab=26.24 E-value=94 Score=34.77 Aligned_cols=47 Identities=23% Similarity=0.222 Sum_probs=26.5
Q ss_pred EEEEeCCCCceEEeCC-CeEEEc-HhHHhhc-----CCHHHHHHHHHHHHHhhh
Q 020771 266 VLVVNEPVINAFCLPG-GKIVVF-TGLLEHF-----RTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 266 v~Vv~~~~~NAfalPg-G~I~V~-tGLL~~~-----~~edeLAaVLaHEiaHv~ 312 (321)
+-+|--|+..+-||=+ |.|..- +-||-.- .+...+|.|+|||+||--
T Consensus 283 ~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQW 336 (882)
T KOG1046|consen 283 LDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQW 336 (882)
T ss_pred ccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHH
Confidence 3344444555444433 444443 3333221 246779999999999963
No 97
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=26.24 E-value=40 Score=31.42 Aligned_cols=14 Identities=21% Similarity=0.116 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHhhh
Q 020771 299 EIATIIGHEPRGLQ 312 (321)
Q Consensus 299 eLAaVLaHEiaHv~ 312 (321)
=+.+|+-||+||..
T Consensus 52 l~~~v~iHElgH~~ 65 (227)
T cd06164 52 LFASVLLHELGHSL 65 (227)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999974
No 98
>PF07225 NDUF_B4: NADH-ubiquinone oxidoreductase B15 subunit (NDUFB4); InterPro: IPR009866 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This family contains human NADH-ubiquinone oxidoreductase subunit NDUFB4 and related sequences.; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0005739 mitochondrion
Probab=26.03 E-value=38 Score=29.43 Aligned_cols=47 Identities=30% Similarity=0.507 Sum_probs=27.9
Q ss_pred ccCCCCCC-----CC-cce-eEeecceeeeeccCCCCccccCCchhHHHHHHHHHHHHHHHh
Q 020771 62 SYCNPNFN-----TA-KRY-YYVDRYHVQHFRPRGPRKWLQNPRTVFIVVVIGSGAFITLYL 116 (321)
Q Consensus 62 ~~~~~~~~-----~~-~r~-y~~~~~~~~hf~~rg~r~w~~~~r~~~~~v~vg~g~~~~~y~ 116 (321)
...||+-. ++ -|+ |.-.-..|.||.|. ||+.++.++.+++.++++|+
T Consensus 46 q~~nP~~~gli~DPAL~Rw~~a~~~~~y~~FRpT--------Pktsllg~~~~v~P~i~~~~ 99 (125)
T PF07225_consen 46 QYNNPHRKGLIFDPALQRWAYARAVNIYEYFRPT--------PKTSLLGLGFGVVPLIFYYY 99 (125)
T ss_pred HcCCCCCCCccCChHHHHHHHHHHhCcccccccC--------chHHHHHHHHHHHHHHHHHh
Confidence 45666631 22 467 55444789999887 47777666665554444433
No 99
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=25.54 E-value=38 Score=34.09 Aligned_cols=17 Identities=12% Similarity=0.122 Sum_probs=13.3
Q ss_pred CHHHHHHHHHHHHHhhhc
Q 020771 296 TDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 296 ~edeLAaVLaHEiaHv~~ 313 (321)
+-+++ ..|+||+||+.+
T Consensus 239 ~~~~v-~tLfHE~GHa~H 255 (458)
T PF01432_consen 239 SHDDV-ETLFHEFGHAMH 255 (458)
T ss_dssp SHHHH-HHHHHHHHHHHH
T ss_pred ChhhH-HHHHHHHhHHHH
Confidence 55666 679999999864
No 100
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=25.21 E-value=50 Score=32.80 Aligned_cols=45 Identities=24% Similarity=0.261 Sum_probs=27.9
Q ss_pred CCCeEEEEEeCCCCceEEeCCCeEEEcHhHHhhc--CCHHHHHHHHHHHHHhh
Q 020771 261 GLNWEVLVVNEPVINAFCLPGGKIVVFTGLLEHF--RTDAEIATIIGHEPRGL 311 (321)
Q Consensus 261 ~~~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~--~~edeLAaVLaHEiaHv 311 (321)
..-|+|+|- .+-|+++-| ....-||..+ .+|.+||.+|=||++|.
T Consensus 162 aqG~DVyV~---Gv~AYSTLG---wFdDPlLstmlr~dd~~lA~LIFHELAHQ 208 (376)
T COG4324 162 AQGLDVYVS---GVTAYSTLG---WFDDPLLSTMLRQDDTYLASLIFHELAHQ 208 (376)
T ss_pred hcCCceeee---cccceeccc---ccccHHHHHHhcCChHHHHHHHHHHHhhh
Confidence 344666663 345555533 2223344322 38999999999999996
No 101
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=24.05 E-value=48 Score=30.09 Aligned_cols=16 Identities=25% Similarity=-0.056 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHhhh
Q 020771 297 DAEIATIIGHEPRGLQ 312 (321)
Q Consensus 297 edeLAaVLaHEiaHv~ 312 (321)
-.=+..++.||+||..
T Consensus 38 l~l~~~l~iHElgH~~ 53 (183)
T cd06160 38 LALLAILGIHEMGHYL 53 (183)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3456788999999974
No 102
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.96 E-value=62 Score=32.74 Aligned_cols=35 Identities=9% Similarity=-0.047 Sum_probs=29.4
Q ss_pred CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771 281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG 316 (321)
Q Consensus 281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~ 316 (321)
+=.++.+--++..| ..+++-+.|-||+-|+...|.
T Consensus 51 ~~~~y~NPei~~~~-p~~~~~aLl~HEV~Hi~l~Hi 85 (396)
T COG3864 51 YFTMYFNPEIFLNC-PISEMKALLKHEVYHIMLNHI 85 (396)
T ss_pred ceEEEeCHHHHccC-CHHHHHHHHHHHHHHHHHHHH
Confidence 33577777777777 799999999999999999995
No 103
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=23.92 E-value=39 Score=35.46 Aligned_cols=44 Identities=16% Similarity=0.091 Sum_probs=23.5
Q ss_pred EEEEeCCCCceEEeC--CCe-EEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771 266 VLVVNEPVINAFCLP--GGK-IVVFTGLLEHFRTDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 266 v~Vv~~~~~NAfalP--gG~-I~V~tGLL~~~~~edeLAaVLaHEiaHv~~ 313 (321)
++--+.....|||.+ ++. -+|.. ..-.+-+++ .+|+||+||+.+
T Consensus 342 ~~~r~gK~~Ga~~~~~~~~~~p~i~~---N~~~~~~~v-~TL~HE~GHa~H 388 (587)
T TIGR02290 342 AEPRPGKRGGAFCTGFPPSKEPRVLM---NYDGSRRDV-STLAHELGHAYH 388 (587)
T ss_pred cCCCCCCCCCcccCCCCCCCCCEEEE---ecCCCchhH-HHHHHHhhHHHH
Confidence 333344556788864 222 23221 111233333 569999999975
No 104
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=23.80 E-value=44 Score=30.14 Aligned_cols=12 Identities=25% Similarity=0.005 Sum_probs=10.4
Q ss_pred HHHHHHHHHhhh
Q 020771 301 ATIIGHEPRGLQ 312 (321)
Q Consensus 301 AaVLaHEiaHv~ 312 (321)
.+++.||++|++
T Consensus 80 ~~~i~HEl~HaL 91 (191)
T PF01400_consen 80 VGTILHELGHAL 91 (191)
T ss_dssp HHHHHHHHHHHH
T ss_pred ccchHHHHHHHH
Confidence 468999999997
No 105
>PF08014 DUF1704: Domain of unknown function (DUF1704); InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=23.01 E-value=1.1e+02 Score=30.81 Aligned_cols=47 Identities=15% Similarity=0.345 Sum_probs=31.7
Q ss_pred CeEEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHH-hhhc
Q 020771 263 NWEVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPR-GLQR 313 (321)
Q Consensus 263 ~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEia-Hv~~ 313 (321)
++.|.+-++=..+|. +.++.|.|.++ ..+ ++.++.+++-|||| |+..
T Consensus 132 ~~~V~~sddl~a~A~-v~~~~l~I~~~--~~f-s~~~l~~L~~HEigvH~lt 179 (349)
T PF08014_consen 132 EVKVELSDDLLARAM-VSGDRLKINKN--AMF-SERDLEALLHHEIGVHLLT 179 (349)
T ss_pred eEEEEEcCCcchhhc-ccCCeeEEcCC--CCc-CHHHHHHHHHHhhhhhhcc
Confidence 444444444345664 55778888876 223 79999999999995 6653
No 106
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=22.99 E-value=38 Score=36.15 Aligned_cols=48 Identities=13% Similarity=0.097 Sum_probs=28.9
Q ss_pred Ce-EEEEEeCCCCceEEeCC---CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771 263 NW-EVLVVNEPVINAFCLPG---GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 263 ~w-~v~Vv~~~~~NAfalPg---G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r 314 (321)
.| ++..-+.+..-|||.+. ++.+|.- ....+... ..+|+||+||..+.
T Consensus 343 ~WiD~~~~~gKrsGaYs~~~~~~~~p~Ilm---N~~gt~~d-V~TLaHElGHs~Hs 394 (598)
T COG1164 343 RWIDVYPRKGKRSGAYSIGFYKGDHPFILM---NYDGTLRD-VFTLAHELGHSVHS 394 (598)
T ss_pred CCeeccCCCCCCCCcccCCCCCCCCCeEEE---eCCCchhH-HHHHHHHccHHHHH
Confidence 44 33333455688999863 5566643 33333333 36799999998653
No 107
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=22.62 E-value=45 Score=34.15 Aligned_cols=17 Identities=18% Similarity=0.073 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHhhhcc
Q 020771 297 DAEIATIIGHEPRGLQRI 314 (321)
Q Consensus 297 edeLAaVLaHEiaHv~~r 314 (321)
-+++ ..|+||+||+.+.
T Consensus 261 ~~~V-~TLfHEfGHalH~ 277 (472)
T cd06455 261 HDEV-ETFFHEFGHVIHH 277 (472)
T ss_pred HHHH-HHHHHHHHHHHHH
Confidence 3554 5689999999753
No 108
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=22.49 E-value=40 Score=31.14 Aligned_cols=12 Identities=25% Similarity=0.146 Sum_probs=10.5
Q ss_pred HHHHHHHHHhhh
Q 020771 301 ATIIGHEPRGLQ 312 (321)
Q Consensus 301 AaVLaHEiaHv~ 312 (321)
.+++.|||+|++
T Consensus 88 ~Gti~HEl~HaL 99 (200)
T cd04281 88 FGIVVHELGHVI 99 (200)
T ss_pred CchHHHHHHHHh
Confidence 368999999997
No 109
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=21.90 E-value=42 Score=30.59 Aligned_cols=12 Identities=33% Similarity=0.180 Sum_probs=10.5
Q ss_pred HHHHHHHHHhhh
Q 020771 301 ATIIGHEPRGLQ 312 (321)
Q Consensus 301 AaVLaHEiaHv~ 312 (321)
.+++.||++|++
T Consensus 78 ~G~i~HEl~HaL 89 (182)
T cd04283 78 KGIIQHELLHAL 89 (182)
T ss_pred cchHHHHHHHHh
Confidence 378999999997
No 110
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=21.54 E-value=1.3e+02 Score=30.49 Aligned_cols=42 Identities=21% Similarity=0.214 Sum_probs=22.9
Q ss_pred CCCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhc
Q 020771 272 PVINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQR 313 (321)
Q Consensus 272 ~~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~ 313 (321)
+.+-|+++|. +.|+..--.+..+. ....-++++-||+.|...
T Consensus 262 ~~vlAYT~p~~~~I~~Cp~ff~~lp~~~~~C~~qDqatt~LHE~TH~~~ 310 (359)
T PF02102_consen 262 SGVLAYTLPSQNQIVNCPIFFSDLPALSNRCHAQDQATTTLHEMTHAPA 310 (359)
T ss_dssp TT--EEEEGGGTEEEE-HHHHHH--SS--STT---HHHHHHHHHHT-TT
T ss_pred CCeEEEEEcCCCeEEECchhhccCCCccccccCCCccchhhhhhhcccc
Confidence 4577999986 45555544443221 345578999999999853
No 111
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=21.13 E-value=24 Score=33.93 Aligned_cols=12 Identities=25% Similarity=0.166 Sum_probs=10.5
Q ss_pred HHHHHHHHHhhh
Q 020771 301 ATIIGHEPRGLQ 312 (321)
Q Consensus 301 AaVLaHEiaHv~ 312 (321)
..|++||++|..
T Consensus 166 igv~~HE~gH~l 177 (286)
T TIGR03296 166 VGVIAHELGHDL 177 (286)
T ss_pred eeeeehhhhccc
Confidence 589999999964
No 112
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=20.87 E-value=85 Score=27.33 Aligned_cols=26 Identities=19% Similarity=0.252 Sum_probs=18.3
Q ss_pred EcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 286 VFTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 286 V~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
+-+-=++.+ +-+|-.-||.||+.|+-
T Consensus 67 l~sekF~rL-s~~ekvKviiHEllHIP 92 (133)
T COG4900 67 LLSEKFKRL-SCAEKVKVIIHELLHIP 92 (133)
T ss_pred eehhhcCCC-ChHHHHHHHHHHHhcCc
Confidence 333344555 56777789999999984
No 113
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=20.27 E-value=62 Score=31.15 Aligned_cols=33 Identities=24% Similarity=0.362 Sum_probs=20.5
Q ss_pred CceEEeCCCeEEE--cHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771 274 INAFCLPGGKIVV--FTGLLEHFRTDAEIATIIGHEPRGLQ 312 (321)
Q Consensus 274 ~NAfalPgG~I~V--~tGLL~~~~~edeLAaVLaHEiaHv~ 312 (321)
.|.-.+||..+++ ..+++ .=+.+|+-||+||..
T Consensus 96 ~~~~~ipGv~~~i~~~~~~i------aL~isv~iHElgHa~ 130 (263)
T cd06159 96 RNVLVIPGVNIFIPLPYGII------ALVVGVVVHELSHGI 130 (263)
T ss_pred ceeeccCCcchHHHHHHHHH------HHHHHHHHHHHHHHH
Confidence 3444678766553 22222 245689999999975
Done!