Query psy768
Match_columns 475
No_of_seqs 355 out of 1928
Neff 6.6
Searched_HMMs 46136
Date Sat Aug 17 00:39:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy768.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/768hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2661|consensus 100.0 5.3E-77 1.2E-81 581.7 22.4 386 15-459 22-417 (424)
2 COG4783 Putative Zn-dependent 100.0 1.1E-36 2.3E-41 313.7 24.3 227 181-440 31-263 (484)
3 COG4784 Putative Zn-dependent 100.0 3.5E-32 7.6E-37 268.1 20.2 228 179-439 21-253 (479)
4 PRK01345 heat shock protein Ht 99.9 6E-26 1.3E-30 230.5 22.8 193 218-439 61-281 (317)
5 PRK03982 heat shock protein Ht 99.9 1.3E-25 2.7E-30 225.7 22.8 190 228-439 69-283 (288)
6 PRK03001 M48 family peptidase; 99.9 2E-25 4.3E-30 223.8 21.9 186 228-439 68-280 (283)
7 PRK02870 heat shock protein Ht 99.9 3.1E-25 6.8E-30 226.0 20.8 187 227-435 115-333 (336)
8 PF01435 Peptidase_M48: Peptid 99.9 4E-26 8.6E-31 219.4 13.1 185 228-432 32-226 (226)
9 PRK03072 heat shock protein Ht 99.9 3.4E-25 7.4E-30 222.4 19.0 193 218-439 64-285 (288)
10 COG0501 HtpX Zn-dependent prot 99.9 1.3E-24 2.9E-29 218.2 18.9 188 227-440 99-300 (302)
11 PRK04897 heat shock protein Ht 99.9 1.1E-23 2.3E-28 212.6 25.2 186 228-435 81-297 (298)
12 PRK02391 heat shock protein Ht 99.9 5.6E-24 1.2E-28 214.2 19.0 198 217-439 69-292 (296)
13 PRK05457 heat shock protein Ht 99.9 2.8E-23 6.2E-28 208.1 20.5 185 228-435 78-282 (284)
14 PRK01265 heat shock protein Ht 99.9 5.8E-23 1.3E-27 208.3 20.1 196 217-436 74-321 (324)
15 KOG2719|consensus 99.7 8.4E-17 1.8E-21 165.9 16.4 198 215-439 208-426 (428)
16 PF05569 Peptidase_M56: BlaR1 98.2 3.1E-05 6.6E-10 78.4 14.7 136 253-433 160-299 (299)
17 PF10463 Peptidase_U49: Peptid 97.7 9.8E-05 2.1E-09 70.8 6.9 60 369-433 123-189 (206)
18 PRK09672 phage exclusion prote 97.7 0.00013 2.8E-09 72.9 8.0 60 369-433 187-253 (305)
19 PF04228 Zn_peptidase: Putativ 97.2 0.0015 3.2E-08 66.1 9.0 131 260-437 136-281 (292)
20 PF06114 DUF955: Domain of unk 97.1 0.0031 6.8E-08 53.6 8.5 50 251-305 6-60 (122)
21 COG4219 MecR1 Antirepressor re 96.8 0.038 8.1E-07 56.1 15.2 53 253-306 154-209 (337)
22 PF13203 DUF2201_N: Putative m 96.1 0.009 2E-07 60.3 5.9 56 252-308 26-81 (292)
23 COG2321 Predicted metalloprote 94.3 0.24 5.3E-06 49.3 8.9 39 263-301 136-181 (295)
24 PRK04351 hypothetical protein; 93.8 0.22 4.8E-06 45.6 7.3 38 260-298 35-72 (149)
25 PF10263 SprT-like: SprT-like 92.3 0.34 7.3E-06 44.0 6.2 41 262-303 33-76 (157)
26 PF14247 DUF4344: Domain of un 92.1 5.1 0.00011 39.1 14.3 50 249-298 37-103 (220)
27 PF13699 DUF4157: Domain of un 92.1 0.12 2.6E-06 42.2 2.7 39 260-301 37-75 (79)
28 smart00731 SprT SprT homologue 89.7 0.37 8E-06 43.7 3.7 33 268-301 41-73 (146)
29 PF04298 Zn_peptidase_2: Putat 88.8 3.6 7.8E-05 40.1 9.9 52 250-304 54-106 (222)
30 PF01863 DUF45: Protein of unk 87.4 3 6.4E-05 39.4 8.4 51 252-304 131-181 (205)
31 COG1451 Predicted metal-depend 87.0 2.2 4.8E-05 41.6 7.5 44 260-305 150-193 (223)
32 PF12388 Peptidase_M57: Dual-a 86.2 0.61 1.3E-05 45.1 3.0 34 268-304 117-151 (211)
33 PF10026 DUF2268: Predicted Zn 86.0 1.9 4.1E-05 41.1 6.3 33 269-303 49-81 (195)
34 PF04450 BSP: Peptidase of pla 85.2 4.7 0.0001 38.9 8.6 49 255-303 57-112 (205)
35 COG2856 Predicted Zn peptidase 82.0 1.4 3E-05 42.7 3.5 29 269-302 59-87 (213)
36 PF02031 Peptidase_M7: Strepto 82.0 2.7 5.9E-05 37.4 5.0 47 249-299 40-89 (132)
37 COG2738 Predicted Zn-dependent 80.6 5.1 0.00011 38.2 6.5 20 286-305 91-110 (226)
38 COG3590 PepO Predicted metallo 80.1 0.43 9.4E-06 51.9 -0.8 46 259-304 449-504 (654)
39 PF08325 WLM: WLM domain; Int 79.5 2.4 5.1E-05 40.3 4.1 73 226-305 19-100 (186)
40 PF13485 Peptidase_MA_2: Pepti 79.3 2.5 5.5E-05 35.8 3.9 22 282-303 20-41 (128)
41 PF12725 DUF3810: Protein of u 79.1 16 0.00034 37.7 10.2 15 285-299 194-208 (318)
42 cd04268 ZnMc_MMP_like Zinc-dep 78.5 2.3 5E-05 38.6 3.6 51 250-300 45-107 (165)
43 PF13402 M60-like: Peptidase M 75.5 6.7 0.00014 39.5 6.3 49 254-302 181-234 (307)
44 PRK04860 hypothetical protein; 75.5 2.7 5.9E-05 38.9 3.2 33 268-301 45-77 (160)
45 PF05572 Peptidase_M43: Pregna 72.6 1.5 3.3E-05 40.3 0.7 57 249-305 16-88 (154)
46 PF00413 Peptidase_M10: Matrix 72.2 2.3 5.1E-05 38.0 1.9 35 269-303 86-122 (154)
47 COG4324 Predicted aminopeptida 71.5 8.3 0.00018 38.6 5.5 37 272-308 180-221 (376)
48 cd04279 ZnMc_MMP_like_1 Zinc-d 69.8 2.8 6E-05 38.1 1.8 16 284-299 101-116 (156)
49 cd04270 ZnMc_TACE_like Zinc-de 68.0 3.2 6.9E-05 41.0 1.9 31 271-301 146-181 (244)
50 smart00235 ZnMc Zinc-dependent 65.8 6.9 0.00015 34.6 3.5 44 250-299 51-98 (140)
51 PF01431 Peptidase_M13: Peptid 65.6 6.9 0.00015 37.1 3.7 44 262-305 1-54 (206)
52 COG1164 Oligoendopeptidase F [ 63.6 13 0.00028 41.6 5.9 49 252-304 346-397 (598)
53 COG3091 SprT Zn-dependent meta 63.0 6.9 0.00015 35.9 2.9 30 270-300 45-74 (156)
54 PF10023 DUF2265: Predicted am 62.2 6.1 0.00013 40.9 2.7 48 260-310 139-191 (337)
55 cd04276 ZnMc_MMP_like_2 Zinc-d 62.1 5.7 0.00012 38.1 2.4 47 253-299 65-128 (197)
56 cd04269 ZnMc_adamalysin_II_lik 62.1 4.7 0.0001 37.8 1.8 16 284-299 128-143 (194)
57 PF01447 Peptidase_M4: Thermol 60.9 13 0.00027 34.1 4.3 49 249-301 97-149 (150)
58 PF13688 Reprolysin_5: Metallo 58.9 5.4 0.00012 37.4 1.6 17 284-300 139-155 (196)
59 PRK06926 flagellar motor prote 58.5 2.1E+02 0.0045 28.9 13.3 35 272-306 112-148 (271)
60 PF01421 Reprolysin: Reprolysi 58.4 7.7 0.00017 36.6 2.6 17 283-299 127-143 (199)
61 PF14891 Peptidase_M91: Effect 57.9 5.7 0.00012 37.1 1.6 15 285-299 101-115 (174)
62 PF13582 Reprolysin_3: Metallo 57.5 5.4 0.00012 34.4 1.2 12 288-299 108-119 (124)
63 cd04267 ZnMc_ADAM_like Zinc-de 56.9 4.3 9.3E-05 38.0 0.5 15 285-299 131-145 (192)
64 COG1291 MotA Flagellar motor c 55.6 2.1E+02 0.0047 28.8 12.2 110 283-403 121-245 (266)
65 cd04278 ZnMc_MMP Zinc-dependen 55.2 6 0.00013 35.9 1.2 32 268-299 85-119 (157)
66 PF09768 Peptidase_M76: Peptid 54.6 38 0.00082 31.9 6.4 36 261-299 48-83 (173)
67 cd00203 ZnMc Zinc-dependent me 54.3 5.7 0.00012 35.9 0.9 15 285-299 94-108 (167)
68 cd04277 ZnMc_serralysin_like Z 53.9 9.9 0.00021 35.5 2.5 36 269-304 94-131 (186)
69 PF13574 Reprolysin_2: Metallo 53.8 6.6 0.00014 36.6 1.2 15 287-301 111-125 (173)
70 cd04272 ZnMc_salivary_gland_MP 53.1 7.6 0.00016 37.4 1.6 14 286-299 144-157 (220)
71 TIGR02411 leuko_A4_hydro leuko 52.0 12 0.00027 41.8 3.2 37 263-303 259-295 (601)
72 KOG3624|consensus 50.4 12 0.00025 42.6 2.7 46 260-305 481-536 (687)
73 KOG1047|consensus 48.8 11 0.00024 41.3 2.1 32 270-304 274-305 (613)
74 cd04271 ZnMc_ADAM_fungal Zinc- 48.5 6.1 0.00013 38.6 0.1 12 289-300 147-158 (228)
75 TIGR00181 pepF oligoendopeptid 48.4 7.5 0.00016 43.2 0.8 49 252-304 345-395 (591)
76 cd06459 M3B_Oligoendopeptidase 48.3 11 0.00024 39.6 2.0 49 252-304 188-239 (427)
77 PF01433 Peptidase_M1: Peptida 45.9 27 0.00059 36.1 4.5 52 251-303 254-311 (390)
78 COG3824 Predicted Zn-dependent 45.8 13 0.00029 32.7 1.8 30 270-299 86-121 (136)
79 PF13583 Reprolysin_4: Metallo 44.5 11 0.00023 36.2 1.1 16 289-304 139-155 (206)
80 PF01457 Peptidase_M8: Leishma 43.7 25 0.00054 38.7 4.0 54 246-300 157-223 (521)
81 PF12652 CotJB: CotJB protein; 43.6 57 0.0012 26.6 5.0 39 417-455 19-59 (78)
82 PF06262 DUF1025: Possibl zinc 42.5 37 0.0008 28.8 3.9 33 269-301 50-87 (97)
83 PF13150 DUF3989: Protein of u 41.5 48 0.001 27.5 4.3 18 133-150 13-30 (85)
84 PF06861 BALF1: BALF1 protein; 41.2 18 0.0004 33.8 2.0 28 279-306 124-151 (182)
85 PF12315 DUF3633: Protein of u 41.2 14 0.00031 35.6 1.4 26 271-301 82-107 (212)
86 COG3864 Uncharacterized protei 40.6 30 0.00064 35.6 3.5 37 268-305 51-87 (396)
87 cd04327 ZnMc_MMP_like_3 Zinc-d 39.5 22 0.00048 33.7 2.5 17 285-301 90-106 (198)
88 cd04273 ZnMc_ADAMTS_like Zinc- 39.4 4.4 9.6E-05 38.7 -2.4 14 286-299 139-152 (207)
89 PF02163 Peptidase_M50: Peptid 36.3 22 0.00047 33.2 1.8 15 287-301 7-21 (192)
90 PF14521 Aspzincin_M35: Lysine 35.8 47 0.001 30.2 3.8 39 260-298 65-107 (148)
91 TIGR03818 MotA1 flagellar moto 35.4 4.9E+02 0.011 26.4 13.0 34 272-305 125-161 (282)
92 TIGR02289 M3_not_pepF oligoend 35.3 19 0.0004 39.8 1.3 49 252-304 303-354 (549)
93 PRK12482 flagellar motor prote 34.3 5.2E+02 0.011 26.3 14.3 25 282-306 138-162 (287)
94 cd06161 S2P-M50_SpoIVFB SpoIVF 33.5 25 0.00055 33.7 1.8 15 286-300 37-51 (208)
95 COG2976 Uncharacterized protei 31.4 61 0.0013 31.2 3.9 22 157-178 22-43 (207)
96 TIGR02412 pepN_strep_liv amino 30.8 52 0.0011 38.4 4.0 36 269-304 265-304 (831)
97 PTZ00337 surface protease GP63 30.5 78 0.0017 35.3 5.1 53 245-300 178-242 (567)
98 PRK08990 flagellar motor prote 30.1 5.6E+02 0.012 25.5 13.0 24 282-305 116-139 (254)
99 COG0308 PepN Aminopeptidase N 29.7 44 0.00096 39.1 3.2 56 249-304 262-324 (859)
100 TIGR03296 M6dom_TIGR03296 M6 f 29.5 13 0.00029 37.4 -0.9 11 288-298 166-176 (286)
101 PF13398 Peptidase_M50B: Pepti 29.2 33 0.00072 32.7 1.8 16 286-301 21-36 (200)
102 cd05709 S2P-M50 Site-2 proteas 29.1 34 0.00073 31.6 1.8 13 287-299 8-20 (180)
103 PRK08124 flagellar motor prote 28.7 6E+02 0.013 25.3 12.4 24 282-305 120-143 (263)
104 KOG3658|consensus 27.7 21 0.00045 40.1 0.1 34 268-301 368-406 (764)
105 KOG1046|consensus 27.2 1.1E+02 0.0024 36.0 6.0 52 253-304 283-341 (882)
106 PF01432 Peptidase_M3: Peptida 26.9 30 0.00065 37.0 1.2 18 283-301 239-256 (458)
107 cd06455 M3A_TOP Peptidase M3 T 26.9 33 0.00071 37.2 1.5 18 285-303 262-279 (472)
108 cd06164 S2P-M50_SpoIVFB_CBS Sp 26.7 39 0.00084 33.0 1.8 15 286-300 52-66 (227)
109 COG3426 Butyrate kinase [Energ 26.6 60 0.0013 33.1 3.1 40 259-298 67-120 (358)
110 PF12911 OppC_N: N-terminal TM 26.5 1.3E+02 0.0028 22.1 4.3 17 149-165 7-23 (56)
111 TIGR02290 M3_fam_3 oligoendope 25.7 34 0.00074 38.0 1.4 48 252-303 341-391 (587)
112 TIGR00035 asp_race aspartate r 24.9 1.5E+02 0.0032 28.7 5.5 60 390-449 9-76 (229)
113 cd06160 S2P-M50_like_2 Unchara 24.6 45 0.00098 31.5 1.8 15 286-300 40-54 (183)
114 PRK08456 flagellar motor prote 24.5 7E+02 0.015 24.7 12.0 23 282-304 119-141 (257)
115 TIGR02414 pepN_proteo aminopep 23.1 62 0.0013 38.0 2.8 19 286-304 282-300 (863)
116 PTZ00257 Glycoprotein GP63 (le 23.0 1.3E+02 0.0029 33.8 5.2 55 246-301 203-270 (622)
117 cd06163 S2P-M50_PDZ_RseP-like 22.6 53 0.0011 31.0 1.8 13 288-300 10-22 (182)
118 PRK13267 archaemetzincin-like 22.6 1E+02 0.0023 29.0 3.8 17 286-302 124-141 (179)
119 KOG3607|consensus 22.0 50 0.0011 37.8 1.8 17 284-300 320-336 (716)
120 cd06258 Peptidase_M3_like The 21.9 41 0.00089 34.7 1.0 21 283-304 151-171 (365)
121 KOG2975|consensus 21.8 1.2E+02 0.0026 30.5 4.1 33 375-407 172-204 (288)
122 PF04612 T2SM: Type II secreti 21.6 31 0.00067 31.2 0.0 14 134-147 3-16 (160)
123 cd04281 ZnMc_BMP1_TLD Zinc-dep 21.5 43 0.00093 32.2 1.0 15 288-302 88-102 (200)
124 cd04283 ZnMc_hatching_enzyme Z 21.4 44 0.00095 31.6 1.0 14 288-301 78-91 (182)
125 cd06159 S2P-M50_PDZ_Arch Uncha 21.1 57 0.0012 32.7 1.8 15 286-300 117-131 (263)
126 PF05960 DUF885: Bacterial pro 21.0 4.2E+02 0.0091 28.9 8.7 74 227-301 300-388 (549)
127 PF01400 Astacin: Astacin (Pep 20.5 55 0.0012 30.9 1.5 14 288-301 80-93 (191)
128 KOG3314|consensus 20.5 2.5E+02 0.0054 26.4 5.6 32 271-305 78-109 (194)
No 1
>KOG2661|consensus
Probab=100.00 E-value=5.3e-77 Score=581.72 Aligned_cols=386 Identities=35% Similarity=0.586 Sum_probs=353.7
Q ss_pred HHHhccCCCccccCCCCCCCCCCCCCCCCChhhhhhhhhhhccccccccceeeeeecCcccccchh-------hcccccc
Q psy768 15 LLRYIGHTSPECLETLDTPRNIPASRSSINTELEQDTIGCILSCDKKYSTCLNFQITGHQDKHLTR-------VLFHQSA 87 (475)
Q Consensus 15 ~~~~~g~~~~ec~~~~~~~~n~p~~~p~~~~~~e~~~~~~il~~~~~e~~~v~~~~~~~~~s~~~q-------~~p~~~~ 87 (475)
.+.|+|.++++|++..++|+|++|+.+++|+ +.+-+.+.+++|.|.++++|+.|||.+.+| ++|+.++
T Consensus 22 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-----~~~~l~~~~~~~i~~~t~~~t~~n~~~~~~~~~~~~t~v~S~~v 96 (424)
T KOG2661|consen 22 VNKYQGLGVNQCDRWSFLPGNFHFYSTFNNK-----RTGGLSSTKSKEIWRITSKCTVWNDAFSRQLLIKEVTAVPSLSV 96 (424)
T ss_pred HHHHhccCcchhhhHhhCCCcceeeeeecch-----hcccccccCCccceecccccceehhhhhhccchhhhhccccccc
Confidence 5789999999999999999999999999998 999999999999999999999999999999 8999999
Q ss_pred cccccccccccccccccCCC-CCCcchHHHHhhccceeeeeeeecchhHHHHHHhcCchhhHHHHHHHHhcccch-HHHH
Q psy768 88 NHQLQVFNIRVLRCFHTSQP-KNALPGFIYLIFKPILRVVGAIVIARISRKWWSKLSPDQKKIILSQIKKHQDKI-AGIT 165 (475)
Q Consensus 88 ~~~~~~~~~~~~r~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~l~~~~r~~~~~~~~~~~~~~-~~~~ 165 (475)
.+|++.-..|.. .|||.+| +..|.|...+.+ +||+.+.|+++....+.++++.|++ +|+
T Consensus 97 ~l~~~~~s~Rn~-Ng~~y~rFqsnp~~ys~~~r-----------------~~~~sfkpn~~dk~f~n~rk~f~~lf~G~- 157 (424)
T KOG2661|consen 97 LLPLSPASIRNI-NGHTYPRFQSNPVPYSLMIR-----------------KWWQSFKPNKKDKVFENIRKNFWKLFLGL- 157 (424)
T ss_pred ccccchhhhhcc-CccccchhhcCCccHHHHHH-----------------HHHHhcCCCCccchhhhHHHHHHHHHhhc-
Confidence 999988888888 9999999 888877776544 3999999999999999888888877 443
Q ss_pred HHHHHHHHHHHHhhccccCCCCccceeccCHHHHHHHHHHHHHHHHHHhcCCCCCCCCHHHHHHHHHHHHHHHhcccccc
Q psy768 166 LTSIGLAYIYYFLHLETCPITGRQKFIIVKPNQLNDVTQIAYDNFIEEHGNQVLPLGHPAYKRVGAVVKRLIDANKVYME 245 (475)
Q Consensus 166 ~~~~~~~~~~~~~~~~~~P~tgr~~~~~ls~~~e~~lg~~~~~~~~~~~~~~~lp~~dp~~~~v~~i~~rL~~~~~~~p~ 245 (475)
+++++|||+|+|.+|+|||++|+++|+++|+.||+.+|++++++++++++|..||++-+|++|+.++++++++.|+
T Consensus 158 ----~~~~vfyfthlE~~Pit~Rsrfillskp~ekllge~~ye~im~eyq~~mLp~~hp~sl~V~~vlk~iIea~~~~~s 233 (424)
T KOG2661|consen 158 ----LLFVVFYFTHLEVSPITGRSRFILLSKPQEKLLGELEYEAIMEEYQNDMLPEKHPRSLAVKEVLKHIIEANKDVPS 233 (424)
T ss_pred ----cceEEEEEeeeeeeccCCcceeeeccCHHHHHhhHHHHHHHHHHhcCccCCcCCchhhHHHHHHHHHHHHhccCCc
Confidence 4467799999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCcceeEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHH
Q psy768 246 HNNFKYPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIW 325 (475)
Q Consensus 246 ~~~~~~~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~ 325 (475)
+++..|+|+||++|.+|||++|||+|||++|+|..++|||+||+|||||+||.++||.+|.+++..+++++++++.+++|
T Consensus 234 lsgIkWeihVVndPipNAFvLPgGKvfVFtgiLn~ck~ddglAtvLgHE~aHaVarH~AEki~k~~~~siLgLvlyt~~~ 313 (424)
T KOG2661|consen 234 LSGIKWEIHVVNDPIPNAFVLPGGKVFVFTGILNSCKDDDGLATVLGHEIAHAVARHAAEKIGKVHLLSILGLVLYTMIW 313 (424)
T ss_pred ccCceeEEEEecCCCCceeeccCCeEEEEechhhcccChHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888
Q ss_pred HhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHh
Q psy768 326 FLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMAL 405 (475)
Q Consensus 326 ~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~ 405 (475)
+..+. ..+.+++ +.++|||++|.|||.+|+.+|++|||||+++..+||+|+.
T Consensus 314 a~~~n----------~~Ll~~f------------------lrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~ 365 (424)
T KOG2661|consen 314 AICPN----------DKLLEYF------------------LRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEF 365 (424)
T ss_pred hccch----------HHHHHHH------------------hcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHH
Confidence 76541 1233444 7899999999999999999999999999999999999998
Q ss_pred hccccccCC-CcccccCCCCChHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCCC
Q psy768 406 KETQDQVGP-KMEEYLSTHPSHENRANNLESKMKEALDIRKECNCLPLGPLFIPR 459 (475)
Q Consensus 406 ~~~~~~~g~-~~~~~lsTHP~~~~RI~~L~~~i~~a~~~~~~~~c~~~~~~~~~~ 459 (475)
.+ ...|. ..++|+||||+..+||+++++++++|..+||.|+|++.... ++|
T Consensus 366 ~e--gqmg~~~~~eflSTHPSskkRie~~~~lLpqAnvirE~sdcy~e~~p-~~~ 417 (424)
T KOG2661|consen 366 VE--GQMGQPKMPEFLSTHPSSKKRIEYLDRLLPQANVIREMSDCYPESNP-DPR 417 (424)
T ss_pred hh--hhcCCCCCchhhhcCCCccchhHHHHHhchHHHHHHHHhcCChhcCc-chh
Confidence 87 33333 35999999999999999999999999999999999987776 444
No 2
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=100.00 E-value=1.1e-36 Score=313.73 Aligned_cols=227 Identities=26% Similarity=0.376 Sum_probs=178.2
Q ss_pred cccCCCCccceeccCHHHHHHHHHHHHHHHHHHhcCCCCCCCCHHHH-HHHHHHHHHHHhccccccCCCcceeEEEEecC
Q psy768 181 ETCPITGRQKFIIVKPNQLNDVTQIAYDNFIEEHGNQVLPLGHPAYK-RVGAVVKRLIDANKVYMEHNNFKYPITIIDDP 259 (475)
Q Consensus 181 ~~~P~tgr~~~~~ls~~~e~~lg~~~~~~~~~~~~~~~lp~~dp~~~-~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~ 259 (475)
...|.+|......+|+++|..+|+.. +.+..+.+....||+++ +|++++++|++++. ..+.+|+|++|+++
T Consensus 31 ~~lp~ig~~~~s~ls~~qev~~g~~~----~~Qlr~~~~~i~D~el~~yv~~~g~rL~~~a~----~~~~~f~f~lV~d~ 102 (484)
T COG4783 31 DLLPDIGVSAGSTLSPAQEVALGDPA----NAQLRGSVPLIRDPELEEYVNSLGQRLAAAAD----LVKTPFTFFLVNDD 102 (484)
T ss_pred hhcchhhhhhhhcCCHHHHHHhhHHH----HHHhccCCCCcCCHHHHHHHHHHHHHHHHhcC----CCCCCeEEEEecCC
Confidence 35788999999999999999999854 44555566778999985 99999999998752 34689999999999
Q ss_pred CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHH---HHHHHHHHHHHHHHHh-ccch-HHH
Q psy768 260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKT---FLEILYIVPLMIIWFL-LPDL-GAI 334 (475)
Q Consensus 260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~---l~~~l~~~~~~~i~~~-~~~~-~~~ 334 (475)
.+||||+|||+|+|++|||...+||+||++|||||+||+..||..+.+.... ...++++.++ ++.+. .++. .+.
T Consensus 103 ~iNAFA~~Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~aR~~e~~~r~~~~~i~~ml~g-i~aa~a~~~ag~a~ 181 (484)
T COG4783 103 SINAFATPGGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLARSMEQQQRAAPMAIAGMLLG-ILAALAGADAGMAG 181 (484)
T ss_pred ccchhhcCCceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHHHHHHHHhhhchhHHHHHHHH-HHHHHhCccccHHH
Confidence 9999999999999999999999999999999999999999999998776633 2222222222 22222 1221 111
Q ss_pred HHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccccCC
Q psy768 335 VTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGP 414 (475)
Q Consensus 335 l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~g~ 414 (475)
++..... ...-.+.|||++|.|||++|+.++.++||||++|++||+|+.... ..++
T Consensus 182 iag~~a~---------------------~~~g~L~~sR~~E~eADr~Gi~~L~raGydp~gM~~ff~rl~~~~---~~~~ 237 (484)
T COG4783 182 IAGALAG---------------------AAQGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQL---RYGG 237 (484)
T ss_pred HHHHHHH---------------------hhhhhhhcchhhHHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHH---hcCC
Confidence 1111110 001357899999999999999999999999999999999999754 2366
Q ss_pred CcccccCCCCChHHHHHHHHHHHHHH
Q psy768 415 KMEEYLSTHPSHENRANNLESKMKEA 440 (475)
Q Consensus 415 ~~~~~lsTHP~~~~RI~~L~~~i~~a 440 (475)
..|+|++|||-+++||..+++.+.+.
T Consensus 238 ~~p~yl~THPlp~~RIa~lr~ra~q~ 263 (484)
T COG4783 238 QPPEYLLTHPLPEERIADLRNRAEQS 263 (484)
T ss_pred CCChHHhcCCCchhHHHHHHHHHHhC
Confidence 77999999999999999999877653
No 3
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=100.00 E-value=3.5e-32 Score=268.07 Aligned_cols=228 Identities=24% Similarity=0.341 Sum_probs=174.4
Q ss_pred hccccCCCCccceeccCHHHHH-HHHHHHHHHHHHHhcCCCCCCCCHHHH-HHHHHHHHHHHhccccccCCCcceeEEEE
Q psy768 179 HLETCPITGRQKFIIVKPNQLN-DVTQIAYDNFIEEHGNQVLPLGHPAYK-RVGAVVKRLIDANKVYMEHNNFKYPITII 256 (475)
Q Consensus 179 ~~~~~P~tgr~~~~~ls~~~e~-~lg~~~~~~~~~~~~~~~lp~~dp~~~-~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv 256 (475)
.+...|-++...+..+++.++. +||..+.++++.+|++.. +||.++ +|.+|+.+|...+. .| +..|+|.|+
T Consensus 21 ~~~~~Ps~~p~~v~~v~~n~~laqmgar~hprIla~yggeY---~D~Kler~Vari~g~lt~~S~-~p---~q~YriTil 93 (479)
T COG4784 21 ESGVQPSDNPQTVDNVQKNDPLAQMGARQHPRILATYGGEY---RDPKLERMVARIVGALTAVSE-NP---QQTYRITIL 93 (479)
T ss_pred ccccCCCCCcchHHHhhccChHhHhhhhhChhHHHhcCCcc---CCHHHHHHHHHHHhHhhhhcc-CC---CceEEEEEe
Confidence 3456788888778888887654 599988999999998654 677775 79999999987652 23 578999999
Q ss_pred ecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhccchHHHHH
Q psy768 257 DDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVT 336 (475)
Q Consensus 257 ~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~~~~~~l~ 336 (475)
|+|.+||||+|||++||+.|||..++|+.|||+||+|||||+..+|..++.++.....+.+-++...+ ..+..+-..
T Consensus 94 nSP~INAFALPGGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHgi~rQ~~e~a~~ia~rvva~vl---~~~~agk~A 170 (479)
T COG4784 94 NSPNINAFALPGGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHGIQRQQREAAEVIASRVVAEVL---GSDAAGKQA 170 (479)
T ss_pred cCCCccccccCCceEEEehhHHHHcCCHHHHHHHHHhhhhheecchhHHHHHHHHHHHHHHHHHHHHh---CCcchhhHH
Confidence 99999999999999999999999999999999999999999999999998877655443332222111 001111111
Q ss_pred HHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc-CC-
Q psy768 337 QVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV-GP- 414 (475)
Q Consensus 337 ~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~-g~- 414 (475)
...+ .+....|||+||.|||.+|++++.+|||||.++..|++.|+........ |.
T Consensus 171 ~~rG-----------------------klrla~fsRnqELqAD~iG~~~lgeAGYDP~A~~rfl~sm~ay~~F~s~~g~a 227 (479)
T COG4784 171 LIRG-----------------------KLRLAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAYTDFRSVSGAA 227 (479)
T ss_pred Hhhh-----------------------hHHHhhhccchhhhhhhhhHHHHHhcCCChHHHHHHHHHHHhhhhhcccCcch
Confidence 1110 0145689999999999999999999999999999999999977732221 21
Q ss_pred -CcccccCCCCChHHHHHHHHHHHHH
Q psy768 415 -KMEEYLSTHPSHENRANNLESKMKE 439 (475)
Q Consensus 415 -~~~~~lsTHP~~~~RI~~L~~~i~~ 439 (475)
..-+||+|||.+.+||+..+.-..+
T Consensus 228 dqsldfl~sHPntpqRiqla~~hARq 253 (479)
T COG4784 228 DQSLDFLASHPNTPQRIQLARRHARQ 253 (479)
T ss_pred hhhcchhhcCCCChHHHHHHHHHHHh
Confidence 1268999999999999877665433
No 4
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.94 E-value=6e-26 Score=230.53 Aligned_cols=193 Identities=22% Similarity=0.280 Sum_probs=124.8
Q ss_pred CCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768 218 VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSH 293 (475)
Q Consensus 218 ~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAH 293 (475)
+.|.++|+ +.+++++|+... .+| ..+|+++|++.+|||++|. +.|+|++||++.+ |+|||++||||
T Consensus 61 v~~~~~p~---L~~~v~~La~~a-gi~-----~p~v~vid~~~~NAFa~G~~~~~~~V~vt~gLL~~L-~~dEL~aVlAH 130 (317)
T PRK01345 61 VDERSAPE---LYRMVRDLARRA-GLP-----MPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRL-SPEEVAGVMAH 130 (317)
T ss_pred CCcccCHH---HHHHHHHHHHHc-CCC-----CCcEEEEcCCCcceEEecCCCCCeEEEechHHHhhC-CHHHHHHHHHH
Confidence 44555554 555666665543 122 2479999999999999963 3699999999998 57999999999
Q ss_pred HHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-cc----hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhcc
Q psy768 294 ELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-PD----LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFEL 368 (475)
Q Consensus 294 ElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-~~----~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~ 368 (475)
||||++++|..+......+..++..+.....+... .+ ....++..+ ..++ .|+ ...++.+
T Consensus 131 ElgHi~~~d~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~i~~~~----~~~~--~~~---------~~~l~~~ 195 (317)
T PRK01345 131 ELAHVKNRDTLTMTITATLAGAISMLANFAFFFGGNRENNNGPLGLVGTLA----AMIV--APL---------AAMLVQM 195 (317)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccchHHHHHHHH----HHHH--HHH---------HHHHHHH
Confidence 99999999998765443332222111111111110 00 001111110 0000 010 1113677
Q ss_pred ccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc----cc---------------cCCCcccccCCCCChHHH
Q psy768 369 PFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ----DQ---------------VGPKMEEYLSTHPSHENR 429 (475)
Q Consensus 369 ~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~----~~---------------~g~~~~~~lsTHP~~~~R 429 (475)
.|||.+|++||++|++++ .||++++++|+||...... .. .+....+||+|||++++|
T Consensus 196 ~~SR~rE~~AD~~A~~lt----g~p~~L~~AL~KL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lfsTHP~~~eR 271 (317)
T PRK01345 196 AISRTREYAADRRGAEIC----GNPLWLASALGKIERGAHGVPNEEAERNPATAHMFIINPLSGEGMDNLFSTHPATENR 271 (317)
T ss_pred HHHHHHHHHHHHHHHHHh----CCHHHHHHHHHHHHhhhccCcccccccChHHHHHHhcCCccccchhHHhcCCcChHHH
Confidence 899999999999999875 3999999999999965310 00 011124799999999999
Q ss_pred HHHHHHHHHH
Q psy768 430 ANNLESKMKE 439 (475)
Q Consensus 430 I~~L~~~i~~ 439 (475)
|++|+++..+
T Consensus 272 I~~L~~~~~~ 281 (317)
T PRK01345 272 IAALQRMAGE 281 (317)
T ss_pred HHHHHHHHHh
Confidence 9999998766
No 5
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.94 E-value=1.3e-25 Score=225.74 Aligned_cols=190 Identities=23% Similarity=0.270 Sum_probs=120.2
Q ss_pred HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
.+++.++++++..+ +| ..++++++++.+|||++| +|.|+|++||++.+ |+|||++|||||+||++++|.
T Consensus 69 ~L~~~v~~la~~~g-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~l-~~~El~AVlAHElgHi~~~h~ 141 (288)
T PRK03982 69 ELYRIVERLAERAN-IP-----KPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLL-NEDELEGVIAHELTHIKNRDT 141 (288)
T ss_pred HHHHHHHHHHHHcC-CC-----CCeEEEEeCCCcceEEeccCCCCeEEEeehHHHhhC-CHHHHHHHHHHHHHHHHcCCH
Confidence 45555566655431 22 238999999999999995 45688999999998 689999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh-ccchH---HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768 304 AEKLSNKTFLEILYIVPLMIIWFL-LPDLG---AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD 379 (475)
Q Consensus 304 ~~~~~~~~l~~~l~~~~~~~i~~~-~~~~~---~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD 379 (475)
.+......+...+..+.....+.. ..+.. ...+..++..+..++ .++ ...++.+.|||++|+|||
T Consensus 142 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~---------~~~l~~~~~SR~~E~~AD 210 (288)
T PRK03982 142 LIQTIAATLAGAIMYLAQWLSWGLWFGGGGRDDRNGGNPIGSLLLIIL--API---------AATLIQFAISRQREFSAD 210 (288)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchHHHHHHHHHH--HHH---------HHHHHHHHHhHHHHHHHh
Confidence 877655433221111111111110 00100 000000000000010 000 012366789999999999
Q ss_pred HHHHHHHHHcCCCchhHHHHHHHHHhhcccc-c----------------cCCCcccccCCCCChHHHHHHHHHHHHH
Q psy768 380 EVGLKLMARACYDVRVAPLFWQKMALKETQD-Q----------------VGPKMEEYLSTHPSHENRANNLESKMKE 439 (475)
Q Consensus 380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~-~----------------~g~~~~~~lsTHP~~~~RI~~L~~~i~~ 439 (475)
++|+++ +.+|.+++++++|++...+.. . .+....+|++|||++++||+++++..++
T Consensus 211 ~~A~~~----~~~p~~l~~aL~kL~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~l~sTHP~~~eRI~~l~~~~~~ 283 (288)
T PRK03982 211 EGGARL----TGNPLALANALQKLEKGVRYIPLKNGNPATAHMFIINPFRGQFLANLFSTHPPTEERIERLLEMAQE 283 (288)
T ss_pred HHHHHH----hCCHHHHHHHHHHHHhhhccCCCCCCCHHHHhHhhcCCCCCchhhHHhCCCcCHHHHHHHHHHHHHh
Confidence 999886 468999999999998653110 0 0111257999999999999999987644
No 6
>PRK03001 M48 family peptidase; Provisional
Probab=99.94 E-value=2e-25 Score=223.78 Aligned_cols=186 Identities=19% Similarity=0.270 Sum_probs=120.2
Q ss_pred HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
.+.+++++++...+ +| .-+++++|++.+|||++|.+ +|+|++||++.+ |+|||++||||||||++++|.
T Consensus 68 ~L~~~v~~la~~~g-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHi~~~h~ 140 (283)
T PRK03001 68 QFYRMVRELAQRAG-LP-----MPKVYLINEDQPNAFATGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAHVKHRDI 140 (283)
T ss_pred HHHHHHHHHHHHcC-CC-----CCeEEEecCCCcceEEecCCCCCeEEEecHHHHhhC-CHHHHHHHHHHHHHHHhCCCh
Confidence 45566666655431 22 23899999999999999753 599999999998 689999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccch--H--HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768 304 AEKLSNKTFLEILYIVPLMIIWFLLPDL--G--AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD 379 (475)
Q Consensus 304 ~~~~~~~~l~~~l~~~~~~~i~~~~~~~--~--~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD 379 (475)
........+..++..+.....+....+. . ...... +..++ .++ ...++.+.|||.+|++||
T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~---------~~~~~~~~~SR~~E~~AD 205 (283)
T PRK03001 141 LISTISATMAGAISALANFAMFFGGRDENGRPVNPIAGI----AVAIL--APL---------AASLIQMAISRAREFEAD 205 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccchHHHH----HHHHH--HHH---------HHHHHHHHHhHHHHHHHh
Confidence 8765443222222211111111110000 0 000000 00000 000 011366789999999999
Q ss_pred HHHHHHHHHcCCCchhHHHHHHHHHhhccccc-------------------cCCCcccccCCCCChHHHHHHHHHHHHH
Q psy768 380 EVGLKLMARACYDVRVAPLFWQKMALKETQDQ-------------------VGPKMEEYLSTHPSHENRANNLESKMKE 439 (475)
Q Consensus 380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-------------------~g~~~~~~lsTHP~~~~RI~~L~~~i~~ 439 (475)
++|+++ ++||++++++++|+...+.... .++....|++|||++++||++++++.++
T Consensus 206 ~~a~~l----~~~p~~l~~AL~Kl~~~~~~~p~~~~~~~~~~~~~~~~~p~~~~~~~~l~~THP~~~eRI~~l~~~~~~ 280 (283)
T PRK03001 206 RGGARI----SGDPQALASALDKIHRYASGIPFQAAEAHPATAQMMIINPLSGGGLANLFSTHPSTEERIARLMAMART 280 (283)
T ss_pred HHHHHH----hCCHHHHHHHHHHHHhhhccCCcccccCCHHHHHHHhcCCCCcchHHHHHcCCcCHHHHHHHHHHHHHh
Confidence 999886 4699999999999997652100 0011246899999999999999987644
No 7
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.93 E-value=3.1e-25 Score=225.99 Aligned_cols=187 Identities=27% Similarity=0.366 Sum_probs=122.4
Q ss_pred HHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768 227 KRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH 302 (475)
Q Consensus 227 ~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH 302 (475)
.++.++++++....+ .| ..++|++++++.+||||+| +++|+||+||++.+ |+|||++||||||||+.++|
T Consensus 115 ~~L~~~ve~La~~ag-~p----~~p~V~vi~~~~~NAFA~G~~~~~~~Ivvt~GLL~~L-~~dEL~aVlAHELgHik~~d 188 (336)
T PRK02870 115 RQLYNVVEELLVAAG-LR----FMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKL-DRDELQAVMAHELSHIRHGD 188 (336)
T ss_pred HHHHHHHHHHHHHcC-CC----CCCeEEEEcCCCCceEEecCCCCCcEEEEehHHhhhC-CHHHHHHHHHHHHHHHHccc
Confidence 478888888877542 22 2469999999999999994 67899999999998 68999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccc---hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768 303 VAEKLSNKTFLEILYIVPLMIIWFLLPD---LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD 379 (475)
Q Consensus 303 ~~~~~~~~~l~~~l~~~~~~~i~~~~~~---~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD 379 (475)
.........+..++..+....++...++ ...... .+...+..++ +|+ ...++.+.+||.+|++||
T Consensus 189 i~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~l~l~~~--~~~---------~~~ll~~~iSR~rEy~AD 256 (336)
T PRK02870 189 IRLTLCVGVLSNIMLIVADFLFYSFMGNRRNSGANRA-RMIILILRYV--LPI---------LTVLLMLFLSRTREYMAD 256 (336)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccchh-HHHHHHHHHH--HHH---------HHHHHHHHHHHHHHHHHh
Confidence 8665433332222111111111111111 000000 0000000000 111 112367889999999999
Q ss_pred HHHHHHHHHcCCCchhHHHHHHHHHhhccccc-------------------cC------CCcccccCCCCChHHHHHHHH
Q psy768 380 EVGLKLMARACYDVRVAPLFWQKMALKETQDQ-------------------VG------PKMEEYLSTHPSHENRANNLE 434 (475)
Q Consensus 380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-------------------~g------~~~~~~lsTHP~~~~RI~~L~ 434 (475)
++|++++. ||.+++++++|+........ .. ....++|||||++++||++|+
T Consensus 257 ~~Aa~ltg----~p~aLasAL~KL~~~~~~~~~~~~~~~~~~~~~~a~~~i~~p~~~~~~~~~~LfsTHPp~e~RI~rL~ 332 (336)
T PRK02870 257 AGAVELMR----DNEPMARALQKISNDHAQNDEQYAYKHTDHESTRRAAYLFDPAGISPGSLSDAFSTHPSIENRLAALG 332 (336)
T ss_pred HHHHHHhC----CHHHHHHHHHHHHhccccCccccccccccCChhhhhhhccCCcccccccHhHHHcCCCCHHHHHHHHh
Confidence 99988864 89999999999986442110 00 012468999999999999997
Q ss_pred H
Q psy768 435 S 435 (475)
Q Consensus 435 ~ 435 (475)
.
T Consensus 333 ~ 333 (336)
T PRK02870 333 G 333 (336)
T ss_pred h
Confidence 5
No 8
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.93 E-value=4e-26 Score=219.44 Aligned_cols=185 Identities=28% Similarity=0.319 Sum_probs=100.4
Q ss_pred HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCCe---EEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDGR---IFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG~---IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
++++++++++...+ ....+++++|++++.+|||++|+|. |+|++||++.+ |+|||++|||||+||+.++|..
T Consensus 32 ~L~~~v~~l~~~~~----~~~~~~~v~v~~~~~~NA~~~g~~~~~~I~v~~~ll~~~-~~~el~aVlaHElgH~~~~h~~ 106 (226)
T PF01435_consen 32 ELRRIVEELARRAG----LGIPPPRVYVIDSPSPNAFATGGGPRKRIVVTSGLLESL-SEDELAAVLAHELGHIKHRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHH----CTSS--EEEEE--SSEEEEEETTTC--EEEEEHHHHHHS-SHHHHHHHHHHHHHHHHTTHCC
T ss_pred HHHHHHHHHHHHhc----CCCCCCeEEEEcCCCCcEEEEccCCCcEEEEeChhhhcc-cHHHHHHHHHHHHHHHHcCCcc
Confidence 55555666555431 1124579999999999999999998 99999999665 7899999999999999999998
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHH
Q psy768 305 EKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLK 384 (475)
Q Consensus 305 ~~~~~~~l~~~l~~~~~~~i~~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ 384 (475)
+.+....+..++..+....+... .......... ......+ . ....++...+||.+|++||++|++
T Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~----------~--~~~~~~~~~~sr~~E~~AD~~a~~ 171 (226)
T PF01435_consen 107 KSLLISLLLSILFFALLALLIGS--MSLFSAFGFI-DILGILI----------A--FLFQLLTNAFSRRQEYEADRYAAR 171 (226)
T ss_dssp CCCCHHH-HHHHHHHHHT-------HHHHHHHH-----------------------HHSTT------HHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHhhh--hhhhhhhhhc-ccchhhH----------H--HHHHHhhcchhHHHHHHHHHHHHH
Confidence 77544443333222211111000 0000000000 0000000 0 111237789999999999999999
Q ss_pred H------HHHcCCCchhHHHHHHHHHhhcccc-ccCCCcccccCCCCChHHHHHH
Q psy768 385 L------MARACYDVRVAPLFWQKMALKETQD-QVGPKMEEYLSTHPSHENRANN 432 (475)
Q Consensus 385 l------l~~AGydp~a~~~~l~kl~~~~~~~-~~g~~~~~~lsTHP~~~~RI~~ 432 (475)
+ |.++|+++..+.++|+++....... ..+.....+++|||++.+||++
T Consensus 172 ~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~tHP~~~~Ri~~ 226 (226)
T PF01435_consen 172 LGGDPALLARALYKPAAAISALEKLAEANSMRPDSDWRYSSLFSTHPSTEERIAA 226 (226)
T ss_dssp H------HHHTTS-TTHHHHHHHHHT-----------------------HHHHHH
T ss_pred hcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccCCccccchhcCCCcCHHHHhCc
Confidence 9 9999999999999999998733111 1122337889999999999975
No 9
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.93 E-value=3.4e-25 Score=222.43 Aligned_cols=193 Identities=22% Similarity=0.265 Sum_probs=122.7
Q ss_pred CCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768 218 VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSH 293 (475)
Q Consensus 218 ~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAH 293 (475)
+.|.++|+ +.+++++|....+ +| .-+++++|++.+|||+++. +.|+|++||++.+ |+|||++||||
T Consensus 64 v~~~~~p~---L~~~v~~la~~~g-~p-----~p~vyv~~~~~~NAFa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAH 133 (288)
T PRK03072 64 VSEVQAPA---MYRIVRELSTAAR-QP-----MPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQIL-NERELRGVLGH 133 (288)
T ss_pred CChhhhHH---HHHHHHHHHHHcC-CC-----CCCEEEecCCCCceEEecCCCCCcEEEecHHHHHhC-CHHHHHHHHHH
Confidence 45555554 4444455544321 22 2379999999999999963 2499999999998 67999999999
Q ss_pred HHHhhhcccHHHHHHHHHHHHHHHHHHHHHHH-Hhcc---c--hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhc
Q psy768 294 ELSHTLLKHVAEKLSNKTFLEILYIVPLMIIW-FLLP---D--LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFE 367 (475)
Q Consensus 294 ElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~-~~~~---~--~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~ 367 (475)
|+||+.++|.........+..++..+.....+ .... + ....++..+ ..++ .|+ ...++.
T Consensus 134 ElgHi~~~d~~~~~~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~--~~~---------~~~~~~ 198 (288)
T PRK03072 134 ELSHVYNRDILISSVAGALASVITYLANMAMFAGMFGGRRDNDGPNPLALLL----VSLL--GPI---------AATVIQ 198 (288)
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHH----HHHH--HHH---------HHHHHH
Confidence 99999999999765443332222211111111 1100 0 001111100 0011 010 112366
Q ss_pred cccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccc-c------------------CCCcccccCCCCChHH
Q psy768 368 LPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQ-V------------------GPKMEEYLSTHPSHEN 428 (475)
Q Consensus 368 ~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-~------------------g~~~~~~lsTHP~~~~ 428 (475)
+.|||.+|++||++|+++. .||++++++|+||........ . +.....|++|||++++
T Consensus 199 ~~~SR~rE~~AD~~A~~l~----~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~~~~~i~~p~~~~~~~~l~sTHP~~~e 274 (288)
T PRK03072 199 LAISRSREYQADESGAELT----GDPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPFRAGGIGRLFSTHPPMAD 274 (288)
T ss_pred HHHHhHHHHHHhHHHHHHh----CCHHHHHHHHHHHHhccccCCCCccccchhhhhhhhcCcccchHHHHHHcCCcCHHH
Confidence 7899999999999998874 599999999999986441100 0 0112578999999999
Q ss_pred HHHHHHHHHHH
Q psy768 429 RANNLESKMKE 439 (475)
Q Consensus 429 RI~~L~~~i~~ 439 (475)
||++|++...+
T Consensus 275 RI~~L~~~~~~ 285 (288)
T PRK03072 275 RIARLEQMAGR 285 (288)
T ss_pred HHHHHHHHhhh
Confidence 99999977543
No 10
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.92 E-value=1.3e-24 Score=218.24 Aligned_cols=188 Identities=26% Similarity=0.300 Sum_probs=123.1
Q ss_pred HHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768 227 KRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH 302 (475)
Q Consensus 227 ~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH 302 (475)
......+.++...+. .+..++++|++++.+|||++|+ |+|+||+||++.+ |+|||++|||||+||+++||
T Consensus 99 ~~~~~~v~~~a~~~~-----~~~~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gLl~~l-~~dEl~aVlaHElgHi~~rd 172 (302)
T COG0501 99 AVLLLKVAELARQAG-----IPHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDLL-NDDELEAVLAHELGHIKNRH 172 (302)
T ss_pred HHHHHHHHHHHHHCC-----CCCCCeeEEecCCCccceecCCCCCCeeEEecHHHHhhC-CHHHHHHHHHHHHHHHhccc
Confidence 344445555544321 1347899999999999999998 6899999999976 68999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHH
Q psy768 303 VAEKLSNKTFLEILYIVPLM-IIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEV 381 (475)
Q Consensus 303 ~~~~~~~~~l~~~l~~~~~~-~i~~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~ 381 (475)
...++............+.. +.+.............+...+... ....+.+.|||.+|++||++
T Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~SR~~E~~AD~~ 237 (302)
T COG0501 173 TLVRLTLRGLLASAFVLLATLALAAGLLGEAALALLLLLLLLALF---------------LATLLVLAFSRKREYEADRF 237 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHH---------------HHHHHHHHHHHHhHHhhhhc
Confidence 99865544433222222111 111110000000011100000000 01127889999999999999
Q ss_pred HHHHHHHcCCCchhHHHHHHHHHhhcc----ccccCCC-----cccccCCCCChHHHHHHHHHHHHHH
Q psy768 382 GLKLMARACYDVRVAPLFWQKMALKET----QDQVGPK-----MEEYLSTHPSHENRANNLESKMKEA 440 (475)
Q Consensus 382 gl~ll~~AGydp~a~~~~l~kl~~~~~----~~~~g~~-----~~~~lsTHP~~~~RI~~L~~~i~~a 440 (475)
|++++. |+.+...++|+....+ .....+. ...+++|||++.+||++|+++.+..
T Consensus 238 a~~l~~-----~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~stHP~~~~Ri~~L~~~~~~~ 300 (302)
T COG0501 238 AAKLTG-----PEKLASALQKLARLSGRANSKAFIASGFSGGRLQALFSTHPPLAERIAALRQLALTV 300 (302)
T ss_pred hhhccC-----hHHHHHHHHHHHhhhcccchhhhhcCcccccchHHHHhcCCCHHHHHHHHHHHHHhh
Confidence 988874 7888999999887431 1111111 1479999999999999999987653
No 11
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.92 E-value=1.1e-23 Score=212.64 Aligned_cols=186 Identities=19% Similarity=0.265 Sum_probs=118.1
Q ss_pred HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
++.+++++++.... +| .-++++++++.+|||++|. +.|+|++||++.+ |+|||++|||||+||++++|.
T Consensus 81 ~L~~~v~~la~~~g-ip-----~p~v~v~~~~~~NAfa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi~~~d~ 153 (298)
T PRK04897 81 ELWHIVEDMAMVAQ-IP-----MPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIM-NREELEGVIGHEISHIRNYDI 153 (298)
T ss_pred HHHHHHHHHHHHcC-CC-----CCcEEEecCCCCceEEeccCCCCcEEEeehHHHhhC-CHHHHHHHHHHHHHHHhcCCH
Confidence 46666666665432 22 2389999999999999973 3699999999999 679999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHhc------cc-hHH-HHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccc
Q psy768 304 AEKLSNKTFLEILYIVPLMI----IWFLL------PD-LGA-IVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFE 371 (475)
Q Consensus 304 ~~~~~~~~l~~~l~~~~~~~----i~~~~------~~-~~~-~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~S 371 (475)
........+..++..+.... .+... .+ ..+ .+...+...+..++ .|+ ...++.+.+|
T Consensus 154 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~--~~~---------~~~ll~~~~S 222 (298)
T PRK04897 154 RLSTIAVALASAITLLSDIAGRMMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLIL--APL---------AATLIQLAIS 222 (298)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccchhhHHHHHHHHHHHHH--HHH---------HHHHHHHhhh
Confidence 87654433222221111100 11110 00 000 00000000000000 011 1123567899
Q ss_pred hhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc----------CCC-----cccccCCCCChHHHHHHHHH
Q psy768 372 REMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV----------GPK-----MEEYLSTHPSHENRANNLES 435 (475)
Q Consensus 372 R~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~----------g~~-----~~~~lsTHP~~~~RI~~L~~ 435 (475)
|.+|++||++|+++. +||++++++++|+...+..... .++ ...|++|||++++||++|++
T Consensus 223 R~rE~~AD~~A~~lt----~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~i~~p~~~~~~~~lfsTHP~~~eRI~~L~~ 297 (298)
T PRK04897 223 RQREYLADASSVELT----RNPQGLISALEKISNSQPMKHPVDDASAALYISDPLKKKGLSKLFDTHPPIEERIERLKN 297 (298)
T ss_pred HHHHHHHHHHHHHHh----CCHHHHHHHHHHHHhccccccccChHHHHhccCCCcccchHHHHHcCCcCHHHHHHHHHc
Confidence 999999999988875 6899999999999865421100 011 24689999999999999975
No 12
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.92 E-value=5.6e-24 Score=214.21 Aligned_cols=198 Identities=24% Similarity=0.222 Sum_probs=121.7
Q ss_pred CCCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHH
Q psy768 217 QVLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLS 292 (475)
Q Consensus 217 ~~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLA 292 (475)
++.|.++|+ +.+++++++...+ .| .-++++++++.+|||++|.+ .|+|++||++.+ |+||+++|||
T Consensus 69 ~~~~~~~p~---L~~~v~~la~~~~-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVla 138 (296)
T PRK02391 69 IVSEDEYPE---LHAMVERLCALAD-LP-----KPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRL-DPDELEAVLA 138 (296)
T ss_pred ECChhhCHH---HHHHHHHHHHHcC-CC-----CCcEEEEeCCCCceEEecCCCCCcEEEecHHHHhhC-CHHHHHHHHH
Confidence 345556664 4455555554321 22 23789999999999999743 489999999998 5799999999
Q ss_pred HHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-c-c-hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccc
Q psy768 293 HELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-P-D-LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELP 369 (475)
Q Consensus 293 HElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-~-~-~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~ 369 (475)
||+||+.++|.........+..+...+....++... . + .....+.++...+...+ . .....++.+.
T Consensus 139 HElgHi~~~di~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~--~~~~~~l~~~ 207 (296)
T PRK02391 139 HELSHVKNRDVAVMTIASFLSTIAFLIVRWGFYFGGFGGRGGGGGGGGILVVILVSLV---------V--WAISFLLIRA 207 (296)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCcccchHHHHHHHHHHH---------H--HHHHHHHHHH
Confidence 999999999998754433222222111111111110 0 0 00000000000000000 0 0011125678
Q ss_pred cchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc----c-----------c----cCCCcccccCCCCChHHHH
Q psy768 370 FEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ----D-----------Q----VGPKMEEYLSTHPSHENRA 430 (475)
Q Consensus 370 ~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~----~-----------~----~g~~~~~~lsTHP~~~~RI 430 (475)
+||.+|++||++|+++. .||.++++++.|+....+. + . .+.....|++|||++++||
T Consensus 208 ~SR~rE~~AD~~Aa~lt----g~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~i~p~~~~~~~~~l~sTHP~~~eRI 283 (296)
T PRK02391 208 LSRYREFAADRGAAIIT----GRPSALASALMKISGRMDRVPTEDLREAEGMNAFFIIPALSGGSLGRLFSTHPPLEKRI 283 (296)
T ss_pred HHHHHHHHHhHHHHHHh----CCHHHHHHHHHHHHcccccCCchhhcccHHHhhhcccCCCCcchHHHHhcCCcCHHHHH
Confidence 99999999999988765 4899999999999864210 0 0 0112257899999999999
Q ss_pred HHHHHHHHH
Q psy768 431 NNLESKMKE 439 (475)
Q Consensus 431 ~~L~~~i~~ 439 (475)
++|++..++
T Consensus 284 ~~L~~~~~~ 292 (296)
T PRK02391 284 AQLEKLERE 292 (296)
T ss_pred HHHHHHHHh
Confidence 999976543
No 13
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.91 E-value=2.8e-23 Score=208.05 Aligned_cols=185 Identities=22% Similarity=0.230 Sum_probs=116.8
Q ss_pred HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
++.+.+++++...+ .| .-++++++++.+|||++|.+ .|+|++||++.+ |+||+++|+|||+||+.++|.
T Consensus 78 ~L~~~v~~la~~~g-~p-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlAHElgHi~~~d~ 150 (284)
T PRK05457 78 WLVETVARQARQAG-IG-----MPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNM-SRDEVEAVLAHEISHIANGDM 150 (284)
T ss_pred HHHHHHHHHHHhCC-CC-----CCCEEEEeCCCceEEEecCCCCCeEEEeehHHhhhC-CHHHHHHHHHHHHHHHHcCCH
Confidence 57777777766532 22 34789999999999999843 489999999999 679999999999999999998
Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHhcc-c-hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHH
Q psy768 304 AEKLSNKTFLEILYIVPLMI----IWFLLP-D-LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETE 377 (475)
Q Consensus 304 ~~~~~~~~l~~~l~~~~~~~----i~~~~~-~-~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~E 377 (475)
........+...+...+... ...... + ....++..+...+..++ +++ ...++.+.+||.+|++
T Consensus 151 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~---------~~~ll~~~~SR~rEy~ 219 (284)
T PRK05457 151 VTMTLIQGVVNTFVIFLSRIIAQIVDRFVSGNEEGNGIGYFIVSIVLEIV--FGI---------LASIIVMWFSRHREFR 219 (284)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccHHHHHHHHHHHHH--HHH---------HHHHHHHHHHhHHHHH
Confidence 86533222221111111100 000000 0 00001100000000000 000 1113567899999999
Q ss_pred HHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc---------CC-CcccccCCCCChHHHHHHHHH
Q psy768 378 ADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV---------GP-KMEEYLSTHPSHENRANNLES 435 (475)
Q Consensus 378 AD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~---------g~-~~~~~lsTHP~~~~RI~~L~~ 435 (475)
||++|.++. | |++++++++|+.+....... +. ....|++|||++++||++|++
T Consensus 220 AD~~Aa~lt---g--p~~L~~AL~KL~~~~~~~~~~~~~~~~i~~~~~~~~lfsTHP~~~eRI~~L~~ 282 (284)
T PRK05457 220 ADAGGAKLA---G--REKMIAALQRLKTSYEPQLPGSMAAFGINGKSGLSELFMSHPPLEKRIAALRS 282 (284)
T ss_pred HHHHHHHhh---C--HHHHHHHHHHHHhhCccCCChHHHHhhccCchhHHHHHcCCcCHHHHHHHHHh
Confidence 999999885 3 89999999999876521000 10 135789999999999999976
No 14
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.90 E-value=5.8e-23 Score=208.27 Aligned_cols=196 Identities=20% Similarity=0.223 Sum_probs=121.4
Q ss_pred CCCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHH
Q psy768 217 QVLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLS 292 (475)
Q Consensus 217 ~~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLA 292 (475)
++.|.+ |...++.+++++++.... .| ..+++++|++.+|||+++ +++|+|++||++.+ |+|||.+|||
T Consensus 74 ~~~p~~-~~~~~L~~~v~~la~~~g-~~-----~p~vyv~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlA 145 (324)
T PRK01265 74 EVTPTD-PVYGWLYSIVAEVAKYNG-IR-----VPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKIL-NRDEIKAVAG 145 (324)
T ss_pred eCCCCC-cccHHHHHHHHHHHHHcC-CC-----CCeEEEecCCCCCeEEeccCCCCCEEEEehHHHhhC-CHHHHHHHHH
Confidence 444443 222345666666665431 22 348999999999999975 45799999999999 6799999999
Q ss_pred HHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-------cchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhh
Q psy768 293 HELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-------PDLGAIVTQVLLDDLKSIIFELPFEREMETEADESII 365 (475)
Q Consensus 293 HElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-------~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~ 365 (475)
||+||++++|....+....+..++..+.....|... .+.... +..+...+. .++ ..+ ..+
T Consensus 146 HElgHik~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~l~~ll~-----~~~-----~~i--~~~ 212 (324)
T PRK01265 146 HELGHLKHRDVELLMAIGLIPTLIYYLGYSLFWGGMFGGGGGGRGNNGG-LLFLIGIAL-----MAV-----SFV--FNL 212 (324)
T ss_pred HHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCCCccch-HHHHHHHHH-----HHH-----HHH--HHH
Confidence 999999999998765443332222221111122110 000000 000000000 000 001 112
Q ss_pred hccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc--------c---------------ccC---------
Q psy768 366 FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ--------D---------------QVG--------- 413 (475)
Q Consensus 366 ~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~--------~---------------~~g--------- 413 (475)
+.+.+||.+|++||++|.+.+ +-+|.++.++++|+...... . ..+
T Consensus 213 l~~aiSR~rEy~AD~~aa~~~---tg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~~~l~i~~~~~~~~~~~~~~~ 289 (324)
T PRK01265 213 LVLSINRMREAYADVNSALTV---PGGAENLQTALAKITLSMDPGALERFKKKSTTNQMASMLFFSNAIEEVPTWDAREL 289 (324)
T ss_pred HHHHHHHHHHHHHhHHHHHhh---cCChHHHHHHHHHHHhcccccccccccccccCChhhhhheeeCcccccccchhhhh
Confidence 667899999999999987643 23789999999999853210 0 001
Q ss_pred ---------CCcccccCCCCChHHHHHHHHHH
Q psy768 414 ---------PKMEEYLSTHPSHENRANNLESK 436 (475)
Q Consensus 414 ---------~~~~~~lsTHP~~~~RI~~L~~~ 436 (475)
+...++|||||++++||++|++.
T Consensus 290 ~~~~~~~~~~~~~~lFsTHPp~e~Ri~rL~~~ 321 (324)
T PRK01265 290 VEYWKTTKVPWYADIFSDHPHPAKRIQLLEKL 321 (324)
T ss_pred hhhhcccchhHHHHHhCCCcChHHHHHHHHHh
Confidence 01246799999999999999985
No 15
>KOG2719|consensus
Probab=99.72 E-value=8.4e-17 Score=165.88 Aligned_cols=198 Identities=17% Similarity=0.244 Sum_probs=127.4
Q ss_pred cCCCCCCCCHHH-HHHHHHHHHHHHhccccccCCCcce-eEEEEec----CCcceeEeCCC---eEEEehhhhh--hcCC
Q psy768 215 GNQVLPLGHPAY-KRVGAVVKRLIDANKVYMEHNNFKY-PITIIDD----PLINAFVFPDG---RIFMFTGMFQ--LCQT 283 (475)
Q Consensus 215 ~~~~lp~~dp~~-~~v~~i~~rL~~~~~~~p~~~~~~~-~v~Vv~~----~~~NAFalpgG---~IvV~~GLL~--~l~~ 283 (475)
-++..|..+... +.++++++++. .|- +++|++. ...|||.+|-+ +|+++|.|+. ...|
T Consensus 208 f~K~~PLe~g~l~~~Ie~la~s~g-----------fp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~ 276 (428)
T KOG2719|consen 208 FGKFTPLEEGDLKEKIERLADSVG-----------FPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLN 276 (428)
T ss_pred hcCCCCCCCCchHHHHHHHHHhcC-----------CCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhhhhccc
Confidence 456667666544 46777766653 222 4667772 27999999855 6999999993 3336
Q ss_pred HHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHH------hcc--c-hHHHHHHHHHHHHHHHHhhcchhh
Q psy768 284 DDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWF------LLP--D-LGAIVTQVLLDDLKSIIFELPFER 354 (475)
Q Consensus 284 edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~------~~~--~-~~~~l~~~~~~~l~~~l~~lp~~~ 354 (475)
+||+++|+|||+||+.++|..++....++..++.+.+.++++. .++ + ....+|-++ +..+. ..|...
T Consensus 277 ~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~~~P~~ig~li---vf~~~-l~py~~ 352 (428)
T KOG2719|consen 277 NEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFIDEQPSLIGFLI---VFQFV-LAPYRA 352 (428)
T ss_pred cHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCCCCcchhHHHH---HHHHH-HHhHHH
Confidence 6899999999999999999999988877766665554444432 111 1 001122110 00110 011111
Q ss_pred hhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccccCCCc-ccccCCCCChHHHHHHH
Q psy768 355 EMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGPKM-EEYLSTHPSHENRANNL 433 (475)
Q Consensus 355 ~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~g~~~-~~~lsTHP~~~~RI~~L 433 (475)
. .....+-+||..|++||++| .+-||.- .+..++-||...+-.-...++. ..|-+|||+.-+|++++
T Consensus 353 l-------~~~~~n~~sR~fEyqAD~fA----~klGYg~-~L~~AL~KL~~dnlsf~~~D~LYs~~~~~HPtvleRl~~l 420 (428)
T KOG2719|consen 353 L-------LNFLMNLISRRFEYQADAFA----KKLGYGK-DLRQALIKLFVDNLSFPVSDPLYSAWHHLHPTVLERLDAL 420 (428)
T ss_pred H-------HHHHHHHHHHHHHHHHHHHH----HHcCCch-hHHHHHHHHhhhhcCCCCCcHHHHHHHhcCchHHHHHHHH
Confidence 1 11267889999999999994 6668864 5677777887655111112222 56789999999999999
Q ss_pred HHHHHH
Q psy768 434 ESKMKE 439 (475)
Q Consensus 434 ~~~i~~ 439 (475)
+....+
T Consensus 421 ~~~~~k 426 (428)
T KOG2719|consen 421 DYLSEK 426 (428)
T ss_pred HHHHhh
Confidence 987654
No 16
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=98.20 E-value=3.1e-05 Score=78.35 Aligned_cols=136 Identities=21% Similarity=0.284 Sum_probs=83.3
Q ss_pred EEEEecC-CcceeEeCC--CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy768 253 ITIIDDP-LINAFVFPD--GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLP 329 (475)
Q Consensus 253 v~Vv~~~-~~NAFalpg--G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~ 329 (475)
+.++.++ ....|++|- -+|++..++.+.+ |++|+..||.||++|+.++|....+-... +..+.|. .|
T Consensus 160 ~~i~~s~~i~sP~~~G~~~p~I~lP~~~~~~~-~~~el~~il~HEl~Hikr~D~~~~~l~~l--------~~~l~Wf-nP 229 (299)
T PF05569_consen 160 IRIRVSSGISSPFVFGFLRPVIVLPESLLEDL-SEEELRAILLHELAHIKRRDLLWKLLAEL--------LCALHWF-NP 229 (299)
T ss_pred eEEEEcCCCCCCeeecCcceEEEecCcccccc-CHHHHHHHHHHHHHHHHCCChHHHHHHHH--------HHHHHHh-hH
Confidence 3344333 456777753 3799999999888 57999999999999999999986542221 1122221 11
Q ss_pred chHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc
Q psy768 330 DLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ 409 (475)
Q Consensus 330 ~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~ 409 (475)
+.-+ +...+.+.+|+.||+..++.+. +-+.......+-++......
T Consensus 230 -------------~~~~-------------------~~~~~~~~~E~~cD~~vl~~l~--~~~~~~Y~~~Ll~~~~~~~~ 275 (299)
T PF05569_consen 230 -------------LVWL-------------------LRRRIRRDRELACDEAVLRNLG--KEERKAYAETLLKVAKRSQQ 275 (299)
T ss_pred -------------HHHH-------------------HHHHHHHHHHHhhhHHHHHhcC--chhHHHHHHHHHHHHHhhcC
Confidence 1111 4567889999999999877642 23455666666566544411
Q ss_pred cccCCCc-ccccCCCCChHHHHHHH
Q psy768 410 DQVGPKM-EEYLSTHPSHENRANNL 433 (475)
Q Consensus 410 ~~~g~~~-~~~lsTHP~~~~RI~~L 433 (475)
...... ..+..+.-..++||..|
T Consensus 276 -~~~~~~~~~~~~~~~~lk~RI~~I 299 (299)
T PF05569_consen 276 -FKRPPLASSFAFSKSQLKRRIKMI 299 (299)
T ss_pred -CCcchhhhhccCChHHHHHHHHhC
Confidence 001111 12234666778887653
No 17
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=97.67 E-value=9.8e-05 Score=70.79 Aligned_cols=60 Identities=18% Similarity=0.119 Sum_probs=34.0
Q ss_pred ccchhhhHHHHHHHHHHHHHcCCC-----chhHHHHHHHHH--hhccccccCCCcccccCCCCChHHHHHHH
Q psy768 369 PFEREMETEADEVGLKLMARACYD-----VRVAPLFWQKMA--LKETQDQVGPKMEEYLSTHPSHENRANNL 433 (475)
Q Consensus 369 ~~SR~~E~EAD~~gl~ll~~AGyd-----p~a~~~~l~kl~--~~~~~~~~g~~~~~~lsTHP~~~~RI~~L 433 (475)
..+++.|.+||.+|..++-....+ -+.+.-....+. ... -......-.|||+..+||.++
T Consensus 123 ~~~~~eE~~AD~~A~~~il~~~~~~~~~~~r~~gIa~al~~i~~l~-----~~~~~~~~~tHP~~~~Ri~~~ 189 (206)
T PF10463_consen 123 SQSIQEEKEADSYATEMILSDVGNYAEERKRNLGIAIALFFISLLF-----PENSLNGTTTHPDIDDRIFNI 189 (206)
T ss_pred chhHHHHHhhhHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHh-----cccccCCCCCCcCHHHHHHHH
Confidence 456778999999999998766555 111111111111 111 011122235999999999665
No 18
>PRK09672 phage exclusion protein Lit; Provisional
Probab=97.67 E-value=0.00013 Score=72.91 Aligned_cols=60 Identities=15% Similarity=-0.004 Sum_probs=38.0
Q ss_pred ccchhhhHHHHHHHHHHHHHcCCC--chh-----HHHHHHHHHhhccccccCCCcccccCCCCChHHHHHHH
Q psy768 369 PFEREMETEADEVGLKLMARACYD--VRV-----APLFWQKMALKETQDQVGPKMEEYLSTHPSHENRANNL 433 (475)
Q Consensus 369 ~~SR~~E~EAD~~gl~ll~~AGyd--p~a-----~~~~l~kl~~~~~~~~~g~~~~~~lsTHP~~~~RI~~L 433 (475)
..|++.|.+||.+|..++-...++ +.. ..-++--+.... ........+|||+..+||.+.
T Consensus 187 e~s~~eE~eaDs~At~~il~~~~~~a~s~Kr~vgIaia~l~l~~l~-----~~~~~~g~~tHP~~~~RI~a~ 253 (305)
T PRK09672 187 EDSIQEEKEADSYATNWLLSKSEKYAPSKKRSVGIAIALLFLQELE-----LENKSCGKGTHPDADQRIFAN 253 (305)
T ss_pred hHHHHHHHHHHHHHHHHHHhccccccHHHhhhHHHHHHHHHHHHHh-----ccccCCCCCCCCCHHHHHHHh
Confidence 456788999999999999987777 322 111111222222 111122246999999999987
No 19
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=97.20 E-value=0.0015 Score=66.14 Aligned_cols=131 Identities=21% Similarity=0.171 Sum_probs=71.2
Q ss_pred CcceeEeC-CCeEEEehhhhhhcC------CHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhccchH
Q psy768 260 LINAFVFP-DGRIFMFTGMFQLCQ------TDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLG 332 (475)
Q Consensus 260 ~~NAFalp-gG~IvV~~GLL~~l~------~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~~~~ 332 (475)
..|||=.| +.+||+....++.+. .+-..+.|||||.||.+.+..- ++.-
T Consensus 136 ~~gpFYCp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l~G----------il~~-------------- 191 (292)
T PF04228_consen 136 ATGPFYCPADQTIYLDLSFFDELQQRFGASGDFAQAYVLAHEYGHHVQNLLG----------ILDA-------------- 191 (292)
T ss_pred CCCCEeCCCCCEEEechHHHHHHHHHhCCccHHHHHHHHHHHHHHHHHHHhh----------hHHH--------------
Confidence 35667666 568999988766552 2345678999999998654221 0000
Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcC----CCchhHHHHHHHHHhhcc
Q psy768 333 AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARAC----YDVRVAPLFWQKMALKET 408 (475)
Q Consensus 333 ~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AG----ydp~a~~~~l~kl~~~~~ 408 (475)
+.... ..... --.+..||..|.+||.||=-++..+- +++....+.+.......+
T Consensus 192 --~~~~~-~~~~~-------------------~~~~~~svr~ELQADC~AGvw~~~~~~~~~l~~~di~~al~aa~aiGD 249 (292)
T PF04228_consen 192 --VRQAQ-QGRSP-------------------AEANELSVRLELQADCFAGVWAGHAAEDGSLTPGDIEEALNAASAIGD 249 (292)
T ss_pred --HHHHh-hccCc-------------------cccchHHHHHHHHHHHHHHHHhhhccccCCcCHHHHHHHHHHHHHhcC
Confidence 00000 00000 02356799999999999988876553 233333333333222211
Q ss_pred ---ccc-cCCCcccccCCCCChHHHHHHHHHHH
Q psy768 409 ---QDQ-VGPKMEEYLSTHPSHENRANNLESKM 437 (475)
Q Consensus 409 ---~~~-~g~~~~~~lsTHP~~~~RI~~L~~~i 437 (475)
++. .+...|+ -.||=+.++|+.-.++=.
T Consensus 250 D~iq~~~~g~v~pd-s~tHGSs~qR~~Wf~~G~ 281 (292)
T PF04228_consen 250 DRIQKRAGGRVVPD-SFTHGSSEQRVRWFQRGY 281 (292)
T ss_pred hHhhhhcCCCCCCC-CcCcCCHHHHHHHHHHHh
Confidence 011 1111122 139999999987665543
No 20
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.06 E-value=0.0031 Score=53.65 Aligned_cols=50 Identities=28% Similarity=0.439 Sum_probs=31.2
Q ss_pred eeEEEEecC--CcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768 251 YPITIIDDP--LINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 251 ~~v~Vv~~~--~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~ 305 (475)
.+|...+.. ....+..+ .+.|+|+..+ ++.+-.+++|||+||+..+|...
T Consensus 6 i~v~~~~~~~~~~~~~~~~~~~~~~I~in~~~-----~~~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 6 IDVVYTDLGPKDLGGFSIPKENNPIIFINSNL-----SPERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp -EEEEE--TT--EEEETT----TTEEEEESSS------HHHHHHHHHHHHHHHHHHH-HH
T ss_pred EEEEEEcCCCCCcCEEEEecCcCCEEEECCCC-----CHHHHHHHHHHHHHHHHhhhccc
Confidence 344444443 33334333 5578888873 57888999999999999888764
No 21
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=96.85 E-value=0.038 Score=56.14 Aligned_cols=53 Identities=25% Similarity=0.281 Sum_probs=40.3
Q ss_pred EEEEecC-CcceeEeC--CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHH
Q psy768 253 ITIIDDP-LINAFVFP--DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEK 306 (475)
Q Consensus 253 v~Vv~~~-~~NAFalp--gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~ 306 (475)
+.+..+. ..|-.+++ --.|++-+.+.+.++ ++|+..|++||++|++++|....
T Consensus 154 i~ir~s~~i~~P~v~gl~kp~IvlP~d~~~r~~-~ee~~yIilHEl~Hlk~gD~i~n 209 (337)
T COG4219 154 ILIRKSKAIDGPMVFGLVKPCIVLPADFVERLT-DEELKYIILHELSHLKRGDAIIN 209 (337)
T ss_pred eeEeecccCCCceeeccCcceEEccHHHHhhcC-HHhhhhhHhHHHhhhhcccHHHH
Confidence 4555543 33444433 446999999999995 68999999999999999998764
No 22
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=96.13 E-value=0.009 Score=60.27 Aligned_cols=56 Identities=20% Similarity=0.281 Sum_probs=47.7
Q ss_pred eEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHH
Q psy768 252 PITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLS 308 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~ 308 (475)
.+.++.++.+..+++.|.+|+++-..+..+ +.+|+.+||+||+-|++.+|....-.
T Consensus 26 ~l~~~~~~~~~t~~tDg~~l~~nP~~~~~l-~~~~~~~~l~HevlH~~~~H~~r~~~ 81 (292)
T PF13203_consen 26 SLPPVEDDGIPTAATDGRRLYYNPEFLESL-SPEERVGLLLHEVLHCLLRHPWRRGG 81 (292)
T ss_pred hCeeeeCCCCceeeEcCcEEEECcHHHhcC-CHHHHHHHHHHHHHHHHccchhhhcc
Confidence 344455777888999999999999999888 57899999999999999999987655
No 23
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=94.27 E-value=0.24 Score=49.28 Aligned_cols=39 Identities=26% Similarity=0.479 Sum_probs=29.5
Q ss_pred eeEeC-CCeEEEehhhhhhcC------CHHHHHHHHHHHHHhhhcc
Q psy768 263 AFVFP-DGRIFMFTGMFQLCQ------TDDELATVLSHELSHTLLK 301 (475)
Q Consensus 263 AFalp-gG~IvV~~GLL~~l~------~edELaaVLAHElgHv~~r 301 (475)
-|-.| +.++|+.....+.|+ .+=.-+.|+|||.||++.+
T Consensus 136 PFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHVQn 181 (295)
T COG2321 136 PFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHVQN 181 (295)
T ss_pred CeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHHHH
Confidence 45556 468999999988874 3345689999999998653
No 24
>PRK04351 hypothetical protein; Provisional
Probab=93.85 E-value=0.22 Score=45.60 Aligned_cols=38 Identities=24% Similarity=0.314 Sum_probs=31.3
Q ss_pred CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhh
Q psy768 260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHT 298 (475)
Q Consensus 260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv 298 (475)
..-++-+....|-++.-+++.. +++.|..||+|||+|+
T Consensus 35 tgG~~~l~~~~I~lnp~ll~~~-~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 35 TGGRYLLKDHHIEFNPKMLEEY-GLEELIGIIKHELCHY 72 (149)
T ss_pred hhheeecCCCeEEeCHHHHhhc-cHHHHHhhHHHHHHHH
Confidence 3445556678899999998875 5799999999999997
No 25
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=92.33 E-value=0.34 Score=44.02 Aligned_cols=41 Identities=17% Similarity=0.266 Sum_probs=32.1
Q ss_pred ceeEeCCC---eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 262 NAFVFPDG---RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 262 NAFalpgG---~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
-.+...++ .|.++..+++. .+++++..+|.|||+|+...+.
T Consensus 33 G~~~~~~~~~~~I~ls~~~~~~-~~~~~~~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 33 GRCRYKRRSPCEIRLSPKLLDR-NPEEELIDTLLHEMAHAAAYVF 76 (157)
T ss_pred EEEEECCCCceEEEECHHHHHh-hHHHHHHHHHHHHHHHHHhhhc
Confidence 33445555 69999999988 4688999999999999976433
No 26
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=92.12 E-value=5.1 Score=39.10 Aligned_cols=50 Identities=22% Similarity=0.183 Sum_probs=33.8
Q ss_pred cceeEEEEecCCcceeEeCCC-eEEEehhhhhhcC----C------------HHHHHHHHHHHHHhh
Q psy768 249 FKYPITIIDDPLINAFVFPDG-RIFMFTGMFQLCQ----T------------DDELATVLSHELSHT 298 (475)
Q Consensus 249 ~~~~v~Vv~~~~~NAFalpgG-~IvV~~GLL~~l~----~------------edELaaVLAHElgHv 298 (475)
.+..|.+-...+.|||=-|.. .|.|--.+++... + .+-+.+++-||+||.
T Consensus 37 ~~l~i~~~~CGe~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHA 103 (220)
T PF14247_consen 37 RDLTIRFAECGEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHA 103 (220)
T ss_pred CCeEEEEeecCCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHH
Confidence 345555555779999999954 6888777765531 0 234667788888876
No 27
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=92.12 E-value=0.12 Score=42.16 Aligned_cols=39 Identities=26% Similarity=0.408 Sum_probs=28.0
Q ss_pred CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768 260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK 301 (475)
Q Consensus 260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r 301 (475)
...||+. |+.|++..|-.+.-.. +=..+||||++|+...
T Consensus 37 ~A~A~T~-G~~I~f~~g~~~~~s~--~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 37 GARAFTV-GNDIYFAPGKYNPDSP--EGRALLAHELAHVVQQ 75 (79)
T ss_pred CCeEEEE-CCEEEEcCCCcCCCCC--CcchhHhHHHHHHHhh
Confidence 5677776 7789997776544322 3467999999999753
No 28
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=89.67 E-value=0.37 Score=43.73 Aligned_cols=33 Identities=27% Similarity=0.315 Sum_probs=28.4
Q ss_pred CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK 301 (475)
Q Consensus 268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r 301 (475)
.+.|.++..|++.. .++.+..||.|||+|+...
T Consensus 41 ~~~I~ln~~l~~~~-~~~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 41 SAEIRLNPKLLTEN-GRDRLRETLLHELCHAALY 73 (146)
T ss_pred CCEEEeCHHHHhhc-cHHHHHhhHHHHHHHHHHH
Confidence 67899999998865 4689999999999999873
No 29
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=88.77 E-value=3.6 Score=40.11 Aligned_cols=52 Identities=12% Similarity=0.213 Sum_probs=29.6
Q ss_pred ceeEEEEecCCcceeEeCCC-eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 250 KYPITIIDDPLINAFVFPDG-RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 250 ~~~v~Vv~~~~~NAFalpgG-~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
+.+|..++..--+=|- |.. .|-++++-.+.- |- .-.+|-|||.||..+++..
T Consensus 54 ~V~Ve~~~G~LtDHYd-P~~k~vrLS~~vy~~~-Si-aAvaVAAHEvGHAiQ~a~~ 106 (222)
T PF04298_consen 54 DVRVERVPGELTDHYD-PRNKVVRLSEDVYNGR-SI-AAVAVAAHEVGHAIQHAEG 106 (222)
T ss_pred CeeEEEeCCCCCCCcC-CCCCEEEeCCccCCCC-CH-HHHHHHHHHHhHHHhcccc
Confidence 3455555432122222 433 455666655442 32 4467999999999987753
No 30
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=87.36 E-value=3 Score=39.41 Aligned_cols=51 Identities=20% Similarity=0.117 Sum_probs=35.9
Q ss_pred eEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 252 PITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
+|.|.+...-=.-+...|.|.++.-|+.. .++-+..|+.||++|..+.++.
T Consensus 131 ~i~ir~~ksrWGsc~~~~~I~ln~~L~~~--P~~~idYVvvHEL~Hl~~~nHs 181 (205)
T PF01863_consen 131 KIKIRDMKSRWGSCSSKGNITLNWRLVMA--PPEVIDYVVVHELCHLRHPNHS 181 (205)
T ss_pred eEEEeehhhccccCCCCCcEEeecccccC--CccHHHHHHHHHHHHhccCCCC
Confidence 45555544333334457899999988764 4578999999999999665443
No 31
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=87.04 E-value=2.2 Score=41.64 Aligned_cols=44 Identities=18% Similarity=0.123 Sum_probs=31.4
Q ss_pred CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768 260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~ 305 (475)
..=+-+...|.|.++.-|+.. .++.+..|++||++|....++..
T Consensus 150 ~~WGScs~~~~i~~~~~l~~~--p~~~i~YVvvHELaHLke~nHs~ 193 (223)
T COG1451 150 RRWGSCSKAGEIRFNWRLVMA--PEEVIDYVVVHELAHLKEKNHSK 193 (223)
T ss_pred ceeeeecCCCcEEeehhhhcC--CHHHHHHHHHHHHHHHhhhhccH
Confidence 333444566767776655443 57899999999999998877544
No 32
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=86.23 E-value=0.61 Score=45.05 Aligned_cols=34 Identities=21% Similarity=0.409 Sum_probs=24.3
Q ss_pred CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh-cccHH
Q psy768 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTL-LKHVA 304 (475)
Q Consensus 268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~-~rH~~ 304 (475)
++.|.| .++ ... +-+.++.||+|||||.+ .||.-
T Consensus 117 ~~~I~I-~~~-~~~-~~~~~~hvi~HEiGH~IGfRHTD 151 (211)
T PF12388_consen 117 YKFIQI-YGL-SNY-SVNVIEHVITHEIGHCIGFRHTD 151 (211)
T ss_pred CceEEE-Eec-CCC-chhHHHHHHHHHhhhhccccccC
Confidence 345888 354 332 45789999999999987 57865
No 33
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=86.02 E-value=1.9 Score=41.06 Aligned_cols=33 Identities=27% Similarity=0.421 Sum_probs=25.0
Q ss_pred CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 269 GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 269 G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
|+|+++- +..-.+.++|.+++|||+-|+..-..
T Consensus 49 ~~i~l~~--~~~~~~~~~l~~~iaHE~hH~~r~~~ 81 (195)
T PF10026_consen 49 GYIFLFL--LPNDYSLEELPALIAHEYHHNCRYEQ 81 (195)
T ss_pred CEEEEEe--cCCcccHHHHHHHHHHHHHHHHHHhc
Confidence 4677775 44445678999999999999976443
No 34
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=85.16 E-value=4.7 Score=38.87 Aligned_cols=49 Identities=24% Similarity=0.201 Sum_probs=35.1
Q ss_pred EEecCCcceeEeC---CCeEEEehhhhhhcCC----HHHHHHHHHHHHHhhhcccH
Q psy768 255 IIDDPLINAFVFP---DGRIFMFTGMFQLCQT----DDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 255 Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~----edELaaVLAHElgHv~~rH~ 303 (475)
++++..-=|++.+ ...|.++...|....+ .+|+.+||-|||.|+.+...
T Consensus 57 ~~~~~~gVA~t~gd~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q~~~ 112 (205)
T PF04450_consen 57 ILDDMDGVAYTSGDDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQWDG 112 (205)
T ss_pred EEECCCeeEEEecCCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhhcCC
Confidence 3444333344433 1469999999998853 35999999999999977654
No 35
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=82.03 E-value=1.4 Score=42.75 Aligned_cols=29 Identities=28% Similarity=0.385 Sum_probs=22.4
Q ss_pred CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768 269 GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH 302 (475)
Q Consensus 269 G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH 302 (475)
+.|+++.-. +++.=.++||||+||++++-
T Consensus 59 ~~I~iN~n~-----~~~r~rFtlAHELGH~llH~ 87 (213)
T COG2856 59 PVIYINANN-----SLERKRFTLAHELGHALLHT 87 (213)
T ss_pred ceEEEeCCC-----CHHHHHHHHHHHHhHHHhcc
Confidence 367776644 56788999999999998753
No 36
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=82.03 E-value=2.7 Score=37.40 Aligned_cols=47 Identities=26% Similarity=0.108 Sum_probs=23.8
Q ss_pred cceeEEEEecC-CcceeEeCCC--eEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768 249 FKYPITIIDDP-LINAFVFPDG--RIFMFTGMFQLCQTDDELATVLSHELSHTL 299 (475)
Q Consensus 249 ~~~~v~Vv~~~-~~NAFalpgG--~IvV~~GLL~~l~~edELaaVLAHElgHv~ 299 (475)
-++.++..+++ ..=|...+.| +|++-..--+.. +.+. |.+||+||+.
T Consensus 40 a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy---~~~R-IaaHE~GHiL 89 (132)
T PF02031_consen 40 ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGY---NSTR-IAAHELGHIL 89 (132)
T ss_dssp -SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS----HHH-HHHHHHHHHH
T ss_pred CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCC---ccce-eeeehhcccc
Confidence 34555544333 3445555544 577765432222 2344 9999999984
No 37
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=80.62 E-value=5.1 Score=38.24 Aligned_cols=20 Identities=20% Similarity=0.307 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHhhhcccHHH
Q psy768 286 ELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 286 ELaaVLAHElgHv~~rH~~~ 305 (475)
.=.+|-|||.||....+...
T Consensus 91 a~~aVAAHEVGHAiQd~~~Y 110 (226)
T COG2738 91 AAIAVAAHEVGHAIQDQEDY 110 (226)
T ss_pred HHHHHHHHHhhHHHhhhccc
Confidence 34679999999999877653
No 38
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=80.10 E-value=0.43 Score=51.91 Aligned_cols=46 Identities=22% Similarity=0.372 Sum_probs=32.0
Q ss_pred CCcceeEeCCCeEEEe-hhhhhhc---------CCHHHHHHHHHHHHHhhhcccHH
Q psy768 259 PLINAFVFPDGRIFMF-TGMFQLC---------QTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 259 ~~~NAFalpgG~IvV~-~GLL~~l---------~~edELaaVLAHElgHv~~rH~~ 304 (475)
-.+|||--|+...+|+ -++|..- .|--.+-+||||||||.--....
T Consensus 449 q~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGa 504 (654)
T COG3590 449 QTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGA 504 (654)
T ss_pred HHhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCcc
Confidence 3799999998764444 5555421 13346889999999998766554
No 39
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=79.47 E-value=2.4 Score=40.33 Aligned_cols=73 Identities=8% Similarity=0.084 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHHHhccccccCCCcceeEEEEecCCccee-EeC-----CCeEEE--e-hhhhhhcCCHHHHHHHHHHHHH
Q psy768 226 YKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAF-VFP-----DGRIFM--F-TGMFQLCQTDDELATVLSHELS 296 (475)
Q Consensus 226 ~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAF-alp-----gG~IvV--~-~GLL~~l~~edELaaVLAHElg 296 (475)
...|++++..+.. -+....|+|-.+..-.+|.- .++ |..|.| - +.. ...-+-+.|..|+-|||+
T Consensus 19 ~~lL~rlA~~v~p------IM~~~~~~V~~L~E~~P~~~~llG~N~N~G~~I~lrLR~~~~-~~fl~~~~i~~t~lHELa 91 (186)
T PF08325_consen 19 LELLERLAADVKP------IMRKHGWRVGSLEEFYPNGERLLGLNVNKGEKICLRLRTPDD-GGFLPYETILGTMLHELA 91 (186)
T ss_pred HHHHHHHHHHHHH------HHHHcCcccCeeeccCCCCCCCcceecCCCcEEEEEeCCCCC-CCEeeHHHHHHHHHHHHH
Confidence 3466666666433 23346787777665332221 111 223333 2 222 444456899999999999
Q ss_pred hhhcccHHH
Q psy768 297 HTLLKHVAE 305 (475)
Q Consensus 297 Hv~~rH~~~ 305 (475)
|++++.+-.
T Consensus 92 H~~~~~H~~ 100 (186)
T PF08325_consen 92 HNVHGPHDD 100 (186)
T ss_pred hcccCCccH
Confidence 999775543
No 40
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=79.28 E-value=2.5 Score=35.76 Aligned_cols=22 Identities=36% Similarity=0.551 Sum_probs=18.1
Q ss_pred CCHHHHHHHHHHHHHhhhcccH
Q psy768 282 QTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 282 ~~edELaaVLAHElgHv~~rH~ 303 (475)
.+++.+..||+||++|......
T Consensus 20 ~~~~~~~~~l~HE~~H~~~~~~ 41 (128)
T PF13485_consen 20 SDEDWLDRVLAHELAHQWFGNY 41 (128)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH
Confidence 4677889999999999976554
No 41
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=79.05 E-value=16 Score=37.70 Aligned_cols=15 Identities=40% Similarity=0.603 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHhhh
Q psy768 285 DELATVLSHELSHTL 299 (475)
Q Consensus 285 dELaaVLAHElgHv~ 299 (475)
-.+-+++||||||..
T Consensus 194 ~~~P~T~~HElAHq~ 208 (318)
T PF12725_consen 194 YSLPFTICHELAHQL 208 (318)
T ss_pred ccccHHHHHHHHHHh
Confidence 467899999999984
No 42
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=78.45 E-value=2.3 Score=38.56 Aligned_cols=51 Identities=20% Similarity=0.245 Sum_probs=28.9
Q ss_pred ceeEEEEe----cCCcceeEe----C-CCeEEEehhhhhhc---CCHHHHHHHHHHHHHhhhc
Q psy768 250 KYPITIID----DPLINAFVF----P-DGRIFMFTGMFQLC---QTDDELATVLSHELSHTLL 300 (475)
Q Consensus 250 ~~~v~Vv~----~~~~NAFal----p-gG~IvV~~GLL~~l---~~edELaaVLAHElgHv~~ 300 (475)
+..+..++ +...+++.- | .|.|.+.+.-+..- .+.+.+.+|+.|||||.+.
T Consensus 45 di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLG 107 (165)
T cd04268 45 DIRYSVIRWIPYNDGTWSYGPSQVDPLTGEILLARVYLYSSFVEYSGARLRNTAEHELGHALG 107 (165)
T ss_pred CceEEEEEeecCCCCccccCCccCCCCCccEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhc
Confidence 34555554 234455542 2 35566655443221 0235789999999999864
No 43
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=75.47 E-value=6.7 Score=39.53 Aligned_cols=49 Identities=16% Similarity=0.080 Sum_probs=28.7
Q ss_pred EEEecCCcc-eeEe-CCCeEEEehhhhhhcCCHHHH---HHHHHHHHHhhhccc
Q psy768 254 TIIDDPLIN-AFVF-PDGRIFMFTGMFQLCQTDDEL---ATVLSHELSHTLLKH 302 (475)
Q Consensus 254 ~Vv~~~~~N-AFal-pgG~IvV~~GLL~~l~~edEL---aaVLAHElgHv~~rH 302 (475)
.++.+...+ +|+. .|+.|....+-.+.+-+.+.+ .--+.||+||....=
T Consensus 181 r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~~~~~WG~~HE~GH~~Q~~ 234 (307)
T PF13402_consen 181 RFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPLRKGGWGPWHELGHNHQQG 234 (307)
T ss_dssp EEEEETT----SEEEETTEEEEETT--HHHH-HHHHHHH-HHHHHHHHHHH-BG
T ss_pred eEEEeccccccceeecCCcEEeeCcHHhcccCHhHcCCCCeeehhhhhhhcCcc
Confidence 555555444 5555 566777777766666566666 668999999986643
No 44
>PRK04860 hypothetical protein; Provisional
Probab=75.47 E-value=2.7 Score=38.95 Aligned_cols=33 Identities=21% Similarity=0.254 Sum_probs=27.0
Q ss_pred CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK 301 (475)
Q Consensus 268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r 301 (475)
.+.|-++..++..- ++++|..||.|||+|+...
T Consensus 45 ~~~I~~Np~ll~~~-~~~~l~~~v~HEl~H~~~~ 77 (160)
T PRK04860 45 SNEIRLNPVLLLEN-QQAFIDEVVPHELAHLLVY 77 (160)
T ss_pred cCCeeeCHHHHhhC-cHHHHHhHHHHHHHHHHHH
Confidence 45688888887765 6789999999999999553
No 45
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=72.62 E-value=1.5 Score=40.29 Aligned_cols=57 Identities=21% Similarity=0.265 Sum_probs=28.0
Q ss_pred cceeEEEEecC------CcceeE-eCCC--------eEEEehhhhhhcCCHHHHHHHHHHHHHhhh-cccHHH
Q psy768 249 FKYPITIIDDP------LINAFV-FPDG--------RIFMFTGMFQLCQTDDELATVLSHELSHTL-LKHVAE 305 (475)
Q Consensus 249 ~~~~v~Vv~~~------~~NAFa-lpgG--------~IvV~~GLL~~l~~edELaaVLAHElgHv~-~rH~~~ 305 (475)
.-..|+|++.. ...-+| +|.. .|++....+....+.....-+|.||+||+. +.|...
T Consensus 16 ~ylNIwv~~~~~~~g~~~~~G~A~~P~~~~~~~~~~~vv~~~~~l~~~~~~~~~g~TltHEvGH~LGL~HtF~ 88 (154)
T PF05572_consen 16 KYLNIWVVNDLYDGGGTSILGYAYFPWSGMSDNGTDGVVINYRYLGGNNSQYNFGKTLTHEVGHWLGLYHTFG 88 (154)
T ss_dssp T-EEEEEES-EE-TTS-EESEEE--TTS-GGG-SEEEEGGGSSSSTT--TTS-SSHHHHHHHHHHTT---TT-
T ss_pred CEEEEEEecccccCCCCCCCeEEeCCCCCCCCCCCCEEEEcCcccCCCCCccccccchhhhhhhhhccccccc
Confidence 34578988732 223344 3644 233332333322233456889999999997 456654
No 46
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=72.18 E-value=2.3 Score=38.00 Aligned_cols=35 Identities=29% Similarity=0.429 Sum_probs=21.2
Q ss_pred CeEEEehhhhhhc-CCHHHHHHHHHHHHHhhh-cccH
Q psy768 269 GRIFMFTGMFQLC-QTDDELATVLSHELSHTL-LKHV 303 (475)
Q Consensus 269 G~IvV~~GLL~~l-~~edELaaVLAHElgHv~-~rH~ 303 (475)
|.|.+...---.. .+...+..|+.|||||.. ++|.
T Consensus 86 ~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~ 122 (154)
T PF00413_consen 86 GDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHS 122 (154)
T ss_dssp TEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESS
T ss_pred ccccccccccchhhhhhhhhhhhhhhccccccCcCcC
Confidence 4555554331111 234578999999999984 4454
No 47
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=71.51 E-value=8.3 Score=38.58 Aligned_cols=37 Identities=27% Similarity=0.288 Sum_probs=25.5
Q ss_pred EEehhhhhhc--CCHHHHHHHHHHHHHhhh---cccHHHHHH
Q psy768 272 FMFTGMFQLC--QTDDELATVLSHELSHTL---LKHVAEKLS 308 (475)
Q Consensus 272 vV~~GLL~~l--~~edELaaVLAHElgHv~---~rH~~~~~~ 308 (475)
+..|.||..+ .+|.+|+..|-||++|.+ .++.+-+-+
T Consensus 180 wFdDPlLstmlr~dd~~lA~LIFHELAHQk~Y~~~DtAFNEs 221 (376)
T COG4324 180 WFDDPLLSTMLRQDDTYLASLIFHELAHQKIYVNNDTAFNES 221 (376)
T ss_pred ccccHHHHHHhcCChHHHHHHHHHHHhhheEeecCcchHhHH
Confidence 3455666655 467899999999999964 556654433
No 48
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=69.84 E-value=2.8 Score=38.05 Aligned_cols=16 Identities=38% Similarity=0.669 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHhhh
Q psy768 284 DDELATVLSHELSHTL 299 (475)
Q Consensus 284 edELaaVLAHElgHv~ 299 (475)
...+..|+.|||||++
T Consensus 101 ~~~~~~~~~HEiGHaL 116 (156)
T cd04279 101 AENLQAIALHELGHAL 116 (156)
T ss_pred chHHHHHHHHHhhhhh
Confidence 3579999999999986
No 49
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=68.02 E-value=3.2 Score=40.99 Aligned_cols=31 Identities=26% Similarity=0.364 Sum_probs=20.3
Q ss_pred EEEehhhhhhcC-----CHHHHHHHHHHHHHhhhcc
Q psy768 271 IFMFTGMFQLCQ-----TDDELATVLSHELSHTLLK 301 (475)
Q Consensus 271 IvV~~GLL~~l~-----~edELaaVLAHElgHv~~r 301 (475)
.+..+||....+ .....+.++||||||...=
T Consensus 146 ~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnlGm 181 (244)
T cd04270 146 KYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNFGS 181 (244)
T ss_pred eeeecceEeeeccCCccchhHHHHHHHHHHHHhcCC
Confidence 455566654211 1345789999999998754
No 50
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=65.80 E-value=6.9 Score=34.63 Aligned_cols=44 Identities=25% Similarity=0.262 Sum_probs=25.7
Q ss_pred ceeEEEEecC--Cc-ceeE-eCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768 250 KYPITIIDDP--LI-NAFV-FPDGRIFMFTGMFQLCQTDDELATVLSHELSHTL 299 (475)
Q Consensus 250 ~~~v~Vv~~~--~~-NAFa-lpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~ 299 (475)
+..|.+.+.+ .. -|++ .|+|.+-++.+ ..+. -.+|+.|||||.+
T Consensus 51 d~~I~f~~~~~~g~~~a~~g~~~g~~~~~~~--~~~~----~~~~~~HEigHaL 98 (140)
T smart00235 51 DIYISFGKGDGSGCTLSHAGRPGGDQHFSLG--NGCI----NTGVAAHELGHAL 98 (140)
T ss_pred CeEEEEEECCCCCcceeeeecCCCceEEEcc--CCcC----CcccHHHHHHHHh
Confidence 3455555554 33 2665 35777666541 1122 2459999999986
No 51
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=65.58 E-value=6.9 Score=37.06 Aligned_cols=44 Identities=27% Similarity=0.369 Sum_probs=30.0
Q ss_pred ceeEeCC-CeEEEehhhhhhc--C-------CHHHHHHHHHHHHHhhhcccHHH
Q psy768 262 NAFVFPD-GRIFMFTGMFQLC--Q-------TDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 262 NAFalpg-G~IvV~~GLL~~l--~-------~edELaaVLAHElgHv~~rH~~~ 305 (475)
|||=.|. ..|+|..|+|..- . +-..|-+||||||+|..-.+...
T Consensus 1 na~Y~~~~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~ 54 (206)
T PF01431_consen 1 NAYYSPRFNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEGIN 54 (206)
T ss_dssp --EEETTTTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGG
T ss_pred CCCCCcccCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhh
Confidence 6666654 5688888887642 1 23568899999999998776653
No 52
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=63.58 E-value=13 Score=41.64 Aligned_cols=49 Identities=14% Similarity=0.164 Sum_probs=33.4
Q ss_pred eEEEEecCCcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 252 PITIIDDPLINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
.++-.+....-|||.+ ++..+|.-.....+.| ..+|+||+||..|....
T Consensus 346 D~~~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~d----V~TLaHElGHs~Hs~~s 397 (598)
T COG1164 346 DVYPRKGKRSGAYSIGFYKGDHPFILMNYDGTLRD----VFTLAHELGHSVHSYFS 397 (598)
T ss_pred eccCCCCCCCCcccCCCCCCCCCeEEEeCCCchhH----HHHHHHHccHHHHHHHH
Confidence 4555556688899975 3556665544444433 57999999999887554
No 53
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=62.97 E-value=6.9 Score=35.86 Aligned_cols=30 Identities=27% Similarity=0.380 Sum_probs=24.8
Q ss_pred eEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768 270 RIFMFTGMFQLCQTDDELATVLSHELSHTLL 300 (475)
Q Consensus 270 ~IvV~~GLL~~l~~edELaaVLAHElgHv~~ 300 (475)
.|=++--|+..- .+|.+..|+.||++|+..
T Consensus 45 ~I~lNP~ll~en-~~~f~~~vV~HELaHl~l 74 (156)
T COG3091 45 EIRLNPKLLEEN-GEDFIEQVVPHELAHLHL 74 (156)
T ss_pred ccccCHHHHHHc-cHHHHHHHHHHHHHHHHH
Confidence 677778887775 579999999999999843
No 54
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=62.16 E-value=6.1 Score=40.89 Aligned_cols=48 Identities=23% Similarity=0.199 Sum_probs=33.0
Q ss_pred CcceeEeCCCeEEEehhhhhhc--CCHHHHHHHHHHHHHhhh---cccHHHHHHHH
Q psy768 260 LINAFVFPDGRIFMFTGMFQLC--QTDDELATVLSHELSHTL---LKHVAEKLSNK 310 (475)
Q Consensus 260 ~~NAFalpgG~IvV~~GLL~~l--~~edELaaVLAHElgHv~---~rH~~~~~~~~ 310 (475)
.+-||++-| +..|.|+..+ -++.+||.+|-|||+|-. .+|+.-+-+..
T Consensus 139 gV~AYSTLG---wF~DPlLSt~l~~~~~~LA~LIfHELaHq~~Yv~~dt~FNEsfA 191 (337)
T PF10023_consen 139 GVPAYSTLG---WFDDPLLSTMLRYPDGELARLIFHELAHQTLYVKGDTAFNESFA 191 (337)
T ss_pred ccccccccc---ccCCcccccccCCCchHHHHHHHHHHhhceeecCCCchhhHHHH
Confidence 566777654 4467777665 367899999999999964 56666444333
No 55
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=62.10 E-value=5.7 Score=38.09 Aligned_cols=47 Identities=30% Similarity=0.426 Sum_probs=31.5
Q ss_pred EEEEecC-CcceeEe----C------CCeEEEehhhhhhcCCH------HHHHHHHHHHHHhhh
Q psy768 253 ITIIDDP-LINAFVF----P------DGRIFMFTGMFQLCQTD------DELATVLSHELSHTL 299 (475)
Q Consensus 253 v~Vv~~~-~~NAFal----p------gG~IvV~~GLL~~l~~e------dELaaVLAHElgHv~ 299 (475)
|..++++ ..+|+.- | +|.|.++.+.+....+. +-+..+++||+||..
T Consensus 65 ir~~~~~~~~~~~gps~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~l 128 (197)
T cd04276 65 IRWIHSPNGGWAYGPSVVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTL 128 (197)
T ss_pred EEEEecCCCcceecccccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHh
Confidence 4455665 5566642 2 23578988887664322 458899999999985
No 56
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=62.07 E-value=4.7 Score=37.84 Aligned_cols=16 Identities=38% Similarity=0.459 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHhhh
Q psy768 284 DDELATVLSHELSHTL 299 (475)
Q Consensus 284 edELaaVLAHElgHv~ 299 (475)
....+.++||||||..
T Consensus 128 ~~~~a~~~AHElGH~l 143 (194)
T cd04269 128 LLLFAVTMAHELGHNL 143 (194)
T ss_pred hHHHHHHHHHHHHhhc
Confidence 4578999999999986
No 57
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=60.87 E-value=13 Score=34.13 Aligned_cols=49 Identities=22% Similarity=0.261 Sum_probs=26.3
Q ss_pred cceeEEEE-ecCCcceeEeCCCeEEEehhhhhhcCCHHHHHH---HHHHHHHhhhcc
Q psy768 249 FKYPITII-DDPLINAFVFPDGRIFMFTGMFQLCQTDDELAT---VLSHELSHTLLK 301 (475)
Q Consensus 249 ~~~~v~Vv-~~~~~NAFalpgG~IvV~~GLL~~l~~edELaa---VLAHElgHv~~r 301 (475)
.++...|- ...-.|||=- |..++.-+|-=..+. .+++ |+||||+|-+-.
T Consensus 97 ~~~~~~Vhyg~~~~NAfW~-g~~m~yGdG~~~~f~---~~~~~lDVvaHEltHGVte 149 (150)
T PF01447_consen 97 MPIISRVHYGKNYNNAFWN-GSQMVYGDGDGQIFK---PFASSLDVVAHELTHGVTE 149 (150)
T ss_dssp S-EEEEESESSSTT-EEE--SSSEEEE---SSSBS----GGG-HHHHHHHHHHHHHH
T ss_pred cEEEEEEeCCCCccCcccc-CCEEEEECCCCcccc---cCccccceeeecccccccc
Confidence 44444442 3447899984 457888877643332 2333 999999997643
No 58
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=58.94 E-value=5.4 Score=37.40 Aligned_cols=17 Identities=29% Similarity=0.344 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy768 284 DDELATVLSHELSHTLL 300 (475)
Q Consensus 284 edELaaVLAHElgHv~~ 300 (475)
...-.-|+||||||...
T Consensus 139 ~~~~~~~~AHEiGH~lG 155 (196)
T PF13688_consen 139 TYNGAITFAHEIGHNLG 155 (196)
T ss_dssp HHHHHHHHHHHHHHHTT
T ss_pred CCceehhhHHhHHHhcC
Confidence 46778899999999864
No 59
>PRK06926 flagellar motor protein MotP; Reviewed
Probab=58.47 E-value=2.1e+02 Score=28.87 Aligned_cols=35 Identities=11% Similarity=0.264 Sum_probs=26.7
Q ss_pred EEehhhhhhc--CCHHHHHHHHHHHHHhhhcccHHHH
Q psy768 272 FMFTGMFQLC--QTDDELATVLSHELSHTLLKHVAEK 306 (475)
Q Consensus 272 vV~~GLL~~l--~~edELaaVLAHElgHv~~rH~~~~ 306 (475)
|+.+|+--.. .+++++..+|-+|+.....+|....
T Consensus 112 Fl~~gl~lvvDG~~~~~i~~iLe~ei~~~~~r~~~~~ 148 (271)
T PRK06926 112 FIKKGLLLAIDGWEPETIRDIMMAEIAAMEERHRKGR 148 (271)
T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHhHH
Confidence 5666663332 4789999999999999999987643
No 60
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=58.41 E-value=7.7 Score=36.59 Aligned_cols=17 Identities=35% Similarity=0.661 Sum_probs=14.4
Q ss_pred CHHHHHHHHHHHHHhhh
Q psy768 283 TDDELATVLSHELSHTL 299 (475)
Q Consensus 283 ~edELaaVLAHElgHv~ 299 (475)
+...++.++||||||..
T Consensus 127 ~~~~~a~~~AHelGH~l 143 (199)
T PF01421_consen 127 SGLSFAVIIAHELGHNL 143 (199)
T ss_dssp SHHHHHHHHHHHHHHHT
T ss_pred hhHHHHHHHHHHHHHhc
Confidence 45688999999999974
No 61
>PF14891 Peptidase_M91: Effector protein
Probab=57.94 E-value=5.7 Score=37.07 Aligned_cols=15 Identities=40% Similarity=0.412 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHhhh
Q psy768 285 DELATVLSHELSHTL 299 (475)
Q Consensus 285 dELaaVLAHElgHv~ 299 (475)
..=..||+|||+|..
T Consensus 101 ~~p~v~L~HEL~HA~ 115 (174)
T PF14891_consen 101 RPPFVVLYHELIHAY 115 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 455789999999983
No 62
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=57.47 E-value=5.4 Score=34.37 Aligned_cols=12 Identities=33% Similarity=0.736 Sum_probs=10.4
Q ss_pred HHHHHHHHHhhh
Q psy768 288 ATVLSHELSHTL 299 (475)
Q Consensus 288 aaVLAHElgHv~ 299 (475)
..+++|||||..
T Consensus 108 ~~~~~HEiGH~l 119 (124)
T PF13582_consen 108 VDTFAHEIGHNL 119 (124)
T ss_dssp TTHHHHHHHHHT
T ss_pred ceEeeehhhHhc
Confidence 389999999975
No 63
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=56.93 E-value=4.3 Score=38.04 Aligned_cols=15 Identities=40% Similarity=0.476 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHhhh
Q psy768 285 DELATVLSHELSHTL 299 (475)
Q Consensus 285 dELaaVLAHElgHv~ 299 (475)
-..+.++|||+||..
T Consensus 131 ~~~~~~~aHElGH~l 145 (192)
T cd04267 131 LLTALTMAHELGHNL 145 (192)
T ss_pred eeehhhhhhhHHhhc
Confidence 467889999999986
No 64
>COG1291 MotA Flagellar motor component [Cell motility and secretion]
Probab=55.57 E-value=2.1e+02 Score=28.76 Aligned_cols=110 Identities=15% Similarity=0.071 Sum_probs=55.9
Q ss_pred CHHHHHHHHHHHHHhhhcccHHHHHHHHHHH------HHHHHHHHHHHH-HhccchHHHHHHHHHHHHHHHHhhcchhhh
Q psy768 283 TDDELATVLSHELSHTLLKHVAEKLSNKTFL------EILYIVPLMIIW-FLLPDLGAIVTQVLLDDLKSIIFELPFERE 355 (475)
Q Consensus 283 ~edELaaVLAHElgHv~~rH~~~~~~~~~l~------~~l~~~~~~~i~-~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~ 355 (475)
+.+++.+++=-|+.+...+|.........+. .+++.+++.+.. ....+ -..+|..+...+...+.
T Consensus 121 ~~~~I~~~me~Ei~~~ee~~~~~a~~~~~~g~~aPa~GivgaV~GlI~~l~~l~~-p~~LG~~iA~Alv~T~~------- 192 (266)
T COG1291 121 DPEEIEALMEEEIETMEERHEKPAHAFTTAGDYAPAFGIVGAVMGLIHALGNLDD-PAELGALIAAALVGTLY------- 192 (266)
T ss_pred CHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhCchhhHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHHHHH-------
Confidence 5899999999999999999877544333322 223332222111 11112 23344443333322210
Q ss_pred hhhhhhhhhhhccccchhhhHHHHHHHH--------HHHHHcCCCchhHHHHHHHH
Q psy768 356 METEADESIIFELPFEREMETEADEVGL--------KLMARACYDVRVAPLFWQKM 403 (475)
Q Consensus 356 l~~~~~~~~~~~~~~SR~~E~EAD~~gl--------~ll~~AGydp~a~~~~l~kl 403 (475)
+...+..+..|+.-+.+...|.--. -+....|.+|+-+.+.+.++
T Consensus 193 ---Gi~~ay~~~~P~a~kLk~~~~~e~~~~~~i~e~ll~i~~G~nP~i~~e~l~~~ 245 (266)
T COG1291 193 ---GIFLAYGLFGPLANKLKQKSDEEVKLKEIIIEGLLAIQNGENPRILEEKLRKF 245 (266)
T ss_pred ---HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcHHHHHHHHhh
Confidence 1111112445555555555554211 13556789998777766544
No 65
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=55.23 E-value=6 Score=35.93 Aligned_cols=32 Identities=25% Similarity=0.293 Sum_probs=20.4
Q ss_pred CCeEEEehhhhhhcC---CHHHHHHHHHHHHHhhh
Q psy768 268 DGRIFMFTGMFQLCQ---TDDELATVLSHELSHTL 299 (475)
Q Consensus 268 gG~IvV~~GLL~~l~---~edELaaVLAHElgHv~ 299 (475)
.|.|.+.+.-.-... ....+..|+.|||||..
T Consensus 85 ~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaL 119 (157)
T cd04278 85 GGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHAL 119 (157)
T ss_pred ceeEEECCCcccccCCCCccchHHHHHHHHhcccc
Confidence 566776654321111 22469999999999984
No 66
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=54.61 E-value=38 Score=31.87 Aligned_cols=36 Identities=22% Similarity=0.316 Sum_probs=24.6
Q ss_pred cceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768 261 INAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTL 299 (475)
Q Consensus 261 ~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~ 299 (475)
.-.|.-+...|+|-. +.+.+.+++.-+|+|||-|..
T Consensus 48 ~Ggf~p~~~~I~lC~---N~~~~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 48 SGGFDPSKKGIVLCQ---NRIRSQGHLEDTLTHELIHAY 83 (173)
T ss_pred cCCccCCCCCEEEee---CCCCCHHHHHHHHHHHHHHHH
Confidence 334432244566654 334788999999999999984
No 67
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=54.29 E-value=5.7 Score=35.94 Aligned_cols=15 Identities=47% Similarity=0.680 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHhhh
Q psy768 285 DELATVLSHELSHTL 299 (475)
Q Consensus 285 dELaaVLAHElgHv~ 299 (475)
.....+++|||||..
T Consensus 94 ~~~~~~~~HElGH~L 108 (167)
T cd00203 94 KEGAQTIAHELGHAL 108 (167)
T ss_pred ccchhhHHHHHHHHh
Confidence 478999999999986
No 68
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=53.95 E-value=9.9 Score=35.49 Aligned_cols=36 Identities=19% Similarity=0.226 Sum_probs=23.0
Q ss_pred CeEEEehhhhhhc-CCHHHHHHHHHHHHHhhhc-ccHH
Q psy768 269 GRIFMFTGMFQLC-QTDDELATVLSHELSHTLL-KHVA 304 (475)
Q Consensus 269 G~IvV~~GLL~~l-~~edELaaVLAHElgHv~~-rH~~ 304 (475)
|.|++........ ..-...-.|+.|||||.+- +|.-
T Consensus 94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~~ 131 (186)
T cd04277 94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHPG 131 (186)
T ss_pred ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCCC
Confidence 4677776654432 1124678999999999863 4443
No 69
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=53.77 E-value=6.6 Score=36.57 Aligned_cols=15 Identities=33% Similarity=0.381 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhhhcc
Q psy768 287 LATVLSHELSHTLLK 301 (475)
Q Consensus 287 LaaVLAHElgHv~~r 301 (475)
-.-++||||||..--
T Consensus 111 ~~~~~aHElGH~lGa 125 (173)
T PF13574_consen 111 GIDTFAHELGHQLGA 125 (173)
T ss_dssp HHHHHHHHHHHHHT-
T ss_pred eeeeehhhhHhhcCC
Confidence 355799999998654
No 70
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=53.11 E-value=7.6 Score=37.42 Aligned_cols=14 Identities=36% Similarity=0.449 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhh
Q psy768 286 ELATVLSHELSHTL 299 (475)
Q Consensus 286 ELaaVLAHElgHv~ 299 (475)
..+.++||||||..
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 45899999999987
No 71
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=51.95 E-value=12 Score=41.82 Aligned_cols=37 Identities=22% Similarity=0.332 Sum_probs=25.1
Q ss_pred eeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 263 AFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 263 AFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
|.--|| .+|.+..++.. +.+++.|+||||+|---+..
T Consensus 259 gMEN~~-ltf~~~~ll~~---d~s~~~viaHElAHqWfGNl 295 (601)
T TIGR02411 259 GMENPN-LTFATPTLIAG---DRSNVDVIAHELAHSWSGNL 295 (601)
T ss_pred cccccc-ceeeccccccC---ChhhhhhHHHHHHhhccCce
Confidence 333343 56777777632 34577899999999877744
No 72
>KOG3624|consensus
Probab=50.41 E-value=12 Score=42.58 Aligned_cols=46 Identities=26% Similarity=0.402 Sum_probs=34.5
Q ss_pred CcceeEeCC-CeEEEehhhhhh----c-----CCHHHHHHHHHHHHHhhhcccHHH
Q psy768 260 LINAFVFPD-GRIFMFTGMFQL----C-----QTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 260 ~~NAFalpg-G~IvV~~GLL~~----l-----~~edELaaVLAHElgHv~~rH~~~ 305 (475)
.+|||-.|. ..|+|--|++.. . -+-.-+.+||||||+|.--.++..
T Consensus 481 ~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~ 536 (687)
T KOG3624|consen 481 QVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQ 536 (687)
T ss_pred eeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhcccccccc
Confidence 789988775 468888888875 1 134568899999999987666543
No 73
>KOG1047|consensus
Probab=48.79 E-value=11 Score=41.31 Aligned_cols=32 Identities=22% Similarity=0.416 Sum_probs=24.9
Q ss_pred eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 270 RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 270 ~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
..||+..||.. +.-|+-||||||+|---+...
T Consensus 274 ltF~TpTllaG---Drsl~~vIaHEIAHSWtGNlV 305 (613)
T KOG1047|consen 274 LTFVTPTLLAG---DRSLVDVIAHEIAHSWTGNLV 305 (613)
T ss_pred eeeecchhhcC---CcchhhHHHHHhhhhhccccc
Confidence 38899999755 356899999999997666443
No 74
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=48.47 E-value=6.1 Score=38.63 Aligned_cols=12 Identities=42% Similarity=0.706 Sum_probs=10.6
Q ss_pred HHHHHHHHhhhc
Q psy768 289 TVLSHELSHTLL 300 (475)
Q Consensus 289 aVLAHElgHv~~ 300 (475)
.++||||||...
T Consensus 147 ~t~AHElGHnLG 158 (228)
T cd04271 147 QVFAHEIGHTFG 158 (228)
T ss_pred eehhhhhhhhcC
Confidence 699999999974
No 75
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=48.42 E-value=7.5 Score=43.20 Aligned_cols=49 Identities=14% Similarity=0.175 Sum_probs=28.4
Q ss_pred eEEEEecCCcceeEeCCC--eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 252 PITIIDDPLINAFVFPDG--RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalpgG--~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
.++-.++....||+.+.+ .-+| |++.-++-++ ..+|+||+||..|....
T Consensus 345 D~~~r~gK~~Ga~~~~~~~~~p~i---l~N~~~~~~d-v~TLaHElGHa~H~~~~ 395 (591)
T TIGR00181 345 DYAENKGKRSGAYSIGGYKVKPYI---LMNWDGTLNS-VFTLAHELGHSMHSYFS 395 (591)
T ss_pred eecCCCCCCCCcccCCCCCCCCeE---EEecCCCcch-HHHHHHHhhhHHHHHHH
Confidence 444445557789997532 2222 1122222222 56899999999877544
No 76
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=48.26 E-value=11 Score=39.63 Aligned_cols=49 Identities=18% Similarity=0.230 Sum_probs=28.8
Q ss_pred eEEEEecCCcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 252 PITIIDDPLINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
.++..++...-||+.+ ++..+|....-. +-++ ...|+||+||..|....
T Consensus 188 D~~~r~gK~~gaf~~~~~~~~~p~i~~n~~~---~~~~-v~tl~HE~GHa~h~~~~ 239 (427)
T cd06459 188 DVEPRKGKRSGAYCTGLPPGKHPFILMNFNG---TLDD-VFTLAHELGHAFHSYLS 239 (427)
T ss_pred eccCCCCCCCCeecCCCCCCCCCeEEecCCC---Chhh-HHHHHHHhhHHHHHHHH
Confidence 3443344456788865 334555433322 3334 56799999998776544
No 77
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=45.94 E-value=27 Score=36.07 Aligned_cols=52 Identities=19% Similarity=0.236 Sum_probs=28.7
Q ss_pred eeEEEEecCCcceeEeCCCeEEEehhh-hhh--cC---CHHHHHHHHHHHHHhhhcccH
Q psy768 251 YPITIIDDPLINAFVFPDGRIFMFTGM-FQL--CQ---TDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 251 ~~v~Vv~~~~~NAFalpgG~IvV~~GL-L~~--l~---~edELaaVLAHElgHv~~rH~ 303 (475)
+.+.++.+-..++..-+ |.|++.... +-. .. +..++..++||||+|.--++.
T Consensus 254 ~~~v~~p~~~~~~me~~-g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~ 311 (390)
T PF01433_consen 254 LDIVAVPDFPFGGMENW-GLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNL 311 (390)
T ss_dssp EEEEEEST-SSSEE--T-TEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTT
T ss_pred eeEEEEecccccccccc-ccccccccccccCcccccchhhhhhHHHHHHHHHHHHhccC
Confidence 34444443234566555 445555443 322 11 235789999999999977764
No 78
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=45.78 E-value=13 Score=32.72 Aligned_cols=30 Identities=27% Similarity=0.610 Sum_probs=21.5
Q ss_pred eEEEehh-hhhhcCC-H----HHHHHHHHHHHHhhh
Q psy768 270 RIFMFTG-MFQLCQT-D----DELATVLSHELSHTL 299 (475)
Q Consensus 270 ~IvV~~G-LL~~l~~-e----dELaaVLAHElgHv~ 299 (475)
+|.+|.+ |++.+.+ + |++.-||-|||||.-
T Consensus 86 rItlYRrailDywae~eetlgd~vthvliHEIgHhF 121 (136)
T COG3824 86 RITLYRRALLDYWAENEETLGDQVTHVLIHEIGHHF 121 (136)
T ss_pred eeeeeHHHHHHHHhhhhhhHhhHhhhhhhhhhhhhc
Confidence 6888876 5666643 3 467888999999863
No 79
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=44.47 E-value=11 Score=36.22 Aligned_cols=16 Identities=31% Similarity=0.532 Sum_probs=11.5
Q ss_pred HHHHHHHHhhhc-ccHH
Q psy768 289 TVLSHELSHTLL-KHVA 304 (475)
Q Consensus 289 aVLAHElgHv~~-rH~~ 304 (475)
-+++|||||... +|..
T Consensus 139 ~~~aHEiGH~lGl~H~~ 155 (206)
T PF13583_consen 139 QTFAHEIGHNLGLRHDF 155 (206)
T ss_pred hHHHHHHHHHhcCCCCc
Confidence 459999999864 3443
No 80
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=43.74 E-value=25 Score=38.67 Aligned_cols=54 Identities=22% Similarity=0.271 Sum_probs=28.4
Q ss_pred CCCcceeEEEEecCC---cceeEeC-----CC-----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768 246 HNNFKYPITIIDDPL---INAFVFP-----DG-----RIFMFTGMFQLCQTDDELATVLSHELSHTLL 300 (475)
Q Consensus 246 ~~~~~~~v~Vv~~~~---~NAFalp-----gG-----~IvV~~GLL~~l~~edELaaVLAHElgHv~~ 300 (475)
+.+.++-|||...+. .=|+|.+ +| .|.+....+... ..+...-|+.|||+|+..
T Consensus 157 v~~~D~vlyV~~~~~~~~~~A~A~~C~~~~~~RP~~G~in~~p~~i~~~-~~~~~~~~~~HEi~HaLG 223 (521)
T PF01457_consen 157 VPNADFVLYVTARPSSSSTLAWAAPCQQDSDGRPIAGVININPSYIPSF-YFQEFFRTVIHEIAHALG 223 (521)
T ss_dssp EES-SEEEEEE----STT---EEEEEEE-TTS-EEEEEEE--GGG---S---HHHHHHHHHHHHHHTT
T ss_pred cCCccEEEEEEEeeccCCceEEEeehhhccCCCCeeEEEEEchhHccch-hhhcccceeeeeeeeeee
Confidence 345678888876552 2366643 33 477777666654 357788899999999975
No 81
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=43.58 E-value=57 Score=26.61 Aligned_cols=39 Identities=26% Similarity=0.385 Sum_probs=26.5
Q ss_pred ccccCCCCChHHHHHHHHHHHHHHHHHHHh--cCCCCCCCC
Q psy768 417 EEYLSTHPSHENRANNLESKMKEALDIRKE--CNCLPLGPL 455 (475)
Q Consensus 417 ~~~lsTHP~~~~RI~~L~~~i~~a~~~~~~--~~c~~~~~~ 455 (475)
.-|+.|||...+=++..++...+-.++.+. ..+++|--.
T Consensus 19 ~LyLDTHP~d~~Al~~y~~~~~~~~~l~~~Ye~~yGPLt~~ 59 (78)
T PF12652_consen 19 NLYLDTHPDDQEALEYYNEYSKQRKQLKKEYEKRYGPLTNF 59 (78)
T ss_pred HHHhcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC
Confidence 457899999999999888776655444442 255555433
No 82
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=42.52 E-value=37 Score=28.84 Aligned_cols=33 Identities=18% Similarity=0.380 Sum_probs=21.7
Q ss_pred CeEEEehhhh-hhcCCHH----HHHHHHHHHHHhhhcc
Q psy768 269 GRIFMFTGMF-QLCQTDD----ELATVLSHELSHTLLK 301 (475)
Q Consensus 269 G~IvV~~GLL-~~l~~ed----ELaaVLAHElgHv~~r 301 (475)
-+|+||..=| ..+.+.+ ++.-|+-||+||+..-
T Consensus 50 ~rI~lyR~pl~~~~~~~~eL~~~I~~tlvhEiah~fG~ 87 (97)
T PF06262_consen 50 DRIVLYRRPLERRARSREELAELIRDTLVHEIAHHFGI 87 (97)
T ss_dssp EEEEEEHHHHHHT-SSHHHHHHHHHHHHHHHHHHHTT-
T ss_pred CEEEEehHHHHHHhCCHHHHHHHHHHHHHHHHHHHcCC
Confidence 3688888654 4566655 4566888999988653
No 83
>PF13150 DUF3989: Protein of unknown function (DUF3989)
Probab=41.51 E-value=48 Score=27.51 Aligned_cols=18 Identities=28% Similarity=0.503 Sum_probs=13.8
Q ss_pred hhHHHHHHhcCchhhHHH
Q psy768 133 RISRKWWSKLSPDQKKII 150 (475)
Q Consensus 133 ~~~~~w~~~l~~~~r~~~ 150 (475)
..+|.+|++|||.+|...
T Consensus 13 ~~Lr~~c~~Lsp~~R~~v 30 (85)
T PF13150_consen 13 DRLRRYCGRLSPKQRLRV 30 (85)
T ss_pred HHHHHHHhcCCHHHHHHH
Confidence 457888899999887554
No 84
>PF06861 BALF1: BALF1 protein; InterPro: IPR010677 Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members [].
Probab=41.15 E-value=18 Score=33.77 Aligned_cols=28 Identities=29% Similarity=0.294 Sum_probs=23.2
Q ss_pred hhcCCHHHHHHHHHHHHHhhhcccHHHH
Q psy768 279 QLCQTDDELATVLSHELSHTLLKHVAEK 306 (475)
Q Consensus 279 ~~l~~edELaaVLAHElgHv~~rH~~~~ 306 (475)
..+.|+.|=++|++|++||...+|-.--
T Consensus 124 ~~l~~d~e~~s~v~~~lA~Fy~~~r~~W 151 (182)
T PF06861_consen 124 RNLLNDHENASLVSHALAHFYLRYRRAW 151 (182)
T ss_pred HHhcCchHHHHHHHHHHHHHHHHHHHHH
Confidence 3456789999999999999999987643
No 85
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=41.15 E-value=14 Score=35.60 Aligned_cols=26 Identities=31% Similarity=0.535 Sum_probs=18.7
Q ss_pred EEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768 271 IFMFTGMFQLCQTDDELATVLSHELSHTLLK 301 (475)
Q Consensus 271 IvV~~GLL~~l~~edELaaVLAHElgHv~~r 301 (475)
|.|-.||=. .-..+|||||+.|..++
T Consensus 82 Ilvl~GLPr-----ll~gsiLAHE~mHa~Lr 107 (212)
T PF12315_consen 82 ILVLYGLPR-----LLTGSILAHELMHAWLR 107 (212)
T ss_pred EEEECCCCH-----HHHhhHHHHHHHHHHhc
Confidence 566666632 33478999999999875
No 86
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.65 E-value=30 Score=35.58 Aligned_cols=37 Identities=24% Similarity=0.492 Sum_probs=32.2
Q ss_pred CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~ 305 (475)
+-+++.+-..+..+ ..+++.+.|-||+=|+.++|+-.
T Consensus 51 ~~~~y~NPei~~~~-p~~~~~aLl~HEV~Hi~l~Hi~r 87 (396)
T COG3864 51 YFTMYFNPEIFLNC-PISEMKALLKHEVYHIMLNHIKR 87 (396)
T ss_pred ceEEEeCHHHHccC-CHHHHHHHHHHHHHHHHHHHHHh
Confidence 33688888888888 57999999999999999999974
No 87
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.50 E-value=22 Score=33.67 Aligned_cols=17 Identities=35% Similarity=0.468 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHhhhcc
Q psy768 285 DELATVLSHELSHTLLK 301 (475)
Q Consensus 285 dELaaVLAHElgHv~~r 301 (475)
....+|+.|||||.+.=
T Consensus 90 ~~~~~~i~HElgHaLG~ 106 (198)
T cd04327 90 PEFSRVVLHEFGHALGF 106 (198)
T ss_pred hhHHHHHHHHHHHHhcC
Confidence 35668999999998754
No 88
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.44 E-value=4.4 Score=38.66 Aligned_cols=14 Identities=43% Similarity=0.624 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHhhh
Q psy768 286 ELATVLSHELSHTL 299 (475)
Q Consensus 286 ELaaVLAHElgHv~ 299 (475)
..+.++||||||..
T Consensus 139 ~~a~~~aHElGH~L 152 (207)
T cd04273 139 SSAFTIAHELGHVL 152 (207)
T ss_pred eeEEeeeeechhhc
Confidence 35889999999986
No 89
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=36.33 E-value=22 Score=33.15 Aligned_cols=15 Identities=40% Similarity=0.678 Sum_probs=12.2
Q ss_pred HHHHHHHHHHhhhcc
Q psy768 287 LATVLSHELSHTLLK 301 (475)
Q Consensus 287 LaaVLAHElgHv~~r 301 (475)
+.+++.||+||...-
T Consensus 7 ~i~i~~HE~gH~~~a 21 (192)
T PF02163_consen 7 LISIVLHELGHALAA 21 (192)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred ccccccccccccccc
Confidence 467899999998654
No 90
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=35.80 E-value=47 Score=30.16 Aligned_cols=39 Identities=31% Similarity=0.496 Sum_probs=26.2
Q ss_pred CcceeEeCCC---eEEEehhhhhhc-CCHHHHHHHHHHHHHhh
Q psy768 260 LINAFVFPDG---RIFMFTGMFQLC-QTDDELATVLSHELSHT 298 (475)
Q Consensus 260 ~~NAFalpgG---~IvV~~GLL~~l-~~edELaaVLAHElgHv 298 (475)
..-|++.|+. .|+|-......- ...+-=+++|.||+.|.
T Consensus 65 ~~~a~~~~~~~~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf 107 (148)
T PF14521_consen 65 NTYAYVYPDSPTYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF 107 (148)
T ss_dssp SSSEEE-TTST-TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred ceEEEEECCCCceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence 5567778876 499988887742 23355689999999993
No 91
>TIGR03818 MotA1 flagellar motor stator protein MotA. This model represents one family of MotA proteins which are often not identified by the "transporter, MotA/TolQ/ExbB proton channel family" model, pfam01618.
Probab=35.44 E-value=4.9e+02 Score=26.37 Aligned_cols=34 Identities=12% Similarity=0.153 Sum_probs=23.7
Q ss_pred EEehhh---hhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768 272 FMFTGM---FQLCQTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 272 vV~~GL---L~~l~~edELaaVLAHElgHv~~rH~~~ 305 (475)
|+-+|+ ++...+++++..+|--|+-....+|...
T Consensus 125 Fl~~gl~l~vdG~~~~~~i~~iLe~ei~~~~~~~~~~ 161 (282)
T TIGR03818 125 FICDYLRLMVMGNMNPHELEALMEEEIETHHHELLKP 161 (282)
T ss_pred HHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 444554 2333378999999999998887777653
No 92
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=35.32 E-value=19 Score=39.84 Aligned_cols=49 Identities=16% Similarity=0.213 Sum_probs=28.7
Q ss_pred eEEEEecCCcceeEeC--C-CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768 252 PITIIDDPLINAFVFP--D-GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalp--g-G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~ 304 (475)
.++-.++....|||.+ . +.-+|. ++.-.+-+. ..+|+||+||..|....
T Consensus 303 D~~~r~gK~~Gayc~~~~~~~~P~I~---~Nf~~t~~d-v~TL~HElGHa~H~~~s 354 (549)
T TIGR02289 303 DLESRKGKAAGGYCTYLPKYKAPFIF---SNFNGTSGD-IDVLTHEAGHAFHVYES 354 (549)
T ss_pred ccCCCCCCCCCcccCCCCCCCCcEEE---EeCCCChhH-HHHHHHHhhHHHHHHHh
Confidence 3444445577899865 2 333442 222233334 45899999999877544
No 93
>PRK12482 flagellar motor protein MotA; Provisional
Probab=34.29 E-value=5.2e+02 Score=26.33 Aligned_cols=25 Identities=20% Similarity=0.234 Sum_probs=21.4
Q ss_pred CCHHHHHHHHHHHHHhhhcccHHHH
Q psy768 282 QTDDELATVLSHELSHTLLKHVAEK 306 (475)
Q Consensus 282 ~~edELaaVLAHElgHv~~rH~~~~ 306 (475)
.+++++..+|-.|+.....+|....
T Consensus 138 ~d~~~i~~iLe~eie~~~~r~~~~a 162 (287)
T PRK12482 138 INQHELEGILDQELDAVEEELLQPS 162 (287)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHhHH
Confidence 4789999999999999998887643
No 94
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=33.54 E-value=25 Score=33.70 Aligned_cols=15 Identities=47% Similarity=0.693 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHhhhc
Q psy768 286 ELATVLSHELSHTLL 300 (475)
Q Consensus 286 ELaaVLAHElgHv~~ 300 (475)
=+.+|+.||+||...
T Consensus 37 l~~~v~iHElgH~~~ 51 (208)
T cd06161 37 LFLSVLLHELGHALV 51 (208)
T ss_pred HHHHHHHHHHHHHHH
Confidence 467899999999864
No 95
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.40 E-value=61 Score=31.24 Aligned_cols=22 Identities=27% Similarity=0.270 Sum_probs=14.1
Q ss_pred cccchHHHHHHHHHHHHHHHHh
Q psy768 157 HQDKIAGITLTSIGLAYIYYFL 178 (475)
Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~ 178 (475)
.+|.++|++++++++|+--|+.
T Consensus 22 Gk~li~gviLg~~~lfGW~ywq 43 (207)
T COG2976 22 GKALIVGVILGLGGLFGWRYWQ 43 (207)
T ss_pred CchhHHHHHHHHHHHHHHHHHH
Confidence 3677777777777776654443
No 96
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=30.75 E-value=52 Score=38.39 Aligned_cols=36 Identities=19% Similarity=0.362 Sum_probs=23.1
Q ss_pred CeEEEehhhhhhc-CC---HHHHHHHHHHHHHhhhcccHH
Q psy768 269 GRIFMFTGMFQLC-QT---DDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 269 G~IvV~~GLL~~l-~~---edELaaVLAHElgHv~~rH~~ 304 (475)
|.|.+...++..- .+ ...++.|++|||+|---++..
T Consensus 265 Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlV 304 (831)
T TIGR02412 265 GCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLV 304 (831)
T ss_pred ceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEe
Confidence 4566666554321 11 235789999999998777543
No 97
>PTZ00337 surface protease GP63; Provisional
Probab=30.46 E-value=78 Score=35.31 Aligned_cols=53 Identities=19% Similarity=0.135 Sum_probs=33.9
Q ss_pred cCCCcceeEEEEecCC--cceeEeC-----C-----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768 245 EHNNFKYPITIIDDPL--INAFVFP-----D-----GRIFMFTGMFQLCQTDDELATVLSHELSHTLL 300 (475)
Q Consensus 245 ~~~~~~~~v~Vv~~~~--~NAFalp-----g-----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~ 300 (475)
++.+.++.++|.-.|. .=|+|.+ . |.|.++...+ .+.++.-.|+.|||+|+..
T Consensus 178 Gv~naD~vLyV~a~p~~~~~A~A~~C~~~~~~RPi~G~in~np~~i---~~~~~~~~v~~HEi~HALG 242 (567)
T PTZ00337 178 GVAGADMILYANIFPTSGPAAWAIPCFLLDDGRPFAAAVNFDPRQI---AVTNGDVRVAAHELGHALG 242 (567)
T ss_pred CccccceEEEEEeccCCCcceEEEEcccccCCCceEEEEEECHHHc---cchhHHHHHHHHHHHHHHc
Confidence 4456678888865442 2366654 2 3466665444 3346778899999999974
No 98
>PRK08990 flagellar motor protein PomA; Reviewed
Probab=30.13 E-value=5.6e+02 Score=25.48 Aligned_cols=24 Identities=17% Similarity=0.266 Sum_probs=20.2
Q ss_pred CCHHHHHHHHHHHHHhhhcccHHH
Q psy768 282 QTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 282 ~~edELaaVLAHElgHv~~rH~~~ 305 (475)
.+++++..+|-.|+.....+|...
T Consensus 116 ~~~~~i~~~l~~~~~~~~~~~~~~ 139 (254)
T PRK08990 116 HDGDVVRAALEKDIALTEERHETG 139 (254)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHhH
Confidence 478999999999999988887653
No 99
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=29.72 E-value=44 Score=39.13 Aligned_cols=56 Identities=20% Similarity=0.363 Sum_probs=32.9
Q ss_pred cceeEEEEecCCcceeEeCC-CeEEEehhh-hhh---cCC--HHHHHHHHHHHHHhhhcccHH
Q psy768 249 FKYPITIIDDPLINAFVFPD-GRIFMFTGM-FQL---CQT--DDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 249 ~~~~v~Vv~~~~~NAFalpg-G~IvV~~GL-L~~---l~~--edELaaVLAHElgHv~~rH~~ 304 (475)
++|...+|.-+..|+=++=+ |.|.+++.. +.. ..+ -...+.|++||++|---+.-.
T Consensus 262 y~l~~~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlV 324 (859)
T COG0308 262 YALPIDIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLV 324 (859)
T ss_pred CCCcccEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhccccee
Confidence 33443556666677655533 455555553 221 111 245777999999998777654
No 100
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=29.49 E-value=13 Score=37.43 Aligned_cols=11 Identities=45% Similarity=0.797 Sum_probs=10.0
Q ss_pred HHHHHHHHHhh
Q psy768 288 ATVLSHELSHT 298 (475)
Q Consensus 288 aaVLAHElgHv 298 (475)
.+|++||+||.
T Consensus 166 igv~~HE~gH~ 176 (286)
T TIGR03296 166 VGVIAHELGHD 176 (286)
T ss_pred eeeeehhhhcc
Confidence 59999999995
No 101
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=29.16 E-value=33 Score=32.72 Aligned_cols=16 Identities=38% Similarity=0.567 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHhhhcc
Q psy768 286 ELATVLSHELSHTLLK 301 (475)
Q Consensus 286 ELaaVLAHElgHv~~r 301 (475)
.+..++.||+||....
T Consensus 21 ~~l~t~~HE~gHal~a 36 (200)
T PF13398_consen 21 RLLVTFVHELGHALAA 36 (200)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4667999999998653
No 102
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=29.08 E-value=34 Score=31.64 Aligned_cols=13 Identities=54% Similarity=0.779 Sum_probs=11.1
Q ss_pred HHHHHHHHHHhhh
Q psy768 287 LATVLSHELSHTL 299 (475)
Q Consensus 287 LaaVLAHElgHv~ 299 (475)
+.+|+.||+||..
T Consensus 8 ~i~i~iHE~gH~~ 20 (180)
T cd05709 8 LISVTVHELGHAL 20 (180)
T ss_pred HHHHHHHHHHHHH
Confidence 4578999999986
No 103
>PRK08124 flagellar motor protein MotA; Validated
Probab=28.72 E-value=6e+02 Score=25.35 Aligned_cols=24 Identities=25% Similarity=0.314 Sum_probs=19.4
Q ss_pred CCHHHHHHHHHHHHHhhhcccHHH
Q psy768 282 QTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 282 ~~edELaaVLAHElgHv~~rH~~~ 305 (475)
.+++++..+|-.|+.....+|...
T Consensus 120 ~~~~~i~~~le~~i~~~~~~~~~~ 143 (263)
T PRK08124 120 QSPEFIRDVLEEEIEAMEERHAAG 143 (263)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHhh
Confidence 378899999999999888776553
No 104
>KOG3658|consensus
Probab=27.67 E-value=21 Score=40.08 Aligned_cols=34 Identities=18% Similarity=0.205 Sum_probs=25.4
Q ss_pred CCeEEEehhhhhhcC-----CHHHHHHHHHHHHHhhhcc
Q psy768 268 DGRIFMFTGMFQLCQ-----TDDELATVLSHELSHTLLK 301 (475)
Q Consensus 268 gG~IvV~~GLL~~l~-----~edELaaVLAHElgHv~~r 301 (475)
|-+.++++||+...+ -..+-.-|+|||+||--..
T Consensus 368 G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNfGS 406 (764)
T KOG3658|consen 368 GKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNFGS 406 (764)
T ss_pred cceEEeecceeeeeecCCccCcchhheeehhhhccccCC
Confidence 346899999987763 1357778999999997543
No 105
>KOG1046|consensus
Probab=27.19 E-value=1.1e+02 Score=35.97 Aligned_cols=52 Identities=17% Similarity=0.275 Sum_probs=30.1
Q ss_pred EEEEecCCcceeEeC-CCeEEEehhhhhh------cCCHHHHHHHHHHHHHhhhcccHH
Q psy768 253 ITIIDDPLINAFVFP-DGRIFMFTGMFQL------CQTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 253 v~Vv~~~~~NAFalp-gG~IvV~~GLL~~------l~~edELaaVLAHElgHv~~rH~~ 304 (475)
+-+|--|+..+=|+= .|.|..-...|.. ..+...++.|+|||+||.--+...
T Consensus 283 ~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLV 341 (882)
T KOG1046|consen 283 LDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLV 341 (882)
T ss_pred ccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcc
Confidence 444444544443332 3555544443322 123567999999999998777554
No 106
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=26.93 E-value=30 Score=37.02 Aligned_cols=18 Identities=28% Similarity=0.589 Sum_probs=14.1
Q ss_pred CHHHHHHHHHHHHHhhhcc
Q psy768 283 TDDELATVLSHELSHTLLK 301 (475)
Q Consensus 283 ~edELaaVLAHElgHv~~r 301 (475)
+-+++ ..|+||+||..|.
T Consensus 239 ~~~~v-~tLfHE~GHa~H~ 256 (458)
T PF01432_consen 239 SHDDV-ETLFHEFGHAMHS 256 (458)
T ss_dssp SHHHH-HHHHHHHHHHHHH
T ss_pred ChhhH-HHHHHHHhHHHHH
Confidence 44666 7899999999764
No 107
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=26.90 E-value=33 Score=37.16 Aligned_cols=18 Identities=28% Similarity=0.530 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHhhhcccH
Q psy768 285 DELATVLSHELSHTLLKHV 303 (475)
Q Consensus 285 dELaaVLAHElgHv~~rH~ 303 (475)
+++ ..|+||+||..|...
T Consensus 262 ~~V-~TLfHEfGHalH~~l 279 (472)
T cd06455 262 DEV-ETFFHEFGHVIHHLL 279 (472)
T ss_pred HHH-HHHHHHHHHHHHHHh
Confidence 554 579999999987543
No 108
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=26.72 E-value=39 Score=33.02 Aligned_cols=15 Identities=53% Similarity=0.793 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHhhhc
Q psy768 286 ELATVLSHELSHTLL 300 (475)
Q Consensus 286 ELaaVLAHElgHv~~ 300 (475)
=..+|+-||+||...
T Consensus 52 l~~~v~iHElgH~~~ 66 (227)
T cd06164 52 LFASVLLHELGHSLV 66 (227)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457899999999864
No 109
>COG3426 Butyrate kinase [Energy production and conversion]
Probab=26.57 E-value=60 Score=33.11 Aligned_cols=40 Identities=15% Similarity=0.363 Sum_probs=31.4
Q ss_pred CCcceeE--------eCCCeEEEehhhhhhcCC------HHHHHHHHHHHHHhh
Q psy768 259 PLINAFV--------FPDGRIFMFTGMFQLCQT------DDELATVLSHELSHT 298 (475)
Q Consensus 259 ~~~NAFa--------lpgG~IvV~~GLL~~l~~------edELaaVLAHElgHv 298 (475)
+.++|++ .|||+.+|+.-|++.+.+ ...|.++|||+++--
T Consensus 67 ~~~dAvvgRGGLL~pi~gGTY~Vn~~M~~~lk~~~~G~haSnLGaiiA~~ia~~ 120 (358)
T COG3426 67 SKFDAVVGRGGLLRPIPGGTYVVNEKMLKDLKNGVQGEHASNLGAIIANRIAKA 120 (358)
T ss_pred CCccceeecCccccccCCceeEeCHHHHHHHHcCCCCcchhhhhHHHHHHHhhh
Confidence 3567776 478999999999988742 456999999999854
No 110
>PF12911 OppC_N: N-terminal TM domain of oligopeptide transport permease C
Probab=26.51 E-value=1.3e+02 Score=22.08 Aligned_cols=17 Identities=6% Similarity=0.233 Sum_probs=13.0
Q ss_pred HHHHHHHhcccchHHHH
Q psy768 149 IILSQIKKHQDKIAGIT 165 (475)
Q Consensus 149 ~~~~~~~~~~~~~~~~~ 165 (475)
.+.+++++||..++|++
T Consensus 7 ~~~~~f~~nk~a~~gl~ 23 (56)
T PF12911_consen 7 DAWRRFRRNKLAVIGLI 23 (56)
T ss_pred HHHHHHHhCchHHHHHH
Confidence 57778889988886663
No 111
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=25.69 E-value=34 Score=38.02 Aligned_cols=48 Identities=15% Similarity=0.105 Sum_probs=26.8
Q ss_pred eEEEEecCCcceeEeC--CC-eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768 252 PITIIDDPLINAFVFP--DG-RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV 303 (475)
Q Consensus 252 ~v~Vv~~~~~NAFalp--gG-~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~ 303 (475)
.++-.+.....||+.+ ++ .-+|....- ++-++ ...|+||+||..|...
T Consensus 341 D~~~r~gK~~Ga~~~~~~~~~~p~i~~N~~---~~~~~-v~TL~HE~GHa~H~~l 391 (587)
T TIGR02290 341 DAEPRPGKRGGAFCTGFPPSKEPRVLMNYD---GSRRD-VSTLAHELGHAYHSEL 391 (587)
T ss_pred ccCCCCCCCCCcccCCCCCCCCCEEEEecC---CCchh-HHHHHHHhhHHHHHHH
Confidence 4454555577888854 22 223322221 12223 4579999999988543
No 112
>TIGR00035 asp_race aspartate racemase.
Probab=24.93 E-value=1.5e+02 Score=28.69 Aligned_cols=60 Identities=8% Similarity=0.075 Sum_probs=39.9
Q ss_pred CCCchhHHHHHHHHHhhccccccCCCcccccCCCCChHHHHHHHH--------HHHHHHHHHHHhcCC
Q psy768 390 CYDVRVAPLFWQKMALKETQDQVGPKMEEYLSTHPSHENRANNLE--------SKMKEALDIRKECNC 449 (475)
Q Consensus 390 Gydp~a~~~~l~kl~~~~~~~~~g~~~~~~lsTHP~~~~RI~~L~--------~~i~~a~~~~~~~~c 449 (475)
|..|.+..++++++.+.......++..+...-.||...+|.+.+. ..+.++.+.-++++|
T Consensus 9 Gmgp~at~~~~~~i~~~~~a~~d~~~~~~i~~~~~~~~dr~~~~~~~~~~~~~~~l~~~~~~L~~~g~ 76 (229)
T TIGR00035 9 GMGPLATAELFRRINEKTKAKRDQEHPAEVLFNNPNIPDRTAYILGRGEDRPRPILIDIAVKLENAGA 76 (229)
T ss_pred CcCHHHHHHHHHHHHHHhHHhcCCCCCceeeeeCCCHHHHHHHHhcCCcchHHHHHHHHHHHHHHcCC
Confidence 778899999999988665212222334677889999999988754 244445444455554
No 113
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=24.63 E-value=45 Score=31.48 Aligned_cols=15 Identities=40% Similarity=0.408 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHhhhc
Q psy768 286 ELATVLSHELSHTLL 300 (475)
Q Consensus 286 ELaaVLAHElgHv~~ 300 (475)
=+..+..||+||...
T Consensus 40 l~~~l~iHElgH~~~ 54 (183)
T cd06160 40 LLAILGIHEMGHYLA 54 (183)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456788999999854
No 114
>PRK08456 flagellar motor protein MotA; Validated
Probab=24.55 E-value=7e+02 Score=24.74 Aligned_cols=23 Identities=4% Similarity=-0.012 Sum_probs=19.2
Q ss_pred CCHHHHHHHHHHHHHhhhcccHH
Q psy768 282 QTDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 282 ~~edELaaVLAHElgHv~~rH~~ 304 (475)
.+++++..+|-.|+.....+|..
T Consensus 119 ~~~~~i~~~le~ei~~~~~~~~~ 141 (257)
T PRK08456 119 KDLEEIKESMEIQIEEMEEYYHG 141 (257)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHh
Confidence 47899999999999988887764
No 115
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=23.10 E-value=62 Score=38.00 Aligned_cols=19 Identities=21% Similarity=0.378 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHhhhcccHH
Q psy768 286 ELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 286 ELaaVLAHElgHv~~rH~~ 304 (475)
.++.|+|||++|---++..
T Consensus 282 ~i~~VIaHElaHqWfGNlV 300 (863)
T TIGR02414 282 RIESVIAHEYFHNWTGNRV 300 (863)
T ss_pred HHHHHHHHHHHHHHhccee
Confidence 4889999999998776543
No 116
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=22.98 E-value=1.3e+02 Score=33.84 Aligned_cols=55 Identities=18% Similarity=0.233 Sum_probs=33.6
Q ss_pred CCCcceeEEEEecCC-cc--eeEeC-----C-----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768 246 HNNFKYPITIIDDPL-IN--AFVFP-----D-----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLK 301 (475)
Q Consensus 246 ~~~~~~~v~Vv~~~~-~N--AFalp-----g-----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~r 301 (475)
+.+.++-+||.-.|. .+ |+|.+ + |.|.+.-.-+.. ..+++...|+.|||.|...=
T Consensus 203 v~naD~vLYVta~P~~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i~s-~~~~~~~rv~~HEi~HALGF 270 (622)
T PTZ00257 203 VSNADFVLYVASVPSEPGVLAWATTCQVFSDDHPAVGVMNIPAANIVS-RYDQGTTRTVTHEVAHALGF 270 (622)
T ss_pred ccccCEEEEEEEecCCCCeeEEEEEeccccCCCceEEEEeeCHHHCCC-ccchHHHHHHHHHHHHHhcC
Confidence 445677788775552 23 66653 2 346665544432 12356789999999999753
No 117
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=22.61 E-value=53 Score=31.03 Aligned_cols=13 Identities=38% Similarity=0.657 Sum_probs=10.7
Q ss_pred HHHHHHHHHhhhc
Q psy768 288 ATVLSHELSHTLL 300 (475)
Q Consensus 288 aaVLAHElgHv~~ 300 (475)
..|+-||+||...
T Consensus 10 ~~v~iHElGH~~~ 22 (182)
T cd06163 10 ILIFVHELGHFLV 22 (182)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999864
No 118
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=22.59 E-value=1e+02 Score=29.01 Aligned_cols=17 Identities=47% Similarity=0.800 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHhhh-ccc
Q psy768 286 ELATVLSHELSHTL-LKH 302 (475)
Q Consensus 286 ELaaVLAHElgHv~-~rH 302 (475)
-+.-++.||+||.. +.|
T Consensus 124 R~~k~~~HElGH~lGL~H 141 (179)
T PRK13267 124 RVRKEVTHELGHTLGLEH 141 (179)
T ss_pred HHHHHHHHHHHHHcCCcc
Confidence 45556899999993 344
No 119
>KOG3607|consensus
Probab=22.05 E-value=50 Score=37.83 Aligned_cols=17 Identities=47% Similarity=0.491 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy768 284 DDELATVLSHELSHTLL 300 (475)
Q Consensus 284 edELaaVLAHElgHv~~ 300 (475)
....|.|+||||||..-
T Consensus 320 ~~~~a~v~AhelgH~lG 336 (716)
T KOG3607|consen 320 LLAFAVVLAHELGHNLG 336 (716)
T ss_pred chhHHHHHHHHHHhhcC
Confidence 45789999999999854
No 120
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=21.91 E-value=41 Score=34.72 Aligned_cols=21 Identities=24% Similarity=0.393 Sum_probs=14.8
Q ss_pred CHHHHHHHHHHHHHhhhcccHH
Q psy768 283 TDDELATVLSHELSHTLLKHVA 304 (475)
Q Consensus 283 ~edELaaVLAHElgHv~~rH~~ 304 (475)
+-+++. .|.||+||..+.-..
T Consensus 151 ~~~~v~-tl~HE~GHa~h~~l~ 171 (365)
T cd06258 151 GHDDIN-TLFHEFGHAVHFLLI 171 (365)
T ss_pred CHHHHH-HHHHHHhHHHHHHHh
Confidence 445554 689999999876433
No 121
>KOG2975|consensus
Probab=21.84 E-value=1.2e+02 Score=30.47 Aligned_cols=33 Identities=18% Similarity=0.283 Sum_probs=27.7
Q ss_pred hHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhc
Q psy768 375 ETEADEVGLKLMARACYDVRVAPLFWQKMALKE 407 (475)
Q Consensus 375 E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~ 407 (475)
-++|+++|++++.+.+++|.....++..|++.+
T Consensus 172 ~~~~ervgl~li~kt~~sp~r~~~l~~dLqQv~ 204 (288)
T KOG2975|consen 172 YYDAERVGLDLIEKTSFSPSRVAGLSSDLQQVE 204 (288)
T ss_pred ecchhhhHHHHHHHhccChhhhccccchHHHHH
Confidence 468999999999999999998877776666655
No 122
>PF04612 T2SM: Type II secretion system (T2SS), protein M; InterPro: IPR007690 General secretion pathway (GSP) protein M is a membrane protein involved in the export of proteins in bacteria. It consists of a short cytosolic N-terminal domain, a transmembrane domain, and a C-terminal periplasmic domain. The precise function of this protein is unknown, though in Vibrio cholerae, the EpsM protein interacts with the EpsL protein, and also forms homodimers [],; GO: 0006858 extracellular transport; PDB: 1UV7_A.
Probab=21.59 E-value=31 Score=31.20 Aligned_cols=14 Identities=36% Similarity=1.113 Sum_probs=0.0
Q ss_pred hHHHHHHhcCchhh
Q psy768 134 ISRKWWSKLSPDQK 147 (475)
Q Consensus 134 ~~~~w~~~l~~~~r 147 (475)
.++.||..|++..|
T Consensus 3 ~l~~~w~~ls~REr 16 (160)
T PF04612_consen 3 QLKQWWQSLSPRER 16 (160)
T ss_dssp --------------
T ss_pred HHHHHHHhCCHHHH
Confidence 46789999888654
No 123
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=21.52 E-value=43 Score=32.16 Aligned_cols=15 Identities=27% Similarity=0.330 Sum_probs=12.1
Q ss_pred HHHHHHHHHhhhccc
Q psy768 288 ATVLSHELSHTLLKH 302 (475)
Q Consensus 288 aaVLAHElgHv~~rH 302 (475)
.+++.|||+|.+.=+
T Consensus 88 ~Gti~HEl~HaLGf~ 102 (200)
T cd04281 88 FGIVVHELGHVIGFW 102 (200)
T ss_pred CchHHHHHHHHhcCc
Confidence 479999999987543
No 124
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=21.42 E-value=44 Score=31.63 Aligned_cols=14 Identities=36% Similarity=0.453 Sum_probs=11.6
Q ss_pred HHHHHHHHHhhhcc
Q psy768 288 ATVLSHELSHTLLK 301 (475)
Q Consensus 288 aaVLAHElgHv~~r 301 (475)
.+++.|||+|.+.=
T Consensus 78 ~G~i~HEl~HaLG~ 91 (182)
T cd04283 78 KGIIQHELLHALGF 91 (182)
T ss_pred cchHHHHHHHHhCC
Confidence 48999999998753
No 125
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=21.14 E-value=57 Score=32.70 Aligned_cols=15 Identities=47% Similarity=0.660 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHhhhc
Q psy768 286 ELATVLSHELSHTLL 300 (475)
Q Consensus 286 ELaaVLAHElgHv~~ 300 (475)
=+.+|+.||+||...
T Consensus 117 L~isv~iHElgHa~~ 131 (263)
T cd06159 117 LVVGVVVHELSHGIL 131 (263)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457799999999854
No 126
>PF05960 DUF885: Bacterial protein of unknown function (DUF885); InterPro: IPR010281 This family consists of hypothetical bacterial proteins.; PDB: 3O0Y_B 3U24_A 3IUK_A.
Probab=20.98 E-value=4.2e+02 Score=28.89 Aligned_cols=74 Identities=18% Similarity=0.209 Sum_probs=39.3
Q ss_pred HHHHHHHHHHHHhcc-ccccCCCcceeEEEEecC----CcceeEeCC--------CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768 227 KRVGAVVKRLIDANK-VYMEHNNFKYPITIIDDP----LINAFVFPD--------GRIFMFTGMFQLCQTDDELATVLSH 293 (475)
Q Consensus 227 ~~v~~i~~rL~~~~~-~~p~~~~~~~~v~Vv~~~----~~NAFalpg--------G~IvV~~GLL~~l~~edELaaVLAH 293 (475)
...+++.+++....+ ..+.+...++.|.-+... ..-||..|+ |.++|+..-.... +.-++..+++|
T Consensus 300 ~~~~~~~~~~~~~~~~~f~~~p~~~~~v~~~p~~~~~~~~~a~y~~~~~~d~~~~~~~~in~~~~~~~-~~~~~~~~~~H 378 (549)
T PF05960_consen 300 DDYRELLERLKAKLPDLFPTIPDQPYEVRPVPPFLEPSAPPAYYQPPPPIDGSRPGRFYINPPDLRSR-NKYSLYTTAAH 378 (549)
T ss_dssp HHHHHHHHHHHHHHCTCS---TCS-EEEEEEECCCHTTS-SEEEEE--TTTSSS-EEEEE--STHHCC-ECTCHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhcccCCCCCCceeecCccccCcCCCCcccCCCCCCCCCCcEEEEeCCCcccc-ChhhHHHHHHh
Confidence 455566666543322 223344566777766653 234565553 5688888554444 33589999999
Q ss_pred HH--Hhhhcc
Q psy768 294 EL--SHTLLK 301 (475)
Q Consensus 294 El--gHv~~r 301 (475)
|. ||+.+-
T Consensus 379 E~~PGHh~Q~ 388 (549)
T PF05960_consen 379 EAYPGHHLQI 388 (549)
T ss_dssp HSTTTHHHHH
T ss_pred hcCCcHHHHH
Confidence 98 566543
No 127
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=20.54 E-value=55 Score=30.91 Aligned_cols=14 Identities=36% Similarity=0.396 Sum_probs=11.5
Q ss_pred HHHHHHHHHhhhcc
Q psy768 288 ATVLSHELSHTLLK 301 (475)
Q Consensus 288 aaVLAHElgHv~~r 301 (475)
.+++.|||+|.+.=
T Consensus 80 ~~~i~HEl~HaLG~ 93 (191)
T PF01400_consen 80 VGTILHELGHALGF 93 (191)
T ss_dssp HHHHHHHHHHHHTB
T ss_pred ccchHHHHHHHHhh
Confidence 56999999998654
No 128
>KOG3314|consensus
Probab=20.53 E-value=2.5e+02 Score=26.38 Aligned_cols=32 Identities=25% Similarity=0.320 Sum_probs=24.5
Q ss_pred EEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768 271 IFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE 305 (475)
Q Consensus 271 IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~ 305 (475)
|++-. +.+.+.+++.-|++|||-|..-.|-++
T Consensus 78 IvlCq---N~l~~q~h~n~vv~HElIH~fDd~r~~ 109 (194)
T KOG3314|consen 78 IVLCQ---NRLTIQDHVNQVVIHELIHAFDDCRAK 109 (194)
T ss_pred eEEec---cccchHHHHHHHHHHHHHHHHHhhhhh
Confidence 76665 345667899999999999997766554
Done!