Query psy6925
Match_columns 131
No_of_seqs 153 out of 1053
Neff 9.7
Searched_HMMs 46136
Date Fri Aug 16 21:57:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6925.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6925hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1046|consensus 100.0 3.9E-42 8.4E-47 270.8 10.2 131 1-131 219-349 (882)
2 TIGR02414 pepN_proteo aminopep 100.0 1.3E-38 2.7E-43 249.6 11.4 131 1-131 174-308 (863)
3 PRK14015 pepN aminopeptidase N 100.0 3.7E-38 7.9E-43 247.4 12.5 131 1-131 187-321 (875)
4 PF01433 Peptidase_M1: Peptida 100.0 2.2E-38 4.7E-43 231.8 10.0 131 1-131 190-320 (390)
5 TIGR02412 pepN_strep_liv amino 100.0 6.3E-38 1.4E-42 246.6 12.8 128 1-131 183-312 (831)
6 COG0308 PepN Aminopeptidase N 100.0 2E-35 4.3E-40 232.8 9.0 130 1-131 200-332 (859)
7 TIGR02411 leuko_A4_hydro leuko 100.0 2.3E-33 5E-38 214.3 8.1 117 1-131 186-304 (601)
8 KOG1047|consensus 99.9 9.1E-28 2E-32 176.8 4.7 117 1-131 195-313 (613)
9 KOG1932|consensus 99.8 1.4E-18 3.1E-23 136.3 7.6 126 1-131 241-366 (1180)
10 COG3975 Predicted protease wit 98.1 3.7E-05 8.1E-10 58.1 9.3 100 23-127 167-268 (558)
11 PF13485 Peptidase_MA_2: Pepti 96.6 0.0015 3.2E-08 40.2 2.2 21 104-124 23-43 (128)
12 PRK04860 hypothetical protein; 96.3 0.009 1.9E-07 39.1 4.4 72 41-119 5-76 (160)
13 smart00731 SprT SprT homologue 95.3 0.026 5.5E-07 36.3 3.5 17 104-120 57-73 (146)
14 PF01863 DUF45: Protein of unk 94.9 0.09 1.9E-06 35.4 5.4 68 41-119 110-177 (205)
15 PF12725 DUF3810: Protein of u 94.2 0.025 5.5E-07 41.1 1.5 10 108-117 198-207 (318)
16 COG4783 Putative Zn-dependent 94.2 0.056 1.2E-06 41.0 3.2 51 61-117 90-141 (484)
17 PRK03001 M48 family peptidase; 92.8 0.21 4.6E-06 35.7 4.2 63 44-117 68-135 (283)
18 PRK03982 heat shock protein Ht 92.5 0.25 5.4E-06 35.4 4.3 64 44-118 69-137 (288)
19 PF06114 DUF955: Domain of unk 92.3 0.17 3.6E-06 30.6 2.9 30 89-122 29-58 (122)
20 PF01435 Peptidase_M48: Peptid 92.3 0.13 2.9E-06 34.9 2.6 71 44-123 32-106 (226)
21 PRK04351 hypothetical protein; 92.0 0.12 2.5E-06 33.5 2.0 16 103-118 58-73 (149)
22 PF01447 Peptidase_M4: Thermol 91.3 0.88 1.9E-05 29.5 5.5 73 37-117 68-146 (150)
23 PRK01345 heat shock protein Ht 91.3 0.4 8.6E-06 34.9 4.2 64 44-118 68-136 (317)
24 PF13699 DUF4157: Domain of un 91.2 0.18 4E-06 29.0 2.0 21 48-70 6-26 (79)
25 cd04269 ZnMc_adamalysin_II_lik 91.2 1.5 3.3E-05 29.2 6.8 14 105-118 130-143 (194)
26 COG1451 Predicted metal-depend 90.9 0.59 1.3E-05 32.3 4.6 71 40-121 120-190 (223)
27 cd04272 ZnMc_salivary_gland_MP 90.6 1.2 2.6E-05 30.5 5.9 12 107-118 146-157 (220)
28 PF10026 DUF2268: Predicted Zn 90.5 1.7 3.7E-05 29.3 6.5 18 104-121 63-80 (195)
29 PRK04897 heat shock protein Ht 90.4 0.5 1.1E-05 34.1 4.0 64 44-118 81-149 (298)
30 PRK02391 heat shock protein Ht 90.3 0.51 1.1E-05 34.0 4.0 67 44-121 77-148 (296)
31 PHA02456 zinc metallopeptidase 90.2 0.26 5.6E-06 30.2 2.1 15 105-119 78-92 (141)
32 cd04279 ZnMc_MMP_like_1 Zinc-d 89.7 2.9 6.3E-05 26.9 6.9 37 24-60 2-41 (156)
33 PF10263 SprT-like: SprT-like 89.5 0.41 8.9E-06 30.8 2.8 18 103-120 57-74 (157)
34 PF01421 Reprolysin: Reprolysi 88.9 1.2 2.5E-05 29.9 4.8 14 104-117 129-142 (199)
35 PF13574 Reprolysin_2: Metallo 88.8 0.33 7.2E-06 32.1 2.0 12 107-118 112-123 (173)
36 PRK01265 heat shock protein Ht 88.6 0.83 1.8E-05 33.4 4.1 63 44-117 84-151 (324)
37 PRK05457 heat shock protein Ht 88.1 1 2.2E-05 32.3 4.2 68 45-121 79-149 (284)
38 PF10460 Peptidase_M30: Peptid 86.6 5.8 0.00013 29.6 7.4 15 104-118 137-151 (366)
39 PF08325 WLM: WLM domain; Int 86.4 0.61 1.3E-05 31.3 2.2 21 101-121 77-97 (186)
40 COG3091 SprT Zn-dependent meta 86.3 1.1 2.5E-05 28.9 3.3 14 103-116 58-71 (156)
41 PRK02870 heat shock protein Ht 85.9 1.7 3.6E-05 32.0 4.4 64 44-117 116-184 (336)
42 PRK03072 heat shock protein Ht 85.2 1.7 3.7E-05 31.2 4.1 63 44-117 71-138 (288)
43 KOG2661|consensus 84.6 0.64 1.4E-05 34.1 1.7 18 103-120 272-289 (424)
44 KOG2719|consensus 84.4 1.5 3.3E-05 33.2 3.6 68 46-120 220-294 (428)
45 KOG3607|consensus 83.3 5.5 0.00012 32.4 6.5 88 25-117 242-334 (716)
46 PF13688 Reprolysin_5: Metallo 81.4 1.3 2.8E-05 29.5 2.2 15 104-118 140-154 (196)
47 COG0501 HtpX Zn-dependent prot 80.7 1.2 2.6E-05 31.7 1.9 20 104-123 155-174 (302)
48 COG4324 Predicted aminopeptida 80.7 1.2 2.6E-05 31.7 1.8 17 104-120 195-211 (376)
49 PF05569 Peptidase_M56: BlaR1 77.6 1.8 4E-05 30.9 2.1 14 104-117 194-207 (299)
50 PF10023 DUF2265: Predicted am 77.6 1.6 3.4E-05 32.1 1.7 29 29-57 39-67 (337)
51 cd04281 ZnMc_BMP1_TLD Zinc-dep 77.1 9.4 0.0002 26.0 5.2 69 42-117 28-98 (200)
52 PF04450 BSP: Peptidase of pla 76.2 10 0.00022 26.0 5.2 19 103-121 93-111 (205)
53 PF09768 Peptidase_M76: Peptid 74.8 12 0.00025 25.0 5.1 63 44-118 17-83 (173)
54 PF01431 Peptidase_M13: Peptid 72.1 2.5 5.4E-05 28.5 1.5 25 96-120 26-50 (206)
55 COG2856 Predicted Zn peptidase 71.8 6.7 0.00015 27.0 3.5 15 108-122 74-88 (213)
56 PF01400 Astacin: Astacin (Pep 71.3 17 0.00037 24.4 5.4 68 41-117 20-90 (191)
57 PF13582 Reprolysin_3: Metallo 71.3 2.8 6.1E-05 25.6 1.5 10 108-117 109-118 (124)
58 PF13265 DUF4056: Protein of u 70.4 2.2 4.7E-05 30.1 0.9 27 28-54 73-99 (270)
59 PF14891 Peptidase_M91: Effect 70.3 3.6 7.8E-05 27.2 1.9 13 107-119 104-116 (174)
60 cd04270 ZnMc_TACE_like Zinc-de 67.2 4.1 9E-05 28.5 1.8 12 107-118 168-179 (244)
61 KOG3314|consensus 66.2 36 0.00078 22.6 5.8 15 104-118 89-103 (194)
62 PF08219 TOM13: Outer membrane 65.6 5.2 0.00011 22.7 1.7 12 107-118 53-64 (77)
63 KOG3714|consensus 64.9 17 0.00036 27.7 4.7 70 39-117 97-170 (411)
64 cd04267 ZnMc_ADAM_like Zinc-de 63.2 4 8.7E-05 27.1 1.1 12 106-117 133-144 (192)
65 PF15641 Tox-MPTase5: Metallop 63.0 13 0.00028 21.8 3.0 54 59-121 24-80 (109)
66 COG4900 Predicted metallopepti 62.7 4.7 0.0001 24.8 1.2 10 108-117 82-91 (133)
67 COG4784 Putative Zn-dependent 60.3 7.8 0.00017 28.9 2.2 52 59-116 82-134 (479)
68 PF00413 Peptidase_M10: Matrix 58.9 7.9 0.00017 24.5 1.9 34 25-58 6-40 (154)
69 cd04268 ZnMc_MMP_like Zinc-dep 58.4 11 0.00023 24.2 2.4 21 40-60 15-35 (165)
70 COG3590 PepO Predicted metallo 58.2 2 4.3E-05 33.8 -1.1 26 93-118 474-499 (654)
71 cd04280 ZnMc_astacin_like Zinc 57.4 39 0.00085 22.4 5.1 68 40-117 15-85 (180)
72 PF14521 Aspzincin_M35: Lysine 57.1 8.2 0.00018 24.8 1.7 10 107-116 97-106 (148)
73 PF06262 DUF1025: Possibl zinc 55.1 14 0.00031 22.1 2.4 35 81-117 50-84 (97)
74 PF09471 Peptidase_M64: IgA Pe 54.3 9.8 0.00021 27.1 1.8 14 107-120 217-230 (264)
75 PF02031 Peptidase_M7: Strepto 54.2 12 0.00027 23.6 2.0 14 104-117 75-88 (132)
76 PF11950 DUF3467: Protein of u 53.9 31 0.00068 20.2 3.7 32 23-57 57-88 (92)
77 cd04271 ZnMc_ADAM_fungal Zinc- 53.6 5.1 0.00011 27.8 0.3 10 108-117 147-156 (228)
78 PF05572 Peptidase_M43: Pregna 52.9 9.7 0.00021 24.7 1.5 14 107-121 70-83 (154)
79 COG3227 LasB Zinc metalloprote 51.8 6.5 0.00014 30.3 0.7 26 37-63 270-295 (507)
80 PF13583 Reprolysin_4: Metallo 51.8 9.2 0.0002 26.0 1.4 11 108-118 139-149 (206)
81 COG4219 MecR1 Antirepressor re 51.1 4.9 0.00011 29.4 -0.0 29 89-117 173-201 (337)
82 PF04228 Zn_peptidase: Putativ 50.6 13 0.00027 27.0 1.9 12 106-117 170-181 (292)
83 PF12315 DUF3633: Protein of u 49.8 14 0.00031 25.3 2.0 13 107-119 94-106 (212)
84 cd00203 ZnMc Zinc-dependent me 49.6 9.8 0.00021 24.4 1.2 23 38-60 20-42 (167)
85 COG4758 Predicted membrane pro 48.7 7.3 0.00016 27.1 0.5 19 113-131 115-136 (235)
86 PF07998 Peptidase_M54: Peptid 48.6 20 0.00043 24.4 2.5 19 100-118 139-157 (194)
87 PF13402 M60-like: Peptidase M 48.6 33 0.00071 24.5 3.8 25 37-61 144-168 (307)
88 PF10186 Atg14: UV radiation r 47.6 31 0.00066 24.4 3.6 27 37-63 196-222 (302)
89 PF12388 Peptidase_M57: Dual-a 47.5 16 0.00035 25.2 2.0 36 22-58 37-73 (211)
90 PF04036 DUF372: Domain of unk 47.0 12 0.00027 18.2 1.0 16 112-127 12-27 (38)
91 PF13203 DUF2201_N: Putative m 44.6 32 0.00069 24.5 3.3 22 105-126 59-80 (292)
92 PRK13267 archaemetzincin-like 43.0 33 0.00071 23.0 2.9 16 102-117 121-136 (179)
93 PF13374 TPR_10: Tetratricopep 42.4 38 0.00082 15.6 2.6 23 37-59 17-39 (42)
94 PF06167 Peptidase_M90: Glucos 40.6 19 0.0004 25.5 1.5 11 108-118 156-166 (253)
95 COG3824 Predicted Zn-dependent 39.9 17 0.00036 22.8 1.0 14 104-117 107-120 (136)
96 cd04273 ZnMc_ADAMTS_like Zinc- 39.9 4.6 0.0001 27.3 -1.5 11 107-117 141-151 (207)
97 cd04278 ZnMc_MMP Zinc-dependen 38.0 18 0.0004 23.2 1.1 23 38-60 20-42 (157)
98 cd04277 ZnMc_serralysin_like Z 37.9 21 0.00046 23.6 1.4 21 40-60 34-54 (186)
99 COG3228 Uncharacterized protei 37.1 16 0.00035 25.7 0.7 10 108-117 149-158 (266)
100 cd04276 ZnMc_MMP_like_2 Zinc-d 36.6 32 0.00069 23.4 2.1 30 25-57 9-38 (197)
101 smart00235 ZnMc Zinc-dependent 35.8 21 0.00045 22.3 1.1 20 41-60 24-43 (140)
102 cd06007 R3H_DEXH_helicase R3H 35.7 41 0.00088 18.1 2.0 11 107-117 29-39 (59)
103 PF01457 Peptidase_M8: Leishma 35.3 32 0.0007 27.0 2.2 15 103-117 207-221 (521)
104 PRK15410 DgsA anti-repressor M 34.8 20 0.00044 25.5 1.0 11 108-118 144-154 (260)
105 cd06160 S2P-M50_like_2 Unchara 33.0 40 0.00086 22.6 2.1 13 107-119 42-54 (183)
106 PRK15245 type III effector pho 32.8 1.1E+02 0.0023 21.3 4.0 60 22-89 151-215 (241)
107 PF10463 Peptidase_U49: Peptid 32.5 37 0.0008 23.3 1.9 18 107-124 102-119 (206)
108 PHA00527 hypothetical protein 32.2 45 0.00097 20.3 2.0 14 107-120 74-87 (129)
109 cd02640 R3H_NRF R3H domain of 31.9 50 0.0011 17.8 2.0 11 107-117 30-40 (60)
110 cd06159 S2P-M50_PDZ_Arch Uncha 31.3 42 0.00091 23.9 2.1 13 107-119 119-131 (263)
111 COG4101 Predicted mannose-6-ph 31.2 9.5 0.00021 23.9 -1.0 9 115-123 78-86 (142)
112 cd04327 ZnMc_MMP_like_3 Zinc-d 30.9 39 0.00085 22.7 1.8 36 22-60 5-40 (198)
113 PF13398 Peptidase_M50B: Pepti 30.8 42 0.00092 22.7 2.0 16 104-119 20-35 (200)
114 cd06164 S2P-M50_SpoIVFB_CBS Sp 30.4 40 0.00086 23.4 1.8 13 107-119 54-66 (227)
115 cd06162 S2P-M50_PDZ_SREBP Ster 29.7 41 0.00089 24.2 1.8 13 107-119 136-148 (277)
116 PF12652 CotJB: CotJB protein; 29.6 96 0.0021 17.7 3.0 21 39-60 36-56 (78)
117 PF08334 T2SG: Type II secreti 29.6 87 0.0019 18.7 3.1 25 37-62 13-37 (108)
118 PF09865 DUF2092: Predicted pe 29.5 70 0.0015 22.1 2.9 43 22-64 69-111 (214)
119 cd06161 S2P-M50_SpoIVFB SpoIVF 28.9 44 0.00095 22.7 1.8 13 107-119 39-51 (208)
120 KOG3658|consensus 28.8 14 0.0003 29.9 -0.7 11 108-118 394-404 (764)
121 PF03929 PepSY_TM: PepSY-assoc 28.3 10 0.00023 17.0 -0.9 9 116-124 5-13 (27)
122 PF13174 TPR_6: Tetratricopept 26.4 69 0.0015 13.7 2.2 20 42-62 13-32 (33)
123 COG5504 Predicted Zn-dependent 26.3 2.1E+02 0.0045 20.6 4.7 14 104-117 138-151 (280)
124 PF14475 Mso1_Sec1_bdg: Sec1-b 25.4 88 0.0019 15.6 2.1 17 44-61 22-38 (41)
125 PF03536 VRP3: Salmonella viru 25.3 1.5E+02 0.0033 20.6 3.8 61 22-89 150-214 (240)
126 PTZ00337 surface protease GP63 24.9 59 0.0013 26.0 2.1 15 43-57 133-147 (567)
127 cd04283 ZnMc_hatching_enzyme Z 24.8 45 0.00097 22.4 1.3 20 41-60 18-37 (182)
128 COG4865 Glutamate mutase epsil 24.2 41 0.0009 25.2 1.1 30 43-73 195-227 (485)
129 cd02641 R3H_Smubp-2_like R3H d 24.1 70 0.0015 17.1 1.7 13 107-120 30-42 (60)
130 PF00903 Glyoxalase: Glyoxalas 24.0 63 0.0014 18.8 1.8 14 45-58 12-25 (128)
131 PTZ00257 Glycoprotein GP63 (le 23.5 64 0.0014 26.1 2.0 13 105-117 255-267 (622)
132 COG1164 Oligoendopeptidase F [ 23.5 74 0.0016 25.6 2.4 11 107-117 381-391 (598)
133 COG2764 PhnB Uncharacterized p 23.4 68 0.0015 20.5 1.8 14 44-57 11-24 (136)
134 COG2321 Predicted metalloprote 23.3 51 0.0011 23.8 1.4 11 107-117 168-178 (295)
135 PF10994 DUF2817: Protein of u 23.2 2.3E+02 0.0051 21.1 4.8 15 111-125 237-251 (341)
136 TIGR00388 glyQ glycyl-tRNA syn 22.7 1.2E+02 0.0026 21.9 3.0 54 55-119 227-280 (293)
137 PF09349 OHCU_decarbox: OHCU d 22.6 1.3E+02 0.0027 19.6 3.0 22 39-60 93-114 (159)
138 cd06588 PhnB_like Escherichia 22.3 75 0.0016 19.3 1.9 14 44-57 10-23 (128)
139 KOG3624|consensus 21.8 89 0.0019 25.4 2.6 19 100-118 512-530 (687)
140 PF02633 Creatininase: Creatin 21.6 2.1E+02 0.0045 19.7 4.1 59 45-123 42-104 (237)
141 cd02639 R3H_RRM R3H domain of 21.5 85 0.0018 16.9 1.7 12 107-118 30-41 (60)
142 PRK10148 hypothetical protein; 20.6 84 0.0018 20.0 1.9 14 44-57 12-25 (147)
143 cd04282 ZnMc_meprin Zinc-depen 20.2 59 0.0013 22.7 1.1 69 40-117 61-131 (230)
No 1
>KOG1046|consensus
Probab=100.00 E-value=3.9e-42 Score=270.84 Aligned_cols=131 Identities=68% Similarity=1.141 Sum_probs=127.9
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN 80 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~ 80 (131)
||||++||+||+|...+..+.+|+++++|++|+...+.+++++.+++.+++|+++||+|||.+|+|+|++|++..|+|||
T Consensus 219 MstYLvAf~V~~f~~~e~~~~~~v~vrv~a~p~~~~~~~~al~~~~~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMEN 298 (882)
T KOG1046|consen 219 MSTYLVAFAVGDFVYVETITKSGVPVRVYARPEKINQGQFALEVATKVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMEN 298 (882)
T ss_pred CchhhheeeeeccccceeecCCCceEEEEeChHHhhHHHHHHHHHHHHHHHHHHHhCCCCCCccccEEecCCccccchhc
Confidence 89999999999999998888889999999999999999999999999999999999999999999999999999999999
Q ss_pred hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 81 WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 81 ~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
+|+++|++..+|+++..++..++++++.+||||+|||||||+||++||+||
T Consensus 299 wGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLVTm~wW~dL 349 (882)
T KOG1046|consen 299 WGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLVTMKWWNDL 349 (882)
T ss_pred CcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcccHhhhhhh
Confidence 999999999999999999999999999999999999999999999999986
No 2
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=100.00 E-value=1.3e-38 Score=249.56 Aligned_cols=131 Identities=34% Similarity=0.594 Sum_probs=120.3
Q ss_pred CCceeeeeEEecceEeeee----ecCCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCc
Q psy6925 1 MSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCG 76 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~ 76 (131)
||+|+|||++|+|+++++. .++++++++|++|+..+..+++++.+++++++||++||.|||++|+++|++|++..|
T Consensus 174 mptYLfA~vaGdf~~~~~~~~t~sg~~v~l~iy~~p~~~~~~~~al~~~~~~L~~~E~~fG~pYPl~k~diVavpdf~~G 253 (863)
T TIGR02414 174 KPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLDIFMIVAVDDFNMG 253 (863)
T ss_pred cChhHheEEEeCCEEEEEEeeccCCCceEEEEEEccCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhccEEecCCCCCc
Confidence 7999999999999998753 234689999999998888899999999999999999999999999999999999999
Q ss_pred chhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 77 AMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 77 ~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
||||+|+++|++..++.++...+...+..+..+++||+|||||||+||++||+|+
T Consensus 254 aMEN~GLi~f~e~~lL~~~~~~td~~~~~i~~VIaHElaHqWfGNlVT~~~W~~L 308 (863)
T TIGR02414 254 AMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQL 308 (863)
T ss_pred cccccceeccccceEEeCCCCCCHHHHHHHHHHHHHHHHHHHhcceeeecchhhh
Confidence 9999999999999998888776777788888999999999999999999999875
No 3
>PRK14015 pepN aminopeptidase N; Provisional
Probab=100.00 E-value=3.7e-38 Score=247.42 Aligned_cols=131 Identities=33% Similarity=0.565 Sum_probs=119.3
Q ss_pred CCceeeeeEEecceEeeee----ecCCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCc
Q psy6925 1 MSTYLVAVVVGEFDYVEET----SSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCG 76 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~ 76 (131)
||+||++|++|+|+++++. .++++++++|++|+..+.++.+++.++++|++||++||.|||++++++|++|++..|
T Consensus 187 mpsYL~Al~aGdf~~~~d~~~~~~g~~vpl~iy~~p~~~~~~~~al~~~~~~L~~~E~~FG~pYP~~k~diVavp~f~~G 266 (875)
T PRK14015 187 KPSYLFALVAGDLDVLEDTFTTRSGREVALEIYVEPGNLDKCDHAMDSLKKSMKWDEERFGLEYDLDIFMIVAVDDFNMG 266 (875)
T ss_pred cccceEEEEEeCCEEEEEEeeccCCCeEEEEEEEeCCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhhCEEeCCCCCCc
Confidence 7999999999999998753 224599999999999888899999999999999999998899999999999999999
Q ss_pred chhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 77 AMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 77 ~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
||||+|+++|++..++.+++..+..++..+..+|+||+|||||||+||++||+|+
T Consensus 267 aMEN~Gl~~f~~~~lL~~~~~~t~~~~~~i~~vIaHElaHqWFGNlVT~~~W~dL 321 (875)
T PRK14015 267 AMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQL 321 (875)
T ss_pred ccccccccccccceEecCcccCCHHHHHHHHHHHHHHHHHHHHhCcceecchhhh
Confidence 9999999999999888887766666777788999999999999999999999885
No 4
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=100.00 E-value=2.2e-38 Score=231.82 Aligned_cols=131 Identities=56% Similarity=1.009 Sum_probs=115.2
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN 80 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~ 80 (131)
||+|++||++|+|...+....+++++++|++|+..+..+.+++.+++++++|+++||+|||++|+++|++|++..+|||+
T Consensus 190 ~~~yl~a~~vg~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~ 269 (390)
T PF01433_consen 190 MPTYLFAFAVGDFESVEVTTKSGVPVRVYARPGDEEQLQFALDIAPKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMEN 269 (390)
T ss_dssp EEGGG--EEEESEEEEEEETTTEEEEEEEEECTCGGGHHHHHHHHHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE--
T ss_pred cCchhhhhhcCcccccccccccccchheeehhhhHHHHHHHHHhhHHHHHHHHhhccccceecceeEEEEeccccccccc
Confidence 78999999999999987766556999999999999999999999999999999999999999999999999999999999
Q ss_pred hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 81 WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 81 ~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
+|+++++++.++++++..+...+..+..+||||+|||||||+||+++|+|+
T Consensus 270 ~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~vt~~~w~d~ 320 (390)
T PF01433_consen 270 WGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNLVTPKWWSDL 320 (390)
T ss_dssp TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTTEEESSGGGH
T ss_pred cccccccccccccCcccccchhhhhhHHHHHHHHHHHHhccCCccccchhh
Confidence 999999999999999888888889999999999999999999999999984
No 5
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=100.00 E-value=6.3e-38 Score=246.55 Aligned_cols=128 Identities=37% Similarity=0.675 Sum_probs=113.3
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccch--HHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcch
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQ--GQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAM 78 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~m 78 (131)
||||+++|++|+|+.++.. .+|+++++|++|+..+. .+.+++.+++++++|+++||+||||+|+++|++|++..|||
T Consensus 183 mstYL~a~~vG~f~~~~~~-~~gvpi~v~~~~~~~~~~~~~~al~~~~~~l~~~e~~fg~pYP~~k~d~V~vP~f~~GaM 261 (831)
T TIGR02412 183 LSTYLTAVAAGPYHSVQDE-SRSYPLGIYARRSLAQYLDADAIFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAM 261 (831)
T ss_pred cccceEEEEEeceEEEeec-CCCEEEEEEECcchhhhhhHHHHHHHHHHHHHHHHHHhCCCCCcccCCEEEcCCCCCCcc
Confidence 8999999999999998764 47899999999987654 56789999999999999999999999999999999999999
Q ss_pred hhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 79 ENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 79 e~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
||+|++++++.. +.++. .+...++.+..+++||+|||||||+||++||+|+
T Consensus 262 En~Glit~~e~~-l~~~~-~~~~~~~~~~~viaHElAHqWFGnlVT~~wW~dl 312 (831)
T TIGR02412 262 ENAGCVTFAENF-LHRAE-ATRAEKENRAGVILHEMAHMWFGDLVTMRWWNDL 312 (831)
T ss_pred cccceeeechhh-ccCCc-CCHHHHHHHHHHHHHHHHHHHhCCEeccccccch
Confidence 999999999984 44433 3355667788999999999999999999999985
No 6
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=100.00 E-value=2e-35 Score=232.80 Aligned_cols=130 Identities=48% Similarity=0.950 Sum_probs=121.2
Q ss_pred CCceeeeeEEecceEeeeeec---CCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcc
Q psy6925 1 MSTYLVAVVVGEFDYVEETSS---DGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGA 77 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~ 77 (131)
||||||++++|+|++.++... +++++++|++++....+++.++.+++++++++++||+|||+++ ++|.+|++..|+
T Consensus 200 mptYL~al~~G~~~~~~~~~~~~~~~v~l~iy~~~g~~~~a~~~~~~~~~~~~~~e~~fg~~y~l~~-~~V~v~~f~~Ga 278 (859)
T COG0308 200 MPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGA 278 (859)
T ss_pred cchHhhheeeecceeeeeeeccCCCCeeEEEEecCcchhhhhhhHHHHHHHhhhHHHhcCCCCCCcc-cEEeccCCCCcc
Confidence 899999999999998877642 4899999999987888999999999999999999999999999 899999999999
Q ss_pred hhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 78 MENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 78 me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
|||+|++++++..++.++...+...++++.++++||+|||||||+||++||+|+
T Consensus 279 MEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlVT~~~W~~l 332 (859)
T COG0308 279 MENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDL 332 (859)
T ss_pred ccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhcccceeeccCHHHH
Confidence 999999999999988888788888889999999999999999999999999985
No 7
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=100.00 E-value=2.3e-33 Score=214.26 Aligned_cols=117 Identities=30% Similarity=0.514 Sum_probs=98.2
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccchHHHHHH-HHHHHHHHHHHhhccccCCCcccEEEe-cCCCCcch
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALH-VASKVLPFYKDYFNIAYPLPKIDLVAV-PDFSCGAM 78 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~~l~~~~~~~g~~yP~~~~~~v~~-p~~~~~~m 78 (131)
||+||+||+||+|+..+ .|.++++|++|+..+.++..+. .++++++++|+++| ||||+|+|+|++ |+++.|||
T Consensus 186 mptYLia~avG~~~~~~----~g~~~~v~~~p~~~~~~~~~~~~~~~~~l~~~e~~~~-pYp~~k~d~vvlpp~f~~GgM 260 (601)
T TIGR02411 186 IPAYLIALASGDLASAP----IGPRSSVYSEPEQLEKCQYEFEHDTENFIKTAEDLIF-PYEWGQYDLLVLPPSFPYGGM 260 (601)
T ss_pred cchhhheeeeccceecc----cCCceEEEccchhHHHHHHHHHHhHHHHHHHHHHhCC-CCcCccceEEEecCccccccc
Confidence 89999999999998753 4667999999988777777777 89999999999887 999999999887 68999999
Q ss_pred hhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 79 ENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 79 e~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
||+|++ +.+..++.+. . . ...++|||+|||||||+||++||+|+
T Consensus 261 EN~~lt-f~~~~ll~~d-~------s-~~~viaHElAHqWfGNlVT~~~W~d~ 304 (601)
T TIGR02411 261 ENPNLT-FATPTLIAGD-R------S-NVDVIAHELAHSWSGNLVTNCSWEHF 304 (601)
T ss_pred ccccce-eeccccccCC-h------h-hhhhHHHHHHhhccCceeecCCchHH
Confidence 999965 4555455432 1 1 24699999999999999999999984
No 8
>KOG1047|consensus
Probab=99.94 E-value=9.1e-28 Score=176.76 Aligned_cols=117 Identities=31% Similarity=0.507 Sum_probs=100.4
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccchHHHHHH-HHHHHHHHHHHhhccccCCCcccEEEec-CCCCcch
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALH-VASKVLPFYKDYFNIAYPLPKIDLVAVP-DFSCGAM 78 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~~l~~~~~~~g~~yP~~~~~~v~~p-~~~~~~m 78 (131)
+|+|++||++|+....+ -|.+.+||+.|...+.++.-+. .+.++|..-|+.+| ||+|++||++++| +|++|||
T Consensus 195 IP~YLiai~~G~L~s~e----IgpRs~VwaEp~~~~a~~~ef~~~~e~~L~~Ae~l~G-pY~WgryDllvlPpSFP~gGM 269 (613)
T KOG1047|consen 195 IPSYLIAIAVGDLESRE----IGPRSRVWAEPCLLDACQEEFAGETEDFLKAAEKLFG-PYVWGRYDLLVLPPSFPFGGM 269 (613)
T ss_pred chhhhHHHhhccccccc----cCCccceecchhhhHHHHHHHHhhhHHHHHHHHHHcC-CcccccceEEEecCCCCcccc
Confidence 58999999999997653 3667899999999888877666 89999999999999 9999999999885 8999999
Q ss_pred hhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 79 ENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 79 e~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
|||.+++...+. |.. +. ....+|||||||.||||+||.++|++.
T Consensus 270 ENPcltF~TpTl-laG-Dr-------sl~~vIaHEIAHSWtGNlVTN~sWehf 313 (613)
T KOG1047|consen 270 ENPCLTFVTPTL-LAG-DR-------SLVDVIAHEIAHSWTGNLVTNASWEHF 313 (613)
T ss_pred cCcceeeecchh-hcC-Cc-------chhhHHHHHhhhhhcccccccCccchh
Confidence 999999988873 332 22 224699999999999999999999873
No 9
>KOG1932|consensus
Probab=99.76 E-value=1.4e-18 Score=136.28 Aligned_cols=126 Identities=17% Similarity=0.242 Sum_probs=106.7
Q ss_pred CCceeeeeEEecceEeeeeecCCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh
Q psy6925 1 MSTYLVAVVVGEFDYVEETSSDGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN 80 (131)
Q Consensus 1 ms~y~~~~~vg~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~ 80 (131)
|++..++||||+|+.... +.+++|..+|.|+.+..++...-.+.++++|||+.+|+.|||+.|++|++|.....-+.+
T Consensus 241 vA~~~I~~AiG~F~~~~~--P~~~~i~~f~LP~~~~~v~nt~~~l~k~iefye~~ls~rYPF~~~k~VFvd~~~~~i~~~ 318 (1180)
T KOG1932|consen 241 VAPSNIGFAIGPFKSYVE--PSMIDITHFCLPGLEPLVKNTTVYLHKAIEFYEEELSSRYPFSCYKTVFVDEAAVEISSY 318 (1180)
T ss_pred CCccccceeeccccccCC--CccCcceeEecCcchHHhhhHHHHHHHHHHHHHHHhccCCCcceeeEEEecCCcceeeec
Confidence 467789999999998854 458899999999999999999999999999999999988999999999999766666777
Q ss_pred hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCccCccccCCC
Q psy6925 81 WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGNLVTMEWWTHL 131 (131)
Q Consensus 81 ~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~vt~~~W~d~ 131 (131)
.++.+++.+ +|++.+..++ .......++-.+|.||||.++||.+|+|.
T Consensus 319 asl~I~st~-lLy~~~iIDq--~~~tr~~La~aLA~Q~fg~yIsp~~wsD~ 366 (1180)
T KOG1932|consen 319 ASLSIFSTS-LLYSKNIIDQ--TFLTRRKLAWALASQWFGVYISPVDWSDF 366 (1180)
T ss_pred ceeeeeecc-ccchHhhhhH--HHHHHHHHHHHHHHhhhEEEeeccchhhh
Confidence 767666655 7887665443 44455799999999999999999999983
No 10
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=98.10 E-value=3.7e-05 Score=58.09 Aligned_cols=100 Identities=14% Similarity=0.055 Sum_probs=67.9
Q ss_pred CeEEEEEecCCc-cchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEe-cCCCCcchhhhchhhhhccccccCCCCCCH
Q psy6925 23 GVLVRVYTPVGK-REQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAV-PDFSCGAMENWGLVTYREVCLLVDSQNTSA 100 (131)
Q Consensus 23 ~~~i~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~-p~~~~~~me~~g~i~~~~~~~l~~~~~~~~ 100 (131)
|.+.++-..... ..+.+...+.++++++.--+.|| +-||.+|.+++. .+-..||+|+-....+..+... .++.
T Consensus 167 g~ph~~~~~g~~p~~d~~~~~~~~k~ii~~~~~vFg-~~~~~~Y~Fl~~~s~q~~GGlEH~~St~l~~~r~~----~~~~ 241 (558)
T COG3975 167 GAPHTIALRGELPNFDKERLASDTKKIIEAEIKVFG-SAPFDKYVFLLHLSDQIYGGLEHRRSTALIYDRFG----FTDQ 241 (558)
T ss_pred CCceeEEEeeccccccHHHHHHHHHHHHHHHHHHhc-CCCccceEEEEEecCCCCCCceecccccccccccc----ccch
Confidence 344443333332 34566777889999999999999 899999988764 3446679998754444333211 1222
Q ss_pred HHHHHHHHHHHHHHHHHhhcCccCccc
Q psy6925 101 ITRQNIALVVGHELAHQWFGNLVTMEW 127 (131)
Q Consensus 101 ~~~~~~~~~iahEiaHqWfG~~vt~~~ 127 (131)
...+....+++||..|-|=+-.+.|+.
T Consensus 242 ~ky~~~l~llsHEyfH~WNvKrIrpa~ 268 (558)
T COG3975 242 DKYQDLLGLLSHEYFHAWNVKRIRPAA 268 (558)
T ss_pred hHHHHHHHHHHHHHHHhccceeccccc
Confidence 334666789999999999988887764
No 11
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=96.63 E-value=0.0015 Score=40.22 Aligned_cols=21 Identities=52% Similarity=0.930 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHhhcCccC
Q psy6925 104 QNIALVVGHELAHQWFGNLVT 124 (131)
Q Consensus 104 ~~~~~~iahEiaHqWfG~~vt 124 (131)
.....+++||++|+|+++.+.
T Consensus 23 ~~~~~~l~HE~~H~~~~~~~~ 43 (128)
T PF13485_consen 23 DWLDRVLAHELAHQWFGNYFG 43 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHcC
Confidence 344579999999999998885
No 12
>PRK04860 hypothetical protein; Provisional
Probab=96.30 E-value=0.009 Score=39.10 Aligned_cols=72 Identities=19% Similarity=0.199 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhh
Q psy6925 41 ALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWF 119 (131)
Q Consensus 41 ~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWf 119 (131)
+...+...+..-+++||.|++-++..+-. ....+|+... ....+-++|..-.......+..+|.||+||-|-
T Consensus 5 ~~~~~~~~~~~a~~~f~~~f~~p~~~f~~--R~rtaG~~~l-----~~~~I~~Np~ll~~~~~~~l~~~v~HEl~H~~~ 76 (160)
T PRK04860 5 VMRRLRECLAQANLYFKRTFPEPKVSYTQ--RGTSAGTAWL-----QSNEIRLNPVLLLENQQAFIDEVVPHELAHLLV 76 (160)
T ss_pred HHHHHHHHHHHHHHHhCCCCCCCEEEEee--cchhhcchhH-----hcCCeeeCHHHHhhCcHHHHHhHHHHHHHHHHH
Confidence 44556667777778888888776544322 1222444322 112122222221122345667899999999875
No 13
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=95.33 E-value=0.026 Score=36.32 Aligned_cols=17 Identities=24% Similarity=0.243 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHhhc
Q psy6925 104 QNIALVVGHELAHQWFG 120 (131)
Q Consensus 104 ~~~~~~iahEiaHqWfG 120 (131)
..+..+|.||++|.+..
T Consensus 57 ~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 57 DRLRETLLHELCHAALY 73 (146)
T ss_pred HHHHhhHHHHHHHHHHH
Confidence 45677999999999985
No 14
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=94.94 E-value=0.09 Score=35.40 Aligned_cols=68 Identities=25% Similarity=0.302 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhh
Q psy6925 41 ALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWF 119 (131)
Q Consensus 41 ~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWf 119 (131)
+...+...++.+++.+|. +.+++.+=-.- ...|.-..-|.|.++-..+..++ .-+..+++||++|--.
T Consensus 110 ~~~~l~~~~~~~~~~~~~--~~~~i~ir~~k-srWGsc~~~~~I~ln~~L~~~P~--------~~idYVvvHEL~Hl~~ 177 (205)
T PF01863_consen 110 AKEYLPERLKKYAKKLGL--PPPKIKIRDMK-SRWGSCSSKGNITLNWRLVMAPP--------EVIDYVVVHELCHLRH 177 (205)
T ss_pred HHHHHHHHHHHHHHHcCC--CcceEEEeehh-hccccCCCCCcEEeecccccCCc--------cHHHHHHHHHHHHhcc
Confidence 345566777778888885 33443332222 23454455667777766554443 3456799999999743
No 15
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=94.23 E-value=0.025 Score=41.07 Aligned_cols=10 Identities=60% Similarity=0.926 Sum_probs=9.3
Q ss_pred HHHHHHHHHH
Q psy6925 108 LVVGHELAHQ 117 (131)
Q Consensus 108 ~~iahEiaHq 117 (131)
-+++||+|||
T Consensus 198 ~T~~HElAHq 207 (318)
T PF12725_consen 198 FTICHELAHQ 207 (318)
T ss_pred HHHHHHHHHH
Confidence 5999999999
No 16
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=94.19 E-value=0.056 Score=41.04 Aligned_cols=51 Identities=25% Similarity=0.268 Sum_probs=32.8
Q ss_pred CCCcccEEEecCCCCcchhhhc-hhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 61 PLPKIDLVAVPDFSCGAMENWG-LVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 61 P~~~~~~v~~p~~~~~~me~~g-~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
|-..++++.|.+-..++...+| .|.++...++..++ ...+..|||||++|-
T Consensus 90 ~~~~f~f~lV~d~~iNAFA~~Gg~v~vntGLll~ae~------esElagViAHEigHv 141 (484)
T COG4783 90 VKTPFTFFLVNDDSINAFATPGGYVVVNTGLLLTAEN------ESELAGVIAHEIGHV 141 (484)
T ss_pred CCCCeEEEEecCCccchhhcCCceEEEehHHHHhcCC------HHHHHHHHHHHHHHH
Confidence 3445678888766677777765 44445444443322 245678999999994
No 17
>PRK03001 M48 family peptidase; Provisional
Probab=92.78 E-value=0.21 Score=35.65 Aligned_cols=63 Identities=19% Similarity=0.202 Sum_probs=33.4
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh-----hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN-----WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~-----~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
.+.+.++-+.+..|.|. +++ .++++...++... .+.|.+.+. ++ +. . +.+.+..++|||++|-
T Consensus 68 ~L~~~v~~la~~~g~~~--p~v--~v~~~~~~NAfa~G~~~~~~~Ivvt~g-Ll-~~-l----~~~El~aVlAHElgHi 135 (283)
T PRK03001 68 QFYRMVRELAQRAGLPM--PKV--YLINEDQPNAFATGRNPEHAAVAATTG-IL-RV-L----SEREIRGVMAHELAHV 135 (283)
T ss_pred HHHHHHHHHHHHcCCCC--CeE--EEecCCCcceEEecCCCCCeEEEecHH-HH-hh-C----CHHHHHHHHHHHHHHH
Confidence 45677788888888654 443 3334322222221 122333333 11 11 1 2356778999999994
No 18
>PRK03982 heat shock protein HtpX; Provisional
Probab=92.54 E-value=0.25 Score=35.37 Aligned_cols=64 Identities=23% Similarity=0.308 Sum_probs=33.9
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh-----hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN-----WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~-----~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
.+.+.++-+.+..|.|- +++- ++++...+++.. -+.+.+.+. ++ +. . +.+.+..++|||++|-=
T Consensus 69 ~L~~~v~~la~~~g~~~--p~v~--v~~~~~~NAfa~G~~~~~~~V~vt~g-Ll-~~-l----~~~El~AVlAHElgHi~ 137 (288)
T PRK03982 69 ELYRIVERLAERANIPK--PKVA--IVPTQTPNAFATGRDPKHAVVAVTEG-IL-NL-L----NEDELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHcCCCC--CeEE--EEeCCCcceEEeccCCCCeEEEeehH-HH-hh-C----CHHHHHHHHHHHHHHHH
Confidence 45677777778887553 4433 334322233322 223333333 21 11 1 23567789999999953
No 19
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=92.32 E-value=0.17 Score=30.56 Aligned_cols=30 Identities=17% Similarity=0.261 Sum_probs=18.2
Q ss_pred cccccCCCCCCHHHHHHHHHHHHHHHHHHhhcCc
Q psy6925 89 VCLLVDSQNTSAITRQNIALVVGHELAHQWFGNL 122 (131)
Q Consensus 89 ~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~~ 122 (131)
..++++++.+. . +..-+++||++|.+++..
T Consensus 29 ~~I~in~~~~~-~---~~~f~laHELgH~~~~~~ 58 (122)
T PF06114_consen 29 PIIFINSNLSP-E---RQRFTLAHELGHILLHHG 58 (122)
T ss_dssp TEEEEESSS-H-H---HHHHHHHHHHHHHHHHH-
T ss_pred CEEEECCCCCH-H---HHHHHHHHHHHHHHhhhc
Confidence 33555555322 2 223599999999998754
No 20
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=92.29 E-value=0.13 Score=34.87 Aligned_cols=71 Identities=18% Similarity=0.278 Sum_probs=33.9
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhhhc----hhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMENWG----LVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWF 119 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~~g----~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWf 119 (131)
.+.+.++-+.+..|.+.|-++ +.++++...++...++ .|.+... ++.. .....+..+++||++|---
T Consensus 32 ~L~~~v~~l~~~~~~~~~~~~--v~v~~~~~~NA~~~g~~~~~~I~v~~~-ll~~------~~~~el~aVlaHElgH~~~ 102 (226)
T PF01435_consen 32 ELRRIVEELARRAGLGIPPPR--VYVIDSPSPNAFATGGGPRKRIVVTSG-LLES------LSEDELAAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHHHCTSS--E--EEEE--SSEEEEEETTTC--EEEEEHH-HHHH------SSHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHhcCCCCCCe--EEEEcCCCCcEEEEccCCCcEEEEeCh-hhhc------ccHHHHHHHHHHHHHHHHc
Confidence 344555555555554555544 4444432233332222 1333333 2211 1124567899999999876
Q ss_pred cCcc
Q psy6925 120 GNLV 123 (131)
Q Consensus 120 G~~v 123 (131)
++..
T Consensus 103 ~h~~ 106 (226)
T PF01435_consen 103 RHIL 106 (226)
T ss_dssp THCC
T ss_pred CCcc
Confidence 6543
No 21
>PRK04351 hypothetical protein; Provisional
Probab=92.03 E-value=0.12 Score=33.50 Aligned_cols=16 Identities=25% Similarity=0.432 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHh
Q psy6925 103 RQNIALVVGHELAHQW 118 (131)
Q Consensus 103 ~~~~~~~iahEiaHqW 118 (131)
...+..+|.||++|-.
T Consensus 58 ~~~l~~vv~HElcH~~ 73 (149)
T PRK04351 58 LEELIGIIKHELCHYH 73 (149)
T ss_pred HHHHHhhHHHHHHHHH
Confidence 4667789999999954
No 22
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=91.35 E-value=0.88 Score=29.47 Aligned_cols=73 Identities=22% Similarity=0.239 Sum_probs=37.6
Q ss_pred hHHHHHHHHHHHHHHHHHhhccccCCCc--ccEEEecCCC---Ccchhhhchhhhhcccc-ccCCCCCCHHHHHHHHHHH
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIAYPLPK--IDLVAVPDFS---CGAMENWGLVTYREVCL-LVDSQNTSAITRQNIALVV 110 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~yP~~~--~~~v~~p~~~---~~~me~~g~i~~~~~~~-l~~~~~~~~~~~~~~~~~i 110 (131)
.+..+...+.++.++|.+.|| .=++.. ..++..-.++ ..+.-+-..+.|++..- ...+ ......++
T Consensus 68 ~~vdA~~~~~~v~d~y~~~~g-r~siD~~G~~~~~~Vhyg~~~~NAfW~g~~m~yGdG~~~~f~~-------~~~~lDVv 139 (150)
T PF01447_consen 68 AAVDAHYNAGKVYDYYKNVFG-RNSIDGNGMPIISRVHYGKNYNNAFWNGSQMVYGDGDGQIFKP-------FASSLDVV 139 (150)
T ss_dssp HHHHHHHHHHHHHHHHHHHHS-S-STTSS-S-EEEEESESSSTT-EEE-SSSEEEE---SSSBS--------GGG-HHHH
T ss_pred HHHHHHHhHHHHHHHHHHHHC-CCCcCCCCcEEEEEEeCCCCccCccccCCEEEEECCCCccccc-------Ccccccee
Confidence 345667788999999999999 777762 2333221111 12333333444443321 1111 11224699
Q ss_pred HHHHHHH
Q psy6925 111 GHELAHQ 117 (131)
Q Consensus 111 ahEiaHq 117 (131)
|||++|-
T Consensus 140 aHEltHG 146 (150)
T PF01447_consen 140 AHELTHG 146 (150)
T ss_dssp HHHHHHH
T ss_pred eeccccc
Confidence 9999995
No 23
>PRK01345 heat shock protein HtpX; Provisional
Probab=91.28 E-value=0.4 Score=34.90 Aligned_cols=64 Identities=20% Similarity=0.193 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh-----hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN-----WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~-----~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
.+.+.++-+.+..|.| -+++- ++++...++... -+.|.+.+. ++ +. . +.+.+..++|||++|.=
T Consensus 68 ~L~~~v~~La~~agi~--~p~v~--vid~~~~NAFa~G~~~~~~~V~vt~g-LL-~~-L----~~dEL~aVlAHElgHi~ 136 (317)
T PRK01345 68 ELYRMVRDLARRAGLP--MPKVY--IIDNPQPNAFATGRNPENAAVAATTG-LL-QR-L----SPEEVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHcCCC--CCcEE--EEcCCCcceEEecCCCCCeEEEechH-HH-hh-C----CHHHHHHHHHHHHHHHH
Confidence 3567778888888865 34543 344322222222 123433333 22 11 1 22567789999999963
No 24
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=91.22 E-value=0.18 Score=28.96 Aligned_cols=21 Identities=10% Similarity=0.127 Sum_probs=14.0
Q ss_pred HHHHHHHhhccccCCCcccEEEe
Q psy6925 48 VLPFYKDYFNIAYPLPKIDLVAV 70 (131)
Q Consensus 48 ~l~~~~~~~g~~yP~~~~~~v~~ 70 (131)
+...+|..||.+ ++..++-.-
T Consensus 6 ~r~~~e~~~G~d--l~~Vrvh~~ 26 (79)
T PF13699_consen 6 IRSRLERAFGAD--LSDVRVHTG 26 (79)
T ss_pred HHHHHHHHhCCC--ccceEEEeC
Confidence 456788899844 676666543
No 25
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=91.16 E-value=1.5 Score=29.21 Aligned_cols=14 Identities=36% Similarity=0.582 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHh
Q psy6925 105 NIALVVGHELAHQW 118 (131)
Q Consensus 105 ~~~~~iahEiaHqW 118 (131)
....++|||++|+.
T Consensus 130 ~~a~~~AHElGH~l 143 (194)
T cd04269 130 LFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHhhc
Confidence 44579999999984
No 26
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=90.91 E-value=0.59 Score=32.31 Aligned_cols=71 Identities=20% Similarity=0.183 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhh
Q psy6925 40 FALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWF 119 (131)
Q Consensus 40 ~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWf 119 (131)
.+.+.....++.+.+.+|.++.--++. .+ .-..|+-...|.+.++......+ ...+..+++||+||-=.
T Consensus 120 ~~~~~l~~~~~~~~~~l~~~~~~~~ik--~~-k~~WGScs~~~~i~~~~~l~~~p--------~~~i~YVvvHELaHLke 188 (223)
T COG1451 120 ILREILEIRLKEYAKKLGVPPRAIKLK--NM-KRRWGSCSKAGEIRFNWRLVMAP--------EEVIDYVVVHELAHLKE 188 (223)
T ss_pred HHHHHHHHHHHHHHHHhCCCccceeee--ec-cceeeeecCCCcEEeehhhhcCC--------HHHHHHHHHHHHHHHhh
Confidence 344566777778888888665432222 11 22334444444444443322111 24556799999999755
Q ss_pred cC
Q psy6925 120 GN 121 (131)
Q Consensus 120 G~ 121 (131)
.|
T Consensus 189 ~n 190 (223)
T COG1451 189 KN 190 (223)
T ss_pred hh
Confidence 54
No 27
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=90.60 E-value=1.2 Score=30.49 Aligned_cols=12 Identities=42% Similarity=0.565 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHh
Q psy6925 107 ALVVGHELAHQW 118 (131)
Q Consensus 107 ~~~iahEiaHqW 118 (131)
..++|||++|..
T Consensus 146 ~~~~AHElGH~l 157 (220)
T cd04272 146 VYTMTHELAHLL 157 (220)
T ss_pred HHHHHHHHHHHh
Confidence 479999999974
No 28
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=90.52 E-value=1.7 Score=29.30 Aligned_cols=18 Identities=17% Similarity=0.336 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHhhcC
Q psy6925 104 QNIALVVGHELAHQWFGN 121 (131)
Q Consensus 104 ~~~~~~iahEiaHqWfG~ 121 (131)
..+..++|||+-|.+--.
T Consensus 63 ~~l~~~iaHE~hH~~r~~ 80 (195)
T PF10026_consen 63 EELPALIAHEYHHNCRYE 80 (195)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 467789999999996543
No 29
>PRK04897 heat shock protein HtpX; Provisional
Probab=90.37 E-value=0.5 Score=34.06 Aligned_cols=64 Identities=22% Similarity=0.282 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchh-----hhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAME-----NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me-----~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
...+.++-+.+..|.| -+++-++ ++...++.. .-+.+.+.+. ++ + .-..+.+..++|||++|-=
T Consensus 81 ~L~~~v~~la~~~gip--~p~v~v~--~~~~~NAfa~G~~~~~~~v~vt~g-Ll-~-----~l~~~El~aVlAHElgHi~ 149 (298)
T PRK04897 81 ELWHIVEDMAMVAQIP--MPRVFII--DDPSPNAFATGSSPKNAAVAVTTG-LL-A-----IMNREELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHcCCC--CCcEEEe--cCCCCceEEeccCCCCcEEEeehH-HH-h-----hCCHHHHHHHHHHHHHHHh
Confidence 4667788888888754 4555443 322222221 1123333322 11 1 0113567789999999953
No 30
>PRK02391 heat shock protein HtpX; Provisional
Probab=90.30 E-value=0.51 Score=34.02 Aligned_cols=67 Identities=13% Similarity=0.147 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchh-----hhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAME-----NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me-----~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
...+.++-+.+..|.|- +++-++ ++...++.. .-+.+.+.+. ++ + .. +...+..++|||++|--
T Consensus 77 ~L~~~v~~la~~~~~~~--p~v~v~--~~~~~NAfa~G~~~~~~~V~vt~g-Ll-~-~L----~~~El~aVlaHElgHi~ 145 (296)
T PRK02391 77 ELHAMVERLCALADLPK--PRVAVA--DSDVPNAFATGRSPKNAVVCVTTG-LM-R-RL----DPDELEAVLAHELSHVK 145 (296)
T ss_pred HHHHHHHHHHHHcCCCC--CcEEEE--eCCCCceEEecCCCCCcEEEecHH-HH-h-hC----CHHHHHHHHHHHHHHHH
Confidence 45667777778888553 454433 321111111 1123333332 22 1 11 13556789999999954
Q ss_pred hcC
Q psy6925 119 FGN 121 (131)
Q Consensus 119 fG~ 121 (131)
=++
T Consensus 146 ~~d 148 (296)
T PRK02391 146 NRD 148 (296)
T ss_pred cCC
Confidence 433
No 31
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=90.22 E-value=0.26 Score=30.20 Aligned_cols=15 Identities=33% Similarity=0.616 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHhh
Q psy6925 105 NIALVVGHELAHQWF 119 (131)
Q Consensus 105 ~~~~~iahEiaHqWf 119 (131)
.+..+++||++|-|=
T Consensus 78 GC~~TL~HEL~H~WQ 92 (141)
T PHA02456 78 GCRDTLAHELNHAWQ 92 (141)
T ss_pred chHHHHHHHHHHHHh
Confidence 345699999999994
No 32
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=89.71 E-value=2.9 Score=26.88 Aligned_cols=37 Identities=14% Similarity=0.172 Sum_probs=25.2
Q ss_pred eEEEEEecCCcc---chHHHHHHHHHHHHHHHHHhhcccc
Q psy6925 24 VLVRVYTPVGKR---EQGQFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 24 ~~i~~~~~~~~~---~~~~~~~~~~~~~l~~~~~~~g~~y 60 (131)
.+|++|..+... .......+.+.+++...++..++.+
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F 41 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKF 41 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEE
Confidence 467777776532 2345567888999999988766443
No 33
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=89.53 E-value=0.41 Score=30.82 Aligned_cols=18 Identities=22% Similarity=0.170 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHhhc
Q psy6925 103 RQNIALVVGHELAHQWFG 120 (131)
Q Consensus 103 ~~~~~~~iahEiaHqWfG 120 (131)
...+..+|.||++|.|..
T Consensus 57 ~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 57 EEELIDTLLHEMAHAAAY 74 (157)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 456778999999999983
No 34
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=88.91 E-value=1.2 Score=29.94 Aligned_cols=14 Identities=36% Similarity=0.762 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 104 QNIALVVGHELAHQ 117 (131)
Q Consensus 104 ~~~~~~iahEiaHq 117 (131)
.....++|||++|.
T Consensus 129 ~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 129 LSFAVIIAHELGHN 142 (199)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHh
Confidence 34557999999995
No 35
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=88.80 E-value=0.33 Score=32.08 Aligned_cols=12 Identities=42% Similarity=0.498 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHh
Q psy6925 107 ALVVGHELAHQW 118 (131)
Q Consensus 107 ~~~iahEiaHqW 118 (131)
..++|||++||+
T Consensus 112 ~~~~aHElGH~l 123 (173)
T PF13574_consen 112 IDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHH
T ss_pred eeeehhhhHhhc
Confidence 468999999995
No 36
>PRK01265 heat shock protein HtpX; Provisional
Probab=88.59 E-value=0.83 Score=33.40 Aligned_cols=63 Identities=24% Similarity=0.251 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh-----hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN-----WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~-----~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
...+.++-+.+..|.| -+++-++ ++...++..+ ...+.+.+. ++ +. -..+.+..++|||++|-
T Consensus 84 ~L~~~v~~la~~~g~~--~p~vyv~--~~~~~NAfa~G~~~~~~~Ivvt~g-Ll-~~-----l~~~El~aVlAHElgHi 151 (324)
T PRK01265 84 WLYSIVAEVAKYNGIR--VPKVYIA--DVPFPNAFAYGSPIAGKRIAITLP-LL-KI-----LNRDEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHcCCC--CCeEEEe--cCCCCCeEEeccCCCCCEEEEehH-HH-hh-----CCHHHHHHHHHHHHHHH
Confidence 3566777777888754 4554433 3211111111 122333333 22 11 12356778999999994
No 37
>PRK05457 heat shock protein HtpX; Provisional
Probab=88.10 E-value=1 Score=32.28 Aligned_cols=68 Identities=24% Similarity=0.391 Sum_probs=35.0
Q ss_pred HHHHHHHHHHhhccccCCCcccEEEecCC---CCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhcC
Q psy6925 45 ASKVLPFYKDYFNIAYPLPKIDLVAVPDF---SCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFGN 121 (131)
Q Consensus 45 ~~~~l~~~~~~~g~~yP~~~~~~v~~p~~---~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG~ 121 (131)
..+.++-+.+..|. |-+++.++-.+.. ..|.-...+.+.+.+. ++ + .-+.+.+..++|||++|.==|+
T Consensus 79 L~~~v~~la~~~g~--p~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~g-Ll-~-----~L~~~El~aVlAHElgHi~~~d 149 (284)
T PRK05457 79 LVETVARQARQAGI--GMPEVAIYHSPEINAFATGASKNNSLVAVSTG-LL-Q-----NMSRDEVEAVLAHEISHIANGD 149 (284)
T ss_pred HHHHHHHHHHhCCC--CCCCEEEEeCCCceEEEecCCCCCeEEEeehH-Hh-h-----hCCHHHHHHHHHHHHHHHHcCC
Confidence 55677777888874 4565554432210 1111112223333332 11 1 1123567789999999975544
No 38
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=86.62 E-value=5.8 Score=29.62 Aligned_cols=15 Identities=20% Similarity=0.381 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHh
Q psy6925 104 QNIALVVGHELAHQW 118 (131)
Q Consensus 104 ~~~~~~iahEiaHqW 118 (131)
.....++|||+-|.=
T Consensus 137 ~~~~sTlAHEfQHmI 151 (366)
T PF10460_consen 137 DTVYSTLAHEFQHMI 151 (366)
T ss_pred HHHHHHHHHHHHHHH
Confidence 446789999999963
No 39
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=86.42 E-value=0.61 Score=31.33 Aligned_cols=21 Identities=33% Similarity=0.392 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHhhcC
Q psy6925 101 ITRQNIALVVGHELAHQWFGN 121 (131)
Q Consensus 101 ~~~~~~~~~iahEiaHqWfG~ 121 (131)
-....+..++.||+||.++|+
T Consensus 77 l~~~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 77 LPYETILGTMLHELAHNVHGP 97 (186)
T ss_pred eeHHHHHHHHHHHHHhcccCC
Confidence 345677889999999999886
No 40
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=86.32 E-value=1.1 Score=28.94 Aligned_cols=14 Identities=57% Similarity=0.662 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 103 RQNIALVVGHELAH 116 (131)
Q Consensus 103 ~~~~~~~iahEiaH 116 (131)
...+..+|.||+||
T Consensus 58 ~~f~~~vV~HELaH 71 (156)
T COG3091 58 EDFIEQVVPHELAH 71 (156)
T ss_pred HHHHHHHHHHHHHH
Confidence 34567899999987
No 41
>PRK02870 heat shock protein HtpX; Provisional
Probab=85.89 E-value=1.7 Score=32.01 Aligned_cols=64 Identities=20% Similarity=0.282 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh-----hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN-----WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~-----~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
.+.++++-+.+..|.|+ -+++.+ +++...++... -+.|.+.+. ++ + .. ..+.+..++|||++|-
T Consensus 116 ~L~~~ve~La~~ag~p~-~p~V~v--i~~~~~NAFA~G~~~~~~~Ivvt~G-LL-~-~L----~~dEL~aVlAHELgHi 184 (336)
T PRK02870 116 QLYNVVEELLVAAGLRF-MPKVYI--IDAPYMNAFASGYSEKSAMVAITTG-LL-E-KL----DRDELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHcCCCC-CCeEEE--EcCCCCceEEecCCCCCcEEEEehH-Hh-h-hC----CHHHHHHHHHHHHHHH
Confidence 34566677777777543 234333 34322222221 233443333 22 1 11 2356778999999995
No 42
>PRK03072 heat shock protein HtpX; Provisional
Probab=85.24 E-value=1.7 Score=31.20 Aligned_cols=63 Identities=19% Similarity=0.299 Sum_probs=33.0
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEEecCCCCcchhh---h--chhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCGAMEN---W--GLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~~me~---~--g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
...+.++-+.+..|.| -+++-+ +++...++... + ..+...+. ++ + ....+.+..++|||++|-
T Consensus 71 ~L~~~v~~la~~~g~p--~p~vyv--~~~~~~NAFa~G~~~~~~~v~vt~g-Ll-~-----~l~~~El~aVlAHElgHi 138 (288)
T PRK03072 71 AMYRIVRELSTAARQP--MPRLYI--SPTAAPNAFATGRNPRNAAVCCTEG-IL-Q-----ILNERELRGVLGHELSHV 138 (288)
T ss_pred HHHHHHHHHHHHcCCC--CCCEEE--ecCCCCceEEecCCCCCcEEEecHH-HH-H-----hCCHHHHHHHHHHHHHHH
Confidence 4667888888888854 455433 33322222221 1 12222222 22 1 112356778999999994
No 43
>KOG2661|consensus
Probab=84.62 E-value=0.64 Score=34.08 Aligned_cols=18 Identities=39% Similarity=0.582 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHhhc
Q psy6925 103 RQNIALVVGHELAHQWFG 120 (131)
Q Consensus 103 ~~~~~~~iahEiaHqWfG 120 (131)
...++.+++||+|||=-+
T Consensus 272 ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 272 DDGLATVLGHEIAHAVAR 289 (424)
T ss_pred hHHHHHHHHHHHHHHHHH
Confidence 356778999999999554
No 44
>KOG2719|consensus
Probab=84.45 E-value=1.5 Score=33.16 Aligned_cols=68 Identities=21% Similarity=0.381 Sum_probs=34.2
Q ss_pred HHHHHHHHHhhccccCCCcccEEEe--cCCCCc----chh-hhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 46 SKVLPFYKDYFNIAYPLPKIDLVAV--PDFSCG----AME-NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 46 ~~~l~~~~~~~g~~yP~~~~~~v~~--p~~~~~----~me-~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
+.-++.+.+-.| +|.+++.++-. ++...+ |.. +-.+++|+ + ++.++.. ...+.+..+++||++|-=
T Consensus 220 ~~~Ie~la~s~g--fp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyD-t-Ll~~~~~---~~~eel~AVl~HELGHW~ 292 (428)
T KOG2719|consen 220 KEKIERLADSVG--FPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYD-T-LLLEEEH---LNNEELVAVLAHELGHWK 292 (428)
T ss_pred HHHHHHHHHhcC--CCceEEEEEecCCCCCCCCeeeeeccccceEEEeh-h-hhhhhhc---cccHHHHHHHHHHhhHHH
Confidence 455666666665 88888888764 211111 111 11122222 1 1110000 123567789999999954
Q ss_pred hc
Q psy6925 119 FG 120 (131)
Q Consensus 119 fG 120 (131)
-|
T Consensus 293 ~~ 294 (428)
T KOG2719|consen 293 LN 294 (428)
T ss_pred Hh
Confidence 33
No 45
>KOG3607|consensus
Probab=83.32 E-value=5.5 Score=32.45 Aligned_cols=88 Identities=17% Similarity=0.216 Sum_probs=41.6
Q ss_pred EEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCC--CCcchhhhchhh---hhccccccCCCCCC
Q psy6925 25 LVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDF--SCGAMENWGLVT---YREVCLLVDSQNTS 99 (131)
Q Consensus 25 ~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~--~~~~me~~g~i~---~~~~~~l~~~~~~~ 99 (131)
.+++|..++.....+.+.+.+...++|=..++...-|.+..-++..-.+ ...|+++.+-+- ++.. + ......
T Consensus 242 ~lE~Wt~~dki~~~~~~~~tL~~F~~wr~~~l~~r~~hD~a~L~~~~~~~~~~~G~a~~~~mCs~~~s~g-v--~~~~~~ 318 (716)
T KOG3607|consen 242 GLEIWTDGNKIDVSEDLRETLHNFLKWRKSYLTTRLPHDAAHLLSGILFYGKYVGLAYFGGMCSPGHSGG-V--NKFHSD 318 (716)
T ss_pred EEEecCCCCeecccccHHHHHHHHHHHHHhhccccCCCCceEEEEeeeccCceeceeecccccCcccccc-e--eecCcc
Confidence 3567777766554444545555555554554543335444333332111 122444444221 1111 1 111111
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6925 100 AITRQNIALVVGHELAHQ 117 (131)
Q Consensus 100 ~~~~~~~~~~iahEiaHq 117 (131)
.....+.++||||+|.
T Consensus 319 --~~~~~a~v~AhelgH~ 334 (716)
T KOG3607|consen 319 --ILLAFAVVLAHELGHN 334 (716)
T ss_pred --cchhHHHHHHHHHHhh
Confidence 1244567999999995
No 46
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=81.44 E-value=1.3 Score=29.53 Aligned_cols=15 Identities=33% Similarity=0.496 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHh
Q psy6925 104 QNIALVVGHELAHQW 118 (131)
Q Consensus 104 ~~~~~~iahEiaHqW 118 (131)
.....++||||+|.+
T Consensus 140 ~~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 140 YNGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred CceehhhHHhHHHhc
Confidence 334579999999984
No 47
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=80.71 E-value=1.2 Score=31.67 Aligned_cols=20 Identities=25% Similarity=0.358 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHhhcCcc
Q psy6925 104 QNIALVVGHELAHQWFGNLV 123 (131)
Q Consensus 104 ~~~~~~iahEiaHqWfG~~v 123 (131)
+.+..+++||++|.=-++.+
T Consensus 155 dEl~aVlaHElgHi~~rd~~ 174 (302)
T COG0501 155 DELEAVLAHELGHIKNRHTL 174 (302)
T ss_pred HHHHHHHHHHHHHHhcccHH
Confidence 56778999999997655544
No 48
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=80.68 E-value=1.2 Score=31.74 Aligned_cols=17 Identities=41% Similarity=0.386 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHhhc
Q psy6925 104 QNIALVVGHELAHQWFG 120 (131)
Q Consensus 104 ~~~~~~iahEiaHqWfG 120 (131)
..+..+|-||+|||=+.
T Consensus 195 ~~lA~LIFHELAHQk~Y 211 (376)
T COG4324 195 TYLASLIFHELAHQKIY 211 (376)
T ss_pred HHHHHHHHHHHhhheEe
Confidence 46678999999999653
No 49
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=77.62 E-value=1.8 Score=30.93 Aligned_cols=14 Identities=36% Similarity=0.577 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 104 QNIALVVGHELAHQ 117 (131)
Q Consensus 104 ~~~~~~iahEiaHq 117 (131)
+....++.||++|-
T Consensus 194 ~el~~il~HEl~Hi 207 (299)
T PF05569_consen 194 EELRAILLHELAHI 207 (299)
T ss_pred HHHHHHHHHHHHHH
Confidence 44557999999994
No 50
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=77.57 E-value=1.6 Score=32.10 Aligned_cols=29 Identities=10% Similarity=0.107 Sum_probs=18.4
Q ss_pred EecCCccchHHHHHHHHHHHHHHHHHhhc
Q psy6925 29 YTPVGKREQGQFALHVASKVLPFYKDYFN 57 (131)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~g 57 (131)
...|...+..+.-+..+.++-+|-.+.+|
T Consensus 39 l~dp~~~~~lr~rL~~~~~iR~FA~~~L~ 67 (337)
T PF10023_consen 39 LADPATPPALRARLRLAQQIRRFASEELG 67 (337)
T ss_pred hhCCCCCHHHHHHHHHHHHHHHHHHHhcC
Confidence 34455555555556677777777777776
No 51
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=77.08 E-value=9.4 Score=26.00 Aligned_cols=69 Identities=13% Similarity=0.233 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHhhcccc-CC-CcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 42 LHVASKVLPFYKDYFNIAY-PL-PKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 42 ~~~~~~~l~~~~~~~g~~y-P~-~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
.+.+.+|++.+++...+.| |. +.-+.|.+-....|.-++.|....+...+-+.+... ...+++||+.|-
T Consensus 28 ~~~I~~Am~~~~~~TCirF~~r~~~~~yi~f~~~~~GC~S~vG~~~~g~q~isl~~~C~-------~~Gti~HEl~Ha 98 (200)
T cd04281 28 RAMFKQAMRHWENFTCVTFVERTPEENYIVFTYRPCGCCSYVGRRGNGPQAISIGKNCD-------KFGIVVHELGHV 98 (200)
T ss_pred HHHHHHHHHHHHhCCceEEEECCCCCCEEEEEECCCCeeEcCCCcCCCceeeecCCCcC-------cCchHHHHHHHH
Confidence 4578899999999877555 22 122333322112233333333221222222222211 125899999984
No 52
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=76.21 E-value=10 Score=25.95 Aligned_cols=19 Identities=37% Similarity=0.569 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHhhcC
Q psy6925 103 RQNIALVVGHELAHQWFGN 121 (131)
Q Consensus 103 ~~~~~~~iahEiaHqWfG~ 121 (131)
+..+..+|.||++|-|=.+
T Consensus 93 ~~Ei~Gvl~HE~~H~~Q~~ 111 (205)
T PF04450_consen 93 RDEIIGVLYHEMVHCWQWD 111 (205)
T ss_pred HHHHHHHHHHHHHHHhhcC
Confidence 4667789999999976543
No 53
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=74.83 E-value=12 Score=24.98 Aligned_cols=63 Identities=17% Similarity=0.332 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHhhccccCCCcccEEE--ecCCCCcchh--hhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 44 VASKVLPFYKDYFNIAYPLPKIDLVA--VPDFSCGAME--NWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP~~~~~~v~--~p~~~~~~me--~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
.++-+++.+++ +|.+ |.+.-.+.. .++...||.. ..|+++ -.....+ +..+..+|+||+.|-|
T Consensus 17 ~v~fl~~~~~~-~gc~-~~~~~~i~c~~C~~~~~Ggf~p~~~~I~l-------C~N~~~~---~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 17 TVRFLMEALKK-LGCP-PVPPRHIKCEPCDSSVSGGFDPSKKGIVL-------CQNRIRS---QGHLEDTLTHELIHAY 83 (173)
T ss_pred HHHHHHHHHHH-cCCC-CCCCCCeEEEECcCCCcCCccCCCCCEEE-------eeCCCCC---HHHHHHHHHHHHHHHH
Confidence 34555555555 4545 344333433 3443345543 233433 2222333 3455689999999975
No 54
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=72.15 E-value=2.5 Score=28.47 Aligned_cols=25 Identities=20% Similarity=0.401 Sum_probs=16.4
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHhhc
Q psy6925 96 QNTSAITRQNIALVVGHELAHQWFG 120 (131)
Q Consensus 96 ~~~~~~~~~~~~~~iahEiaHqWfG 120 (131)
+.......-.+..+|||||.|---.
T Consensus 26 ~~p~~~~yg~lG~ilahel~hafd~ 50 (206)
T PF01431_consen 26 NYPPALNYGGLGFILAHELMHAFDP 50 (206)
T ss_dssp TS-HHHHHHTHHHHHHHHHHHCTST
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333445555668999999997643
No 55
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=71.83 E-value=6.7 Score=26.99 Aligned_cols=15 Identities=27% Similarity=0.459 Sum_probs=12.6
Q ss_pred HHHHHHHHHHhhcCc
Q psy6925 108 LVVGHELAHQWFGNL 122 (131)
Q Consensus 108 ~~iahEiaHqWfG~~ 122 (131)
-++|||+.|-++..-
T Consensus 74 FtlAHELGH~llH~~ 88 (213)
T COG2856 74 FTLAHELGHALLHTD 88 (213)
T ss_pred HHHHHHHhHHHhccc
Confidence 389999999998644
No 56
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=71.34 E-value=17 Score=24.38 Aligned_cols=68 Identities=16% Similarity=0.123 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHHHhhccccCCCc---ccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 41 ALHVASKVLPFYKDYFNIAYPLPK---IDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 41 ~~~~~~~~l~~~~~~~g~~yP~~~---~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
..+.+.++++.+++..++.|-... -+.|.+.. ..|.....|.. -++..+.+.+.... ..++.||+.|-
T Consensus 20 ~~~~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~~-~~gC~S~vG~~-~g~q~i~l~~~c~~-------~~~i~HEl~Ha 90 (191)
T PF01400_consen 20 QRQRIRKAMDEWEKNTCIRFVERTENEDDYISFSN-GSGCWSYVGRQ-GGEQTINLGDGCFS-------VGTILHELGHA 90 (191)
T ss_dssp HHHHHHHHHHHHHHHSSEEEEE-SSSSSSEEEEES-SSSEEEESS---SSEEEEEE-TTC-S-------HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCCEEEEECCCCCceEEEeec-Cccccchhhhc-CcceeEEecceeCC-------ccchHHHHHHH
Confidence 335789999999999986553222 22333322 33444555554 23333444433222 25899999984
No 57
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=71.33 E-value=2.8 Score=25.63 Aligned_cols=10 Identities=30% Similarity=0.541 Sum_probs=9.1
Q ss_pred HHHHHHHHHH
Q psy6925 108 LVVGHELAHQ 117 (131)
Q Consensus 108 ~~iahEiaHq 117 (131)
.+++||+.|+
T Consensus 109 ~~~~HEiGH~ 118 (124)
T PF13582_consen 109 DTFAHEIGHN 118 (124)
T ss_dssp THHHHHHHHH
T ss_pred eEeeehhhHh
Confidence 5999999997
No 58
>PF13265 DUF4056: Protein of unknown function (DUF4056)
Probab=70.42 E-value=2.2 Score=30.15 Aligned_cols=27 Identities=15% Similarity=0.127 Sum_probs=13.7
Q ss_pred EEecCCccchHHHHHHHHHHHHHHHHH
Q psy6925 28 VYTPVGKREQGQFALHVASKVLPFYKD 54 (131)
Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~l~~~~~ 54 (131)
+|...+.--+...+-+.+..++-.|++
T Consensus 73 iYT~~GGFIDiAHVRDTAD~T~yLF~~ 99 (270)
T PF13265_consen 73 IYTHRGGFIDIAHVRDTADYTYYLFSQ 99 (270)
T ss_pred EEeccCCceeehhcccchHHHHHHHHH
Confidence 555555444444555555555555544
No 59
>PF14891 Peptidase_M91: Effector protein
Probab=70.27 E-value=3.6 Score=27.17 Aligned_cols=13 Identities=38% Similarity=0.485 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..+|+||++|-|=
T Consensus 104 ~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 104 FVVLYHELIHAYD 116 (174)
T ss_pred HHHHHHHHHHHHH
Confidence 4699999999885
No 60
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=67.20 E-value=4.1 Score=28.47 Aligned_cols=12 Identities=50% Similarity=0.888 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHh
Q psy6925 107 ALVVGHELAHQW 118 (131)
Q Consensus 107 ~~~iahEiaHqW 118 (131)
..++|||++|.+
T Consensus 168 a~t~AHElGHnl 179 (244)
T cd04270 168 DLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHhc
Confidence 469999999984
No 61
>KOG3314|consensus
Probab=66.22 E-value=36 Score=22.61 Aligned_cols=15 Identities=40% Similarity=0.618 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHh
Q psy6925 104 QNIALVVGHELAHQW 118 (131)
Q Consensus 104 ~~~~~~iahEiaHqW 118 (131)
.-+..+++||+.|..
T Consensus 89 ~h~n~vv~HElIH~f 103 (194)
T KOG3314|consen 89 DHVNQVVIHELIHAF 103 (194)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345679999999974
No 62
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=65.62 E-value=5.2 Score=22.74 Aligned_cols=12 Identities=25% Similarity=0.434 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHh
Q psy6925 107 ALVVGHELAHQW 118 (131)
Q Consensus 107 ~~~iahEiaHqW 118 (131)
.+++|||++..|
T Consensus 53 GEl~AhE~~fr~ 64 (77)
T PF08219_consen 53 GELFAHEIAFRL 64 (77)
T ss_pred HHHHHHHHHHHh
Confidence 368999999875
No 63
>KOG3714|consensus
Probab=64.85 E-value=17 Score=27.66 Aligned_cols=70 Identities=16% Similarity=0.080 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHHHHHhhccccCC---CcccEEEe-cCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHH
Q psy6925 39 QFALHVASKVLPFYKDYFNIAYPL---PKIDLVAV-PDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHEL 114 (131)
Q Consensus 39 ~~~~~~~~~~l~~~~~~~g~~yP~---~~~~~v~~-p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEi 114 (131)
......+.+||+++++..-+.|-- +.-+.+.+ ... |--.+.|..--++..+.+++.-.. ..+++||+
T Consensus 97 ~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~~--gC~S~VGr~gg~~q~~sl~~~C~~-------~G~i~HEl 167 (411)
T KOG3714|consen 97 SSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTGG--GCYSYVGRRGGGQQLLSLGDGCDR-------FGTIVHEL 167 (411)
T ss_pred HHHHHHHHHHHHHHhhCcceEEEeCCCCCcceEEEeCCC--cceeeeCccCCCccceecCCCcCc-------CchhHHHH
Confidence 344567899999999988765532 11233332 322 333333333322222333333222 35899999
Q ss_pred HHH
Q psy6925 115 AHQ 117 (131)
Q Consensus 115 aHq 117 (131)
.|-
T Consensus 168 ~Ha 170 (411)
T KOG3714|consen 168 MHA 170 (411)
T ss_pred HHH
Confidence 994
No 64
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=63.20 E-value=4 Score=27.12 Aligned_cols=12 Identities=50% Similarity=0.797 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHH
Q psy6925 106 IALVVGHELAHQ 117 (131)
Q Consensus 106 ~~~~iahEiaHq 117 (131)
...++|||++|.
T Consensus 133 ~~~~~aHElGH~ 144 (192)
T cd04267 133 TALTMAHELGHN 144 (192)
T ss_pred ehhhhhhhHHhh
Confidence 356999999995
No 65
>PF15641 Tox-MPTase5: Metallopeptidase toxin 5
Probab=62.99 E-value=13 Score=21.84 Aligned_cols=54 Identities=19% Similarity=0.305 Sum_probs=25.5
Q ss_pred ccCCCcccEEEecCCC-Ccchhhhchhh-hhccccccCCCCCCHHHHHHHHHHHHH-HHHHHhhcC
Q psy6925 59 AYPLPKIDLVAVPDFS-CGAMENWGLVT-YREVCLLVDSQNTSAITRQNIALVVGH-ELAHQWFGN 121 (131)
Q Consensus 59 ~yP~~~~~~v~~p~~~-~~~me~~g~i~-~~~~~~l~~~~~~~~~~~~~~~~~iah-EiaHqWfG~ 121 (131)
+||+-++..+-+-++. ..||..++.-+ ++-. .-.....+..++.| |+-|.||--
T Consensus 24 df~~l~lt~~p~y~~~~~sg~ak~~~gt~~g~~---------~f~sra~lr~~iiheelhhrw~~r 80 (109)
T PF15641_consen 24 DFPYLNLTHIPQYNYFMNSGMAKPNEGTQFGFN---------SFSSRAELRNTIIHEELHHRWWKR 80 (109)
T ss_pred hcccccceeccccChhhhcccccCCcccchhhh---------hhhHHHHHHHHHHHHHHHHHHHHh
Confidence 6666666655332222 22677665321 1111 11222333445555 577889953
No 66
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=62.75 E-value=4.7 Score=24.82 Aligned_cols=10 Identities=50% Similarity=0.547 Sum_probs=8.8
Q ss_pred HHHHHHHHHH
Q psy6925 108 LVVGHELAHQ 117 (131)
Q Consensus 108 ~~iahEiaHq 117 (131)
.+++||++|-
T Consensus 82 KviiHEllHI 91 (133)
T COG4900 82 KVIIHELLHI 91 (133)
T ss_pred HHHHHHHhcC
Confidence 6999999994
No 67
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=60.27 E-value=7.8 Score=28.87 Aligned_cols=52 Identities=21% Similarity=0.238 Sum_probs=27.6
Q ss_pred ccCCCcccEEEecCCCCcchhhhchhhhhccccc-cCCCCCCHHHHHHHHHHHHHHHHH
Q psy6925 59 AYPLPKIDLVAVPDFSCGAMENWGLVTYREVCLL-VDSQNTSAITRQNIALVVGHELAH 116 (131)
Q Consensus 59 ~yP~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l-~~~~~~~~~~~~~~~~~iahEiaH 116 (131)
.-|-..|.+-++.+...++..-||=-+|-..-+| +- .+...+..+|+||+.|
T Consensus 82 ~~p~q~YriTilnSP~INAFALPGGYlYitRGLlAla------nd~sEvAAVl~HEmgH 134 (479)
T COG4784 82 ENPQQTYRITILNSPNINAFALPGGYLYITRGLLALA------NDSSEVAAVLAHEMGH 134 (479)
T ss_pred cCCCceEEEEEecCCCccccccCCceEEEehhHHHHc------CCHHHHHHHHHhhhhh
Confidence 3455566766665444445555553332222121 11 1124567899999998
No 68
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=58.91 E-value=7.9 Score=24.49 Aligned_cols=34 Identities=3% Similarity=-0.063 Sum_probs=20.3
Q ss_pred EEEEEecCCcc-chHHHHHHHHHHHHHHHHHhhcc
Q psy6925 25 LVRVYTPVGKR-EQGQFALHVASKVLPFYKDYFNI 58 (131)
Q Consensus 25 ~i~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~g~ 58 (131)
+|+....+... ...+...+.+.+|++..++..+.
T Consensus 6 ~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~ 40 (154)
T PF00413_consen 6 TITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPL 40 (154)
T ss_dssp EEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSE
T ss_pred cEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCc
Confidence 34444433321 22345667889999999987653
No 69
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=58.44 E-value=11 Score=24.24 Aligned_cols=21 Identities=10% Similarity=0.157 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHhhcccc
Q psy6925 40 FALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 40 ~~~~~~~~~l~~~~~~~g~~y 60 (131)
...+.++++++..++..++.+
T Consensus 15 ~~r~~i~~A~~~W~~~~~i~F 35 (165)
T cd04268 15 KLRAAILDAIEAWNKAFAIGF 35 (165)
T ss_pred HHHHHHHHHHHHHHHHhCcCc
Confidence 445678888999988886554
No 70
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=58.25 E-value=2 Score=33.81 Aligned_cols=26 Identities=35% Similarity=0.595 Sum_probs=20.3
Q ss_pred cCCCCCCHHHHHHHHHHHHHHHHHHh
Q psy6925 93 VDSQNTSAITRQNIALVVGHELAHQW 118 (131)
Q Consensus 93 ~~~~~~~~~~~~~~~~~iahEiaHqW 118 (131)
++++.++...+-.+..+|+|||.|..
T Consensus 474 fd~ea~~a~NYGgIGaVIgHEI~HgF 499 (654)
T COG3590 474 FDPEADSAANYGGIGAVIGHEIGHGF 499 (654)
T ss_pred CCCCcchhhcccCccceehhhhcccc
Confidence 56666666667777889999999974
No 71
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=57.38 E-value=39 Score=22.37 Aligned_cols=68 Identities=16% Similarity=0.066 Sum_probs=35.0
Q ss_pred HHHHHHHHHHHHHHHhhcccc---CCCcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHH
Q psy6925 40 FALHVASKVLPFYKDYFNIAY---PLPKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAH 116 (131)
Q Consensus 40 ~~~~~~~~~l~~~~~~~g~~y---P~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaH 116 (131)
...+.+++|++.+++..++.+ +-.+-.+.+..+ .|.-+..|... +...+.+..... ...++.||+.|
T Consensus 15 ~~~~~I~~A~~~w~~~TcIrF~~~~~~~~~I~f~~~--~Gc~S~vG~~~-~~q~i~l~~~c~-------~~g~v~HE~~H 84 (180)
T cd04280 15 SDRSLILRAMREIESNTCIRFVPRTTEKDYIRIVKG--SGCWSYVGRVG-GRQVVSLGSGCF-------SLGTIVHELMH 84 (180)
T ss_pred HHHHHHHHHHHHHHhCCcceEEECCCCCcEEEEEcC--CCcceecCccC-CceeEEeCCCcC-------cCchhHHHHHH
Confidence 445678999999999776544 222222333333 23333333331 222232222111 12589999998
Q ss_pred H
Q psy6925 117 Q 117 (131)
Q Consensus 117 q 117 (131)
-
T Consensus 85 a 85 (180)
T cd04280 85 A 85 (180)
T ss_pred H
Confidence 5
No 72
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=57.12 E-value=8.2 Score=24.82 Aligned_cols=10 Identities=30% Similarity=0.298 Sum_probs=9.0
Q ss_pred HHHHHHHHHH
Q psy6925 107 ALVVGHELAH 116 (131)
Q Consensus 107 ~~~iahEiaH 116 (131)
..+|.||++|
T Consensus 97 ~~TLiHE~SH 106 (148)
T PF14521_consen 97 EGTLIHEWSH 106 (148)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHhhhh
Confidence 4699999999
No 73
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=55.15 E-value=14 Score=22.06 Aligned_cols=35 Identities=23% Similarity=0.196 Sum_probs=20.2
Q ss_pred hchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 81 WGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 81 ~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
..+++|+... . ..-.+...-...+..++.||+||.
T Consensus 50 ~rI~lyR~pl-~-~~~~~~~eL~~~I~~tlvhEiah~ 84 (97)
T PF06262_consen 50 DRIVLYRRPL-E-RRARSREELAELIRDTLVHEIAHH 84 (97)
T ss_dssp EEEEEEHHHH-H-HT-SSHHHHHHHHHHHHHHHHHHH
T ss_pred CEEEEehHHH-H-HHhCCHHHHHHHHHHHHHHHHHHH
Confidence 3466666552 2 212233334556678999999996
No 74
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=54.30 E-value=9.8 Score=27.09 Aligned_cols=14 Identities=43% Similarity=0.591 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 107 ALVVGHELAHQWFG 120 (131)
Q Consensus 107 ~~~iahEiaHqWfG 120 (131)
..++.||+.|...|
T Consensus 217 ~~v~vHE~GHsf~~ 230 (264)
T PF09471_consen 217 KQVVVHEFGHSFGG 230 (264)
T ss_dssp HHHHHHHHHHHTT-
T ss_pred cceeeeeccccccc
Confidence 46999999998654
No 75
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=54.25 E-value=12 Score=23.63 Aligned_cols=14 Identities=36% Similarity=0.491 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 104 QNIALVVGHELAHQ 117 (131)
Q Consensus 104 ~~~~~~iahEiaHq 117 (131)
....++.+||+.|-
T Consensus 75 y~~~RIaaHE~GHi 88 (132)
T PF02031_consen 75 YNSTRIAAHELGHI 88 (132)
T ss_dssp S-HHHHHHHHHHHH
T ss_pred Cccceeeeehhccc
Confidence 33457999999995
No 76
>PF11950 DUF3467: Protein of unknown function (DUF3467); InterPro: IPR021857 This entry is represented by Bacteriophage 92, 0rf53. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family of proteins are functionally uncharacterised. This protein is found in bacteria, archaea and viruses. Proteins in this family are typically between 101 to 118 amino acids in length.
Probab=53.87 E-value=31 Score=20.25 Aligned_cols=32 Identities=22% Similarity=0.218 Sum_probs=24.4
Q ss_pred CeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhc
Q psy6925 23 GVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFN 57 (131)
Q Consensus 23 ~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g 57 (131)
.+.-|++-.|.+ ++..+..+.+.++-||+.||
T Consensus 57 ~v~sRVimsP~~---AKrL~~aL~~~l~~YE~~fG 88 (92)
T PF11950_consen 57 KVSSRVIMSPQH---AKRLLKALQQNLQKYEQRFG 88 (92)
T ss_pred ceEEEEEeCHHH---HHHHHHHHHHHHHHHHHHcC
Confidence 455566666654 56777788999999999999
No 77
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=53.65 E-value=5.1 Score=27.78 Aligned_cols=10 Identities=40% Similarity=0.710 Sum_probs=8.8
Q ss_pred HHHHHHHHHH
Q psy6925 108 LVVGHELAHQ 117 (131)
Q Consensus 108 ~~iahEiaHq 117 (131)
.++|||++|.
T Consensus 147 ~t~AHElGHn 156 (228)
T cd04271 147 QVFAHEIGHT 156 (228)
T ss_pred eehhhhhhhh
Confidence 4899999997
No 78
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=52.94 E-value=9.7 Score=24.72 Aligned_cols=14 Identities=36% Similarity=0.876 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHhhcC
Q psy6925 107 ALVVGHELAHQWFGN 121 (131)
Q Consensus 107 ~~~iahEiaHqWfG~ 121 (131)
..+++||+.| |+|-
T Consensus 70 g~TltHEvGH-~LGL 83 (154)
T PF05572_consen 70 GKTLTHEVGH-WLGL 83 (154)
T ss_dssp SHHHHHHHHH-HTT-
T ss_pred ccchhhhhhh-hhcc
Confidence 3699999998 6664
No 79
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=51.85 E-value=6.5 Score=30.34 Aligned_cols=26 Identities=27% Similarity=0.266 Sum_probs=19.4
Q ss_pred hHHHHHHHHHHHHHHHHHhhccccCCC
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIAYPLP 63 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~yP~~ 63 (131)
.+..+...+.+..++|.++|| +=-++
T Consensus 270 a~~dAh~~~g~vyD~yk~~fg-r~S~D 295 (507)
T COG3227 270 AAVDAHYNAGKVYDYYKNTFG-RNSYD 295 (507)
T ss_pred hhHHHHhhcchHHHHHHHHhc-ccCcC
Confidence 345566778899999999999 44444
No 80
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=51.79 E-value=9.2 Score=26.01 Aligned_cols=11 Identities=27% Similarity=0.543 Sum_probs=9.4
Q ss_pred HHHHHHHHHHh
Q psy6925 108 LVVGHELAHQW 118 (131)
Q Consensus 108 ~~iahEiaHqW 118 (131)
.+++|||.|..
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 57999999974
No 81
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=51.08 E-value=4.9 Score=29.41 Aligned_cols=29 Identities=21% Similarity=0.516 Sum_probs=17.2
Q ss_pred cccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 89 VCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 89 ~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
..++.+.+....-..++...+++||.+|-
T Consensus 173 p~IvlP~d~~~r~~~ee~~yIilHEl~Hl 201 (337)
T COG4219 173 PCIVLPADFVERLTDEELKYIILHELSHL 201 (337)
T ss_pred ceEEccHHHHhhcCHHhhhhhHhHHHhhh
Confidence 33444433333334456667999999984
No 82
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=50.59 E-value=13 Score=26.98 Aligned_cols=12 Identities=42% Similarity=0.653 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHH
Q psy6925 106 IALVVGHELAHQ 117 (131)
Q Consensus 106 ~~~~iahEiaHq 117 (131)
...+||||+.|.
T Consensus 170 ~ayVlAHEyGHH 181 (292)
T PF04228_consen 170 QAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHH
Confidence 346899999996
No 83
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=49.78 E-value=14 Score=25.33 Aligned_cols=13 Identities=38% Similarity=0.991 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++|||+.|-|.
T Consensus 94 gsiLAHE~mHa~L 106 (212)
T PF12315_consen 94 GSILAHELMHAWL 106 (212)
T ss_pred hhHHHHHHHHHHh
Confidence 4699999999997
No 84
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=49.62 E-value=9.8 Score=24.39 Aligned_cols=23 Identities=17% Similarity=0.423 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHhhcccc
Q psy6925 38 GQFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 38 ~~~~~~~~~~~l~~~~~~~g~~y 60 (131)
.+...+.+..+++.+++..++.+
T Consensus 20 ~~~~~~~v~~a~~~w~~~~~i~f 42 (167)
T cd00203 20 SAQIQSLILIAMQIWRDYLNIRF 42 (167)
T ss_pred HHHHHHHHHHHHHHHHhhhCceE
Confidence 35566788899999998776554
No 85
>COG4758 Predicted membrane protein [Function unknown]
Probab=48.70 E-value=7.3 Score=27.11 Aligned_cols=19 Identities=32% Similarity=0.805 Sum_probs=13.5
Q ss_pred HHHHHhhcCccCcc---ccCCC
Q psy6925 113 ELAHQWFGNLVTME---WWTHL 131 (131)
Q Consensus 113 EiaHqWfG~~vt~~---~W~d~ 131 (131)
.+=||||||.=++. .|+|+
T Consensus 115 ~~r~~w~Gn~~~~~~~y~~dDi 136 (235)
T COG4758 115 DFRNQWFGNQRYYYDVYQWDDI 136 (235)
T ss_pred HHHhhccccceeeeccccccch
Confidence 34799999987544 67763
No 86
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=48.61 E-value=20 Score=24.38 Aligned_cols=19 Identities=21% Similarity=0.333 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy6925 100 AITRQNIALVVGHELAHQW 118 (131)
Q Consensus 100 ~~~~~~~~~~iahEiaHqW 118 (131)
..-..++...+.||+.|.|
T Consensus 139 ~l~~~R~~Kea~HElGH~~ 157 (194)
T PF07998_consen 139 ELFLERVCKEAVHELGHLF 157 (194)
T ss_dssp HHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 4456777889999999985
No 87
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=48.56 E-value=33 Score=24.55 Aligned_cols=25 Identities=4% Similarity=0.001 Sum_probs=19.2
Q ss_pred hHHHHHHHHHHHHHHHHHhhccccC
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIAYP 61 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~yP 61 (131)
+.+..++...++++...+..|.+..
T Consensus 144 d~~~ll~~~D~ii~~~~el~Gl~~~ 168 (307)
T PF13402_consen 144 DPEELLRFWDRIIDAEYELAGLDKS 168 (307)
T ss_dssp SSHHHHHHHHHHHHHHHHHTT-BSS
T ss_pred hHHHHHHHHHHHHHHHHHHhCCCcc
Confidence 4566778888999999999997663
No 88
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=47.61 E-value=31 Score=24.41 Aligned_cols=27 Identities=19% Similarity=0.438 Sum_probs=23.9
Q ss_pred hHHHHHHHHHHHHHHHHHhhccccCCC
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIAYPLP 63 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~yP~~ 63 (131)
....++..+.+.+..++.++|+|.||+
T Consensus 196 ~isaALgyvahlv~lls~yL~v~Lpy~ 222 (302)
T PF10186_consen 196 EISAALGYVAHLVSLLSRYLGVPLPYP 222 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCCC
Confidence 456788999999999999999999986
No 89
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=47.52 E-value=16 Score=25.15 Aligned_cols=36 Identities=6% Similarity=0.024 Sum_probs=21.7
Q ss_pred CCeEEEEEecCCccchH-HHHHHHHHHHHHHHHHhhcc
Q psy6925 22 DGVLVRVYTPVGKREQG-QFALHVASKVLPFYKDYFNI 58 (131)
Q Consensus 22 ~~~~i~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~g~ 58 (131)
.+..|+++..+...... ......+.+|+..|... |.
T Consensus 37 ~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNal-gl 73 (211)
T PF12388_consen 37 SPRTITIIGYTGGSQSLNSAWRTALDEAINNYNAL-GL 73 (211)
T ss_pred CCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhh-CC
Confidence 34567777766332222 34556778888888775 53
No 90
>PF04036 DUF372: Domain of unknown function (DUF372); InterPro: IPR007179 This is a group of proteins of unknown function. It is found N-terminal to another domain of unknown function (IPR007181 from INTERPRO).; PDB: 2I52_B 2IEC_D 2OGF_C.
Probab=46.97 E-value=12 Score=18.20 Aligned_cols=16 Identities=38% Similarity=0.422 Sum_probs=11.6
Q ss_pred HHHHHHhhcCccCccc
Q psy6925 112 HELAHQWFGNLVTMEW 127 (131)
Q Consensus 112 hEiaHqWfG~~vt~~~ 127 (131)
-.+-|||-|-=|+++.
T Consensus 12 GaLyHQF~GtPvs~~~ 27 (38)
T PF04036_consen 12 GALYHQFVGTPVSPET 27 (38)
T ss_dssp HHHHHHHTT-EESTTT
T ss_pred HHHHHHhcCCcCCcch
Confidence 3578999998888753
No 91
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=44.62 E-value=32 Score=24.54 Aligned_cols=22 Identities=14% Similarity=0.019 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHhhcCccCcc
Q psy6925 105 NIALVVGHELAHQWFGNLVTME 126 (131)
Q Consensus 105 ~~~~~iahEiaHqWfG~~vt~~ 126 (131)
....+++||+-|--|++....+
T Consensus 59 ~~~~~l~HevlH~~~~H~~r~~ 80 (292)
T PF13203_consen 59 ERVGLLLHEVLHCLLRHPWRRG 80 (292)
T ss_pred HHHHHHHHHHHHHHccchhhhc
Confidence 3457899999999998875443
No 92
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=43.03 E-value=33 Score=22.95 Aligned_cols=16 Identities=31% Similarity=0.441 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6925 102 TRQNIALVVGHELAHQ 117 (131)
Q Consensus 102 ~~~~~~~~iahEiaHq 117 (131)
-..++...++||+.|.
T Consensus 121 ~~~R~~k~~~HElGH~ 136 (179)
T PRK13267 121 FEERVRKEVTHELGHT 136 (179)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4456667799999998
No 93
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=42.39 E-value=38 Score=15.56 Aligned_cols=23 Identities=13% Similarity=0.108 Sum_probs=12.0
Q ss_pred hHHHHHHHHHHHHHHHHHhhccc
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIA 59 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~ 59 (131)
+.+.+.....++++..++.+|-.
T Consensus 17 ~~~~A~~~~~~al~~~~~~~G~~ 39 (42)
T PF13374_consen 17 RYEEALELLEEALEIRERLLGPD 39 (42)
T ss_dssp -HHHHHHHHHHHHHHH-------
T ss_pred hcchhhHHHHHHHHHHHHHhccc
Confidence 34567778889999999999833
No 94
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=40.65 E-value=19 Score=25.53 Aligned_cols=11 Identities=55% Similarity=0.610 Sum_probs=9.1
Q ss_pred HHHHHHHHHHh
Q psy6925 108 LVVGHELAHQW 118 (131)
Q Consensus 108 ~~iahEiaHqW 118 (131)
.++.||+||+=
T Consensus 156 NVviHEfAH~L 166 (253)
T PF06167_consen 156 NVVIHEFAHKL 166 (253)
T ss_dssp -HHHHHHHHHH
T ss_pred chHHHHHHHHH
Confidence 59999999984
No 95
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=39.91 E-value=17 Score=22.82 Aligned_cols=14 Identities=29% Similarity=0.577 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 104 QNIALVVGHELAHQ 117 (131)
Q Consensus 104 ~~~~~~iahEiaHq 117 (131)
..+..++.|||+|.
T Consensus 107 d~vthvliHEIgHh 120 (136)
T COG3824 107 DQVTHVLIHEIGHH 120 (136)
T ss_pred hHhhhhhhhhhhhh
Confidence 34556899999997
No 96
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.87 E-value=4.6 Score=27.31 Aligned_cols=11 Identities=45% Similarity=0.806 Sum_probs=9.5
Q ss_pred HHHHHHHHHHH
Q psy6925 107 ALVVGHELAHQ 117 (131)
Q Consensus 107 ~~~iahEiaHq 117 (131)
..++|||++|.
T Consensus 141 a~~~aHElGH~ 151 (207)
T cd04273 141 AFTIAHELGHV 151 (207)
T ss_pred EEeeeeechhh
Confidence 46999999996
No 97
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=37.97 E-value=18 Score=23.16 Aligned_cols=23 Identities=4% Similarity=0.169 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHhhcccc
Q psy6925 38 GQFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 38 ~~~~~~~~~~~l~~~~~~~g~~y 60 (131)
.....+.+.++++..++..++.+
T Consensus 20 ~~~~~~~i~~A~~~W~~~~~l~F 42 (157)
T cd04278 20 RDDVRRAIARAFRVWSDVTPLTF 42 (157)
T ss_pred HHHHHHHHHHHHHHHHhhcCcee
Confidence 34556778889999988776544
No 98
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=37.92 E-value=21 Score=23.57 Aligned_cols=21 Identities=19% Similarity=0.268 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHhhcccc
Q psy6925 40 FALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 40 ~~~~~~~~~l~~~~~~~g~~y 60 (131)
...+.++++++..++..++.+
T Consensus 34 ~~~~~i~~A~~~w~~~~~l~F 54 (186)
T cd04277 34 AQQAAARDALEAWEDVADIDF 54 (186)
T ss_pred HHHHHHHHHHHHHHhhcCcee
Confidence 444667888888888887555
No 99
>COG3228 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.10 E-value=16 Score=25.67 Aligned_cols=10 Identities=50% Similarity=0.640 Sum_probs=9.2
Q ss_pred HHHHHHHHHH
Q psy6925 108 LVVGHELAHQ 117 (131)
Q Consensus 108 ~~iahEiaHq 117 (131)
.++.||+||+
T Consensus 149 NlViHEfAHk 158 (266)
T COG3228 149 NLVIHEFAHK 158 (266)
T ss_pred eehHhHHhhh
Confidence 5999999998
No 100
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=36.60 E-value=32 Score=23.38 Aligned_cols=30 Identities=7% Similarity=0.037 Sum_probs=18.1
Q ss_pred EEEEEecCCccchHHHHHHHHHHHHHHHHHhhc
Q psy6925 25 LVRVYTPVGKREQGQFALHVASKVLPFYKDYFN 57 (131)
Q Consensus 25 ~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g 57 (131)
+|.++...... +...+.++++++.-++.|+
T Consensus 9 pItyyI~~~~p---~~~r~aI~~A~~~Wn~~fe 38 (197)
T cd04276 9 PIVYYLDNTFP---EKYRDAIREGVLYWNKAFE 38 (197)
T ss_pred CEEEEecCCCc---HHHHHHHHHHHHHHHHHHH
Confidence 45555544322 4456677777777777765
No 101
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=35.78 E-value=21 Score=22.31 Aligned_cols=20 Identities=5% Similarity=0.185 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHhhcccc
Q psy6925 41 ALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 41 ~~~~~~~~l~~~~~~~g~~y 60 (131)
..+.+.++++..++..++.+
T Consensus 24 ~~~~i~~A~~~w~~~t~i~F 43 (140)
T smart00235 24 VREAIARAFAEWSDVTCLRF 43 (140)
T ss_pred HHHHHHHHHHHHhcCCeeEE
Confidence 35678888888888776544
No 102
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=35.68 E-value=41 Score=18.10 Aligned_cols=11 Identities=27% Similarity=0.298 Sum_probs=9.5
Q ss_pred HHHHHHHHHHH
Q psy6925 107 ALVVGHELAHQ 117 (131)
Q Consensus 107 ~~~iahEiaHq 117 (131)
.+.+.|++|++
T Consensus 29 eR~~vH~~a~~ 39 (59)
T cd06007 29 ERAVIHRLCRK 39 (59)
T ss_pred HHHHHHHHHHH
Confidence 36889999999
No 103
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=35.27 E-value=32 Score=26.95 Aligned_cols=15 Identities=40% Similarity=0.366 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHHH
Q psy6925 103 RQNIALVVGHELAHQ 117 (131)
Q Consensus 103 ~~~~~~~iahEiaHq 117 (131)
.+....+++||++|-
T Consensus 207 ~~~~~~~~~HEi~Ha 221 (521)
T PF01457_consen 207 FQEFFRTVIHEIAHA 221 (521)
T ss_dssp -HHHHHHHHHHHHHH
T ss_pred hhcccceeeeeeeee
Confidence 345568999999995
No 104
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=34.82 E-value=20 Score=25.52 Aligned_cols=11 Identities=36% Similarity=0.546 Sum_probs=9.6
Q ss_pred HHHHHHHHHHh
Q psy6925 108 LVVGHELAHQW 118 (131)
Q Consensus 108 ~~iahEiaHqW 118 (131)
.++.||+||+-
T Consensus 144 NVvIHEFAH~L 154 (260)
T PRK15410 144 NLIIHEVAHKL 154 (260)
T ss_pred chhHhHHHhHh
Confidence 59999999983
No 105
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=32.98 E-value=40 Score=22.58 Aligned_cols=13 Identities=31% Similarity=0.345 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++.||++|-|-
T Consensus 42 ~~l~iHElgH~~~ 54 (183)
T cd06160 42 AILGIHEMGHYLA 54 (183)
T ss_pred HHHHHHHHHHHHH
Confidence 3588999999875
No 106
>PRK15245 type III effector phosphothreonine lyase; Provisional
Probab=32.76 E-value=1.1e+02 Score=21.28 Aligned_cols=60 Identities=17% Similarity=0.270 Sum_probs=40.2
Q ss_pred CCeEEEEEecCCccc--hHHHHHHHHHHHHHHHHHhhccccCCCcccEEEecCC---CCcchhhhchhhhhcc
Q psy6925 22 DGVLVRVYTPVGKRE--QGQFALHVASKVLPFYKDYFNIAYPLPKIDLVAVPDF---SCGAMENWGLVTYREV 89 (131)
Q Consensus 22 ~~~~i~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v~~p~~---~~~~me~~g~i~~~~~ 89 (131)
-|-.+++|..|+.++ .....+..+++.++.+|..++ .= ++ .|+- ..-.-|+|..+.|+..
T Consensus 151 ~GAQfTLYvKpd~edsqYsa~~l~k~r~Fie~lE~~L~-~~-----gi--~pG~~P~SDV~pe~W~y~SYRNE 215 (241)
T PRK15245 151 LGAQFTLYVKPDQENSQYSASFLHKTRQFIECLESRLS-EN-----GV--ISGQCPESDVHPENWKYLSYRNE 215 (241)
T ss_pred ccceEEEEecCccccccCCHHHHHHHHHHHHHHHHHHH-Hc-----CC--CCCCCCccccCccccceeeehhh
Confidence 477899999998764 345667889999999999987 11 11 2211 1124567777777654
No 107
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=32.54 E-value=37 Score=23.32 Aligned_cols=18 Identities=39% Similarity=0.492 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHhhcCccC
Q psy6925 107 ALVVGHELAHQWFGNLVT 124 (131)
Q Consensus 107 ~~~iahEiaHqWfG~~vt 124 (131)
.-++.||++|-=+|+...
T Consensus 102 ~fil~HE~~Hv~~~h~~~ 119 (206)
T PF10463_consen 102 AFILLHELAHVVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHcCccc
Confidence 358999999999998765
No 108
>PHA00527 hypothetical protein
Probab=32.22 E-value=45 Score=20.32 Aligned_cols=14 Identities=43% Similarity=0.337 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 107 ALVVGHELAHQWFG 120 (131)
Q Consensus 107 ~~~iahEiaHqWfG 120 (131)
..+|+||-||.=|.
T Consensus 74 ~~T~~HECAH~AF~ 87 (129)
T PHA00527 74 AATLVHECAHVAFY 87 (129)
T ss_pred HHHHHHHHHHHHHH
Confidence 46999999998773
No 109
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=31.94 E-value=50 Score=17.79 Aligned_cols=11 Identities=18% Similarity=0.329 Sum_probs=9.5
Q ss_pred HHHHHHHHHHH
Q psy6925 107 ALVVGHELAHQ 117 (131)
Q Consensus 107 ~~~iahEiaHq 117 (131)
.+.+.|++|++
T Consensus 30 eR~~vH~~a~~ 40 (60)
T cd02640 30 ERALIHQIAQK 40 (60)
T ss_pred HHHHHHHHHHH
Confidence 46889999999
No 110
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=31.29 E-value=42 Score=23.91 Aligned_cols=13 Identities=46% Similarity=0.491 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++.||++|-+.
T Consensus 119 isv~iHElgHa~~ 131 (263)
T cd06159 119 VGVVVHELSHGIL 131 (263)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999875
No 111
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=31.25 E-value=9.5 Score=23.85 Aligned_cols=9 Identities=56% Similarity=1.394 Sum_probs=7.6
Q ss_pred HHHhhcCcc
Q psy6925 115 AHQWFGNLV 123 (131)
Q Consensus 115 aHqWfG~~v 123 (131)
||.||||-+
T Consensus 78 ah~w~G~rL 86 (142)
T COG4101 78 AHTWYGNRL 86 (142)
T ss_pred eeeeeccce
Confidence 899999864
No 112
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=30.88 E-value=39 Score=22.67 Aligned_cols=36 Identities=17% Similarity=0.281 Sum_probs=21.1
Q ss_pred CCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhcccc
Q psy6925 22 DGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 22 ~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~y 60 (131)
+|..+.+....+. .+...+.+.++++..++..++.+
T Consensus 5 ~~~~~~~~f~~~~---~~~~r~~I~~A~~~W~~~t~i~F 40 (198)
T cd04327 5 NGTVLRIAFLGGP---DAFLKDKVRAAAREWLPYANLKF 40 (198)
T ss_pred CCCeEEEEeCCCC---cHHHHHHHHHHHHHHhhhcCeEE
Confidence 4545555554442 12334567888888888766544
No 113
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=30.83 E-value=42 Score=22.67 Aligned_cols=16 Identities=25% Similarity=0.310 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHhh
Q psy6925 104 QNIALVVGHELAHQWF 119 (131)
Q Consensus 104 ~~~~~~iahEiaHqWf 119 (131)
.+...++.||+.|-=.
T Consensus 20 ~~~l~t~~HE~gHal~ 35 (200)
T PF13398_consen 20 FRLLVTFVHELGHALA 35 (200)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3455699999999644
No 114
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=30.43 E-value=40 Score=23.37 Aligned_cols=13 Identities=46% Similarity=0.534 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++.||++|-..
T Consensus 54 ~~v~iHElgH~~~ 66 (227)
T cd06164 54 ASVLLHELGHSLV 66 (227)
T ss_pred HHHHHHHHHHHHH
Confidence 3688999999764
No 115
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=29.74 E-value=41 Score=24.23 Aligned_cols=13 Identities=31% Similarity=0.179 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++.||++|-+-
T Consensus 136 isvvvHElgHal~ 148 (277)
T cd06162 136 ISGVVHEMGHGVA 148 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999763
No 116
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=29.61 E-value=96 Score=17.75 Aligned_cols=21 Identities=5% Similarity=0.023 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHhhcccc
Q psy6925 39 QFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 39 ~~~~~~~~~~l~~~~~~~g~~y 60 (131)
+.......++.+-|++.|| |.
T Consensus 36 ~~~~~~~~~l~~~Ye~~yG-PL 56 (78)
T PF12652_consen 36 NEYSKQRKQLKKEYEKRYG-PL 56 (78)
T ss_pred HHHHHHHHHHHHHHHHHhC-CC
Confidence 3456677899999999999 54
No 117
>PF08334 T2SG: Type II secretion system (T2SS), protein G; InterPro: IPR013545 The general secretion pathway, or type II pullulanase-like machinery, is responsible for the transport of proteins from the periplasm across the outer membrane in Gram-negative bacteria [, ]. This entry includes protein G (e.g. P45773 from SWISSPROT, Q00514 from SWISSPROT) involved in this pathway. The PulG protein (P15746 from SWISSPROT) is thought to be anchored in the inner membrane with its C terminus directed towards the periplasm []. Together with other members of the secretion machinery, it is thought to assemble into a pilus-like structure that may function as a dynamic mechanism to push secreted proteins out of the cell. The polypeptide is organised into a long N-terminal alpha-helix followed by a loop region that separates it from a C-terminal anti-parallel beta-sheet []. ; PDB: 1T92_A 3G20_B 3GN9_B 2KEP_A 3FU1_B.
Probab=29.59 E-value=87 Score=18.67 Aligned_cols=25 Identities=20% Similarity=0.255 Sum_probs=17.6
Q ss_pred hHHHHHHHHHHHHHHHHHhhccccCC
Q psy6925 37 QGQFALHVASKVLPFYKDYFNIAYPL 62 (131)
Q Consensus 37 ~~~~~~~~~~~~l~~~~~~~g~~yP~ 62 (131)
.++.-+..+..+|+.|-.-.| .||-
T Consensus 13 ~a~~~l~~i~~Ale~Y~~d~G-~yP~ 37 (108)
T PF08334_consen 13 RAKADLRTIKTALEMYYLDNG-RYPS 37 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHS-S---
T ss_pred HHHHHHHHHHHHHHHHHHhhC-CCcC
Confidence 344567788999999998899 9997
No 118
>PF09865 DUF2092: Predicted periplasmic protein (DUF2092); InterPro: IPR019207 This entry represents various hypothetical prokaryotic proteins of unknown function.
Probab=29.54 E-value=70 Score=22.07 Aligned_cols=43 Identities=21% Similarity=0.349 Sum_probs=27.8
Q ss_pred CCeEEEEEecCCccchHHHHHHHHHHHHHHHHHhhccccCCCc
Q psy6925 22 DGVLVRVYTPVGKREQGQFALHVASKVLPFYKDYFNIAYPLPK 64 (131)
Q Consensus 22 ~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yP~~~ 64 (131)
+|-.+.+|.+....-....+-..+.++++.+.+.+|++.|...
T Consensus 69 DGkt~Tl~~~~~n~Ya~~~aP~tid~~i~~l~~~~gi~~P~aD 111 (214)
T PF09865_consen 69 DGKTFTLYDPNQNVYAQADAPGTIDAAIDYLRDKYGIELPLAD 111 (214)
T ss_pred CCCEEEEEcCcCCeEEeccCCCCHHHHHHHHHHhhCCCccHHH
Confidence 4556666655444322223334667889999999999888753
No 119
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=28.91 E-value=44 Score=22.71 Aligned_cols=13 Identities=38% Similarity=0.432 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHhh
Q psy6925 107 ALVVGHELAHQWF 119 (131)
Q Consensus 107 ~~~iahEiaHqWf 119 (131)
..++.||++|-.-
T Consensus 39 ~~v~iHElgH~~~ 51 (208)
T cd06161 39 LSVLLHELGHALV 51 (208)
T ss_pred HHHHHHHHHHHHH
Confidence 3689999999753
No 120
>KOG3658|consensus
Probab=28.83 E-value=14 Score=29.89 Aligned_cols=11 Identities=36% Similarity=0.893 Sum_probs=9.9
Q ss_pred HHHHHHHHHHh
Q psy6925 108 LVVGHELAHQW 118 (131)
Q Consensus 108 ~~iahEiaHqW 118 (131)
-++|||+.|.|
T Consensus 394 lt~AHEiGHNf 404 (764)
T KOG3658|consen 394 LTLAHEIGHNF 404 (764)
T ss_pred eeehhhhcccc
Confidence 48999999997
No 121
>PF03929 PepSY_TM: PepSY-associated TM helix; InterPro: IPR005625 This domain represents a conserved transmembrane (TM) helix that is found in bacterial proteins. Coil residues are significantly more conserved than other residues and are frequently found within channels and transporters, where they introduce the flexibility and polarity required for transport across the membrane []. This TM helix associates with PepSY (peptidase (M4) and YpeB of subtilis). PepSY is a repeated region first identified in Thermoanaerobacter tengcongensis. The PepSY domain functions in the control of M4 peptidases through their propeptide and in the germination of spores. It may also play a part in regulating protease activity [].
Probab=28.33 E-value=10 Score=16.97 Aligned_cols=9 Identities=44% Similarity=1.516 Sum_probs=5.9
Q ss_pred HHhhcCccC
Q psy6925 116 HQWFGNLVT 124 (131)
Q Consensus 116 HqWfG~~vt 124 (131)
|+|++...+
T Consensus 5 H~w~~~i~a 13 (27)
T PF03929_consen 5 HKWFGDIFA 13 (27)
T ss_pred HHHHHHHHH
Confidence 668876543
No 122
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=26.38 E-value=69 Score=13.72 Aligned_cols=20 Identities=15% Similarity=0.428 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHhhccccCC
Q psy6925 42 LHVASKVLPFYKDYFNIAYPL 62 (131)
Q Consensus 42 ~~~~~~~l~~~~~~~g~~yP~ 62 (131)
.....++++.|++... .||-
T Consensus 13 ~g~~~~A~~~~~~~~~-~~P~ 32 (33)
T PF13174_consen 13 LGDYDEAIEYFQRLIK-RYPD 32 (33)
T ss_dssp HCHHHHHHHHHHHHHH-HSTT
T ss_pred ccCHHHHHHHHHHHHH-HCcC
Confidence 3466788888888887 7773
No 123
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=26.31 E-value=2.1e+02 Score=20.61 Aligned_cols=14 Identities=21% Similarity=0.572 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHH
Q psy6925 104 QNIALVVGHELAHQ 117 (131)
Q Consensus 104 ~~~~~~iahEiaHq 117 (131)
.++-.+|+||+=|.
T Consensus 138 ~~v~aliaHE~HH~ 151 (280)
T COG5504 138 TSVPALIAHEYHHN 151 (280)
T ss_pred cchHHHHHHHHHhh
Confidence 34567999999885
No 124
>PF14475 Mso1_Sec1_bdg: Sec1-binding region of Mso1
Probab=25.41 E-value=88 Score=15.57 Aligned_cols=17 Identities=18% Similarity=0.421 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHhhccccC
Q psy6925 44 VASKVLPFYKDYFNIAYP 61 (131)
Q Consensus 44 ~~~~~l~~~~~~~g~~yP 61 (131)
..+....||.+.+. |||
T Consensus 22 v~r~l~~yY~~k~~-~~P 38 (41)
T PF14475_consen 22 VHRVLRKYYTEKGR-PFP 38 (41)
T ss_pred HHHHHHHHHHHcCC-CCC
Confidence 45666778888865 887
No 125
>PF03536 VRP3: Salmonella virulence-associated 28kDa protein; InterPro: IPR003519 Salmonella typhimurium contains a 90kb plasmid that is associated with virulence. This plasmid encodes at least 6 genes needed by the bacterium for invading host macrophages during infection. These include the 70kDa mkaA protein [], a recognised virulence factor. Deletion studies into the virulence plasmid have shown that an open reading frame encoding a 28kDa protein was needed for successful invasion of the host. This protein, designated mkfA [], VRP4 [] or VirA [] by different groups, is utilised by the microbe upon entry into macrophages, although the exact mechanism is unclear.; PDB: 2Z8P_A 2Z8O_B 2Z8M_B 2Z8N_A 2P1W_A 2Q8Y_A 3I0U_B 3BO6_B.
Probab=25.33 E-value=1.5e+02 Score=20.56 Aligned_cols=61 Identities=16% Similarity=0.312 Sum_probs=34.9
Q ss_pred CCeEEEEEecCCccc--hHHHHHHHHHHHHHHHHHhhccccCCCcccEE--EecCCCCcchhhhchhhhhcc
Q psy6925 22 DGVLVRVYTPVGKRE--QGQFALHVASKVLPFYKDYFNIAYPLPKIDLV--AVPDFSCGAMENWGLVTYREV 89 (131)
Q Consensus 22 ~~~~i~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~g~~yP~~~~~~v--~~p~~~~~~me~~g~i~~~~~ 89 (131)
-|-.+++|..|+.++ .....+..+++.++.+|..++ .= ++. ..|. ..-.-++|..+.|+..
T Consensus 150 ~GAQfTLY~kpd~eds~Ys~~~l~k~r~fi~~iE~~L~-~a-----gi~pg~~P~-SDV~p~~W~yaSYRNE 214 (240)
T PF03536_consen 150 QGAQFTLYVKPDREDSQYSAEYLHKTRQFIESIESELS-EA-----GIQPGERPE-SDVSPEHWSYASYRNE 214 (240)
T ss_dssp SS-SEEEEEE-SSTTS---HHHHHHHHHHHHHHHHHHH-HT-----T--B----T-T-B--TT-SSEEEEET
T ss_pred ccceEEEEecCccccccCCHHHHHHHHHHHHHHHHHHH-Hc-----CCCCCCCCc-cccCccccceeeehhh
Confidence 477899999998773 355667889999999999886 11 221 0121 1125567777777654
No 126
>PTZ00337 surface protease GP63; Provisional
Probab=24.88 E-value=59 Score=25.99 Aligned_cols=15 Identities=0% Similarity=0.133 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHhhc
Q psy6925 43 HVASKVLPFYKDYFN 57 (131)
Q Consensus 43 ~~~~~~l~~~~~~~g 57 (131)
..++.+++++++.+-
T Consensus 133 ~~lp~a~~~~~~~L~ 147 (567)
T PTZ00337 133 QTLPAAIQLHAERLS 147 (567)
T ss_pred HHHHHHHHHHHHheE
Confidence 567999999999987
No 127
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=24.83 E-value=45 Score=22.36 Aligned_cols=20 Identities=10% Similarity=0.197 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHhhcccc
Q psy6925 41 ALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 41 ~~~~~~~~l~~~~~~~g~~y 60 (131)
..+.+.+|++.+++...+.|
T Consensus 18 ~~~~I~~A~~~~~~~TCirF 37 (182)
T cd04283 18 ERAVIEKAMQEFETLTCVRF 37 (182)
T ss_pred HHHHHHHHHHHHHhCCceee
Confidence 44678899999998776554
No 128
>COG4865 Glutamate mutase epsilon subunit [Amino acid transport and metabolism]
Probab=24.24 E-value=41 Score=25.18 Aligned_cols=30 Identities=20% Similarity=0.455 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHh---hccccCCCcccEEEecCC
Q psy6925 43 HVASKVLPFYKDY---FNIAYPLPKIDLVAVPDF 73 (131)
Q Consensus 43 ~~~~~~l~~~~~~---~g~~yP~~~~~~v~~p~~ 73 (131)
+.+...+-.|++. .+ +=||+-+.--.+|.+
T Consensus 195 qYVDRL~G~YeE~Gi~IN-REpFgPLTgTLVPP~ 227 (485)
T COG4865 195 QYVDRLMGMYEEHGIRIN-REPFGPLTGTLVPPF 227 (485)
T ss_pred HHHHHHHhHHHhcCeeec-cccCCCCcccccChH
Confidence 4556666777653 33 567877776666643
No 129
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=24.12 E-value=70 Score=17.14 Aligned_cols=13 Identities=38% Similarity=0.629 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 107 ALVVGHELAHQWFG 120 (131)
Q Consensus 107 ~~~iahEiaHqWfG 120 (131)
.+.+.|++|+. +|
T Consensus 30 eR~~vH~lA~~-~g 42 (60)
T cd02641 30 DRLLVHELAEE-LG 42 (60)
T ss_pred HHHHHHHHHHH-cC
Confidence 36889999998 44
No 130
>PF00903 Glyoxalase: Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily This Prosite is specific to glyoxalases This Prosite is specific to Extradiol ring-cleavage dioxygenases This prints entry is specific to bleomycin resistance protein.; InterPro: IPR004360 Glyoxalase I (4.4.1.5 from EC) (lactoylglutathione lyase) catalyzes the first step of the glyoxal pathway. S-lactoylglutathione is then converted by glyoxalase II to lactic acid []. Glyoxalase I is an ubiquitous enzyme which binds one mole of zinc per subunit. The bacterial and yeast enzymes are monomeric while the mammalian one is homodimeric. The sequence of glyoxalase I is well conserved. The domain represented by this entry is found in glyoxalase I and in other related proteins, including fosfomycin resistance proteins FosB [], FosA [], FosX [] and dioxygenases (eg. 4-hydroxyphenylpyruvate dioxygenase).; PDB: 1CJX_A 1NPB_E 3OJT_C 3OJN_A 2IG9_B 3OJJ_B 3OJK_D 1Q0C_D 1F1X_C 3BZA_B ....
Probab=24.03 E-value=63 Score=18.84 Aligned_cols=14 Identities=29% Similarity=0.627 Sum_probs=12.2
Q ss_pred HHHHHHHHHHhhcc
Q psy6925 45 ASKVLPFYKDYFNI 58 (131)
Q Consensus 45 ~~~~l~~~~~~~g~ 58 (131)
++++++||++.||.
T Consensus 12 ~~~~~~FY~~~lG~ 25 (128)
T PF00903_consen 12 LEKAIDFYTDVLGF 25 (128)
T ss_dssp HHHHHHHHHHTTTS
T ss_pred HHHHHHHHHHHhCC
Confidence 57899999999994
No 131
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=23.47 E-value=64 Score=26.10 Aligned_cols=13 Identities=38% Similarity=0.516 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHH
Q psy6925 105 NIALVVGHELAHQ 117 (131)
Q Consensus 105 ~~~~~iahEiaHq 117 (131)
...++++||++|-
T Consensus 255 ~~~rv~~HEi~HA 267 (622)
T PTZ00257 255 GTTRTVTHEVAHA 267 (622)
T ss_pred HHHHHHHHHHHHH
Confidence 3458999999995
No 132
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=23.45 E-value=74 Score=25.56 Aligned_cols=11 Identities=36% Similarity=0.748 Sum_probs=8.7
Q ss_pred HHHHHHHHHHH
Q psy6925 107 ALVVGHELAHQ 117 (131)
Q Consensus 107 ~~~iahEiaHq 117 (131)
..+++||+.|.
T Consensus 381 V~TLaHElGHs 391 (598)
T COG1164 381 VFTLAHELGHS 391 (598)
T ss_pred HHHHHHHccHH
Confidence 36999999863
No 133
>COG2764 PhnB Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.45 E-value=68 Score=20.46 Aligned_cols=14 Identities=43% Similarity=0.776 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 44 VASKVLPFYKDYFN 57 (131)
Q Consensus 44 ~~~~~l~~~~~~~g 57 (131)
.++++|+||++-||
T Consensus 11 n~~~Al~fY~~vFg 24 (136)
T COG2764 11 NAREALAFYKEVFG 24 (136)
T ss_pred CHHHHHHHHHHHhC
Confidence 46789999999999
No 134
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=23.31 E-value=51 Score=23.78 Aligned_cols=11 Identities=45% Similarity=0.863 Sum_probs=9.4
Q ss_pred HHHHHHHHHHH
Q psy6925 107 ALVVGHELAHQ 117 (131)
Q Consensus 107 ~~~iahEiaHq 117 (131)
..+||||+.|.
T Consensus 168 aYViAHEVGHH 178 (295)
T COG2321 168 AYVIAHEVGHH 178 (295)
T ss_pred HHHHHhhhhHH
Confidence 46999999996
No 135
>PF10994 DUF2817: Protein of unknown function (DUF2817); InterPro: IPR021259 This family of proteins has no known function.
Probab=23.25 E-value=2.3e+02 Score=21.09 Aligned_cols=15 Identities=40% Similarity=0.634 Sum_probs=12.4
Q ss_pred HHHHHHHhhcCccCc
Q psy6925 111 GHELAHQWFGNLVTM 125 (131)
Q Consensus 111 ahEiaHqWfG~~vt~ 125 (131)
+++-|.+|||..|+.
T Consensus 237 ~~~~a~~~~g~~v~~ 251 (341)
T PF10994_consen 237 ALERARRWWGPDVTS 251 (341)
T ss_pred HHHHHHHHhCcceee
Confidence 678899999988764
No 136
>TIGR00388 glyQ glycyl-tRNA synthetase, tetrameric type, alpha subunit. This tetrameric form of glycyl-tRNA synthetase (2 alpha, 2 beta) is found in the majority of completed eubacterial genomes, with the two genes fused in a few species. A substantially different homodimeric form (not recognized by this model) replaces this form in the Archaea, animals, yeasts, and some eubacteria.
Probab=22.66 E-value=1.2e+02 Score=21.92 Aligned_cols=54 Identities=13% Similarity=0.141 Sum_probs=34.4
Q ss_pred hhccccCCCcccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhh
Q psy6925 55 YFNIAYPLPKIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWF 119 (131)
Q Consensus 55 ~~g~~yP~~~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWf 119 (131)
.+....|++-|+.+.-.+..++-+..-|.|...|.. . .....+.+|.++|..|.
T Consensus 227 ll~~~L~lPAYd~~lK~SH~FNlLDARgaIsvtER~----------~-yI~riR~la~~~a~~y~ 280 (293)
T TIGR00388 227 LLENGLPLPAYEYVLKCSHSFNLLDARKAISVTERQ----------R-YILRIRNLAKGVAEAYY 280 (293)
T ss_pred HHHcCCCCCcHHHHHHHHHHHhhHhhccCccHHHHH----------H-HHHHHHHHHHHHHHHHH
Confidence 444577888888876544455666777776655442 1 11223688999999886
No 137
>PF09349 OHCU_decarbox: OHCU decarboxylase; InterPro: IPR018020 The proteins in this entry are OHCU decarboxylase, an enzyme of the purine catabolism that catalyses the conversion of OHCU into S(+)-allantoin []; it is the third step of the conversion of uric acid (a purine derivative) to allantoin. Step one is catalysed by urate oxidase (IPR002042 from INTERPRO) and step two is catalysed by hydroxyisourate hydrolase (IPR000895 from INTERPRO). ; PDB: 3O7I_B 3O7H_B 3O7J_A 3O7K_A 2Q37_A 2O70_B 2O73_C 2O74_C 2O8I_A.
Probab=22.61 E-value=1.3e+02 Score=19.59 Aligned_cols=22 Identities=14% Similarity=0.279 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHhhcccc
Q psy6925 39 QFALHVASKVLPFYKDYFNIAY 60 (131)
Q Consensus 39 ~~~~~~~~~~l~~~~~~~g~~y 60 (131)
...+......-+.|+++||.||
T Consensus 93 ~~~~~~L~~lN~~Y~~kFGf~F 114 (159)
T PF09349_consen 93 EEELAELAALNQAYEEKFGFPF 114 (159)
T ss_dssp HHHHHHHHHHHHHHHHHHSS--
T ss_pred HHHHHHHHHHHHHHHHHcCCce
Confidence 4456677888899999999555
No 138
>cd06588 PhnB_like Escherichia coli PhnB and similar proteins; the E. coli phnB gene is found next to an operon involved in the cleavage of carbon-phosphorus bonds in unactivated alkylphosphonates. The Escherichia coli phnB gene is found next to an operon of fourteen genes (phnC-to-phnP) related to the cleavage of carbon-phosphorus (C-P) bonds in unactivated alkylphosphonates, supporting bacterial growth on alkylphosphonates as the sole phosphorus source. It was originally considered part of that operon. PhnB appears to play no direct catalytic role in the usage of alkylphosphonate. Although many of the proteins in this family have been annotated as 3-demethylubiquinone-9 3-methyltransferase enzymes by automatic annotation programs, the experimental evidence for this assignment is lacking. In Escherichia coli, the gene coding 3-demethylubiquinone-9 3-methyltransferase enzyme is ubiG, which belongs to the AdoMet-MTase protein family. PhnB-like proteins adopt a structural fold similar to
Probab=22.28 E-value=75 Score=19.30 Aligned_cols=14 Identities=36% Similarity=0.674 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 44 VASKVLPFYKDYFN 57 (131)
Q Consensus 44 ~~~~~l~~~~~~~g 57 (131)
.++++++||++.||
T Consensus 10 ~~~eAi~FY~~~fg 23 (128)
T cd06588 10 NAEEALEFYQSVFG 23 (128)
T ss_pred CHHHHHHHHHHHhC
Confidence 46889999999998
No 139
>KOG3624|consensus
Probab=21.79 E-value=89 Score=25.38 Aligned_cols=19 Identities=37% Similarity=0.721 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy6925 100 AITRQNIALVVGHELAHQW 118 (131)
Q Consensus 100 ~~~~~~~~~~iahEiaHqW 118 (131)
......+..+|+||+.|-.
T Consensus 512 ~~nyg~iG~vigHEl~H~F 530 (687)
T KOG3624|consen 512 YLNYGGIGFVIGHELTHGF 530 (687)
T ss_pred hhhhHHHHHHHHHHHhhcc
Confidence 3444556679999999964
No 140
>PF02633 Creatininase: Creatinine amidohydrolase; InterPro: IPR003785 This family includes the enzymes creatininase and 2-amino-5-formylamino-6-ribosylaminopyrimidin-4(3H)-one 5'-monophosphate deformylase, also known as formamide hydrolase. Creatinase or creatinine amidohydrolase (3.5.2.10 from EC) catalyses the hydrolysis of creatinine to creatine, which can then be metabolised to urea and sarcosine by creatinase (3.5.3.3 from EC). Creatininase is a member of the urease-related amidohydrolase superfamily []. Formamide hydrolase catalyzes the hydrolysis of the formamide of 2-amino-5-formylamino-6-ribosylamino-4(3H)-pyrimidinone 5'-monophosphate (FAPy) to form 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate (APy) (3.5.1.102 from EC). ; PDB: 3A6K_F 3A6F_A 3A6D_B 1J2U_B 3A6J_C 1J2T_A 3A6G_C 3A6H_F 1Q3K_E 3A6L_C ....
Probab=21.60 E-value=2.1e+02 Score=19.74 Aligned_cols=59 Identities=17% Similarity=0.160 Sum_probs=28.3
Q ss_pred HHHHHHHHHHhhccccCCCcccEEEecCCCCc----chhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy6925 45 ASKVLPFYKDYFNIAYPLPKIDLVAVPDFSCG----AMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQWFG 120 (131)
Q Consensus 45 ~~~~l~~~~~~~g~~yP~~~~~~v~~p~~~~~----~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHqWfG 120 (131)
+..+.+-..+.+| . .++.|..++| -+..||.+.++... ...+..-+...++++-|.
T Consensus 42 a~~~a~~~a~~~~-~-------~lv~P~i~yG~s~~h~~fpGTisl~~~t------------~~~~l~di~~sl~~~Gf~ 101 (237)
T PF02633_consen 42 AEAVAERAAERLG-E-------ALVLPPIPYGCSPHHMGFPGTISLSPET------------LIALLRDILRSLARHGFR 101 (237)
T ss_dssp HHHHHHHHHHHHT-H-------EEE---B--BB-GCCTTSTT-BBB-HHH------------HHHHHHHHHHHHHHHT--
T ss_pred HHHHHHHHHHHCC-c-------EEEeCCCccccCcccCCCCCeEEeCHHH------------HHHHHHHHHHHHHHcCCC
Confidence 3444555556666 2 4555655544 34456666655442 234456777788888886
Q ss_pred Ccc
Q psy6925 121 NLV 123 (131)
Q Consensus 121 ~~v 123 (131)
++|
T Consensus 102 ~iv 104 (237)
T PF02633_consen 102 RIV 104 (237)
T ss_dssp EEE
T ss_pred EEE
Confidence 654
No 141
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.53 E-value=85 Score=16.93 Aligned_cols=12 Identities=25% Similarity=0.163 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHh
Q psy6925 107 ALVVGHELAHQW 118 (131)
Q Consensus 107 ~~~iahEiaHqW 118 (131)
.+.+.|.+||++
T Consensus 30 eRriih~la~~l 41 (60)
T cd02639 30 ERRIVHLLASRL 41 (60)
T ss_pred HHHHHHHHHHHc
Confidence 357889999875
No 142
>PRK10148 hypothetical protein; Provisional
Probab=20.58 E-value=84 Score=20.05 Aligned_cols=14 Identities=7% Similarity=0.451 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHhhc
Q psy6925 44 VASKVLPFYKDYFN 57 (131)
Q Consensus 44 ~~~~~l~~~~~~~g 57 (131)
.++++++||++.||
T Consensus 12 ~a~eAi~FY~~~Fg 25 (147)
T PRK10148 12 NCADAIAYYQQTLG 25 (147)
T ss_pred CHHHHHHHHHHHhC
Confidence 36789999999997
No 143
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=20.25 E-value=59 Score=22.73 Aligned_cols=69 Identities=16% Similarity=0.124 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHHHhhccccC-CC-cccEEEecCCCCcchhhhchhhhhccccccCCCCCCHHHHHHHHHHHHHHHHHH
Q psy6925 40 FALHVASKVLPFYKDYFNIAYP-LP-KIDLVAVPDFSCGAMENWGLVTYREVCLLVDSQNTSAITRQNIALVVGHELAHQ 117 (131)
Q Consensus 40 ~~~~~~~~~l~~~~~~~g~~yP-~~-~~~~v~~p~~~~~~me~~g~i~~~~~~~l~~~~~~~~~~~~~~~~~iahEiaHq 117 (131)
...+.+.+|++.+++...+.|- .. .-+.|.+-. ..|.-++.|..- +...+-+.+.... ..+++||+.|-
T Consensus 61 ~~~~~I~~Am~~~~~~TCirF~~rt~e~~yi~i~~-~~GC~S~vG~~g-g~q~isl~~~C~~-------~Gti~HEl~Ha 131 (230)
T cd04282 61 NAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFK-GSGCWSMVGDQQ-GGQNLSIGAGCDY-------KATVEHEFLHA 131 (230)
T ss_pred HHHHHHHHHHHHHHhCCCeeEEECCCCCcEEEEEc-CCCeeeccCccC-CeEEEEECCCcCC-------CchHHHHHHHH
Confidence 3456788999999998775542 11 123333221 223333333321 1222222322211 25899999994
Done!