Query psy1606
Match_columns 285
No_of_seqs 223 out of 1278
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 19:48:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1606.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1606hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3607|consensus 100.0 1.1E-55 2.4E-60 427.7 24.3 281 1-282 93-391 (716)
2 PF01421 Reprolysin: Reprolysi 100.0 1.5E-53 3.4E-58 361.3 17.5 197 85-281 1-199 (199)
3 cd04269 ZnMc_adamalysin_II_lik 100.0 2.2E-51 4.8E-56 346.7 21.3 194 85-279 1-194 (194)
4 cd04273 ZnMc_ADAMTS_like Zinc- 100.0 1.4E-47 3E-52 326.5 15.4 189 85-278 1-207 (207)
5 cd04272 ZnMc_salivary_gland_MP 100.0 5.7E-45 1.2E-49 313.2 18.2 189 86-278 2-220 (220)
6 cd04267 ZnMc_ADAM_like Zinc-de 100.0 7.3E-43 1.6E-47 294.3 18.9 182 85-270 1-192 (192)
7 cd04271 ZnMc_ADAM_fungal Zinc- 100.0 9.1E-43 2E-47 299.6 17.1 186 86-278 2-228 (228)
8 cd04270 ZnMc_TACE_like Zinc-de 100.0 2.2E-41 4.8E-46 294.4 14.5 193 86-280 2-240 (244)
9 KOG3538|consensus 100.0 8E-30 1.7E-34 256.3 17.5 273 2-281 82-384 (845)
10 KOG3658|consensus 100.0 9.4E-30 2E-34 237.4 16.3 273 5-279 107-463 (764)
11 PF13688 Reprolysin_5: Metallo 100.0 6.4E-29 1.4E-33 209.9 13.5 171 84-258 2-196 (196)
12 PF13583 Reprolysin_4: Metallo 99.9 1.4E-23 3E-28 178.3 16.4 186 84-276 2-205 (206)
13 cd00203 ZnMc Zinc-dependent me 99.9 1.1E-23 2.4E-28 173.3 11.8 148 85-268 1-166 (167)
14 PF13574 Reprolysin_2: Metallo 99.9 7.2E-25 1.6E-29 181.3 4.5 154 107-270 2-172 (173)
15 PF13582 Reprolysin_3: Metallo 99.8 1.3E-19 2.8E-24 141.9 5.8 122 110-232 2-124 (124)
16 PRK13267 archaemetzincin-like 98.9 5.6E-09 1.2E-13 86.4 9.7 73 170-250 69-154 (179)
17 PF07998 Peptidase_M54: Peptid 98.4 5.7E-07 1.2E-11 75.0 5.9 67 175-250 98-173 (194)
18 PF01562 Pep_M12B_propep: Repr 98.0 2.6E-06 5.7E-11 67.1 2.3 36 2-38 77-120 (131)
19 COG1913 Predicted Zn-dependent 97.9 4.1E-06 9E-11 67.7 1.7 50 215-270 124-177 (181)
20 cd04275 ZnMc_pappalysin_like Z 97.7 9.1E-05 2E-09 63.7 6.8 117 110-234 25-156 (225)
21 PF00413 Peptidase_M10: Matrix 97.7 3.7E-05 8.1E-10 61.9 4.2 48 215-268 105-152 (154)
22 cd04268 ZnMc_MMP_like Zinc-dep 97.5 0.00068 1.5E-08 55.2 9.3 35 216-250 95-139 (165)
23 cd04279 ZnMc_MMP_like_1 Zinc-d 97.3 0.00026 5.6E-09 57.4 3.6 49 215-268 104-154 (156)
24 PF12388 Peptidase_M57: Dual-a 97.2 0.00066 1.4E-08 57.3 5.8 23 211-233 129-151 (211)
25 cd04278 ZnMc_MMP Zinc-dependen 97.2 0.00032 6.9E-09 56.9 3.6 46 216-267 108-154 (157)
26 PF05548 Peptidase_M11: Gameto 96.9 0.0013 2.9E-08 59.3 4.8 70 172-248 114-184 (314)
27 PF02031 Peptidase_M7: Strepto 96.7 0.0011 2.3E-08 51.1 2.1 31 216-250 78-108 (132)
28 smart00235 ZnMc Zinc-dependent 96.6 0.0011 2.3E-08 52.7 1.6 18 217-234 88-105 (140)
29 PF05572 Peptidase_M43: Pregna 96.3 0.0016 3.5E-08 52.7 1.0 23 215-237 69-92 (154)
30 cd04277 ZnMc_serralysin_like Z 96.1 0.0033 7.3E-08 52.4 2.0 35 216-250 114-158 (186)
31 cd04276 ZnMc_MMP_like_2 Zinc-d 95.8 0.0085 1.9E-07 50.5 3.2 20 215-234 116-135 (197)
32 cd04327 ZnMc_MMP_like_3 Zinc-d 94.8 0.014 3E-07 49.3 1.4 18 217-234 94-111 (198)
33 PF10462 Peptidase_M66: Peptid 94.4 0.022 4.8E-07 51.0 1.9 16 216-231 194-209 (305)
34 cd04283 ZnMc_hatching_enzyme Z 94.1 0.024 5.2E-07 47.2 1.4 17 217-233 79-95 (182)
35 cd04281 ZnMc_BMP1_TLD Zinc-dep 93.7 0.03 6.4E-07 47.3 1.3 17 217-233 89-105 (200)
36 cd04280 ZnMc_astacin_like Zinc 93.7 0.029 6.4E-07 46.6 1.2 17 217-233 76-92 (180)
37 PF01400 Astacin: Astacin (Pep 93.3 0.033 7.2E-07 46.7 1.0 18 216-233 80-97 (191)
38 KOG1565|consensus 93.3 0.06 1.3E-06 51.3 2.7 30 215-250 211-240 (469)
39 cd04282 ZnMc_meprin Zinc-depen 92.6 0.056 1.2E-06 46.6 1.3 17 217-233 122-138 (230)
40 TIGR03296 M6dom_TIGR03296 M6 f 91.9 0.044 9.5E-07 49.0 -0.2 33 216-248 166-203 (286)
41 smart00608 ACR ADAM Cysteine-R 89.6 0.43 9.4E-06 37.8 3.6 60 1-66 72-131 (137)
42 COG3824 Predicted Zn-dependent 88.9 0.19 4.1E-06 38.4 1.0 20 214-233 108-127 (136)
43 KOG3714|consensus 88.8 0.16 3.4E-06 47.8 0.7 17 217-233 161-177 (411)
44 PF01431 Peptidase_M13: Peptid 88.5 0.24 5.2E-06 41.8 1.5 16 212-227 33-48 (206)
45 PF12044 Metallopep: Putative 88.5 0.27 5.9E-06 46.2 1.9 18 218-235 319-336 (423)
46 PF09471 Peptidase_M64: IgA Pe 87.3 0.4 8.7E-06 42.3 2.2 20 215-234 216-235 (264)
47 KOG3624|consensus 84.2 0.97 2.1E-05 45.5 3.5 17 211-227 514-530 (687)
48 PF06262 DUF1025: Possibl zinc 84.2 0.76 1.7E-05 34.1 2.0 19 214-232 72-90 (97)
49 PF12725 DUF3810: Protein of u 83.6 0.56 1.2E-05 42.6 1.4 19 215-233 196-214 (318)
50 PF11350 DUF3152: Protein of u 83.6 0.91 2E-05 38.1 2.5 42 215-265 139-183 (203)
51 PF06114 DUF955: Domain of unk 83.5 1.1 2.3E-05 33.6 2.7 18 210-227 37-54 (122)
52 KOG4525|consensus 82.9 0.75 1.6E-05 42.9 1.9 19 217-235 303-321 (614)
53 COG2856 Predicted Zn peptidase 81.5 0.69 1.5E-05 39.4 1.1 13 215-227 72-84 (213)
54 PF13485 Peptidase_MA_2: Pepti 80.3 1.2 2.6E-05 33.6 2.0 15 213-227 23-37 (128)
55 COG4783 Putative Zn-dependent 79.0 1.1 2.4E-05 42.4 1.6 17 212-228 127-143 (484)
56 PF01863 DUF45: Protein of unk 78.5 2.2 4.7E-05 35.8 3.2 22 213-234 162-183 (205)
57 COG3590 PepO Predicted metallo 78.5 0.68 1.5E-05 44.7 0.1 25 211-235 483-510 (654)
58 PF05547 Peptidase_M6: Immune 75.4 0.52 1.1E-05 46.7 -1.7 34 216-249 222-260 (645)
59 PF01435 Peptidase_M48: Peptid 72.9 2.1 4.6E-05 36.1 1.7 16 213-228 87-102 (226)
60 COG5549 Predicted Zn-dependent 67.3 3.5 7.6E-05 34.8 1.7 21 213-233 185-206 (236)
61 TIGR02290 M3_fam_3 oligoendope 66.3 5.2 0.00011 39.5 3.0 12 216-227 376-387 (587)
62 PRK03001 M48 family peptidase; 65.9 3.5 7.6E-05 36.7 1.6 15 213-227 122-136 (283)
63 TIGR02289 M3_not_pepF oligoend 65.8 5.5 0.00012 39.0 3.0 12 216-227 338-349 (549)
64 PRK03982 heat shock protein Ht 65.1 3.7 8E-05 36.7 1.6 15 213-227 123-137 (288)
65 PRK01345 heat shock protein Ht 63.7 4.1 9E-05 37.0 1.7 15 213-227 122-136 (317)
66 PRK03072 heat shock protein Ht 63.2 4.2 9.2E-05 36.4 1.6 16 213-228 125-140 (288)
67 PF04228 Zn_peptidase: Putativ 61.7 5.1 0.00011 35.9 1.8 13 214-226 169-181 (292)
68 COG0501 HtpX Zn-dependent prot 61.5 4.8 0.0001 35.8 1.7 14 214-227 156-169 (302)
69 PF01433 Peptidase_M1: Peptida 60.4 17 0.00036 33.5 5.1 14 214-227 294-307 (390)
70 PRK05457 heat shock protein Ht 59.9 5.3 0.00011 35.7 1.6 15 213-227 132-146 (284)
71 PRK04897 heat shock protein Ht 59.7 5.3 0.00011 35.9 1.6 15 213-227 135-149 (298)
72 COG1164 Oligoendopeptidase F [ 59.2 9.1 0.0002 38.0 3.2 12 216-227 381-392 (598)
73 TIGR00181 pepF oligoendopeptid 58.7 4.5 9.9E-05 39.9 1.1 13 215-227 378-390 (591)
74 PF01447 Peptidase_M4: Thermol 57.9 5.3 0.00011 32.1 1.1 12 216-227 136-147 (150)
75 KOG2719|consensus 57.5 5.6 0.00012 37.4 1.4 39 186-226 247-291 (428)
76 PRK01265 heat shock protein Ht 56.8 6.4 0.00014 35.9 1.6 14 214-227 139-152 (324)
77 PF04298 Zn_peptidase_2: Putat 56.8 7.9 0.00017 33.2 2.1 12 216-227 90-101 (222)
78 PRK02870 heat shock protein Ht 56.6 6.4 0.00014 36.0 1.6 14 213-226 171-184 (336)
79 PRK02391 heat shock protein Ht 56.6 6.5 0.00014 35.3 1.6 15 213-227 131-145 (296)
80 PF13699 DUF4157: Domain of un 55.2 5.8 0.00013 28.1 0.9 11 217-227 63-73 (79)
81 COG1451 Predicted metal-depend 54.5 9.1 0.0002 32.9 2.1 20 214-233 174-193 (223)
82 cd02639 R3H_RRM R3H domain of 54.0 8 0.00017 25.9 1.3 18 217-234 31-48 (60)
83 PF08325 WLM: WLM domain; Int 53.0 11 0.00024 31.4 2.4 23 212-234 79-101 (186)
84 PF01457 Peptidase_M8: Leishma 52.5 7.8 0.00017 37.7 1.5 59 170-231 159-226 (521)
85 cd06455 M3A_TOP Peptidase M3 T 52.3 7.2 0.00016 37.4 1.3 13 215-227 263-275 (472)
86 PF01432 Peptidase_M3: Peptida 51.7 7.5 0.00016 37.0 1.3 12 216-227 243-254 (458)
87 TIGR02414 pepN_proteo aminopep 51.7 27 0.00058 36.3 5.3 14 214-227 282-295 (863)
88 PHA02456 zinc metallopeptidase 51.6 13 0.00027 28.2 2.2 35 193-228 58-92 (141)
89 cd06459 M3B_Oligoendopeptidase 51.3 7.3 0.00016 36.5 1.1 12 216-227 223-234 (427)
90 PF13398 Peptidase_M50B: Pepti 50.6 10 0.00022 31.9 1.8 16 215-230 22-37 (200)
91 PRK14015 pepN aminopeptidase N 50.0 34 0.00075 35.6 5.8 14 214-227 295-308 (875)
92 PF08219 TOM13: Outer membrane 49.9 13 0.00029 26.0 1.9 18 213-230 50-67 (77)
93 PTZ00257 Glycoprotein GP63 (le 49.5 24 0.00052 35.0 4.3 58 170-230 205-271 (622)
94 PF12315 DUF3633: Protein of u 47.8 17 0.00037 30.8 2.6 26 203-228 81-106 (212)
95 COG4784 Putative Zn-dependent 46.9 12 0.00026 34.2 1.7 18 213-231 122-139 (479)
96 COG2321 Predicted metalloprote 46.8 12 0.00025 33.1 1.5 14 213-226 165-178 (295)
97 cd06461 M2_ACE Peptidase famil 46.2 21 0.00045 34.4 3.3 12 217-228 249-260 (477)
98 PF13402 M60-like: Peptidase M 44.5 9.7 0.00021 34.0 0.8 14 217-230 221-234 (307)
99 cd06457 M3A_MIP Peptidase M3 m 44.1 12 0.00025 35.9 1.3 13 215-227 248-260 (458)
100 COG2738 Predicted Zn-dependent 42.1 17 0.00037 30.5 1.8 18 216-233 93-110 (226)
101 PF02163 Peptidase_M50: Peptid 39.5 19 0.00042 29.6 1.8 19 215-233 7-25 (192)
102 cd02641 R3H_Smubp-2_like R3H d 39.3 18 0.00038 24.2 1.2 20 216-235 30-49 (60)
103 PTZ00337 surface protease GP63 38.9 16 0.00034 36.0 1.3 17 215-231 229-245 (567)
104 cd06258 Peptidase_M3_like The 37.8 16 0.00035 33.5 1.2 12 216-227 155-166 (365)
105 KOG2661|consensus 37.4 19 0.00042 32.7 1.5 15 213-227 273-287 (424)
106 cd06460 M32_Taq Peptidase fami 37.0 22 0.00049 33.3 2.0 20 211-230 155-177 (396)
107 cd06163 S2P-M50_PDZ_RseP-like 36.0 24 0.00052 29.3 1.8 14 215-228 9-22 (182)
108 KOG1047|consensus 34.5 24 0.00052 34.3 1.7 12 216-227 289-300 (613)
109 cd05709 S2P-M50 Site-2 proteas 34.4 26 0.00056 28.6 1.8 17 215-231 8-24 (180)
110 PF14891 Peptidase_M91: Effect 34.2 18 0.0004 29.6 0.8 21 214-234 102-122 (174)
111 PF00480 ROK: ROK family; Int 33.1 19 0.00041 29.1 0.7 28 217-245 144-171 (179)
112 cd06164 S2P-M50_SpoIVFB_CBS Sp 33.1 28 0.0006 30.0 1.8 19 214-232 52-70 (227)
113 cd06456 M3A_DCP_Oligopeptidase 32.8 23 0.00049 33.6 1.3 13 215-227 208-220 (422)
114 cd06161 S2P-M50_SpoIVFB SpoIVF 32.7 29 0.00062 29.4 1.8 19 214-232 37-55 (208)
115 KOG2414|consensus 31.7 18 0.00039 33.9 0.4 14 221-234 402-416 (488)
116 PF04450 BSP: Peptidase of pla 30.3 99 0.0021 26.2 4.6 81 148-235 24-114 (205)
117 cd06159 S2P-M50_PDZ_Arch Uncha 29.8 33 0.00073 30.2 1.8 19 215-233 118-136 (263)
118 smart00731 SprT SprT homologue 29.6 29 0.00062 27.5 1.2 16 214-229 58-73 (146)
119 cd06160 S2P-M50_like_2 Unchara 29.5 35 0.00076 28.3 1.8 20 214-233 40-59 (183)
120 PF14247 DUF4344: Domain of un 29.3 35 0.00077 29.2 1.8 13 215-227 92-104 (220)
121 TIGR02412 pepN_strep_liv amino 29.0 94 0.002 32.2 5.1 14 214-227 286-299 (831)
122 PF02130 UPF0054: Uncharacteri 28.9 51 0.0011 26.2 2.5 19 215-233 108-126 (145)
123 KOG1046|consensus 28.6 92 0.002 32.6 4.9 15 213-227 322-336 (882)
124 PRK00016 metal-binding heat sh 28.0 48 0.001 26.9 2.2 20 215-234 113-132 (159)
125 TIGR00043 metalloprotein, YbeY 26.7 58 0.0013 24.6 2.4 21 214-234 72-92 (110)
126 PF10463 Peptidase_U49: Peptid 26.3 46 0.00099 28.3 1.9 19 215-234 101-119 (206)
127 PRK04860 hypothetical protein; 26.2 32 0.00069 28.0 0.9 13 215-227 63-75 (160)
128 PRK04351 hypothetical protein; 26.0 38 0.00083 27.1 1.3 13 214-226 60-72 (149)
129 TIGR02411 leuko_A4_hydro leuko 25.7 35 0.00076 33.9 1.3 12 216-227 280-291 (601)
130 cd02640 R3H_NRF R3H domain of 25.7 44 0.00096 22.3 1.4 20 216-235 30-49 (60)
131 PF10026 DUF2268: Predicted Zn 25.3 1.3E+02 0.0028 25.0 4.5 16 213-228 63-78 (195)
132 cd06007 R3H_DEXH_helicase R3H 25.3 47 0.001 22.1 1.4 20 216-235 29-48 (59)
133 COG0339 Dcp Zn-dependent oligo 24.6 28 0.0006 34.8 0.4 12 216-227 468-479 (683)
134 PRK10911 oligopeptidase A; Pro 24.5 37 0.0008 34.3 1.2 13 215-227 463-475 (680)
135 PF10263 SprT-like: SprT-like 23.9 47 0.001 26.3 1.5 16 214-229 59-74 (157)
136 TIGR00744 ROK_glcA_fam ROK fam 23.4 28 0.0006 31.1 0.1 29 217-246 150-179 (318)
137 PLN02791 Nudix hydrolase homol 23.2 47 0.001 34.0 1.7 17 215-231 585-601 (770)
138 cd06162 S2P-M50_PDZ_SREBP Ster 22.3 56 0.0012 29.1 1.8 18 215-232 135-152 (277)
139 COG1940 NagC Transcriptional r 21.9 32 0.0007 30.7 0.2 32 217-250 159-190 (314)
140 PRK05082 N-acetylmannosamine k 20.6 38 0.00083 29.9 0.4 31 217-248 148-178 (291)
141 TIGR00054 RIP metalloprotease 20.2 60 0.0013 30.6 1.7 13 215-227 14-26 (420)
142 COG4823 AbiF Abortive infectio 20.2 54 0.0012 28.8 1.2 14 216-229 93-106 (299)
No 1
>KOG3607|consensus
Probab=100.00 E-value=1.1e-55 Score=427.75 Aligned_cols=281 Identities=33% Similarity=0.499 Sum_probs=245.0
Q ss_pred CcccccccccccCCcceEEEeecccccCC--------ceEEeccCcc-ccccccccCCCCCC-CCCCCCCCCccccc---
Q psy1606 1 MLHCQHLSEKLEYGIETVAILSHSFLNPG--------GFIIPCRSAI-VDLGLNQVDPGLSP-DGARCGDGKTKRDT--- 67 (285)
Q Consensus 1 ~~~c~y~g~~~~~~~~~v~~~~~~~~~~~--------~~i~p~~~~~-~~~~l~~~~~~~~~-~~~~Cg~~~~~~~~--- 67 (285)
+.||||||++++++.|.||||||+|+ +| |.|||++.+. +.+.+++.++.... ....||..+..+..
T Consensus 93 ~~~C~YqG~v~g~~~S~vslStC~Gl-~g~~~~~~~~Y~Iepl~~s~~f~h~iy~~~~~~~~~~~~~~g~~~~~~~~~~~ 171 (716)
T KOG3607|consen 93 QDHCYYQGYIEGEPDSFVSLSTCSGL-RGLFLFENISYSIEPLEGSDSFEHLIYKLLSLETDNSALVCGHQEAHIITKQM 171 (716)
T ss_pred cccceeeeEeccccCceEEEEeccCc-ceeEEECceeEEEEECCCCCCcceeeeccccccccccccccccchhhcccccc
Confidence 57999999999999999999999999 44 9999999664 44455555554433 35789976443321
Q ss_pred ----ccccccccCCCCCcccceEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcccccEEEEEEEEEEecCCCc
Q psy1606 68 ----VSKHDEIRGPYNANIKSRYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDE 143 (285)
Q Consensus 68 ----~~~~~~~~~~~~~~~~~~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~~~i~v~l~~l~iw~~~~~ 143 (285)
....+..+.+..++.++||||++||+|+..|++++.+..++.++++.++|++|++|++++++|+|+++|+|+++|+
T Consensus 172 ~~~~~~~~~~~~~~~~~~~~~kyvEl~vV~D~~~~~~~~~~~~~v~~~~~~vvn~~d~~y~~lni~i~lv~lE~Wt~~dk 251 (716)
T KOG3607|consen 172 ELPLRSKRQERREKRTVIEHTKYVELYVVVDNDLYKKYGSNLSKVRSFAKEVVNLVDSIYKQLNIRIVLVGLEIWTDGNK 251 (716)
T ss_pred cccCcccccccccccccccccceEEEEEEEehHHHHHccccHHHHHHHHHHhhhHHhhhHhhccceEEEEEEEecCCCCe
Confidence 1112333335788999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccCCHHHHHHHHHHhhhhccCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHh
Q psy1606 144 ITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEM 223 (285)
Q Consensus 144 ~~~~~~~~~~l~~f~~w~~~~~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHEl 223 (285)
+.+++++..+|.+|..|++..+..+.+||.|+|+++..+.+.++|+||+++||++.+++||+.+++..+...|.|+||||
T Consensus 252 i~~~~~~~~tL~~F~~wr~~~l~~r~~hD~a~L~~~~~~~~~~~G~a~~~~mCs~~~s~gv~~~~~~~~~~~a~v~Ahel 331 (716)
T KOG3607|consen 252 IDVSEDLRETLHNFLKWRKSYLTTRLPHDAAHLLSGILFYGKYVGLAYFGGMCSPGHSGGVNKFHSDILLAFAVVLAHEL 331 (716)
T ss_pred ecccccHHHHHHHHHHHHHhhccccCCCCceEEEEeeeccCceeceeecccccCcccccceeecCcccchhHHHHHHHHH
Confidence 99989999999999999998888888999999999988889999999999999999999999999888888999999999
Q ss_pred hhhcCCCCCCCCCCCC-CCCcccCCCCCCCCCCccCccCHHHHHhHhccCccccccccCc
Q psy1606 224 GHNLGMEHDTTECTCP-SDRCIMAPSSSSVSPTEWSSCSLEYLALSFDHGMDYCMRNKPK 282 (285)
Q Consensus 224 GH~lG~~HD~~~c~C~-~~~cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~~~p~ 282 (285)
||+|||.||...|.|+ ...|||........+..||+||+++|+.++..+++.||+|.|.
T Consensus 332 gH~lGm~hD~~~c~C~~~~~CiM~~~~~~~~~~~FS~CS~~~~~~~~~~~~~~cl~n~p~ 391 (716)
T KOG3607|consen 332 GHNLGMIHDESSCSCPSSGLCIMPEETGFYIPKKFSNCSYQDFESFLLKGGGSCLLNLPS 391 (716)
T ss_pred HhhcCcccccccccCCCCCccccccccCcCcccccccchHHHHHHHHhcCCcceEecCCC
Confidence 9999999999999996 7889994444446789999999999999999999999999654
No 2
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=100.00 E-value=1.5e-53 Score=361.27 Aligned_cols=197 Identities=39% Similarity=0.677 Sum_probs=170.3
Q ss_pred eEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcccccEEEEEEEEEEecCCCcccccCCHHHHHHHHHHhhhhc
Q psy1606 85 RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKDR 164 (285)
Q Consensus 85 ~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~~~i~v~l~~l~iw~~~~~~~~~~~~~~~l~~f~~w~~~~ 164 (285)
||||++||+|+.+|++|+.|.+++++++++++|+||.+|++++++|+|++|+||+++|++.+..++..+|.+|++|++..
T Consensus 1 kyiEl~vvvD~~~~~~~~~n~~~~~~~~~~i~n~v~~~y~~l~i~v~l~~leiw~~~d~i~~~~~~~~~L~~F~~w~~~~ 80 (199)
T PF01421_consen 1 KYIELLVVVDNSMYQYHGSNVTKVIQYVLTIVNIVDSIYQQLNIRVVLVGLEIWTEEDKINISNDADSTLENFCNWQKSE 80 (199)
T ss_dssp EEEEEEEEE-HHHHHHTTTGHHHHHHHHHHHHHHHHHHHGGGTEEEEEEEEEEESSSTSS---SSHHHHHHHHHHHHHHT
T ss_pred CeEEEEEEEeHHHHHHhcccHHHHHHHHHHHHHHHhhhcccCCeEEEEEEEEEcccCCceeeecchHHHHHHHHHHHHhh
Confidence 79999999999999999999999999999999999999999999999999999999999999889999999999999987
Q ss_pred cCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCCC-CCCC-CCCC
Q psy1606 165 LVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTT-ECTC-PSDR 242 (285)
Q Consensus 165 ~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~-~c~C-~~~~ 242 (285)
+.++.+||+|+|||+.++..++.|+|++|++|++.++++|+++.+..+..+|.++||||||+|||.||+. .|.| ++.+
T Consensus 81 ~~~~~~~D~a~Llt~~~~~~~~~G~a~~~~~C~~~~s~~i~~~~~~~~~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~ 160 (199)
T PF01421_consen 81 LPPRIHHDHAHLLTGKDFPDSTVGLAYVGGMCSPSRSCGIVEDHSRSGLSFAVIIAHELGHNLGMPHDGDNGCKCPRRDG 160 (199)
T ss_dssp HHHHS--SEEEEEESS-CSCCBSEEE-TT-TTSTTTSEEEEE-SSSSHHHHHHHHHHHHHHHTT---TTTCTHCSSSSSS
T ss_pred cccccccceeEEeeecccccceeeeEeCCCCCCcCCCCcEeeeccchhHHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCC
Confidence 7656789999999999997679999999999999999999999988888899999999999999999999 9999 6799
Q ss_pred cccCCCCCCCCCCccCccCHHHHHhHhccCccccccccC
Q psy1606 243 CIMAPSSSSVSPTEWSSCSLEYLALSFDHGMDYCMRNKP 281 (285)
Q Consensus 243 cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~~~p 281 (285)
|||++......+..||+||+++|.+++.++.+.||+|.|
T Consensus 161 cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~n~P 199 (199)
T PF01421_consen 161 CIMSPSIGPSSSDKFSNCSRRQFEEFLSSGKSSCLLNKP 199 (199)
T ss_dssp STTSSSSSSSSTSSS-HHHHHHHHHHHHHHTTGGGSS--
T ss_pred ccccccccCCCCCCcCHHHHHHHHHHHccCCCCccCCCC
Confidence 999997765456799999999999999988899999998
No 3
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=100.00 E-value=2.2e-51 Score=346.74 Aligned_cols=194 Identities=43% Similarity=0.764 Sum_probs=182.9
Q ss_pred eEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcccccEEEEEEEEEEecCCCcccccCCHHHHHHHHHHhhhhc
Q psy1606 85 RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKLNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKDR 164 (285)
Q Consensus 85 ~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~~~i~v~l~~l~iw~~~~~~~~~~~~~~~l~~f~~w~~~~ 164 (285)
||||++|++|+.+|+++++|.++++++++.++|+||.+|++++|+|+|++|+||+++|++.+..++..+|..|.+|++..
T Consensus 1 ~~iE~~vv~D~~~~~~~~~n~~~~~~~v~~~~n~vn~~y~~~~i~v~l~~i~i~~~~~~~~~~~~~~~~L~~f~~w~~~~ 80 (194)
T cd04269 1 KYVELVVVVDNSLYKKYGSNLSKVRQRVIEIVNIVDSIYRPLNIRVVLVGLEIWTDKDKISVSGDAGETLNRFLDWKRSN 80 (194)
T ss_pred CeEEEEEEEeHHHHHHhCCCHHHHHHHHHHHHHHHHHHHcCCCeEEEEEEEEEeecCCcccccCCHHHHHHHHHHHHHHh
Confidence 58999999999999999999999999999999999999999999999999999999999988788899999999999987
Q ss_pred cCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCCCCCCCCCCCcc
Q psy1606 165 LVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTECTCPSDRCI 244 (285)
Q Consensus 165 ~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~c~C~~~~cI 244 (285)
+..+.++|+++|||+++++++..|+||+|++|++.++++++++.+.++...|.++||||||+|||+||+..|.|++++||
T Consensus 81 ~~~~~~~D~a~Llt~~~~~~~~~G~A~~g~iC~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~c~C~~~~cI 160 (194)
T cd04269 81 LLPRKPHDNAQLLTGRDFDGNTVGLAYVGGMCSPKYSGGVVQDHSRNLLLFAVTMAHELGHNLGMEHDDGGCTCGRSTCI 160 (194)
T ss_pred hccccCCCceEEEEeeecCCCceeeeecCCccCCCcceEEEEeCCcchHHHHHHHHHHHHhhcCCCcCCCCCCCCCCCeE
Confidence 76667899999999999988999999999999988899999999877778899999999999999999999999999999
Q ss_pred cCCCCCCCCCCccCccCHHHHHhHhccCccccccc
Q psy1606 245 MAPSSSSVSPTEWSSCSLEYLALSFDHGMDYCMRN 279 (285)
Q Consensus 245 M~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~~ 279 (285)
|++.... +++.||+||+++|++++..+.+.||++
T Consensus 161 M~~~~~~-~~~~fS~CS~~~~~~~l~~~~~~CL~~ 194 (194)
T cd04269 161 MAPSPSS-LTDAFSNCSYEDYQKFLSRGGGQCLLN 194 (194)
T ss_pred ecCCCCC-CCCCCChhhHHHHHHHHhcCCCCccCC
Confidence 9999875 689999999999999999988999975
No 4
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=100.00 E-value=1.4e-47 Score=326.53 Aligned_cols=189 Identities=29% Similarity=0.477 Sum_probs=169.1
Q ss_pred eEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhccc----ccEEEEEEEEEEecCCCc-ccccCCHHHHHHHHHH
Q psy1606 85 RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEK----LNIFIALVGVVVWTEYDE-ITLNVNGDITLTNFLS 159 (285)
Q Consensus 85 ~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~----~~i~v~l~~l~iw~~~~~-~~~~~~~~~~l~~f~~ 159 (285)
||||++||+|+.+|++++. +++++|++.++|+||.+|++ ++|+|+|++|+||+++++ +.+..++..+|.+|++
T Consensus 1 ~~vEl~vv~D~~~~~~~~~--~~~~~y~~~~~n~v~~~y~~~s~~~~i~i~l~~i~i~~~~~~~~~~~~~~~~~L~~F~~ 78 (207)
T cd04273 1 RYVETLVVADSKMVEFHHG--EDLEHYILTLMNIVASLYKDPSLGNSINIVVVRLIVLEDEESGLLISGNAQKSLKSFCR 78 (207)
T ss_pred CeEEEEEEECHHHHHHcCh--HHHHHHHHHHHHHHHHHhcCcccCCCeEEEEEEEEEecCCCCcCcccCCHHHHHHHHHH
Confidence 5899999999999998753 78999999999999999996 799999999999998876 5566788999999999
Q ss_pred hhhhccCCC----CCCCceeeeeeeecC-----CCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCC
Q psy1606 160 YRKDRLVLS----HPNDNAQLLTGMTFS-----DGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGME 230 (285)
Q Consensus 160 w~~~~~~~~----~~~D~a~llt~~~~~-----~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~ 230 (285)
|++..+... .++|+++|||+.++. +++.|+||+||+|++.++++++++.+ ...|.++||||||+|||.
T Consensus 79 w~~~~~~~~~~~~~~~D~a~llt~~d~~~~~~~~~~~G~A~~ggiC~~~~s~~i~~~~~---~~~a~~~aHElGH~LG~~ 155 (207)
T cd04273 79 WQKKLNPPNDSDPEHHDHAILLTRQDICRSNGNCDTLGLAPVGGMCSPSRSCSINEDTG---LSSAFTIAHELGHVLGMP 155 (207)
T ss_pred HHHHcCCcccccccccceEEEEeeecccccCCCCCceEEeccccCCCCCcceEEEcCCC---ceeEEeeeeechhhcCCC
Confidence 998766553 479999999999985 58999999999999999999998864 345789999999999999
Q ss_pred CCCCCCCCCCC---CcccCCCCCCC-CCCccCccCHHHHHhHhccCcccccc
Q psy1606 231 HDTTECTCPSD---RCIMAPSSSSV-SPTEWSSCSLEYLALSFDHGMDYCMR 278 (285)
Q Consensus 231 HD~~~c~C~~~---~cIM~~~~~~~-~~~~fS~CS~~~~~~~l~~~~~~CL~ 278 (285)
||+..|.|+.. +|||+|..... ++..||+||+++|+++++++.+.||+
T Consensus 156 HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~ 207 (207)
T cd04273 156 HDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLDTGDGNCLL 207 (207)
T ss_pred CCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHccCCCcccC
Confidence 99999999875 89999998763 68899999999999999998899995
No 5
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=100.00 E-value=5.7e-45 Score=313.23 Aligned_cols=189 Identities=28% Similarity=0.341 Sum_probs=164.0
Q ss_pred EEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcccc---cEEEEEEEEEEecCCCcccccC-------CHHHHHH
Q psy1606 86 YLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKL---NIFIALVGVVVWTEYDEITLNV-------NGDITLT 155 (285)
Q Consensus 86 ~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~~---~i~v~l~~l~iw~~~~~~~~~~-------~~~~~l~ 155 (285)
|||++||+|+.+|+++ ++.+++++|++.++|+||.+|+++ +|+|+|++|+||++++++.+.. +...+|.
T Consensus 2 yvEl~iV~D~~~~~~~-~~~~~~~~yv~~~~n~vn~~Y~~~~~p~I~i~lv~i~i~~~~~~~~~~~~~~~~~~~~~~tL~ 80 (220)
T cd04272 2 YPELFVVVDYDHQSEF-FSNEQLIRYLAVMVNAANLRYRDLKSPRIRLLLVGITISKDPDFEPYIHPINYGYIDAAETLE 80 (220)
T ss_pred cceEEEEecHHHHHhc-CCHHHHHHHHHHHHHHHHHHhhhcCCCcEEEEEEEEEeccCcccceeeccCCcccccHHHHHH
Confidence 7999999999999996 578999999999999999999988 8999999999999998876532 5678999
Q ss_pred HHHHhhhhccCCCCCCCceeeeeeeec--------CCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhc
Q psy1606 156 NFLSYRKDRLVLSHPNDNAQLLTGMTF--------SDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNL 227 (285)
Q Consensus 156 ~f~~w~~~~~~~~~~~D~a~llt~~~~--------~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~l 227 (285)
+|++|++..+.. .++|+++|+|+.++ ..++.|+||+|++|++ ++++++++.+..+ ..|.++||||||+|
T Consensus 81 ~F~~~~~~~~~~-~~~D~~~LlT~~~~~~~~~g~~~~~~~G~A~~g~~C~~-~~~~~~~d~~~~~-~~~~~~AHElGH~l 157 (220)
T cd04272 81 NFNEYVKKKRDY-FNPDVVFLVTGLDMSTYSGGSLQTGTGGYAYVGGACTE-NRVAMGEDTPGSY-YGVYTMTHELAHLL 157 (220)
T ss_pred HHHHHHhccCCC-CcccEEEEEeccceeeccCcccccCccceEeecCccCC-CceeEeecCCCCc-ccHHHHHHHHHHHh
Confidence 999999876544 48999999999987 2467899999999995 5788888876533 34799999999999
Q ss_pred CCCCCCCC-C----------CCCC-CCcccCCCCCCCCCCccCccCHHHHHhHhccCcccccc
Q psy1606 228 GMEHDTTE-C----------TCPS-DRCIMAPSSSSVSPTEWSSCSLEYLALSFDHGMDYCMR 278 (285)
Q Consensus 228 G~~HD~~~-c----------~C~~-~~cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~ 278 (285)
||+||+.+ | .|+. .+|||++.....++..||+||+++|+++|..+.+.||.
T Consensus 158 G~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~CL~ 220 (220)
T cd04272 158 GAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNGERQYRFSQCSQRQIRNVFRRLGASCLH 220 (220)
T ss_pred CCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCCCCCCccCcchHHHHHHHHcCCCccccC
Confidence 99999865 4 5875 68999998776678899999999999999998899984
No 6
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=100.00 E-value=7.3e-43 Score=294.29 Aligned_cols=182 Identities=33% Similarity=0.511 Sum_probs=161.4
Q ss_pred eEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhccc----ccEEEEEEEEEEecCCCcccc-cCCHHHHHHHHHH
Q psy1606 85 RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEK----LNIFIALVGVVVWTEYDEITL-NVNGDITLTNFLS 159 (285)
Q Consensus 85 ~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~----~~i~v~l~~l~iw~~~~~~~~-~~~~~~~l~~f~~ 159 (285)
||||++||+|+.+|++|+++.+.++++++.++|+||.+|++ ++|++.|++|+||++.+++.. ..+...+|..|.+
T Consensus 1 ~~vE~~vv~D~~~~~~~~~~~~~~~~~i~~~in~vn~~y~~~~~~~~i~~~l~~i~i~~~~~~~~~~~~~~~~~l~~f~~ 80 (192)
T cd04267 1 REIELVVVADHRMVSYFNSDENILQAYITELINIANSIYRSTNLRLGIRISLEGLQILKGEQFAPPIDSDASNTLNSFSF 80 (192)
T ss_pred CEEEEEEEEcHHHHHHhcCCHHHHHHHHHHHHHHHHHHHhcCccCCCeEEEEEEEEEecCCCccCccCccHHHHHHHHHH
Confidence 68999999999999999989999999999999999999998 899999999999999988765 4567789999999
Q ss_pred hhhhccCCCCCCCceeeeeeeec-CCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCCCCC--
Q psy1606 160 YRKDRLVLSHPNDNAQLLTGMTF-SDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTEC-- 236 (285)
Q Consensus 160 w~~~~~~~~~~~D~a~llt~~~~-~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~c-- 236 (285)
|++.. ..++|+++|+|+.++ .+++.|+||++++|++.+++|++++.+. +...|.++||||||+|||.||+..|
T Consensus 81 ~~~~~---~~~~d~~~l~t~~~~~~~~~~G~a~~~~~C~~~~~~~v~~~~~~-~~~~~~~~aHElGH~lG~~HD~~~~~~ 156 (192)
T cd04267 81 WRAEG---PIRHDNAVLLTAQDFIEGDILGLAYVGSMCNPYSSVGVVEDTGF-TLLTALTMAHELGHNLGAEHDGGDELA 156 (192)
T ss_pred HHhcC---CCCCceEEEEeeeccCCCCeeeeecccCCCCCCCCeEEEecCCc-ceeehhhhhhhHHhhcCCcCCCCCCcc
Confidence 98854 357999999999998 6789999999999999889999988765 4456899999999999999999754
Q ss_pred -CCC-CCCcccCCCCCCCCCCccCccCHHHHHhHhc
Q psy1606 237 -TCP-SDRCIMAPSSSSVSPTEWSSCSLEYLALSFD 270 (285)
Q Consensus 237 -~C~-~~~cIM~~~~~~~~~~~fS~CS~~~~~~~l~ 270 (285)
.|+ +++|||++.....++..||+||+++|+++|.
T Consensus 157 ~~c~~~~~~IM~~~~~~~~~~~FS~CS~~~i~~~l~ 192 (192)
T cd04267 157 FECDGGGNYIMAPVDSGLNSYRFSQCSIGSIREFLD 192 (192)
T ss_pred ccCCCCCCeEEcccccCCCCCccChhhHHHHHHHhC
Confidence 555 5789999987755678999999999999873
No 7
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=100.00 E-value=9.1e-43 Score=299.64 Aligned_cols=186 Identities=27% Similarity=0.390 Sum_probs=153.0
Q ss_pred EEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcc-cccEEEEEEEEEEecCCCccccc------------CCHHH
Q psy1606 86 YLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYE-KLNIFIALVGVVVWTEYDEITLN------------VNGDI 152 (285)
Q Consensus 86 ~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~-~~~i~v~l~~l~iw~~~~~~~~~------------~~~~~ 152 (285)
.+=+.||+|+.+|++|+ +.+++++|+++++|+||.+|+ ++||+|.|++|+||+++|++.+. .++..
T Consensus 2 ~~~~~vvaD~~~~~~~~-~~~~~~~~v~~ivN~vn~~Y~~~~nI~v~Lvglei~~~~d~~~~~~~~~~~~~~~~~~~~~~ 80 (228)
T cd04271 2 VALIGVAADCSYTKSFG-SVEEARRNILNNVNSASQLYESSFNISLGLRNLTISDASCPSTAVDSAPWNLPCNSRIDIDD 80 (228)
T ss_pred eEEEEEEechHHHHHcC-CHHHHHHHHHHHHHHHHHHhhCcceEEEEEEEEEEecCCCCCcccccccccccccccCCHHH
Confidence 34578999999999997 589999999999999999999 79999999999999998875432 35789
Q ss_pred HHHHHHHhhhhccCCCCCCCce--eeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHH-----HHHHHHHhhh
Q psy1606 153 TLTNFLSYRKDRLVLSHPNDNA--QLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLV-----ATTVAHEMGH 225 (285)
Q Consensus 153 ~l~~f~~w~~~~~~~~~~~D~a--~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~-----a~~~AHElGH 225 (285)
+|++|++|++... .+|.+ +|+++... ++++|+||+|++|++..++++.++........ +.||||||||
T Consensus 81 tL~~F~~Wr~~~~----~~d~a~~~l~t~~~~-~~~lGlA~vg~mC~~~~s~~~~~~~~~~~~~~~~~~~~~t~AHElGH 155 (228)
T cd04271 81 RLSIFSQWRGQQP----DDGNAFWTLMTACPS-GSEVGVAWLGQLCRTGASDQGNETVAGTNVVVRTSNEWQVFAHEIGH 155 (228)
T ss_pred HHHHHHHHHhcCC----CCCcEEeEeeccccC-CCceeEeeccCCCCCCcCCCccccccccceeecccccceehhhhhhh
Confidence 9999999988542 46666 99998764 78999999999999988878776543211111 2599999999
Q ss_pred hcCCCCCCCC--CC---------CC--------CCCcccCCCCCCCCCCccCccCHHHHHhHhccCc--ccccc
Q psy1606 226 NLGMEHDTTE--CT---------CP--------SDRCIMAPSSSSVSPTEWSSCSLEYLALSFDHGM--DYCMR 278 (285)
Q Consensus 226 ~lG~~HD~~~--c~---------C~--------~~~cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~--~~CL~ 278 (285)
||||.||+.. |. || +++|||+|..+. ++..||+||+++|++||..+. +.||.
T Consensus 156 nLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~l~~~~~~~~CL~ 228 (228)
T cd04271 156 TFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSLLGRNPVRTSCLS 228 (228)
T ss_pred hcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-CCCccChhhHHHHHHHHhcCCCCCcccC
Confidence 9999999864 52 33 267999999876 789999999999999998754 88984
No 8
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=100.00 E-value=2.2e-41 Score=294.39 Aligned_cols=193 Identities=20% Similarity=0.343 Sum_probs=148.1
Q ss_pred EEEEEEEEehHHHHHhcCC-hHHHHHHHHHHHHHHHhhcccccEE--------EEEEEEEEecCCCcccccCCHHHHHHH
Q psy1606 86 YLELVIVVDNRLYNLFNKN-SKLVHRHCKDISNVINALYEKLNIF--------IALVGVVVWTEYDEITLNVNGDITLTN 156 (285)
Q Consensus 86 ~vel~vv~D~~~~~~~~~n-~~~~~~~v~~~~n~v~~iy~~~~i~--------v~l~~l~iw~~~~~~~~~~~~~~~l~~ 156 (285)
++.|+||+|+.+|++++++ .+.+++++++++|+||.+|++++++ |.+.+++||++.+.......... ..
T Consensus 2 ~C~l~vVaD~~~~~~~~~~~~~~v~~~~~~~vn~vn~iY~~~n~~~~~l~~i~i~~~~i~I~~~~~~~~~~~~~~~--~~ 79 (244)
T cd04270 2 TCKLLLVADHRFYKYMGRGEEETTINYLISHIDRVDDIYRNTDWDGGGFKGIGFQIKRIRIHTTPDEVDPGNKFYN--KS 79 (244)
T ss_pred ccEEEEEEcHHHHHHhCCChHHHHHHHHHHHHHHHHHHhcccccCCCcccceEEEEEEEEEEcCcccccCCCcccc--cC
Confidence 4789999999999999987 7999999999999999999988765 66777777877665332111111 35
Q ss_pred HHHhhhhccC-----CCCCCC--ceeeeeeeecCCCceeccccC--------CCCcccccC----------ceeec--CC
Q psy1606 157 FLSYRKDRLV-----LSHPND--NAQLLTGMTFSDGVVGKALKG--------PICTYEFSG----------GVNVD--HK 209 (285)
Q Consensus 157 f~~w~~~~~~-----~~~~~D--~a~llt~~~~~~~~~G~A~~g--------~~C~~~~~~----------~v~~~--~~ 209 (285)
|..|+...+. .+.++| +|+|||+++|+++++|+||+| |+|++..++ |++.. .+
T Consensus 80 f~~w~~~~~~~~~~~~~~~d~~c~A~LlT~~df~~~tlGlA~vg~~~~~~~gGIC~~~~s~~~~~~~~~n~gl~t~~~~~ 159 (244)
T cd04270 80 FPNWGVEKFLVKLLLEQFSDDVCLAHLFTYRDFDMGTLGLAYVGSPRDNSAGGICEKAYYYSNGKKKYLNTGLTTTVNYG 159 (244)
T ss_pred ccchhHHHHhhhhhhhcCCccceEEEEEeccccCCCceeeeeecccccCCCCcccCCcccccCCcceeeecceEeeeccC
Confidence 7666653332 233444 799999999988899999996 588886554 34322 22
Q ss_pred C--chhHHHHHHHHHhhhhcCCCCCCC--CCCCCC---CCcccCCCCC---CCCCCccCccCHHHHHhHhccCccccccc
Q psy1606 210 N--VVGLVATTVAHEMGHNLGMEHDTT--ECTCPS---DRCIMAPSSS---SVSPTEWSSCSLEYLALSFDHGMDYCMRN 279 (285)
Q Consensus 210 ~--~~~~~a~~~AHElGH~lG~~HD~~--~c~C~~---~~cIM~~~~~---~~~~~~fS~CS~~~~~~~l~~~~~~CL~~ 279 (285)
. .+..+|.||||||||||||+||+. .|.|+. +.|||+|... ..++..||+||+++|+++|..+.+.||.+
T Consensus 160 ~~~~~~~~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~~~~~~CL~~ 239 (244)
T cd04270 160 KRVPTKESDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLEVKSNSCFVE 239 (244)
T ss_pred CccchhHHHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHccCCCCCcCC
Confidence 1 123367999999999999999998 577764 4899999863 25678999999999999999999999986
Q ss_pred c
Q psy1606 280 K 280 (285)
Q Consensus 280 ~ 280 (285)
.
T Consensus 240 ~ 240 (244)
T cd04270 240 R 240 (244)
T ss_pred C
Confidence 4
No 9
>KOG3538|consensus
Probab=99.97 E-value=8e-30 Score=256.30 Aligned_cols=273 Identities=25% Similarity=0.336 Sum_probs=201.3
Q ss_pred cccccccccccCCcce-EEEeeccccc------CCceEEeccCccccc-----cccccCCCC---CCCCCCCCCCCcccc
Q psy1606 2 LHCQHLSEKLEYGIET-VAILSHSFLN------PGGFIIPCRSAIVDL-----GLNQVDPGL---SPDGARCGDGKTKRD 66 (285)
Q Consensus 2 ~~c~y~g~~~~~~~~~-v~~~~~~~~~------~~~~i~p~~~~~~~~-----~l~~~~~~~---~~~~~~Cg~~~~~~~ 66 (285)
.+|+|+|.+.+.+.++ +|+++|.|.+ .-|+|+|+....-.. ++.+..... .....-++..+....
T Consensus 82 ~~c~~~g~v~~~~~~~~~a~s~c~G~g~~~~~~~~~~iepl~~~~~~~~~~~~h~~~~r~~~~~~~~~~~~~~~~~~~~~ 161 (845)
T KOG3538|consen 82 RGCFYLGDVLDCPPSAGAALSLCSGHGSIQLQGHDYFIEPLPRGPAAPVSGRAHLVYRRLDVELASEVLDGTRDQKCDPS 161 (845)
T ss_pred CCCcccCcccCCCCccceeeeccCCccceeccCcceeEeeccCCCcccccCCCcccccchhhcccccCcccccccccccC
Confidence 5899999999988888 9999999931 229999998654111 111111000 000000000000011
Q ss_pred cccccccccCC-CCCcccceEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhcccc----cEEEEEEEEEEecCC
Q psy1606 67 TVSKHDEIRGP-YNANIKSRYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEKL----NIFIALVGVVVWTEY 141 (285)
Q Consensus 67 ~~~~~~~~~~~-~~~~~~~~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~~----~i~v~l~~l~iw~~~ 141 (285)
........+-. +......+ ||.+|++|..+|.+++ ..+..|++.++|++..+|+.- .|+|+++.++|+...
T Consensus 162 ~~~~~~~~~~~~~r~~~~~~-vetlvvaD~~~~~~~~---~~~~~~~lt~~n~v~~l~~~~sl~~~i~i~vvr~~~l~~~ 237 (845)
T KOG3538|consen 162 PGDGAGRGRCRGRRSVSRHR-VETLVVADKCMVAFHG---ESVERYVLTLMNIVASLYKDPSLGNHINIVVVRLIILLKE 237 (845)
T ss_pred Chhhccccccccccccccce-eeEEEEecHHHHHhcc---hhhHHHHHHHHHHHHHHhcCCCCCCceEEEEEEEEEEecC
Confidence 10000000000 12333444 9999999999999997 689999999999999999843 589999999999877
Q ss_pred Cc-ccccCCHHHHHHHHHHhhhhccCCC--CCCCceeeeeeeecCC------CceeccccCCCCcccccCceeecCCCch
Q psy1606 142 DE-ITLNVNGDITLTNFLSYRKDRLVLS--HPNDNAQLLTGMTFSD------GVVGKALKGPICTYEFSGGVNVDHKNVV 212 (285)
Q Consensus 142 ~~-~~~~~~~~~~l~~f~~w~~~~~~~~--~~~D~a~llt~~~~~~------~~~G~A~~g~~C~~~~~~~v~~~~~~~~ 212 (285)
+. ..+..++..+|++|++|+....... ..+|.++|||+.++.. .+.|+|.++++|.+.+++.++++.+..
T Consensus 238 ~~~~~i~~~~~~sl~~fC~wq~~~n~~~d~~~~D~ailltr~d~~~~~~~~~~~~G~A~v~gmC~~~~sC~i~e~~gl~- 316 (845)
T KOG3538|consen 238 PSALNIEGNAGKSLDSFCKWQTSLNSEEDTEHHDHAILLTRQDFCEQQNGPCDTLGVAPVGGMCSPSRSCSINEDTGLP- 316 (845)
T ss_pred CCCceeccCcccchhhhcccccccCCCccccccceeEEeeeeecccccCCCCccccceeecceeCCccceeeecCCCCc-
Confidence 65 5566778889999999988543332 3599999999998753 588999999999999999999998853
Q ss_pred hHHHHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCCC-CCCCccCccCHHHHHhHhccCccccccccC
Q psy1606 213 GLVATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSS-VSPTEWSSCSLEYLALSFDHGMDYCMRNKP 281 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~-~~~~~fS~CS~~~~~~~l~~~~~~CL~~~p 281 (285)
.|+++|||+||+|||.||+....|.....||++.... .....||.||.+++.+||.++.+.||.+.|
T Consensus 317 --~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~~~~~~~WS~CS~~~~t~fl~~~~~~Cl~d~p 384 (845)
T KOG3538|consen 317 --SAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCGGNHELTWSECSREYLTRFLDDGEGNCLSDQP 384 (845)
T ss_pred --cceeeeeccccccCcccCCcCCCCCCcccccCcccccCCCCccCCcchHHHHHHHhccCccceeecCC
Confidence 5799999999999999999876687766999999763 345579999999999999998889999887
No 10
>KOG3658|consensus
Probab=99.97 E-value=9.4e-30 Score=237.45 Aligned_cols=273 Identities=21% Similarity=0.284 Sum_probs=187.1
Q ss_pred ccccccccCCcceEEEeecccccCC--------ceEEeccCcccccc-------ccccCCC----CCC---CCCCCCCCC
Q psy1606 5 QHLSEKLEYGIETVAILSHSFLNPG--------GFIIPCRSAIVDLG-------LNQVDPG----LSP---DGARCGDGK 62 (285)
Q Consensus 5 ~y~g~~~~~~~~~v~~~~~~~~~~~--------~~i~p~~~~~~~~~-------l~~~~~~----~~~---~~~~Cg~~~ 62 (285)
+|-|++.+.+.|.|--+.-.|+..| |-|||...-..+.. +++.+.. +.+ -...||..+
T Consensus 107 ~YtG~v~ge~~s~V~~~i~dg~~~g~I~t~~~~y~IEPa~~y~~d~~~p~~~viYke~di~~~~~~~~~~~~p~~~g~~k 186 (764)
T KOG3658|consen 107 FYTGHVVGEPASHVLASIEDGVFEGSIQTEGDTYYIEPAWRYFEDDKLPFHSVIYKESDIKNVHRLVATAGSPKVCGYSK 186 (764)
T ss_pred eeeeEEecCcccceeEEEecCeEEEEEEecCceEEeccccccccCCCCccceEEEehhhhcchhhcccccCCCcccchhh
Confidence 5899999999999987777776333 88998775443321 2222211 112 135788776
Q ss_pred cccccccccccccC-C--------------CCC-cccceEEEEEEEEehHHHHHhc-CChHHHHHHHHHHHHHHHhhccc
Q psy1606 63 TKRDTVSKHDEIRG-P--------------YNA-NIKSRYLELVIVVDNRLYNLFN-KNSKLVHRHCKDISNVINALYEK 125 (285)
Q Consensus 63 ~~~~~~~~~~~~~~-~--------------~~~-~~~~~~vel~vv~D~~~~~~~~-~n~~~~~~~v~~~~n~v~~iy~~ 125 (285)
..+.++....+... | ... ...+.++.|+|++||.+|+++| .+.+.++.+++..+..||.||++
T Consensus 187 ~~~~el~~k~~~~~~p~n~~~~~~r~~R~~d~~~~~~kntC~LylqADh~fy~~mg~~t~e~~~~~l~~hI~rVn~IY~~ 266 (764)
T KOG3658|consen 187 RRRRELLKKGEVDRIPPNELTANGRSKRRKDLLSDPMKNTCSLYLQADHRFYRYMGRGTRETTTNYLINHIDRVNEIYRN 266 (764)
T ss_pred hhhhhcccccccccCChhhhhhhhhhhhhcccccccccceeeEEEEecHHHHHHHcCCchHHHHHHHHHHHHHHHHHhcC
Confidence 54433322111111 1 011 3446689999999999999998 35688899999999999999985
Q ss_pred c---------cEEEEEEEEEEecCCCccccc-------CCHHHHHHHHHHhhhhccCCCCCCCceeeeeeeecCCCceec
Q psy1606 126 L---------NIFIALVGVVVWTEYDEITLN-------VNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGK 189 (285)
Q Consensus 126 ~---------~i~v~l~~l~iw~~~~~~~~~-------~~~~~~l~~f~~w~~~~~~~~~~~D~a~llt~~~~~~~~~G~ 189 (285)
. ||.++|..|.|.+..+..+-. .....-+.+|....... +....-+|+|||.++|.++++||
T Consensus 267 T~f~~~~g~kni~F~IkrIkI~~~~~~~~~~~~~~N~f~~e~~~v~~fl~l~s~~--~~sdfCLAylfT~rdFe~GtLGL 344 (764)
T KOG3658|consen 267 TNFDDRGGFKNIGFVIKRIKILDSPACVRPGEDHYNMFCEEVKDVLEFLSLNSEE--KHSDFCLAYLFTYRDFEGGTLGL 344 (764)
T ss_pred CccccCCCccceEEEEEEEEecCchhhcccCcCccccchhhHHHHHHHhhhhhhc--cccchhheeeeeecccccceeee
Confidence 4 489999999999876532211 11233344444332211 12245689999999999999999
Q ss_pred cccC--------CCCcccc--c--------CceeecCC--Cc--hhHHHHHHHHHhhhhcCCCCCCC-CCCCCC---CCc
Q psy1606 190 ALKG--------PICTYEF--S--------GGVNVDHK--NV--VGLVATTVAHEMGHNLGMEHDTT-ECTCPS---DRC 243 (285)
Q Consensus 190 A~~g--------~~C~~~~--~--------~~v~~~~~--~~--~~~~a~~~AHElGH~lG~~HD~~-~c~C~~---~~c 243 (285)
|||| |||...+ . .||+.... .. +-..-.|+|||||||||++||.. +|..+. +.|
T Consensus 345 AwVgsp~~~saGGIC~k~~~y~~G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNfGSpHDp~~ECsPg~~~~Gny 424 (764)
T KOG3658|consen 345 AWVGSPRSNSAGGICEKAYTYNEGKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNFGSPHDPDIECSPGESQGGNY 424 (764)
T ss_pred EEeccCccCCCCcccccceeccCcceEEeecceeeeeecCCccCcchhheeehhhhccccCCCCCCCCccCCCCCCCCcE
Confidence 9986 6888642 2 23333221 11 11234799999999999999987 788754 679
Q ss_pred ccCCCCCC---CCCCccCccCHHHHHhHhccCccccccc
Q psy1606 244 IMAPSSSS---VSPTEWSSCSLEYLALSFDHGMDYCMRN 279 (285)
Q Consensus 244 IM~~~~~~---~~~~~fS~CS~~~~~~~l~~~~~~CL~~ 279 (285)
||++++.. .+..+||+||++.|..+|..++..|+..
T Consensus 425 iM~a~AtSGd~~NN~kFSpCS~ksI~~VL~~k~~~CFqE 463 (764)
T KOG3658|consen 425 IMYAYATSGDKPNNKKFSPCSLKSISKVLESKKRNCFQE 463 (764)
T ss_pred EEEEecccCcCccCcccCcchHHHHHHHHHHhhhhhccC
Confidence 99999863 6788999999999999999988899974
No 11
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=99.96 E-value=6.4e-29 Score=209.93 Aligned_cols=171 Identities=32% Similarity=0.456 Sum_probs=106.2
Q ss_pred ceEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHH-HhhcccccEEEEEEEEEEecCCCccc-----ccCCHHHHHHHH
Q psy1606 84 SRYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVI-NALYEKLNIFIALVGVVVWTEYDEIT-----LNVNGDITLTNF 157 (285)
Q Consensus 84 ~~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v-~~iy~~~~i~v~l~~l~iw~~~~~~~-----~~~~~~~~l~~f 157 (285)
.++|||+|++|+.+++.|+++ .+++.++..+++.| +.+|++.+|++.|++|+||+..++.. ...+...+|+.|
T Consensus 2 ~~t~~l~v~~D~~f~~~~~~~-~~~~~~i~~~~~~v~n~~~~~~~I~~~i~~i~i~~~~~~~~~~~~~~~~~~~~~L~~f 80 (196)
T PF13688_consen 2 KTTCELLVVADYEFYQAFGSD-QAVISYIASLFNQVSNLYYRTSNIQFQISGITIWTSSDPYTNNVSNSNIDAEDTLQDF 80 (196)
T ss_dssp SEEEEEEEEE-HHHHHHTST--HHHHHHHHHHHHHHHHHHHT----EEEEEEEEE--S-B---TT-TSSS--HHHHHHHH
T ss_pred ceEEEEEEEEcHHHHHHhCCc-HHHHHHHHHHHHHHHHHHhhccCceEEEEEEEEecCCCCCcccccccccCHHHHHHHH
Confidence 478999999999999999873 38999999999999 88888999999999999999887643 235678999999
Q ss_pred HHhhhhccCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCC------CchhHHHHHHHHHhhhhcCCCC
Q psy1606 158 LSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHK------NVVGLVATTVAHEMGHNLGMEH 231 (285)
Q Consensus 158 ~~w~~~~~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~------~~~~~~a~~~AHElGH~lG~~H 231 (285)
..+..... .....|+++|||++++.++ +|+||+|++|......++..... ......+.++||||||+||+.|
T Consensus 81 ~~~~~~~~-~~~~~dl~~l~t~~d~~~g-lGlA~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AHEiGH~lGa~H 158 (196)
T PF13688_consen 81 NNDFSSWR-DSSYYDLAHLFTGRDFSGG-LGLAYVGSSCSNTSGGICSSSVSGSYGQVPPTYNGAITFAHEIGHNLGAPH 158 (196)
T ss_dssp --HHHTHH-HHTTSSEEEEEE----GGG-EEEESSSSSSTT-SSSSS--E-EEETTEE--HHHHHHHHHHHHHHHTT---
T ss_pred hhhhhhcc-CCCCCCEEEEEecccCCCC-ceeEeccccCcCCccceeccCcccccceECCCCceehhhHHhHHHhcCCCC
Confidence 53333211 1235799999999999666 99999999999775555554411 1234567999999999999999
Q ss_pred CCCC------CC-----CC-CCCcccCCCCCCCCCCccC
Q psy1606 232 DTTE------CT-----CP-SDRCIMAPSSSSVSPTEWS 258 (285)
Q Consensus 232 D~~~------c~-----C~-~~~cIM~~~~~~~~~~~fS 258 (285)
|+.. |. |+ .++|||+|... ++..+||
T Consensus 159 D~~~~~~~~~c~~~~~~~~~~~~~IM~y~~~-~n~~~FS 196 (196)
T PF13688_consen 159 DGDYSSTSSSCPPSSSTCPAGGGYIMSYCAS-PNNNKFS 196 (196)
T ss_dssp --SSS---TTTS-----GGGT---TTSSS---GGGGS--
T ss_pred CCCCCCCCCCCCCCccccCCCCCeEEcccCC-CCcccCC
Confidence 9863 32 33 36899999988 6678887
No 12
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=99.91 E-value=1.4e-23 Score=178.26 Aligned_cols=186 Identities=28% Similarity=0.364 Sum_probs=133.6
Q ss_pred ceEEEEEEEEehHHHHHh-cCChHHHHHHHHHHHHHHHhhcc-cccEEEEEEE--EEEecCC--Cccccc----CCHHHH
Q psy1606 84 SRYLELVIVVDNRLYNLF-NKNSKLVHRHCKDISNVINALYE-KLNIFIALVG--VVVWTEY--DEITLN----VNGDIT 153 (285)
Q Consensus 84 ~~~vel~vv~D~~~~~~~-~~n~~~~~~~v~~~~n~v~~iy~-~~~i~v~l~~--l~iw~~~--~~~~~~----~~~~~~ 153 (285)
.|.++|+|++|+.+++.| +++.+.++.++..+++.+|++|. .++|++.|+. ..+.++. +.+.-. .-.+..
T Consensus 2 ~~~~~l~v~~d~~~~~~~~~~s~~~v~~~i~~~v~~~N~~Y~s~v~i~~~Lv~~~~~~~t~s~~~~~~~~~~~~~~~~~~ 81 (206)
T PF13583_consen 2 LRVYRLAVVYDYSFYSWFFGGSIEDVKAFIASMVNFANEAYNSDVGIRFRLVHDEQLIYTESSSDDYNDLTSARYTLNNA 81 (206)
T ss_pred ceEEEEEEEECHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHhcCCCCEEEEeccccceecccccccccccchhhhhhhHH
Confidence 468999999999999999 57899999999999999999995 6778877764 3344443 222110 112356
Q ss_pred HHHHHHhhhhccCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCC
Q psy1606 154 LTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 154 l~~f~~w~~~~~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~ 233 (285)
+..|..|+. ...+|++++++......+..|+||+++.|..+..+++..+...... ...++||||||+||+.||.
T Consensus 82 ~~~~~~~~~-----~~~~D~v~l~~~~~~~~~~~GlA~~~~~~~~~~~~~~~~~~~~~~~-~~~~~aHEiGH~lGl~H~~ 155 (206)
T PF13583_consen 82 LATFNSWRG-----SKNADLVILLTDTNKDSGTCGLAYVGSVCWDNAKAGAYSDPDLSAN-GYQTFAHEIGHNLGLRHDF 155 (206)
T ss_pred HHHHHHHhc-----ccCCCEEEEEeccccCCCCCCeeeeccccccccceEEEEecccccc-cchHHHHHHHHHhcCCCCc
Confidence 667777765 2478999999977544788999999999955544455444311111 1267999999999999998
Q ss_pred CC--------CCCCCCCcccCCCCCCCCCCccCccCHHHHHhHhccCcccc
Q psy1606 234 TE--------CTCPSDRCIMAPSSSSVSPTEWSSCSLEYLALSFDHGMDYC 276 (285)
Q Consensus 234 ~~--------c~C~~~~cIM~~~~~~~~~~~fS~CS~~~~~~~l~~~~~~C 276 (285)
.. +.|.++..||++..... ...||+||+.++++++..+...|
T Consensus 156 ~~~~~~~~~~~~~~~~~tIMsY~~~~~-~~~FS~~~i~~i~~~~~~~~~~~ 205 (206)
T PF13583_consen 156 DYSGDPYSYAYECDSGQTIMSYASRPS-RPYFSPPSIEYIREVVICGDASY 205 (206)
T ss_pred ccCCCcccccccCCCCCEEeCCCCCCC-CCCCCchhHHHHhCCCcCccccc
Confidence 42 34445679999754433 34599999999999887766555
No 13
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.90 E-value=1.1e-23 Score=173.26 Aligned_cols=148 Identities=26% Similarity=0.374 Sum_probs=121.7
Q ss_pred eEEEEEEEEehHHHHHhcCChHHHHHHHHHHHHHHHhhccc-ccEEEEEEEEEEecCCCcccccCCHHHHHHHHHHhhhh
Q psy1606 85 RYLELVIVVDNRLYNLFNKNSKLVHRHCKDISNVINALYEK-LNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKD 163 (285)
Q Consensus 85 ~~vel~vv~D~~~~~~~~~n~~~~~~~v~~~~n~v~~iy~~-~~i~v~l~~l~iw~~~~~~~~~~~~~~~l~~f~~w~~~ 163 (285)
|+||+.|++|+.+| +.+.+..++..+++.++.+|++ .+|++.++..++
T Consensus 1 ~~i~~~i~~d~~~~-----~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~-------------------------- 49 (167)
T cd00203 1 KVIPYVVVADDRDV-----EEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEI-------------------------- 49 (167)
T ss_pred CEEEEEEEeccccc-----chhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCC--------------------------
Confidence 58999999999998 4578888999999999999987 789888776543
Q ss_pred ccCCCCCCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCCCCCC------
Q psy1606 164 RLVLSHPNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTECT------ 237 (285)
Q Consensus 164 ~~~~~~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~c~------ 237 (285)
.++|+++++++.++.++..|.|++++.|++..+.++...........+.++||||||.|||.||...|.
T Consensus 50 -----~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~ 124 (167)
T cd00203 50 -----DKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPT 124 (167)
T ss_pred -----CcCcEEEEeccccCCCCceEEEecCCccCCCCCcEEEecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcc
Confidence 368999999988876778999999999998777777655443233457899999999999999997654
Q ss_pred --------CCCCCcccCCCCCC---CCCCccCccCHHHHHhH
Q psy1606 238 --------CPSDRCIMAPSSSS---VSPTEWSSCSLEYLALS 268 (285)
Q Consensus 238 --------C~~~~cIM~~~~~~---~~~~~fS~CS~~~~~~~ 268 (285)
|.+..|||++.... .....||+||+++|+++
T Consensus 125 ~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~~~i~~~ 166 (167)
T cd00203 125 IDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDIDQINKL 166 (167)
T ss_pred ccccccCCCCCCCeEeccCccccCcccCCCcCHHHHHHHHhh
Confidence 33478999998864 36789999999999875
No 14
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=99.90 E-value=7.2e-25 Score=181.33 Aligned_cols=154 Identities=30% Similarity=0.462 Sum_probs=50.8
Q ss_pred HHHHHHHHHHHHHHhhcc-cccEEEEEEE--EEEecCC--CcccccCCH----HHHHHHHHHhhhhccCCCCCCCceeee
Q psy1606 107 LVHRHCKDISNVINALYE-KLNIFIALVG--VVVWTEY--DEITLNVNG----DITLTNFLSYRKDRLVLSHPNDNAQLL 177 (285)
Q Consensus 107 ~~~~~v~~~~n~v~~iy~-~~~i~v~l~~--l~iw~~~--~~~~~~~~~----~~~l~~f~~w~~~~~~~~~~~D~a~ll 177 (285)
+++.++..++|.||.+|+ ++++++.|+. .+|.++. +++....+. ...+..|..|.. ...+|+++++
T Consensus 2 ~~~a~i~~~vn~vN~iY~~d~~i~~~lv~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~Di~~l~ 76 (173)
T PF13574_consen 2 NALAAIVTTVNRVNQIYERDINIRLELVNNDRIIFTDPSTDPYNNNCNGGNLISENQANFDSWIG-----SSNYDIGHLF 76 (173)
T ss_dssp -----------------------------------------------------------------------TCSS-CCEE
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCC-----CCCCCEEEEe
Confidence 567889999999999997 6777776653 2333332 222221111 122334444433 2379999999
Q ss_pred eeeecCCCceeccccCCCCcccccC-ceeecCCCchhHHHHHHHHHhhhhcCCCCCCCCC------CCCC-CCcccCCCC
Q psy1606 178 TGMTFSDGVVGKALKGPICTYEFSG-GVNVDHKNVVGLVATTVAHEMGHNLGMEHDTTEC------TCPS-DRCIMAPSS 249 (285)
Q Consensus 178 t~~~~~~~~~G~A~~g~~C~~~~~~-~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~~c------~C~~-~~cIM~~~~ 249 (285)
. .+.++..|+||+|++|+..+.. |+....+..+ ...++||||||+||++||...+ .|+. +++||++.
T Consensus 77 ~--~~~~g~~Gla~~g~~c~~~~~~~g~~~~~~~~~--~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~- 151 (173)
T PF13574_consen 77 G--TFSGGGGGLAYVGGVCSKGYKATGLSSPYGDFF--GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYA- 151 (173)
T ss_dssp E--EESSGCCSSEEE-CCCCTCCCE-CCCS--TTSH--HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS-
T ss_pred e--ccCCCCccEEEEeEeCcCCCEEEEEEcCCCcee--eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeC-
Confidence 3 3357899999999999876543 4443333223 4678999999999999998642 4554 78999977
Q ss_pred CCCCCCccCccCHHHHHhHhc
Q psy1606 250 SSVSPTEWSSCSLEYLALSFD 270 (285)
Q Consensus 250 ~~~~~~~fS~CS~~~~~~~l~ 270 (285)
+..+.+.||+||+++|+.+|.
T Consensus 152 ~~~~~~~fS~cS~~~i~~~l~ 172 (173)
T PF13574_consen 152 GICNNDKFSPCSIRQIRAVLG 172 (173)
T ss_dssp -GGGG----------------
T ss_pred CCCCCcccccccccccccccC
Confidence 545678999999999999986
No 15
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=99.79 E-value=1.3e-19 Score=141.89 Aligned_cols=122 Identities=31% Similarity=0.399 Sum_probs=78.0
Q ss_pred HHHHHHHHHHHhhcc-cccEEEEEEEEEEecCCCcccccCCHHHHHHHHHHhhhhccCCCCCCCceeeeeeeecCCCcee
Q psy1606 110 RHCKDISNVINALYE-KLNIFIALVGVVVWTEYDEITLNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTFSDGVVG 188 (285)
Q Consensus 110 ~~v~~~~n~v~~iy~-~~~i~v~l~~l~iw~~~~~~~~~~~~~~~l~~f~~w~~~~~~~~~~~D~a~llt~~~~~~~~~G 188 (285)
.++..+++.+|++|+ +++|++.|+++.+|+..+......+....|..+..+..........+|++++|+..++ .+..|
T Consensus 2 ~~i~~~v~~~n~~y~~~~~i~l~l~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~aD~v~~~~~~~~-~~~~G 80 (124)
T PF13582_consen 2 AYINALVNRANQAYESNSGIRLRLVGIIYTDYSDDPSSSSDSNSTLDNFYASFDARDRDQYGADLVHLFTGNDL-NGAGG 80 (124)
T ss_dssp -----HHHHHHHHHTTTGGGTEEEEEEEEE-S-----SSS-HHHHHHHHHHHHHC--GGGTT-TTSEEEEEESS-GGB-E
T ss_pred chHHHHHHHHHHHHHhCCCcEEEEEEEEEeccccccccCccHHHHHHHHHHhhhhhccccCCCCEEEEeccCCC-CCCce
Confidence 467788999999999 8899999999999998876655567788999998886543233457999999998876 67899
Q ss_pred ccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCC
Q psy1606 189 KALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHD 232 (285)
Q Consensus 189 ~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD 232 (285)
+||++++|.+.+..++.............++||||||+||+.||
T Consensus 81 ~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 81 IAYIGGVCSNNSGYGVNYDSSPYGDSGVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp EEECT---TTT--TT-SEEEEE--TTSTTHHHHHHHHHTT----
T ss_pred EEEeCCccccCCceEEEcccCCCCcccceEeeehhhHhcCCCCC
Confidence 99999999998777777665211122247999999999999998
No 16
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=98.94 E-value=5.6e-09 Score=86.40 Aligned_cols=73 Identities=29% Similarity=0.391 Sum_probs=58.5
Q ss_pred CCCceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchh-------------HHHHHHHHHhhhhcCCCCCCCCC
Q psy1606 170 PNDNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVG-------------LVATTVAHEMGHNLGMEHDTTEC 236 (285)
Q Consensus 170 ~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~-------------~~a~~~AHElGH~lG~~HD~~~c 236 (285)
..+.++.+|..|... .|.+|+.|.|.+..+++|+........ .+...++||+||+|||.|
T Consensus 69 ~~~~vl~vt~~DLy~--~~~nfvfG~a~~~~~~aVvS~~Rl~~~~~~~~~~~~l~~~R~~k~~~HElGH~lGL~H----- 141 (179)
T PRK13267 69 NGDKNIGITDCDLYY--RGLNFVFGLAYPNLRGAVISTYRLRPEFYGNKPDSELFEERVRKEVTHELGHTLGLEH----- 141 (179)
T ss_pred CCceEEEEEccccCC--CCCCceeEEeecCCCEEEEEccccChhhccccchHHHHHHHHHHHHHHHHHHHcCCcc-----
Confidence 578899999888743 589999999999888888877653221 245669999999999999
Q ss_pred CCCCCCcccCCCCC
Q psy1606 237 TCPSDRCIMAPSSS 250 (285)
Q Consensus 237 ~C~~~~cIM~~~~~ 250 (285)
|....|+|+...+
T Consensus 142 -C~~~~CvM~~s~s 154 (179)
T PRK13267 142 -CDNPRCVMNFSNS 154 (179)
T ss_pred -CCCCCccCCCCCC
Confidence 7778999997654
No 17
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=98.39 E-value=5.7e-07 Score=75.04 Aligned_cols=67 Identities=24% Similarity=0.333 Sum_probs=36.9
Q ss_pred eeeeeeecCCCceeccccCCCCcccccCceeecCC-----Cc----hhHHHHHHHHHhhhhcCCCCCCCCCCCCCCCccc
Q psy1606 175 QLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHK-----NV----VGLVATTVAHEMGHNLGMEHDTTECTCPSDRCIM 245 (285)
Q Consensus 175 ~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~-----~~----~~~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM 245 (285)
..+|..|.. ..|+.++-|.-.+....+|..... .+ ...+...++||+||.||+.| |.. .|+|
T Consensus 98 l~it~~DlY--~~~~nfVFG~A~~~~~~aVvS~~~~~fy~~~~~l~~~R~~Kea~HElGH~~GL~H------C~~-~CvM 168 (194)
T PF07998_consen 98 LGITDRDLY--SPGLNFVFGLARPGGGVAVVSTSRNEFYGEDEELFLERVCKEAVHELGHLFGLDH------CEN-RCVM 168 (194)
T ss_dssp EEEESS-EE--ETTESEBSEEEECCSSEEEEEGGCGGGGTSSHHHHHHHHHHHHHHHHHHHTT----------SS-TSTT
T ss_pred EEEeccccC--CCCCceEEEEeecCCCeEEEEEeccccccccHHHHHHHHHHHHHHHHHHHcCCcC------CCC-CCcc
Confidence 666766652 234444444444333444543322 12 23467899999999999999 888 9999
Q ss_pred CCCCC
Q psy1606 246 APSSS 250 (285)
Q Consensus 246 ~~~~~ 250 (285)
++..+
T Consensus 169 ~~s~s 173 (194)
T PF07998_consen 169 NFSNS 173 (194)
T ss_dssp S--SS
T ss_pred CCCCC
Confidence 98654
No 18
>PF01562 Pep_M12B_propep: Reprolysin family propeptide; InterPro: IPR002870 This signature covers the region of the propeptide for members of the MEROPS peptidase family M12B (clan MA(M), adamalysin family). The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins, which mediate cell-cell or cell-matrix interactions.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis
Probab=98.02 E-value=2.6e-06 Score=67.06 Aligned_cols=36 Identities=25% Similarity=0.146 Sum_probs=31.4
Q ss_pred cccccccccccCCcceEEEeecccccCC--------ceEEeccCc
Q psy1606 2 LHCQHLSEKLEYGIETVAILSHSFLNPG--------GFIIPCRSA 38 (285)
Q Consensus 2 ~~c~y~g~~~~~~~~~v~~~~~~~~~~~--------~~i~p~~~~ 38 (285)
.+|+|+|++.+.+.|.||+|+|.|| .| |+|||++..
T Consensus 77 ~~c~Y~G~V~g~~~S~vals~c~gl-~G~i~~~~~~y~IeP~~~~ 120 (131)
T PF01562_consen 77 EHCHYQGHVEGDPGSSVALSLCGGL-RGVIQTDDETYFIEPLPRY 120 (131)
T ss_pred cceEEEeeeCCCCCCEEEEEECCce-EEEEEECCEEEEEEECCcC
Confidence 5899999999999999999999887 44 889998644
No 19
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.93 E-value=4.1e-06 Score=67.70 Aligned_cols=50 Identities=40% Similarity=0.699 Sum_probs=36.1
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCCC--CCCCccCccC--HHHHHhHhc
Q psy1606 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSS--VSPTEWSSCS--LEYLALSFD 270 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~~fS~CS--~~~~~~~l~ 270 (285)
+-.-+.||+||.||+.| |+...|+|+...+. ..-+.|+-|. ++.++..+.
T Consensus 124 v~KEv~HElGH~~GL~H------C~N~~CVM~FSnSl~dvDrKs~~fC~~C~~kL~~~l~ 177 (181)
T COG1913 124 VVKEVLHELGHLLGLSH------CPNPRCVMNFSNSLRDVDRKSPNFCNSCLRKLERKLK 177 (181)
T ss_pred HHHHHHHHhhhhcCccc------CCCCCcEEeCCccHHhhhccchhhhHHHHHHHHHhhc
Confidence 45789999999999999 99999999998762 2334454443 455555443
No 20
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.72 E-value=9.1e-05 Score=63.73 Aligned_cols=117 Identities=19% Similarity=0.178 Sum_probs=61.9
Q ss_pred HHHHHHHHHHHhhccc------ccEEEEEEEEEEecCCCccc-ccCCHHHHHHHHHHhhhhccCCCCCCCceeeeeeeec
Q psy1606 110 RHCKDISNVINALYEK------LNIFIALVGVVVWTEYDEIT-LNVNGDITLTNFLSYRKDRLVLSHPNDNAQLLTGMTF 182 (285)
Q Consensus 110 ~~v~~~~n~v~~iy~~------~~i~v~l~~l~iw~~~~~~~-~~~~~~~~l~~f~~w~~~~~~~~~~~D~a~llt~~~~ 182 (285)
..+...+..+|..|+. .+|++.|.++.=-....... .....+..++....|- ....+-+.+... .
T Consensus 25 ~~i~~qi~vlN~~y~~~~~~~~~~i~F~Lagi~r~~n~~~~~~~~~~~~~~~k~~l~~~-------~~~~LNIy~~~~-~ 96 (225)
T cd04275 25 AQITDQIDVLNDDYSGLNAGVDLGIEFVLAGTTRTVNSAWPVFAGSGTEDAMKSALRKG-------GYKYLNIYVANF-L 96 (225)
T ss_pred HHHHHHHHHHHHHhhccccCCCCceEEEEeeeEEecCCCcccccCcchHHHHHHHhccC-------CcCeEEEEEECC-C
Confidence 3566778889999998 89999999864322222221 1112222333222111 112233333222 1
Q ss_pred CCCceeccccCCCCccc--ccCceeecCCC------chhHHHHHHHHHhhhhcCCCCCCC
Q psy1606 183 SDGVVGKALKGPICTYE--FSGGVNVDHKN------VVGLVATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 183 ~~~~~G~A~~g~~C~~~--~~~~v~~~~~~------~~~~~a~~~AHElGH~lG~~HD~~ 234 (285)
.++.+|.|+.-..-... ..-||+..... .....+.|++||+||-|||.|.-.
T Consensus 97 ~~~~lG~a~fP~~~~~~~~~~dGvvi~~~~~~~~~~~~~n~g~t~~HEvGH~lGL~HtF~ 156 (225)
T cd04275 97 GGGLLGYATFPDSLVSLAFITDGVVINPSSLPGGSAAPYNLGDTATHEVGHWLGLYHTFQ 156 (225)
T ss_pred CCCcCEEEECCCcccCCccccceEEEeccccCCCCcccccccceeEEeccceeeeeeeec
Confidence 35677888754433321 12244433211 012235799999999999999754
No 21
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=97.72 E-value=3.7e-05 Score=61.90 Aligned_cols=48 Identities=21% Similarity=0.401 Sum_probs=34.6
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCCCCCCCccCccCHHHHHhH
Q psy1606 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSSVSPTEWSSCSLEYLALS 268 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~~~~~~fS~CS~~~~~~~ 268 (285)
+-.|++|||||.|||.|... ..-||+|.........++..-+.-++.+
T Consensus 105 ~~~v~~HEiGHaLGL~H~~~------~~svM~~~~~~~~~~~l~~~Di~~i~~l 152 (154)
T PF00413_consen 105 LQSVAIHEIGHALGLDHSND------PNSVMYPYYRGPDNKTLSEDDIDGIQYL 152 (154)
T ss_dssp HHHHHHHHHHHHTTBESSSS------TTSTTSSSCTSSSSTSTTHHHHHHHHHH
T ss_pred hhhhhhhccccccCcCcCCC------cccceeeecccCCCCCCCHHHHHHHHHH
Confidence 34799999999999999753 4569999977532236666666655544
No 22
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=97.54 E-value=0.00068 Score=55.19 Aligned_cols=35 Identities=29% Similarity=0.505 Sum_probs=25.6
Q ss_pred HHHHHHHhhhhcCCCCCCCCCCCC----------CCCcccCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTECTCP----------SDRCIMAPSSS 250 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~c~C~----------~~~cIM~~~~~ 250 (285)
-.|++|||||.||+.|....-.-. ...-||.+...
T Consensus 95 ~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~ 139 (165)
T cd04268 95 RNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPS 139 (165)
T ss_pred HHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCcc
Confidence 479999999999999986521111 14689999764
No 23
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=97.27 E-value=0.00026 Score=57.40 Aligned_cols=49 Identities=24% Similarity=0.323 Sum_probs=35.1
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCCCCC--CCccCccCHHHHHhH
Q psy1606 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSSVS--PTEWSSCSLEYLALS 268 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~~~--~~~fS~CS~~~~~~~ 268 (285)
+-.|++|||||.||+.|.... ..=||+|...... ....+...++-++.+
T Consensus 104 ~~~~~~HEiGHaLGL~H~~~~-----~~siM~p~~~~~~~~~~~l~~~D~~~i~~l 154 (156)
T cd04279 104 LQAIALHELGHALGLWHHSDR-----PEDAMYPSQGQGPDGNPTLSARDVATLKRL 154 (156)
T ss_pred HHHHHHHHhhhhhcCCCCCCC-----ccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence 347999999999999998753 1368999876422 246777776666554
No 24
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=97.24 E-value=0.00066 Score=57.30 Aligned_cols=23 Identities=30% Similarity=0.745 Sum_probs=19.5
Q ss_pred chhHHHHHHHHHhhhhcCCCCCC
Q psy1606 211 VVGLVATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 211 ~~~~~a~~~AHElGH~lG~~HD~ 233 (285)
.....+.+|+|||||.+|+.|.+
T Consensus 129 ~~~~~~hvi~HEiGH~IGfRHTD 151 (211)
T PF12388_consen 129 SVNVIEHVITHEIGHCIGFRHTD 151 (211)
T ss_pred chhHHHHHHHHHhhhhccccccC
Confidence 34456889999999999999976
No 25
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=97.21 E-value=0.00032 Score=56.92 Aligned_cols=46 Identities=26% Similarity=0.385 Sum_probs=33.5
Q ss_pred HHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCCCC-CCCccCccCHHHHHh
Q psy1606 216 ATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSSSV-SPTEWSSCSLEYLAL 267 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~~-~~~~fS~CS~~~~~~ 267 (285)
-.|+.|||||.||+.|... ..=||.|..... ....+|..-++-++.
T Consensus 108 ~~~~~HEiGHaLGL~H~~~------~~~vM~~~~~~~~~~~~l~~~D~~~~~~ 154 (157)
T cd04278 108 FSVAAHEIGHALGLGHSSD------PDSIMYPYYQGPVPKFKLSQDDIRGIQA 154 (157)
T ss_pred HHHHHHHhccccccCCCCC------CcCeecccccCCCcCCCCCHHHHHHHHH
Confidence 4699999999999999653 356999887642 245677666555554
No 26
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=96.88 E-value=0.0013 Score=59.34 Aligned_cols=70 Identities=23% Similarity=0.245 Sum_probs=38.7
Q ss_pred CceeeeeeeecCCCceeccccCCCCcccccCceeecCCCchhHHHHHHHHHhhhhcCCCCCCC-CCCCCCCCcccCCC
Q psy1606 172 DNAQLLTGMTFSDGVVGKALKGPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLGMEHDTT-ECTCPSDRCIMAPS 248 (285)
Q Consensus 172 D~a~llt~~~~~~~~~G~A~~g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG~~HD~~-~c~C~~~~cIM~~~ 248 (285)
...+++.-..+..+..|+|.+.|-+..-+ +.... .. -..++.|||||||||.|-+. +|.=++..|.|...
T Consensus 114 ~h~vyvlP~~~~C~w~Gla~v~G~~~~~~---~~~~~---~~-~~~~~~HElgHN~GL~Ha~~~~~eYgD~s~~MG~g 184 (314)
T PF05548_consen 114 THRVYVLPPGFACGWAGLATVPGSQSWLW---ISGYG---VQ-DWATIMHELGHNLGLWHAGRNGCEYGDPSDPMGCG 184 (314)
T ss_pred ceEEEEcCCCCCCCceEEeecCCcceeee---ecCcc---cc-cHHHHHHHhhhhccccccCCCCccccCCCccCCCC
Confidence 33333333444356778888754433211 11110 01 12489999999999999864 33334567888633
No 27
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=96.67 E-value=0.0011 Score=51.15 Aligned_cols=31 Identities=32% Similarity=0.591 Sum_probs=18.7
Q ss_pred HHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~ 250 (285)
-.++||||||.||++....+ .|. -+|+....
T Consensus 78 ~RIaaHE~GHiLGLPD~y~G-pCS---~LMSG~~a 108 (132)
T PF02031_consen 78 TRIAAHELGHILGLPDHYPG-PCS---ELMSGGSA 108 (132)
T ss_dssp HHHHHHHHHHHHT----TTS--TT----GGGTTTT
T ss_pred ceeeeehhccccCCCCCCCC-ccH---HhhcCCCC
Confidence 36999999999999876543 342 47877654
No 28
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=96.57 E-value=0.0011 Score=52.66 Aligned_cols=18 Identities=44% Similarity=0.870 Sum_probs=16.2
Q ss_pred HHHHHHhhhhcCCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~ 234 (285)
.+++|||||.||+.|...
T Consensus 88 ~~~~HEigHaLGl~H~~~ 105 (140)
T smart00235 88 GVAAHELGHALGLYHEQS 105 (140)
T ss_pred ccHHHHHHHHhcCCcCCC
Confidence 389999999999999864
No 29
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.27 E-value=0.0016 Score=52.70 Aligned_cols=23 Identities=39% Similarity=0.723 Sum_probs=15.0
Q ss_pred HHHHHHHHhhhhcCCCCCCC-CCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTT-ECT 237 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~-~c~ 237 (285)
.+.|++||+||-|||.|--. +|.
T Consensus 69 ~g~TltHEvGH~LGL~HtF~~~C~ 92 (154)
T PF05572_consen 69 FGKTLTHEVGHWLGLYHTFGGGCD 92 (154)
T ss_dssp SSHHHHHHHHHHTT---TT---TS
T ss_pred cccchhhhhhhhhcccccccCCCC
Confidence 35899999999999999753 454
No 30
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=96.08 E-value=0.0033 Score=52.38 Aligned_cols=35 Identities=31% Similarity=0.476 Sum_probs=25.2
Q ss_pred HHHHHHHhhhhcCCCCCCCC--CCC--------CCCCcccCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE--CTC--------PSDRCIMAPSSS 250 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~--c~C--------~~~~cIM~~~~~ 250 (285)
-.|+.|||||.|||.|.... ... ...--||++...
T Consensus 114 ~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~ 158 (186)
T cd04277 114 YQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSG 158 (186)
T ss_pred HHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCC
Confidence 37999999999999998742 111 113579988755
No 31
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.78 E-value=0.0085 Score=50.49 Aligned_cols=20 Identities=40% Similarity=0.773 Sum_probs=17.3
Q ss_pred HHHHHHHHhhhhcCCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~ 234 (285)
...+++||+||.||+.|.-.
T Consensus 116 ~~~~~~he~gh~lGl~hn~~ 135 (197)
T cd04276 116 LRYLLAHEVGHTLGLRHNFK 135 (197)
T ss_pred HHHHHHHHHHHHhcCccccc
Confidence 45799999999999999754
No 32
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=94.78 E-value=0.014 Score=49.30 Aligned_cols=18 Identities=44% Similarity=0.804 Sum_probs=15.9
Q ss_pred HHHHHHhhhhcCCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~ 234 (285)
.++.|||||.||+.|...
T Consensus 94 ~~i~HElgHaLG~~HEh~ 111 (198)
T cd04327 94 RVVLHEFGHALGFIHEHQ 111 (198)
T ss_pred HHHHHHHHHHhcCccccc
Confidence 589999999999999754
No 33
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.39 E-value=0.022 Score=51.01 Aligned_cols=16 Identities=44% Similarity=0.989 Sum_probs=13.1
Q ss_pred HHHHHHHhhhhcCCCC
Q psy1606 216 ATTVAHEMGHNLGMEH 231 (285)
Q Consensus 216 a~~~AHElGH~lG~~H 231 (285)
..+++||+||+||+.|
T Consensus 194 ~~~f~HE~GH~~GL~H 209 (305)
T PF10462_consen 194 GNEFSHELGHNFGLGH 209 (305)
T ss_dssp HHHHHHHHHHTTT--S
T ss_pred cceeehhhhhhcCCCC
Confidence 5799999999999999
No 34
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=94.12 E-value=0.024 Score=47.17 Aligned_cols=17 Identities=35% Similarity=0.649 Sum_probs=15.3
Q ss_pred HHHHHHhhhhcCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~ 233 (285)
.+++|||||.||+.|..
T Consensus 79 G~i~HEl~HaLG~~HEh 95 (182)
T cd04283 79 GIIQHELLHALGFYHEQ 95 (182)
T ss_pred chHHHHHHHHhCCcccc
Confidence 48999999999999964
No 35
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=93.75 E-value=0.03 Score=47.35 Aligned_cols=17 Identities=41% Similarity=0.779 Sum_probs=15.1
Q ss_pred HHHHHHhhhhcCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~ 233 (285)
.+++|||||.||+.|..
T Consensus 89 Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 89 GIVVHELGHVIGFWHEH 105 (200)
T ss_pred chHHHHHHHHhcCcchh
Confidence 48999999999999954
No 36
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=93.67 E-value=0.029 Score=46.56 Aligned_cols=17 Identities=41% Similarity=0.733 Sum_probs=15.4
Q ss_pred HHHHHHhhhhcCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~ 233 (285)
.++.|||||.||+.|..
T Consensus 76 g~v~HE~~HalG~~HEh 92 (180)
T cd04280 76 GTIVHELMHALGFYHEQ 92 (180)
T ss_pred chhHHHHHHHhcCcchh
Confidence 58999999999999964
No 37
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=93.35 E-value=0.033 Score=46.71 Aligned_cols=18 Identities=44% Similarity=0.822 Sum_probs=13.8
Q ss_pred HHHHHHHhhhhcCCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~ 233 (285)
..+++|||||.||+.|..
T Consensus 80 ~~~i~HEl~HaLG~~HEh 97 (191)
T PF01400_consen 80 VGTILHELGHALGFWHEH 97 (191)
T ss_dssp HHHHHHHHHHHHTB--GG
T ss_pred ccchHHHHHHHHhhhhhh
Confidence 358999999999999954
No 38
>KOG1565|consensus
Probab=93.27 E-value=0.06 Score=51.34 Aligned_cols=30 Identities=37% Similarity=0.605 Sum_probs=24.2
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~ 250 (285)
+..|+|||+||.||+.|-.. .+=||+|...
T Consensus 211 l~~Va~HEiGH~LGL~HS~~------~~aiM~P~y~ 240 (469)
T KOG1565|consen 211 LFLVAAHEIGHALGLGHSSD------PDAIMYPFYQ 240 (469)
T ss_pred hHHHhhhhcccccccCCCCC------cccccccccc
Confidence 45799999999999999633 4568888876
No 39
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=92.62 E-value=0.056 Score=46.65 Aligned_cols=17 Identities=47% Similarity=0.730 Sum_probs=15.2
Q ss_pred HHHHHHhhhhcCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~ 233 (285)
.+++|||||.||+.|..
T Consensus 122 Gti~HEl~HalGf~HEq 138 (230)
T cd04282 122 ATVEHEFLHALGFYHEQ 138 (230)
T ss_pred chHHHHHHHHhCCcccc
Confidence 48999999999999954
No 40
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=91.91 E-value=0.044 Score=49.00 Aligned_cols=33 Identities=24% Similarity=0.405 Sum_probs=22.6
Q ss_pred HHHHHHHhhhhcCCCCCCCC-CCCC----CCCcccCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE-CTCP----SDRCIMAPS 248 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~-c~C~----~~~cIM~~~ 248 (285)
..|+|||+||.||+++=-.. .... ..-+||+..
T Consensus 166 igv~~HE~gH~lGLPDlY~~~~~~~~~~vG~w~lM~~G 203 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYDTSYDGGGEPVGYWSLMSSG 203 (286)
T ss_pred eeeeehhhhcccCCCccccCcCCCCCCCccccccccCC
Confidence 36999999999999884321 1111 245999883
No 41
>smart00608 ACR ADAM Cysteine-Rich Domain.
Probab=89.63 E-value=0.43 Score=37.77 Aligned_cols=60 Identities=30% Similarity=0.455 Sum_probs=41.5
Q ss_pred CcccccccccccCCcceEEEeecccccCCceEEeccCccccccccccCCCCCCCCCCCCCCCcccc
Q psy1606 1 MLHCQHLSEKLEYGIETVAILSHSFLNPGGFIIPCRSAIVDLGLNQVDPGLSPDGARCGDGKTKRD 66 (285)
Q Consensus 1 ~~~c~y~g~~~~~~~~~v~~~~~~~~~~~~~i~p~~~~~~~~~l~~~~~~~~~~~~~Cg~~~~~~~ 66 (285)
.|+|.+..+.+..+.....+.+.. .-..|+++..+......|+++.+++++||.+|...+
T Consensus 72 kL~C~~~~~~p~~~~~~~~~~~~~------~~~~C~~~~~~~~~~~~d~g~V~dGT~CG~~kvC~n 131 (137)
T smart00608 72 KLQCTNVSSLPLLGEHATVIYSNI------GGLVCWSLDYHLGSDSTDLGMVKDGTKCGPGKVCIN 131 (137)
T ss_pred eEEecCCCccccccCcceEEEecc------CCCEEEEeeECCCCCCCCccccCCCCCcCCCCccCC
Confidence 478888888877666555544432 123577665555555568899999999999987654
No 42
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=88.90 E-value=0.19 Score=38.35 Aligned_cols=20 Identities=35% Similarity=0.762 Sum_probs=17.2
Q ss_pred HHHHHHHHHhhhhcCCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD~ 233 (285)
.++.++-|||||-||+.-|.
T Consensus 108 ~vthvliHEIgHhFGLsDdd 127 (136)
T COG3824 108 QVTHVLIHEIGHHFGLSDDD 127 (136)
T ss_pred HhhhhhhhhhhhhcCCChhH
Confidence 35789999999999998775
No 43
>KOG3714|consensus
Probab=88.81 E-value=0.16 Score=47.85 Aligned_cols=17 Identities=41% Similarity=0.696 Sum_probs=15.2
Q ss_pred HHHHHHhhhhcCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~ 233 (285)
.|++|||+|.||..|..
T Consensus 161 G~i~HEl~HaLGf~Heh 177 (411)
T KOG3714|consen 161 GTIVHELMHALGFWHEH 177 (411)
T ss_pred chhHHHHHHHhhhhhcc
Confidence 48999999999999954
No 44
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=88.50 E-value=0.24 Score=41.80 Aligned_cols=16 Identities=31% Similarity=0.447 Sum_probs=13.9
Q ss_pred hhHHHHHHHHHhhhhc
Q psy1606 212 VGLVATTVAHEMGHNL 227 (285)
Q Consensus 212 ~~~~a~~~AHElGH~l 227 (285)
++.++.++||||+|.|
T Consensus 33 yg~lG~ilahel~haf 48 (206)
T PF01431_consen 33 YGGLGFILAHELMHAF 48 (206)
T ss_dssp HHTHHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5567899999999988
No 45
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=88.47 E-value=0.27 Score=46.18 Aligned_cols=18 Identities=33% Similarity=0.637 Sum_probs=15.6
Q ss_pred HHHHHhhhhcCCCCCCCC
Q psy1606 218 TVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 218 ~~AHElGH~lG~~HD~~~ 235 (285)
.+-|||||.||++|-..+
T Consensus 319 A~lHEiGH~fg~pH~~~G 336 (423)
T PF12044_consen 319 AFLHEIGHLFGCPHQEDG 336 (423)
T ss_pred HHHHHHHHhcCCCCCCCC
Confidence 588999999999997753
No 46
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=87.30 E-value=0.4 Score=42.30 Aligned_cols=20 Identities=35% Similarity=0.536 Sum_probs=12.7
Q ss_pred HHHHHHHHhhhhcCCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~ 234 (285)
...++.||+||.||--=|+.
T Consensus 216 ~~~v~vHE~GHsf~~LaDEY 235 (264)
T PF09471_consen 216 FKQVVVHEFGHSFGGLADEY 235 (264)
T ss_dssp HHHHHHHHHHHHTT------
T ss_pred ccceeeeecccccccccccc
Confidence 46899999999999888874
No 47
>KOG3624|consensus
Probab=84.17 E-value=0.97 Score=45.49 Aligned_cols=17 Identities=24% Similarity=0.464 Sum_probs=14.6
Q ss_pred chhHHHHHHHHHhhhhc
Q psy1606 211 VVGLVATTVAHEMGHNL 227 (285)
Q Consensus 211 ~~~~~a~~~AHElGH~l 227 (285)
+++.++.+|||||+|.|
T Consensus 514 nyg~iG~vigHEl~H~F 530 (687)
T KOG3624|consen 514 NYGGIGFVIGHELTHGF 530 (687)
T ss_pred hhHHHHHHHHHHHhhcc
Confidence 45667899999999988
No 48
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=84.16 E-value=0.76 Score=34.09 Aligned_cols=19 Identities=32% Similarity=0.791 Sum_probs=14.4
Q ss_pred HHHHHHHHHhhhhcCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHD 232 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD 232 (285)
.+..|+-||+||.||+.-|
T Consensus 72 ~I~~tlvhEiah~fG~~~e 90 (97)
T PF06262_consen 72 LIRDTLVHEIAHHFGISDE 90 (97)
T ss_dssp HHHHHHHHHHHHHTT--HH
T ss_pred HHHHHHHHHHHHHcCCCHH
Confidence 3578999999999998654
No 49
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=83.65 E-value=0.56 Score=42.59 Aligned_cols=19 Identities=32% Similarity=0.707 Sum_probs=16.4
Q ss_pred HHHHHHHHhhhhcCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~ 233 (285)
...|+||||+|.+|....+
T Consensus 196 ~P~T~~HElAHq~G~a~E~ 214 (318)
T PF12725_consen 196 LPFTICHELAHQLGFASED 214 (318)
T ss_pred ccHHHHHHHHHHhCCCCHH
Confidence 4689999999999987755
No 50
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=83.64 E-value=0.91 Score=38.10 Aligned_cols=42 Identities=31% Similarity=0.612 Sum_probs=26.4
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCCCC---CCcccCCCCCCCCCCccCccCHHHH
Q psy1606 215 VATTVAHEMGHNLGMEHDTTECTCPS---DRCIMAPSSSSVSPTEWSSCSLEYL 265 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~~c~C~~---~~cIM~~~~~~~~~~~fS~CS~~~~ 265 (285)
-..+|=||+||-||-.|. .|+. .-=||-.. .+..++|-+..|
T Consensus 139 RqYvINHEVGH~LGh~H~----~Cpg~G~~APVMmQQ-----T~gL~g~~~~~~ 183 (203)
T PF11350_consen 139 RQYVINHEVGHALGHGHE----PCPGPGRPAPVMMQQ-----TLGLDGCELAKL 183 (203)
T ss_pred HHHhhhhhhhhhcccCCC----cCCCCCCcCCcchhh-----hcccCCcccccc
Confidence 368999999999997774 3554 23566554 233445544444
No 51
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=83.54 E-value=1.1 Score=33.59 Aligned_cols=18 Identities=39% Similarity=0.368 Sum_probs=13.5
Q ss_pred CchhHHHHHHHHHhhhhc
Q psy1606 210 NVVGLVATTVAHEMGHNL 227 (285)
Q Consensus 210 ~~~~~~a~~~AHElGH~l 227 (285)
.+...-..++||||||.+
T Consensus 37 ~~~~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 37 LSPERQRFTLAHELGHIL 54 (122)
T ss_dssp S-HHHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHH
Confidence 334455689999999987
No 52
>KOG4525|consensus
Probab=82.89 E-value=0.75 Score=42.86 Aligned_cols=19 Identities=42% Similarity=0.760 Sum_probs=16.2
Q ss_pred HHHHHHhhhhcCCCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~~ 235 (285)
-.|-|||||.||++|...+
T Consensus 303 GA~~HElGH~lgcpHq~~G 321 (614)
T KOG4525|consen 303 GAVCHELGHCLGCPHQSEG 321 (614)
T ss_pred HHHHHHhhhccCCCCCCCc
Confidence 4688999999999998753
No 53
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=81.50 E-value=0.69 Score=39.44 Aligned_cols=13 Identities=54% Similarity=0.792 Sum_probs=11.4
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-=+|+||||||.|
T Consensus 72 ~rFtlAHELGH~l 84 (213)
T COG2856 72 KRFTLAHELGHAL 84 (213)
T ss_pred HHHHHHHHHhHHH
Confidence 3479999999998
No 54
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=80.27 E-value=1.2 Score=33.60 Aligned_cols=15 Identities=27% Similarity=0.530 Sum_probs=12.3
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
..+..+++||++|.+
T Consensus 23 ~~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 23 DWLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344589999999998
No 55
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=79.02 E-value=1.1 Score=42.41 Aligned_cols=17 Identities=35% Similarity=0.483 Sum_probs=14.2
Q ss_pred hhHHHHHHHHHhhhhcC
Q psy1606 212 VGLVATTVAHEMGHNLG 228 (285)
Q Consensus 212 ~~~~a~~~AHElGH~lG 228 (285)
..-+|.++||||||..|
T Consensus 127 esElagViAHEigHv~q 143 (484)
T COG4783 127 ESELAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHHHHhh
Confidence 34568999999999987
No 56
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=78.53 E-value=2.2 Score=35.76 Aligned_cols=22 Identities=23% Similarity=0.306 Sum_probs=18.3
Q ss_pred hHHHHHHHHHhhhhcCCCCCCC
Q psy1606 213 GLVATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG~~HD~~ 234 (285)
..+..+++|||.|..-++|...
T Consensus 162 ~~idYVvvHEL~Hl~~~nHs~~ 183 (205)
T PF01863_consen 162 EVIDYVVVHELCHLRHPNHSKR 183 (205)
T ss_pred cHHHHHHHHHHHHhccCCCCHH
Confidence 3456899999999999999753
No 57
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=78.52 E-value=0.68 Score=44.66 Aligned_cols=25 Identities=28% Similarity=0.582 Sum_probs=20.5
Q ss_pred chhHHHHHHHHHhhhhc---CCCCCCCC
Q psy1606 211 VVGLVATTVAHEMGHNL---GMEHDTTE 235 (285)
Q Consensus 211 ~~~~~a~~~AHElGH~l---G~~HD~~~ 235 (285)
++++++.+|+|||||-| |...|+.+
T Consensus 483 NYGgIGaVIgHEI~HgFDdqGakfD~~G 510 (654)
T COG3590 483 NYGGIGAVIGHEIGHGFDDQGAKFDGDG 510 (654)
T ss_pred cccCccceehhhhcccccCCccccCCCC
Confidence 56777889999999999 77777754
No 58
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=75.40 E-value=0.52 Score=46.72 Aligned_cols=34 Identities=26% Similarity=0.344 Sum_probs=22.3
Q ss_pred HHHHHHHhhhhcCCCCCCCC--CCC--C-CCCcccCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE--CTC--P-SDRCIMAPSS 249 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~--c~C--~-~~~cIM~~~~ 249 (285)
.-++|||+||.||++---.. -.= + ..-+||+...
T Consensus 222 iGVfaHEfGH~LGLPDlYDT~~~g~~~~vg~WsLMs~GS 260 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLPDLYDTDYSGGGNGVGFWSLMSSGS 260 (645)
T ss_pred eEEEEeeccccCCCCCcccCCCCCCCCCcccceecCCCC
Confidence 35899999999999863321 000 1 1358998754
No 59
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=72.86 E-value=2.1 Score=36.13 Aligned_cols=16 Identities=38% Similarity=0.522 Sum_probs=13.2
Q ss_pred hHHHHHHHHHhhhhcC
Q psy1606 213 GLVATTVAHEMGHNLG 228 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG 228 (285)
.-++.++|||+||.-+
T Consensus 87 ~el~aVlaHElgH~~~ 102 (226)
T PF01435_consen 87 DELAAVLAHELGHIKH 102 (226)
T ss_dssp HHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3467899999999984
No 60
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=67.35 E-value=3.5 Score=34.79 Aligned_cols=21 Identities=38% Similarity=0.632 Sum_probs=17.0
Q ss_pred hHHHHHHHHHhhhhcCCC-CCC
Q psy1606 213 GLVATTVAHEMGHNLGME-HDT 233 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG~~-HD~ 233 (285)
..+-.|+-||+||.||+. |..
T Consensus 185 e~L~~tarhElGhaLgi~ghsd 206 (236)
T COG5549 185 ENLNPTARHELGHALGIWGHSD 206 (236)
T ss_pred hhhhHHHHHhhcchheeccccc
Confidence 345579999999999998 754
No 61
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=66.33 E-value=5.2 Score=39.50 Aligned_cols=12 Identities=50% Similarity=0.916 Sum_probs=11.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|+|||+||.+
T Consensus 376 v~TL~HE~GHa~ 387 (587)
T TIGR02290 376 VSTLAHELGHAY 387 (587)
T ss_pred HHHHHHHhhHHH
Confidence 589999999998
No 62
>PRK03001 M48 family peptidase; Provisional
Probab=65.94 E-value=3.5 Score=36.72 Aligned_cols=15 Identities=27% Similarity=0.361 Sum_probs=12.4
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-++.++||||||.-
T Consensus 122 ~El~aVlAHElgHi~ 136 (283)
T PRK03001 122 REIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHh
Confidence 345789999999986
No 63
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=65.79 E-value=5.5 Score=39.02 Aligned_cols=12 Identities=33% Similarity=0.689 Sum_probs=11.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|+|||+||.+
T Consensus 338 v~TL~HElGHa~ 349 (549)
T TIGR02289 338 IDVLTHEAGHAF 349 (549)
T ss_pred HHHHHHHhhHHH
Confidence 589999999998
No 64
>PRK03982 heat shock protein HtpX; Provisional
Probab=65.12 E-value=3.7 Score=36.66 Aligned_cols=15 Identities=27% Similarity=0.355 Sum_probs=12.4
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-.+.++|||+||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345789999999986
No 65
>PRK01345 heat shock protein HtpX; Provisional
Probab=63.74 E-value=4.1 Score=36.96 Aligned_cols=15 Identities=40% Similarity=0.439 Sum_probs=12.6
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-++.++||||||.-
T Consensus 122 dEL~aVlAHElgHi~ 136 (317)
T PRK01345 122 EEVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346789999999998
No 66
>PRK03072 heat shock protein HtpX; Provisional
Probab=63.23 E-value=4.2 Score=36.35 Aligned_cols=16 Identities=19% Similarity=0.326 Sum_probs=12.7
Q ss_pred hHHHHHHHHHhhhhcC
Q psy1606 213 GLVATTVAHEMGHNLG 228 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG 228 (285)
.-+..++|||+||.-.
T Consensus 125 ~El~aVlAHElgHi~~ 140 (288)
T PRK03072 125 RELRGVLGHELSHVYN 140 (288)
T ss_pred HHHHHHHHHHHHHHhc
Confidence 3457899999999863
No 67
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=61.71 E-value=5.1 Score=35.94 Aligned_cols=13 Identities=46% Similarity=0.647 Sum_probs=10.6
Q ss_pred HHHHHHHHHhhhh
Q psy1606 214 LVATTVAHEMGHN 226 (285)
Q Consensus 214 ~~a~~~AHElGH~ 226 (285)
..+.+||||.||-
T Consensus 169 a~ayVlAHEyGHH 181 (292)
T PF04228_consen 169 AQAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHHH
Confidence 3567999999994
No 68
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=61.53 E-value=4.8 Score=35.75 Aligned_cols=14 Identities=36% Similarity=0.570 Sum_probs=12.0
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
-+..++|||+||..
T Consensus 156 El~aVlaHElgHi~ 169 (302)
T COG0501 156 ELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHh
Confidence 45789999999986
No 69
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=60.35 E-value=17 Score=33.50 Aligned_cols=14 Identities=36% Similarity=0.650 Sum_probs=11.8
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
..+.++||||+|..
T Consensus 294 ~~~~~iahElahqW 307 (390)
T PF01433_consen 294 EIASLIAHELAHQW 307 (390)
T ss_dssp HHHHHHHHHHHTTT
T ss_pred hhHHHHHHHHHHHH
Confidence 45789999999995
No 70
>PRK05457 heat shock protein HtpX; Provisional
Probab=59.86 E-value=5.3 Score=35.67 Aligned_cols=15 Identities=33% Similarity=0.446 Sum_probs=12.2
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-...++|||+||.-
T Consensus 132 ~El~aVlAHElgHi~ 146 (284)
T PRK05457 132 DEVEAVLAHEISHIA 146 (284)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345789999999975
No 71
>PRK04897 heat shock protein HtpX; Provisional
Probab=59.73 E-value=5.3 Score=35.90 Aligned_cols=15 Identities=20% Similarity=0.317 Sum_probs=12.4
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-...++|||+||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 345689999999986
No 72
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=59.24 E-value=9.1 Score=37.97 Aligned_cols=12 Identities=50% Similarity=0.897 Sum_probs=11.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|+|||+||.+
T Consensus 381 V~TLaHElGHs~ 392 (598)
T COG1164 381 VFTLAHELGHSV 392 (598)
T ss_pred HHHHHHHccHHH
Confidence 489999999998
No 73
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=58.68 E-value=4.5 Score=39.89 Aligned_cols=13 Identities=46% Similarity=0.828 Sum_probs=11.4
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-..|+|||+||.+
T Consensus 378 dv~TLaHElGHa~ 390 (591)
T TIGR00181 378 SVFTLAHELGHSM 390 (591)
T ss_pred hHHHHHHHhhhHH
Confidence 3589999999997
No 74
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=57.88 E-value=5.3 Score=32.11 Aligned_cols=12 Identities=42% Similarity=0.670 Sum_probs=10.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..++||||+|-+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 479999999953
No 75
>KOG2719|consensus
Probab=57.45 E-value=5.6 Score=37.35 Aligned_cols=39 Identities=23% Similarity=0.148 Sum_probs=22.6
Q ss_pred ceeccccCCCCcccccCceeecCC------CchhHHHHHHHHHhhhh
Q psy1606 186 VVGKALKGPICTYEFSGGVNVDHK------NVVGLVATTVAHEMGHN 226 (285)
Q Consensus 186 ~~G~A~~g~~C~~~~~~~v~~~~~------~~~~~~a~~~AHElGH~ 226 (285)
.---||.-|.|...+ -+..|.= -+-..++.++|||+||-
T Consensus 247 ~hsNAyfyG~~~~KR--IvIyDtLl~~~~~~~~eel~AVl~HELGHW 291 (428)
T KOG2719|consen 247 SHSNAYFYGLCKNKR--IVIYDTLLLEEEHLNNEELVAVLAHELGHW 291 (428)
T ss_pred CCCCeeeeeccccce--EEEehhhhhhhhccccHHHHHHHHHHhhHH
Confidence 345678878887542 2222210 11233567999999995
No 76
>PRK01265 heat shock protein HtpX; Provisional
Probab=56.82 E-value=6.4 Score=35.87 Aligned_cols=14 Identities=29% Similarity=0.501 Sum_probs=11.4
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
-+..++||||||.-
T Consensus 139 El~aVlAHElgHik 152 (324)
T PRK01265 139 EIKAVAGHELGHLK 152 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 35689999999973
No 77
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=56.81 E-value=7.9 Score=33.16 Aligned_cols=12 Identities=42% Similarity=0.819 Sum_probs=10.5
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..++|||.||-+
T Consensus 90 vaVAAHEvGHAi 101 (222)
T PF04298_consen 90 VAVAAHEVGHAI 101 (222)
T ss_pred HHHHHHHHhHHH
Confidence 469999999987
No 78
>PRK02870 heat shock protein HtpX; Provisional
Probab=56.64 E-value=6.4 Score=36.05 Aligned_cols=14 Identities=29% Similarity=0.465 Sum_probs=11.7
Q ss_pred hHHHHHHHHHhhhh
Q psy1606 213 GLVATTVAHEMGHN 226 (285)
Q Consensus 213 ~~~a~~~AHElGH~ 226 (285)
.-++.++||||||.
T Consensus 171 dEL~aVlAHELgHi 184 (336)
T PRK02870 171 DELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHH
Confidence 34578999999998
No 79
>PRK02391 heat shock protein HtpX; Provisional
Probab=56.59 E-value=6.5 Score=35.34 Aligned_cols=15 Identities=27% Similarity=0.410 Sum_probs=11.9
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
.-...++|||+||.-
T Consensus 131 ~El~aVlaHElgHi~ 145 (296)
T PRK02391 131 DELEAVLAHELSHVK 145 (296)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345689999999965
No 80
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=55.18 E-value=5.8 Score=28.14 Aligned_cols=11 Identities=36% Similarity=0.667 Sum_probs=9.8
Q ss_pred HHHHHHhhhhc
Q psy1606 217 TTVAHEMGHNL 227 (285)
Q Consensus 217 ~~~AHElGH~l 227 (285)
.++||||.|.+
T Consensus 63 ~llaHEl~Hv~ 73 (79)
T PF13699_consen 63 ALLAHELAHVV 73 (79)
T ss_pred hhHhHHHHHHH
Confidence 68999999985
No 81
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=54.45 E-value=9.1 Score=32.90 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=17.0
Q ss_pred HHHHHHHHHhhhhcCCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD~ 233 (285)
..-.+++|||+|+.=++|-.
T Consensus 174 ~i~YVvvHELaHLke~nHs~ 193 (223)
T COG1451 174 VIDYVVVHELAHLKEKNHSK 193 (223)
T ss_pred HHHHHHHHHHHHHhhhhccH
Confidence 34689999999999999964
No 82
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=53.99 E-value=8 Score=25.92 Aligned_cols=18 Identities=22% Similarity=0.525 Sum_probs=15.6
Q ss_pred HHHHHHhhhhcCCCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~ 234 (285)
..+.|.++|.||+.|-..
T Consensus 31 Rriih~la~~lGL~~~s~ 48 (60)
T cd02639 31 RRIVHLLASRLGLNHVSD 48 (60)
T ss_pred HHHHHHHHHHcCCceEEe
Confidence 678999999999999654
No 83
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=53.02 E-value=11 Score=31.38 Aligned_cols=23 Identities=30% Similarity=0.442 Sum_probs=19.2
Q ss_pred hhHHHHHHHHHhhhhcCCCCCCC
Q psy1606 212 VGLVATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 212 ~~~~a~~~AHElGH~lG~~HD~~ 234 (285)
+..+-.||-|||.|+.=.+||..
T Consensus 79 ~~~i~~t~lHELaH~~~~~H~~~ 101 (186)
T PF08325_consen 79 YETILGTMLHELAHNVHGPHDDK 101 (186)
T ss_pred HHHHHHHHHHHHHhcccCCccHH
Confidence 34456899999999999999974
No 84
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=52.51 E-value=7.8 Score=37.74 Aligned_cols=59 Identities=24% Similarity=0.215 Sum_probs=26.0
Q ss_pred CCCceeeeeeeecCCCceeccccCCCCccc----ccCceeecCCC-----chhHHHHHHHHHhhhhcCCCC
Q psy1606 170 PNDNAQLLTGMTFSDGVVGKALKGPICTYE----FSGGVNVDHKN-----VVGLVATTVAHEMGHNLGMEH 231 (285)
Q Consensus 170 ~~D~a~llt~~~~~~~~~G~A~~g~~C~~~----~~~~v~~~~~~-----~~~~~a~~~AHElGH~lG~~H 231 (285)
..|+.+.++.++...++ +||- +.|... --.|+...... .......+++|||+|.||..-
T Consensus 159 ~~D~vlyV~~~~~~~~~--~A~A-~~C~~~~~~RP~~G~in~~p~~i~~~~~~~~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 159 NADFVLYVTARPSSSST--LAWA-APCQQDSDGRPIAGVININPSYIPSFYFQEFFRTVIHEIAHALGFSS 226 (521)
T ss_dssp S-SEEEEEE----STT-----EE-EEEEE-TTS-EEEEEEE--GGG---S--HHHHHHHHHHHHHHTT-SH
T ss_pred CccEEEEEEEeeccCCc--eEEE-eehhhccCCCCeeEEEEEchhHccchhhhcccceeeeeeeeeeeecc
Confidence 57888888876653322 3322 334321 12233322111 122345799999999999755
No 85
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=52.34 E-value=7.2 Score=37.42 Aligned_cols=13 Identities=38% Similarity=0.585 Sum_probs=11.5
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-..|++||+||.+
T Consensus 263 ~V~TLfHEfGHal 275 (472)
T cd06455 263 EVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999988
No 86
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=51.74 E-value=7.5 Score=36.97 Aligned_cols=12 Identities=42% Similarity=0.786 Sum_probs=11.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|++||+||.+
T Consensus 243 v~tLfHE~GHa~ 254 (458)
T PF01432_consen 243 VETLFHEFGHAM 254 (458)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHhHHH
Confidence 589999999987
No 87
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=51.66 E-value=27 Score=36.30 Aligned_cols=14 Identities=36% Similarity=0.506 Sum_probs=12.0
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
.++.++|||++|+.
T Consensus 282 ~i~~VIaHElaHqW 295 (863)
T TIGR02414 282 RIESVIAHEYFHNW 295 (863)
T ss_pred HHHHHHHHHHHHHH
Confidence 35689999999997
No 88
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=51.58 E-value=13 Score=28.20 Aligned_cols=35 Identities=26% Similarity=0.448 Sum_probs=20.4
Q ss_pred CCCCcccccCceeecCCCchhHHHHHHHHHhhhhcC
Q psy1606 193 GPICTYEFSGGVNVDHKNVVGLVATTVAHEMGHNLG 228 (285)
Q Consensus 193 g~~C~~~~~~~v~~~~~~~~~~~a~~~AHElGH~lG 228 (285)
+..|. .+.+-|.-|....--+--.|+|||+.|..-
T Consensus 58 ~~LCG-~~~~~i~IDP~~~~KGC~~TL~HEL~H~WQ 92 (141)
T PHA02456 58 QDLCG-QFVGWIEIDPDYANKGCRDTLAHELNHAWQ 92 (141)
T ss_pred cchhh-cceeEEEECCcccccchHHHHHHHHHHHHh
Confidence 44565 334444444443223345799999999863
No 89
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=51.34 E-value=7.3 Score=36.45 Aligned_cols=12 Identities=50% Similarity=0.869 Sum_probs=10.5
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|++||+||.+
T Consensus 223 v~tl~HE~GHa~ 234 (427)
T cd06459 223 VFTLAHELGHAF 234 (427)
T ss_pred HHHHHHHhhHHH
Confidence 589999999975
No 90
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=50.63 E-value=10 Score=31.95 Aligned_cols=16 Identities=38% Similarity=0.605 Sum_probs=12.6
Q ss_pred HHHHHHHHhhhhcCCC
Q psy1606 215 VATTVAHEMGHNLGME 230 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~ 230 (285)
...|+.||+||.+-.-
T Consensus 22 ~l~t~~HE~gHal~a~ 37 (200)
T PF13398_consen 22 LLVTFVHELGHALAAL 37 (200)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4579999999988543
No 91
>PRK14015 pepN aminopeptidase N; Provisional
Probab=50.05 E-value=34 Score=35.59 Aligned_cols=14 Identities=36% Similarity=0.506 Sum_probs=12.0
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
.++.++|||++|+.
T Consensus 295 ~i~~vIaHElaHqW 308 (875)
T PRK14015 295 RIESVIAHEYFHNW 308 (875)
T ss_pred HHHHHHHHHHHHHH
Confidence 35689999999987
No 92
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=49.91 E-value=13 Score=26.02 Aligned_cols=18 Identities=28% Similarity=0.289 Sum_probs=14.9
Q ss_pred hHHHHHHHHHhhhhcCCC
Q psy1606 213 GLVATTVAHEMGHNLGME 230 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG~~ 230 (285)
.+++-++|||+++-+|-.
T Consensus 50 LGFGEl~AhE~~fr~gW~ 67 (77)
T PF08219_consen 50 LGFGELFAHEIAFRLGWS 67 (77)
T ss_pred HhHHHHHHHHHHHHhccC
Confidence 446789999999999873
No 93
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=49.49 E-value=24 Score=34.98 Aligned_cols=58 Identities=21% Similarity=0.162 Sum_probs=31.3
Q ss_pred CCCceeeeeeeecCCCceeccccCCCCcccc----cCceeecCCC-----chhHHHHHHHHHhhhhcCCC
Q psy1606 170 PNDNAQLLTGMTFSDGVVGKALKGPICTYEF----SGGVNVDHKN-----VVGLVATTVAHEMGHNLGME 230 (285)
Q Consensus 170 ~~D~a~llt~~~~~~~~~G~A~~g~~C~~~~----~~~v~~~~~~-----~~~~~a~~~AHElGH~lG~~ 230 (285)
..|+.+.++.+.... +.....+.|.... ..|+...... .......+++|||.|.||..
T Consensus 205 naD~vLYVta~P~~~---~~lA~A~~C~~~~d~RPi~G~iNinp~~i~s~~~~~~~rv~~HEi~HALGFS 271 (622)
T PTZ00257 205 NADFVLYVASVPSEP---GVLAWATTCQVFSDDHPAVGVMNIPAANIVSRYDQGTTRTVTHEVAHALGFS 271 (622)
T ss_pred ccCEEEEEEEecCCC---CeeEEEEEeccccCCCceEEEEeeCHHHCCCccchHHHHHHHHHHHHHhcCC
Confidence 578888888765532 2222334454321 1232221110 01123579999999999984
No 94
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=47.81 E-value=17 Score=30.80 Aligned_cols=26 Identities=23% Similarity=0.061 Sum_probs=17.5
Q ss_pred ceeecCCCchhHHHHHHHHHhhhhcC
Q psy1606 203 GVNVDHKNVVGLVATTVAHEMGHNLG 228 (285)
Q Consensus 203 ~v~~~~~~~~~~~a~~~AHElGH~lG 228 (285)
+|....+-.-..++.++|||++|.+=
T Consensus 81 ~Ilvl~GLPrll~gsiLAHE~mHa~L 106 (212)
T PF12315_consen 81 AILVLYGLPRLLTGSILAHELMHAWL 106 (212)
T ss_pred EEEEECCCCHHHHhhHHHHHHHHHHh
Confidence 34444444445567899999999873
No 95
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=46.86 E-value=12 Score=34.21 Aligned_cols=18 Identities=50% Similarity=0.693 Sum_probs=13.7
Q ss_pred hHHHHHHHHHhhhhcCCCC
Q psy1606 213 GLVATTVAHEMGHNLGMEH 231 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG~~H 231 (285)
.-+|.+++|||||.- ++|
T Consensus 122 sEvAAVl~HEmgHVt-AnH 139 (479)
T COG4784 122 SEVAAVLAHEMGHVT-ANH 139 (479)
T ss_pred HHHHHHHHhhhhhee-cch
Confidence 346889999999975 344
No 96
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=46.78 E-value=12 Score=33.14 Aligned_cols=14 Identities=43% Similarity=0.551 Sum_probs=11.2
Q ss_pred hHHHHHHHHHhhhh
Q psy1606 213 GLVATTVAHEMGHN 226 (285)
Q Consensus 213 ~~~a~~~AHElGH~ 226 (285)
+..+.++|||+||-
T Consensus 165 fAqaYViAHEVGHH 178 (295)
T COG2321 165 FAQAYVIAHEVGHH 178 (295)
T ss_pred HHHHHHHHhhhhHH
Confidence 34579999999984
No 97
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=46.24 E-value=21 Score=34.40 Aligned_cols=12 Identities=58% Similarity=0.645 Sum_probs=10.8
Q ss_pred HHHHHHhhhhcC
Q psy1606 217 TTVAHEMGHNLG 228 (285)
Q Consensus 217 ~~~AHElGH~lG 228 (285)
.|+.||+||.+.
T Consensus 249 ~t~~HE~GH~~y 260 (477)
T cd06461 249 VTVHHEMGHIQY 260 (477)
T ss_pred HHHHHHHHHHHH
Confidence 789999999994
No 98
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=44.48 E-value=9.7 Score=34.00 Aligned_cols=14 Identities=36% Similarity=0.337 Sum_probs=10.6
Q ss_pred HHHHHHhhhhcCCC
Q psy1606 217 TTVAHEMGHNLGME 230 (285)
Q Consensus 217 ~~~AHElGH~lG~~ 230 (285)
--++||+||++=..
T Consensus 221 WG~~HE~GH~~Q~~ 234 (307)
T PF13402_consen 221 WGPWHELGHNHQQG 234 (307)
T ss_dssp HHHHHHHHHHH-BG
T ss_pred eeehhhhhhhcCcc
Confidence 46899999999544
No 99
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=44.07 E-value=12 Score=35.88 Aligned_cols=13 Identities=46% Similarity=0.746 Sum_probs=11.4
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-+.|++||+||.+
T Consensus 248 ~v~TLfHEfGHal 260 (458)
T cd06457 248 EVETLFHEMGHAM 260 (458)
T ss_pred HHHHHHHHHhHHH
Confidence 3689999999988
No 100
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=42.11 E-value=17 Score=30.46 Aligned_cols=18 Identities=33% Similarity=0.547 Sum_probs=13.8
Q ss_pred HHHHHHHhhhhcCCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~ 233 (285)
..++|||+||-+-=.||-
T Consensus 93 ~aVAAHEVGHAiQd~~~Y 110 (226)
T COG2738 93 IAVAAHEVGHAIQDQEDY 110 (226)
T ss_pred HHHHHHHhhHHHhhhccc
Confidence 468999999998655543
No 101
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=39.50 E-value=19 Score=29.63 Aligned_cols=19 Identities=32% Similarity=0.478 Sum_probs=14.8
Q ss_pred HHHHHHHHhhhhcCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~ 233 (285)
...++.||+||.+-+..=+
T Consensus 7 ~i~i~~HE~gH~~~a~~~G 25 (192)
T PF02163_consen 7 LISIVLHELGHALAARLYG 25 (192)
T ss_dssp HHHHHHHHHHHHHHHHTTT
T ss_pred ccccccccccccccccccc
Confidence 4578999999998766643
No 102
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=39.33 E-value=18 Score=24.18 Aligned_cols=20 Identities=30% Similarity=0.667 Sum_probs=16.8
Q ss_pred HHHHHHHhhhhcCCCCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~ 235 (285)
--.+.|+|++.+|+.|...+
T Consensus 30 eR~~vH~lA~~~gL~s~S~G 49 (60)
T cd02641 30 DRLLVHELAEELGLRHESTG 49 (60)
T ss_pred HHHHHHHHHHHcCCceEeeC
Confidence 36789999999999997654
No 103
>PTZ00337 surface protease GP63; Provisional
Probab=38.86 E-value=16 Score=35.98 Aligned_cols=17 Identities=41% Similarity=0.688 Sum_probs=14.3
Q ss_pred HHHHHHHHhhhhcCCCC
Q psy1606 215 VATTVAHEMGHNLGMEH 231 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~H 231 (285)
...+++|||.|.||...
T Consensus 229 ~~~v~~HEi~HALGFs~ 245 (567)
T PTZ00337 229 DVRVAAHELGHALGFVR 245 (567)
T ss_pred HHHHHHHHHHHHHccCH
Confidence 34799999999999854
No 104
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=37.76 E-value=16 Score=33.46 Aligned_cols=12 Identities=42% Similarity=0.750 Sum_probs=10.8
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
..|++||+||.+
T Consensus 155 v~tl~HE~GHa~ 166 (365)
T cd06258 155 INTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHhHHH
Confidence 579999999988
No 105
>KOG2661|consensus
Probab=37.40 E-value=19 Score=32.71 Aligned_cols=15 Identities=33% Similarity=0.552 Sum_probs=12.9
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
..+|.++|||+||.+
T Consensus 273 dglAtvLgHE~aHaV 287 (424)
T KOG2661|consen 273 DGLATVLGHEIAHAV 287 (424)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457899999999986
No 106
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=36.98 E-value=22 Score=33.30 Aligned_cols=20 Identities=30% Similarity=0.429 Sum_probs=14.3
Q ss_pred chhHHHHHHHHHhhhhc---CCC
Q psy1606 211 VVGLVATTVAHEMGHNL---GME 230 (285)
Q Consensus 211 ~~~~~a~~~AHElGH~l---G~~ 230 (285)
++.....+++||+||.+ |++
T Consensus 155 d~~~~l~t~iHE~GHalye~~l~ 177 (396)
T cd06460 155 DFRSALFSTIHETGHALYEQGLP 177 (396)
T ss_pred chHHHHHHHHHHhhHHHHHhcCC
Confidence 34444579999999987 554
No 107
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=36.00 E-value=24 Score=29.30 Aligned_cols=14 Identities=36% Similarity=0.401 Sum_probs=11.2
Q ss_pred HHHHHHHHhhhhcC
Q psy1606 215 VATTVAHEMGHNLG 228 (285)
Q Consensus 215 ~a~~~AHElGH~lG 228 (285)
...++.||+||-+=
T Consensus 9 ~~~v~iHElGH~~~ 22 (182)
T cd06163 9 GILIFVHELGHFLV 22 (182)
T ss_pred HHHHHHHHHHHHHH
Confidence 35789999999763
No 108
>KOG1047|consensus
Probab=34.48 E-value=24 Score=34.34 Aligned_cols=12 Identities=33% Similarity=0.678 Sum_probs=11.1
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
+.+|||||+|+.
T Consensus 289 ~~vIaHEIAHSW 300 (613)
T KOG1047|consen 289 VDVIAHEIAHSW 300 (613)
T ss_pred hhHHHHHhhhhh
Confidence 689999999987
No 109
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=34.44 E-value=26 Score=28.61 Aligned_cols=17 Identities=29% Similarity=0.556 Sum_probs=13.3
Q ss_pred HHHHHHHHhhhhcCCCC
Q psy1606 215 VATTVAHEMGHNLGMEH 231 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~H 231 (285)
...++.||+||-+=+..
T Consensus 8 ~i~i~iHE~gH~~~A~~ 24 (180)
T cd05709 8 LISVTVHELGHALVARR 24 (180)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45789999999985544
No 110
>PF14891 Peptidase_M91: Effector protein
Probab=34.16 E-value=18 Score=29.59 Aligned_cols=21 Identities=19% Similarity=0.127 Sum_probs=16.4
Q ss_pred HHHHHHHHHhhhhcCCCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD~~ 234 (285)
.-+++++|||+|..-..++..
T Consensus 102 ~p~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHHHCCCC
Confidence 346899999999987776654
No 111
>PF00480 ROK: ROK family; InterPro: IPR000600 A family of bacterial proteins has been described which groups transcriptional repressors, sugar kinases and yet uncharacterised open reading frames []. This family, known as ROK (Repressor, ORF, Kinase) includes the xylose operon repressor, xylR, from Bacillus subtilis, Lactobacillus pentosus and Staphylococcus xylosus; N-acetylglucosamine repressor, nagC, from Escherichia coli; glucokinase 2.7.1.2 from EC from Streptomyces coelicolor; fructokinase 2.7.1.4 from EC from Pediococcus pentosaceus, Streptococcus mutans and Zymomonas mobilis; allokinase 2.7.1.55 from EC and mlc from E. coli; and E. coli hypothetical proteins yajF and yhcI and the corresponding Haemophilus influenzae proteins. The repressor proteins (xylR and nagC) from this family possess an N-terminal region not present in the sugar kinases and which contains an helix-turn-helix DNA-binding motif.; PDB: 2GUP_A 3LM2_B 3EO3_A 2YHY_A 2YHW_A 2YI1_A 3MCP_A 1Z05_A 3HTV_A 3OHR_A ....
Probab=33.14 E-value=19 Score=29.11 Aligned_cols=28 Identities=25% Similarity=0.593 Sum_probs=19.4
Q ss_pred HHHHHHhhhhcCCCCCCCCCCCCCCCccc
Q psy1606 217 TTVAHEMGHNLGMEHDTTECTCPSDRCIM 245 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~~c~C~~~~cIM 245 (285)
.-.|-|+||.. +..++..|.|+..+|+-
T Consensus 144 ~~~aGeigh~~-~~~~~~~c~cG~~GClE 171 (179)
T PF00480_consen 144 NGFAGEIGHMP-VDPNGEPCYCGNRGCLE 171 (179)
T ss_dssp TS-TTGGGGSB-SSTTSSB-TTSSBSBHH
T ss_pred Cccccceeeee-ccCCCCcCCCCCcCcHH
Confidence 34677899877 45556789999989974
No 112
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=33.10 E-value=28 Score=29.97 Aligned_cols=19 Identities=37% Similarity=0.574 Sum_probs=14.3
Q ss_pred HHHHHHHHHhhhhcCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHD 232 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD 232 (285)
.+..++.||+||.+=+.+=
T Consensus 52 l~~~v~iHElgH~~~A~~~ 70 (227)
T cd06164 52 LFASVLLHELGHSLVARRY 70 (227)
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 3467899999999855553
No 113
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=32.77 E-value=23 Score=33.56 Aligned_cols=13 Identities=54% Similarity=0.774 Sum_probs=11.4
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-..|++||+||.|
T Consensus 208 ~v~tLfHEfGHal 220 (422)
T cd06456 208 EVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHH
Confidence 3589999999998
No 114
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=32.66 E-value=29 Score=29.37 Aligned_cols=19 Identities=32% Similarity=0.490 Sum_probs=14.3
Q ss_pred HHHHHHHHHhhhhcCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHD 232 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD 232 (285)
....++.||+||.+=+.+=
T Consensus 37 l~~~v~iHElgH~~~A~~~ 55 (208)
T cd06161 37 LFLSVLLHELGHALVARRY 55 (208)
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 3468999999998855553
No 115
>KOG2414|consensus
Probab=31.69 E-value=18 Score=33.89 Aligned_cols=14 Identities=57% Similarity=1.120 Sum_probs=11.5
Q ss_pred HHhhhhcCCC-CCCC
Q psy1606 221 HEMGHNLGME-HDTT 234 (285)
Q Consensus 221 HElGH~lG~~-HD~~ 234 (285)
|-+||.|||+ ||.+
T Consensus 402 HhVgHyLGmDVHD~p 416 (488)
T KOG2414|consen 402 HHVGHYLGMDVHDCP 416 (488)
T ss_pred cccchhcCcccccCC
Confidence 6799999997 7764
No 116
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=30.26 E-value=99 Score=26.19 Aligned_cols=81 Identities=20% Similarity=0.165 Sum_probs=40.4
Q ss_pred CCHHHHHHHHHHhhhhccCCC----CCCCceeeeeeeecCCCceeccccCCCCccc---ccCceeecCC---CchhHHHH
Q psy1606 148 VNGDITLTNFLSYRKDRLVLS----HPNDNAQLLTGMTFSDGVVGKALKGPICTYE---FSGGVNVDHK---NVVGLVAT 217 (285)
Q Consensus 148 ~~~~~~l~~f~~w~~~~~~~~----~~~D~a~llt~~~~~~~~~G~A~~g~~C~~~---~~~~v~~~~~---~~~~~~a~ 217 (285)
+++..+|..-.......|+.. .+-+.+.|+. ....|+|+..|-.... .|........ ..-.-+..
T Consensus 24 ~~a~~~L~~a~~~V~~~ly~~~~~~~~v~~Vt~~~-----~~~~gVA~t~gd~~~~~I~~S~~~i~~~~~~~~~~~Ei~G 98 (205)
T PF04450_consen 24 PDAEQVLRDASRFVWRLLYQSPADRKPVRSVTLIL-----DDMDGVAYTSGDDDHKEIHFSARYIAKYPADGDVRDEIIG 98 (205)
T ss_pred cCHHHHHHHHHHHHHHHhCCCCCCCCcccEEEEEE-----ECCCeeEEEecCCCccEEEEeHHHHhhcccccchHHHHHH
Confidence 466666666554444444432 2234455544 2356889888722110 0111111111 11123457
Q ss_pred HHHHHhhhhcCCCCCCCC
Q psy1606 218 TVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 218 ~~AHElGH~lG~~HD~~~ 235 (285)
++.|||.|.. .+|+.+
T Consensus 99 vl~HE~~H~~--Q~~~~~ 114 (205)
T PF04450_consen 99 VLYHEMVHCW--QWDGRG 114 (205)
T ss_pred HHHHHHHHHh--hcCCCC
Confidence 9999999996 555543
No 117
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=29.85 E-value=33 Score=30.24 Aligned_cols=19 Identities=26% Similarity=0.401 Sum_probs=14.3
Q ss_pred HHHHHHHHhhhhcCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~ 233 (285)
...++.||+||-+=+.+-+
T Consensus 118 ~isv~iHElgHa~~Ar~~G 136 (263)
T cd06159 118 VVGVVVHELSHGILARVEG 136 (263)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 3578999999998665543
No 118
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=29.56 E-value=29 Score=27.52 Aligned_cols=16 Identities=25% Similarity=0.258 Sum_probs=12.4
Q ss_pred HHHHHHHHHhhhhcCC
Q psy1606 214 LVATTVAHEMGHNLGM 229 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~ 229 (285)
.+..|+.|||.|..=.
T Consensus 58 ~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 58 RLRETLLHELCHAALY 73 (146)
T ss_pred HHHhhHHHHHHHHHHH
Confidence 3456999999998844
No 119
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=29.52 E-value=35 Score=28.30 Aligned_cols=20 Identities=40% Similarity=0.524 Sum_probs=15.3
Q ss_pred HHHHHHHHHhhhhcCCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD~ 233 (285)
....++.||+||-+-+..=+
T Consensus 40 l~~~l~iHElgH~~~A~~~G 59 (183)
T cd06160 40 LLAILGIHEMGHYLAARRHG 59 (183)
T ss_pred HHHHHHHHHHHHHHHHHHCC
Confidence 34678999999998776543
No 120
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=29.31 E-value=35 Score=29.25 Aligned_cols=13 Identities=54% Similarity=0.790 Sum_probs=11.0
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
+..|+-||+||.|
T Consensus 92 ~~~~l~HE~GHAl 104 (220)
T PF14247_consen 92 VLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHH
Confidence 4579999999976
No 121
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=29.00 E-value=94 Score=32.24 Aligned_cols=14 Identities=36% Similarity=0.375 Sum_probs=11.7
Q ss_pred HHHHHHHHHhhhhc
Q psy1606 214 LVATTVAHEMGHNL 227 (285)
Q Consensus 214 ~~a~~~AHElGH~l 227 (285)
..+.++||||+|..
T Consensus 286 ~~~~viaHElAHqW 299 (831)
T TIGR02412 286 NRAGVILHEMAHMW 299 (831)
T ss_pred HHHHHHHHHHHHHH
Confidence 35689999999976
No 122
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=28.91 E-value=51 Score=26.16 Aligned_cols=19 Identities=37% Similarity=0.623 Sum_probs=16.3
Q ss_pred HHHHHHHHhhhhcCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDT 233 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~ 233 (285)
++..++|-+-|++|-+|..
T Consensus 108 l~~l~vHG~LHLlGyDH~~ 126 (145)
T PF02130_consen 108 LARLLVHGLLHLLGYDHET 126 (145)
T ss_dssp HHHHHHHHHHHHTT-SSTT
T ss_pred HhHHHHHHHHHHcCCCCCC
Confidence 5789999999999999975
No 123
>KOG1046|consensus
Probab=28.63 E-value=92 Score=32.57 Aligned_cols=15 Identities=40% Similarity=0.598 Sum_probs=12.9
Q ss_pred hHHHHHHHHHhhhhc
Q psy1606 213 GLVATTVAHEMGHNL 227 (285)
Q Consensus 213 ~~~a~~~AHElGH~l 227 (285)
..+|.+|||||+|..
T Consensus 322 ~~va~vIaHElAHQW 336 (882)
T KOG1046|consen 322 QRVAEVIAHELAHQW 336 (882)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457899999999985
No 124
>PRK00016 metal-binding heat shock protein; Provisional
Probab=28.01 E-value=48 Score=26.85 Aligned_cols=20 Identities=30% Similarity=0.405 Sum_probs=17.7
Q ss_pred HHHHHHHHhhhhcCCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~ 234 (285)
++.+++|-+-|++|-+|...
T Consensus 113 ~~~l~iHG~LHLlGYDH~~~ 132 (159)
T PRK00016 113 LAHLTVHGILHLLGYDHIED 132 (159)
T ss_pred HHHHHHHhhHHhcCCCCCCh
Confidence 57899999999999999753
No 125
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=26.71 E-value=58 Score=24.59 Aligned_cols=21 Identities=33% Similarity=0.488 Sum_probs=18.1
Q ss_pred HHHHHHHHHhhhhcCCCCCCC
Q psy1606 214 LVATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~~HD~~ 234 (285)
-++.+++|-+=|++|-+|...
T Consensus 72 e~~~l~iHG~LHLlGyDH~~~ 92 (110)
T TIGR00043 72 ELAHLTVHGLLHLLGYDHETE 92 (110)
T ss_pred HHHHHHHHHHHHHcCCCCCCh
Confidence 357899999999999999753
No 126
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=26.26 E-value=46 Score=28.27 Aligned_cols=19 Identities=26% Similarity=0.532 Sum_probs=14.6
Q ss_pred HHHHHHHHhhhhcCCCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHDTT 234 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD~~ 234 (285)
.+.+++||++| +=+.|.+.
T Consensus 101 ~~fil~HE~~H-v~~~h~~~ 119 (206)
T PF10463_consen 101 IAFILLHELAH-VVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHH-HHHcCccc
Confidence 57899999999 44666654
No 127
>PRK04860 hypothetical protein; Provisional
Probab=26.23 E-value=32 Score=27.95 Aligned_cols=13 Identities=38% Similarity=0.675 Sum_probs=9.9
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
+..||+|||.|..
T Consensus 63 l~~~v~HEl~H~~ 75 (160)
T PRK04860 63 IDEVVPHELAHLL 75 (160)
T ss_pred HHhHHHHHHHHHH
Confidence 4578899999863
No 128
>PRK04351 hypothetical protein; Provisional
Probab=26.02 E-value=38 Score=27.15 Aligned_cols=13 Identities=23% Similarity=0.365 Sum_probs=10.8
Q ss_pred HHHHHHHHHhhhh
Q psy1606 214 LVATTVAHEMGHN 226 (285)
Q Consensus 214 ~~a~~~AHElGH~ 226 (285)
.+..|++|||.|.
T Consensus 60 ~l~~vv~HElcH~ 72 (149)
T PRK04351 60 ELIGIIKHELCHY 72 (149)
T ss_pred HHHhhHHHHHHHH
Confidence 3467999999997
No 129
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=25.71 E-value=35 Score=33.90 Aligned_cols=12 Identities=33% Similarity=0.686 Sum_probs=10.7
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
+.++||||+|+.
T Consensus 280 ~~viaHElAHqW 291 (601)
T TIGR02411 280 VDVIAHELAHSW 291 (601)
T ss_pred hhhHHHHHHhhc
Confidence 479999999997
No 130
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=25.68 E-value=44 Score=22.29 Aligned_cols=20 Identities=10% Similarity=0.385 Sum_probs=16.7
Q ss_pred HHHHHHHhhhhcCCCCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~ 235 (285)
--.+-|++++.+|+.|-..+
T Consensus 30 eR~~vH~~a~~~gL~s~S~G 49 (60)
T cd02640 30 ERALIHQIAQKYGLKSRSYG 49 (60)
T ss_pred HHHHHHHHHHHcCCceeeEe
Confidence 35789999999999997654
No 131
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=25.33 E-value=1.3e+02 Score=25.02 Aligned_cols=16 Identities=31% Similarity=0.389 Sum_probs=13.1
Q ss_pred hHHHHHHHHHhhhhcC
Q psy1606 213 GLVATTVAHEMGHNLG 228 (285)
Q Consensus 213 ~~~a~~~AHElGH~lG 228 (285)
..+..++|||+-|+.=
T Consensus 63 ~~l~~~iaHE~hH~~r 78 (195)
T PF10026_consen 63 EELPALIAHEYHHNCR 78 (195)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3467899999999973
No 132
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=25.30 E-value=47 Score=22.11 Aligned_cols=20 Identities=20% Similarity=0.456 Sum_probs=17.0
Q ss_pred HHHHHHHhhhhcCCCCCCCC
Q psy1606 216 ATTVAHEMGHNLGMEHDTTE 235 (285)
Q Consensus 216 a~~~AHElGH~lG~~HD~~~ 235 (285)
--.+-|++++.+|+.|...+
T Consensus 29 eR~~vH~~a~~~gL~s~S~G 48 (59)
T cd06007 29 ERAVIHRLCRKLGLKSKSKG 48 (59)
T ss_pred HHHHHHHHHHHcCCCceeec
Confidence 36789999999999998654
No 133
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=24.65 E-value=28 Score=34.78 Aligned_cols=12 Identities=58% Similarity=0.911 Sum_probs=11.0
Q ss_pred HHHHHHHhhhhc
Q psy1606 216 ATTVAHEMGHNL 227 (285)
Q Consensus 216 a~~~AHElGH~l 227 (285)
+.|+=||+||-|
T Consensus 468 V~TLFHEfGHgL 479 (683)
T COG0339 468 VTTLFHEFGHGL 479 (683)
T ss_pred HHHHHHHhhhHH
Confidence 689999999987
No 134
>PRK10911 oligopeptidase A; Provisional
Probab=24.53 E-value=37 Score=34.30 Aligned_cols=13 Identities=46% Similarity=0.713 Sum_probs=11.5
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
-..|+.||+||.|
T Consensus 463 ~v~tlfHEfGHal 475 (680)
T PRK10911 463 EVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHhHHH
Confidence 3589999999998
No 135
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=23.87 E-value=47 Score=26.27 Aligned_cols=16 Identities=31% Similarity=0.428 Sum_probs=12.8
Q ss_pred HHHHHHHHHhhhhcCC
Q psy1606 214 LVATTVAHEMGHNLGM 229 (285)
Q Consensus 214 ~~a~~~AHElGH~lG~ 229 (285)
.+-.|++|||.|..=-
T Consensus 59 ~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 59 ELIDTLLHEMAHAAAY 74 (157)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 4568999999998753
No 136
>TIGR00744 ROK_glcA_fam ROK family protein (putative glucokinase). This alignment models one branch of the ROK superfamily of proteins. The three members of the seed alignment for this model all have experimental evidence for activity as glucokinase, but the set of related proteins is crowded with paralogs of different or unknown function. Proteins scoring above the trusted_cutoff will show strong similarity to at least one known glucokinase and may be designated as putative glucokinases. However, definitive identification of glucokinases should be done only with extreme caution.
Probab=23.39 E-value=28 Score=31.12 Aligned_cols=29 Identities=34% Similarity=0.610 Sum_probs=21.5
Q ss_pred HHHHHHhhhhcCCCCCC-CCCCCCCCCcccC
Q psy1606 217 TTVAHEMGHNLGMEHDT-TECTCPSDRCIMA 246 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~-~~c~C~~~~cIM~ 246 (285)
.-.|-|+||.. +..++ ..|.|++.+|+-.
T Consensus 150 ~g~agEiGh~~-v~~~g~~~C~cG~~gclE~ 179 (318)
T TIGR00744 150 NGVGAEIGHIR-MVPDGRLLCNCGKQGCIET 179 (318)
T ss_pred CCCCcccCceE-eCCCCCcccCCCCcchHHH
Confidence 34678999986 34566 6899999888754
No 137
>PLN02791 Nudix hydrolase homolog
Probab=23.23 E-value=47 Score=33.98 Aligned_cols=17 Identities=29% Similarity=0.524 Sum_probs=14.8
Q ss_pred HHHHHHHHhhhhcCCCC
Q psy1606 215 VATTVAHEMGHNLGMEH 231 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~H 231 (285)
...|+-||+||.+|=.+
T Consensus 585 ~~~v~lHElgHGsG~~~ 601 (770)
T PLN02791 585 FTHTICHECCHGIGPHT 601 (770)
T ss_pred HHHHHHHHhhccccccc
Confidence 36899999999999866
No 138
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=22.28 E-value=56 Score=29.08 Aligned_cols=18 Identities=33% Similarity=0.645 Sum_probs=13.7
Q ss_pred HHHHHHHHhhhhcCCCCC
Q psy1606 215 VATTVAHEMGHNLGMEHD 232 (285)
Q Consensus 215 ~a~~~AHElGH~lG~~HD 232 (285)
+..++.||+||.+=+..-
T Consensus 135 ~isvvvHElgHal~A~~~ 152 (277)
T cd06162 135 LISGVVHEMGHGVAAVRE 152 (277)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 347899999999866543
No 139
>COG1940 NagC Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]
Probab=21.88 E-value=32 Score=30.69 Aligned_cols=32 Identities=25% Similarity=0.443 Sum_probs=22.3
Q ss_pred HHHHHHhhhhcCCCCCCCCCCCCCCCcccCCCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTTECTCPSDRCIMAPSSS 250 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~ 250 (285)
.-.|+|+||..=..+.. |.|++.+|+-.....
T Consensus 159 ~g~age~Gh~~v~~~g~--c~cG~~GclE~~as~ 190 (314)
T COG1940 159 NGNAGEIGHMVVDPDGE--CGCGRRGCLETYASG 190 (314)
T ss_pred CCccccccceEECCCCc--cCCCCCCchHHhccH
Confidence 44688999986333322 999999998766543
No 140
>PRK05082 N-acetylmannosamine kinase; Provisional
Probab=20.62 E-value=38 Score=29.88 Aligned_cols=31 Identities=26% Similarity=0.510 Sum_probs=23.2
Q ss_pred HHHHHHhhhhcCCCCCCCCCCCCCCCcccCCC
Q psy1606 217 TTVAHEMGHNLGMEHDTTECTCPSDRCIMAPS 248 (285)
Q Consensus 217 ~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~ 248 (285)
.-.|-|+||..-.+ ++..|.|++.+|+-...
T Consensus 148 ~g~AGEiGh~~v~~-~g~~c~CG~~GclE~~~ 178 (291)
T PRK05082 148 GGLAGHIGHTLADP-HGPVCGCGRRGCVEAIA 178 (291)
T ss_pred CCccccccceEecC-CCCCCCCCCcCchhhhc
Confidence 35788999987433 35689999999987654
No 141
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=20.23 E-value=60 Score=30.63 Aligned_cols=13 Identities=38% Similarity=0.626 Sum_probs=10.9
Q ss_pred HHHHHHHHhhhhc
Q psy1606 215 VATTVAHEMGHNL 227 (285)
Q Consensus 215 ~a~~~AHElGH~l 227 (285)
...++.||+||-+
T Consensus 14 ~~~v~~HE~gH~~ 26 (420)
T TIGR00054 14 AVLIFVHELGHFL 26 (420)
T ss_pred HHHHHHHhHHHHH
Confidence 4578999999975
No 142
>COG4823 AbiF Abortive infection bacteriophage resistance protein [Defense mechanisms]
Probab=20.19 E-value=54 Score=28.79 Aligned_cols=14 Identities=36% Similarity=0.560 Sum_probs=11.4
Q ss_pred HHHHHHHhhhhcCC
Q psy1606 216 ATTVAHEMGHNLGM 229 (285)
Q Consensus 216 a~~~AHElGH~lG~ 229 (285)
-.+|||++||+-|.
T Consensus 93 Ks~iAyllg~~~gp 106 (299)
T COG4823 93 KSVIAYLLGHNCGP 106 (299)
T ss_pred HHHHHHHhccccCc
Confidence 36899999998763
Done!