Query psy5111
Match_columns 132
No_of_seqs 139 out of 1581
Neff 9.0
Searched_HMMs 46136
Date Fri Aug 16 17:32:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5111.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5111hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1046|consensus 99.9 2.6E-27 5.6E-32 194.9 2.8 99 1-102 306-404 (882)
2 PF01433 Peptidase_M1: Peptida 99.9 9.3E-24 2E-28 160.3 1.5 95 1-98 277-371 (390)
3 COG0308 PepN Aminopeptidase N 99.8 1.4E-21 3.1E-26 160.9 3.5 96 2-100 290-386 (859)
4 TIGR02412 pepN_strep_liv amino 99.8 2.4E-21 5.1E-26 159.4 3.8 86 11-99 279-364 (831)
5 KOG1046|consensus 99.8 6E-21 1.3E-25 157.5 3.6 85 48-132 265-365 (882)
6 TIGR02414 pepN_proteo aminopep 99.8 1.9E-20 4.1E-25 153.8 4.6 94 1-97 265-358 (863)
7 PRK14015 pepN aminopeptidase N 99.8 5.8E-20 1.3E-24 151.2 5.1 59 2-60 279-337 (875)
8 TIGR02411 leuko_A4_hydro leuko 99.8 4.2E-19 9.1E-24 141.7 4.5 78 54-131 234-319 (601)
9 TIGR02412 pepN_strep_liv amino 99.8 5.1E-19 1.1E-23 145.7 4.1 84 47-131 229-327 (831)
10 PF01433 Peptidase_M1: Peptida 99.7 1E-18 2.2E-23 132.8 3.7 86 46-131 234-335 (390)
11 TIGR02414 pepN_proteo aminopep 99.7 2E-18 4.2E-23 142.0 4.0 84 45-129 221-321 (863)
12 PRK14015 pepN aminopeptidase N 99.7 2.4E-18 5.2E-23 141.7 4.4 84 46-130 235-335 (875)
13 TIGR02411 leuko_A4_hydro leuko 99.7 3.1E-18 6.6E-23 136.8 3.8 47 19-65 279-325 (601)
14 COG0308 PepN Aminopeptidase N 99.7 3E-18 6.5E-23 141.4 3.0 86 46-131 247-347 (859)
15 KOG1047|consensus 99.6 4E-16 8.6E-21 120.7 2.1 65 67-131 264-328 (613)
16 KOG1047|consensus 99.4 2.4E-14 5.1E-19 111.0 1.6 49 17-65 286-334 (613)
17 PF13485 Peptidase_MA_2: Pepti 98.6 4.5E-08 9.7E-13 62.9 3.4 43 14-56 20-62 (128)
18 PF13485 Peptidase_MA_2: Pepti 98.5 7.1E-08 1.5E-12 62.0 2.5 43 87-129 21-63 (128)
19 KOG1932|consensus 98.4 1.4E-07 3.1E-12 78.6 2.3 108 24-132 261-382 (1180)
20 KOG1932|consensus 97.7 4.1E-05 8.9E-10 64.4 3.9 60 3-64 327-386 (1180)
21 PF05299 Peptidase_M61: M61 gl 97.1 9.3E-05 2E-09 48.0 -0.6 43 17-59 2-55 (122)
22 PF07607 DUF1570: Protein of u 96.9 0.00081 1.8E-08 44.0 2.8 39 20-59 2-43 (128)
23 PF05299 Peptidase_M61: M61 gl 95.7 0.0034 7.4E-08 40.8 0.3 39 92-130 5-54 (122)
24 PF10460 Peptidase_M30: Peptid 95.6 0.015 3.3E-07 44.4 3.6 49 15-63 135-187 (366)
25 COG3975 Predicted protease wit 95.5 0.0078 1.7E-07 47.7 1.7 55 4-58 231-297 (558)
26 COG4324 Predicted aminopeptida 93.8 0.063 1.4E-06 39.5 2.8 42 15-62 193-234 (376)
27 PF10023 DUF2265: Predicted am 93.5 0.062 1.3E-06 40.7 2.4 41 16-62 162-202 (337)
28 PF10026 DUF2268: Predicted Zn 93.2 0.26 5.7E-06 34.4 5.1 47 13-59 59-109 (195)
29 PF10460 Peptidase_M30: Peptid 92.7 0.11 2.4E-06 39.9 2.8 41 90-130 138-182 (366)
30 PF04450 BSP: Peptidase of pla 92.1 0.15 3.2E-06 36.1 2.7 36 90-129 95-131 (205)
31 PF10023 DUF2265: Predicted am 90.6 0.13 2.9E-06 38.9 1.3 41 84-130 158-198 (337)
32 PF04450 BSP: Peptidase of pla 89.4 0.44 9.5E-06 33.7 3.1 42 14-58 91-132 (205)
33 COG4324 Predicted aminopeptida 89.4 0.18 3.8E-06 37.2 1.1 41 84-130 190-230 (376)
34 smart00731 SprT SprT homologue 86.2 0.38 8.3E-06 32.0 1.1 29 5-33 45-73 (146)
35 PF06114 DUF955: Domain of unk 85.3 1.1 2.3E-05 27.8 2.9 31 3-37 30-60 (122)
36 PF01431 Peptidase_M13: Peptid 85.0 0.88 1.9E-05 31.7 2.6 40 5-44 22-68 (206)
37 PF07607 DUF1570: Protein of u 84.4 0.48 1E-05 31.0 0.9 14 116-129 28-41 (128)
38 PRK04860 hypothetical protein; 83.7 0.48 1.1E-05 32.2 0.8 27 6-32 50-76 (160)
39 COG3975 Predicted protease wit 83.6 0.48 1E-05 38.0 0.8 42 89-130 245-297 (558)
40 PF10263 SprT-like: SprT-like 83.1 0.63 1.4E-05 31.0 1.1 21 13-33 54-74 (157)
41 PRK04351 hypothetical protein; 82.8 0.46 9.9E-06 31.9 0.4 24 7-30 49-72 (149)
42 COG3590 PepO Predicted metallo 81.6 0.37 8E-06 39.0 -0.5 27 5-31 473-499 (654)
43 PF12725 DUF3810: Protein of u 81.6 0.8 1.7E-05 34.6 1.3 17 15-31 192-208 (318)
44 PF01435 Peptidase_M48: Peptid 80.7 1.2 2.5E-05 31.2 1.8 25 14-38 84-108 (226)
45 PF13699 DUF4157: Domain of un 80.4 1.3 2.8E-05 26.4 1.7 12 20-31 62-73 (79)
46 PHA02456 zinc metallopeptidase 79.7 1.1 2.3E-05 28.7 1.2 17 15-31 75-91 (141)
47 KOG2661|consensus 78.2 1.8 3.8E-05 33.1 2.1 22 12-33 268-289 (424)
48 COG5504 Predicted Zn-dependent 77.1 3.3 7.2E-05 30.3 3.2 48 14-64 135-191 (280)
49 PF08325 WLM: WLM domain; Int 76.6 2.1 4.5E-05 29.8 2.1 24 11-34 74-97 (186)
50 PRK05457 heat shock protein Ht 76.5 1.5 3.2E-05 32.6 1.3 22 14-35 129-150 (284)
51 PF12315 DUF3633: Protein of u 76.3 5.5 0.00012 28.3 4.1 41 19-61 93-133 (212)
52 PRK03072 heat shock protein Ht 76.1 1.9 4E-05 32.1 1.8 20 15-34 123-142 (288)
53 PRK03001 M48 family peptidase; 75.9 1.9 4.1E-05 31.9 1.8 20 15-34 120-139 (283)
54 COG0501 HtpX Zn-dependent prot 75.5 2 4.3E-05 31.6 1.8 25 13-37 151-175 (302)
55 PRK03982 heat shock protein Ht 75.5 2 4.3E-05 31.8 1.8 20 15-34 121-140 (288)
56 PF12315 DUF3633: Protein of u 74.1 4.7 0.0001 28.7 3.3 39 92-132 94-132 (212)
57 PF13574 Reprolysin_2: Metallo 74.0 2.4 5.1E-05 29.0 1.8 13 19-31 111-123 (173)
58 PRK04897 heat shock protein Ht 73.9 2.3 5E-05 31.8 1.8 20 15-34 133-152 (298)
59 PRK01345 heat shock protein Ht 73.9 2.3 5E-05 32.1 1.8 20 15-34 120-139 (317)
60 PRK02870 heat shock protein Ht 71.5 2.8 6.1E-05 32.0 1.8 16 15-30 169-184 (336)
61 PRK02391 heat shock protein Ht 71.0 2.9 6.4E-05 31.2 1.8 20 15-34 129-148 (296)
62 PRK01265 heat shock protein Ht 68.5 3.6 7.8E-05 31.2 1.8 19 15-33 136-154 (324)
63 PF05569 Peptidase_M56: BlaR1 65.2 5.9 0.00013 29.3 2.4 39 5-43 182-228 (299)
64 PF13203 DUF2201_N: Putative m 63.9 4.3 9.3E-05 30.0 1.5 38 3-40 44-81 (292)
65 COG3091 SprT Zn-dependent meta 63.0 2.5 5.4E-05 28.5 0.1 16 15-30 57-72 (156)
66 COG4783 Putative Zn-dependent 62.8 5 0.00011 32.0 1.7 22 15-36 126-147 (484)
67 KOG3624|consensus 60.5 15 0.00033 30.6 4.2 51 6-57 505-562 (687)
68 cd04269 ZnMc_adamalysin_II_lik 60.4 6.2 0.00013 27.2 1.7 15 17-31 129-143 (194)
69 COG3227 LasB Zinc metalloprote 57.8 7.5 0.00016 31.1 1.9 34 93-128 339-375 (507)
70 PF13688 Reprolysin_5: Metallo 57.4 7 0.00015 26.8 1.6 15 17-31 140-154 (196)
71 COG4900 Predicted metallopepti 57.3 7.7 0.00017 24.9 1.6 30 7-37 69-101 (133)
72 PF09768 Peptidase_M76: Peptid 54.3 11 0.00024 26.0 2.1 28 13-41 65-92 (173)
73 cd04272 ZnMc_salivary_gland_MP 52.3 9.5 0.00021 26.9 1.6 14 18-31 144-157 (220)
74 cd04267 ZnMc_ADAM_like Zinc-de 50.1 8.4 0.00018 26.4 1.0 15 17-31 131-145 (192)
75 cd04270 ZnMc_TACE_like Zinc-de 49.6 11 0.00024 27.3 1.5 13 19-31 167-179 (244)
76 PF01863 DUF45: Protein of unk 49.2 12 0.00025 25.9 1.6 19 16-34 161-179 (205)
77 PF01421 Reprolysin: Reprolysi 48.2 13 0.00028 25.7 1.7 16 15-30 127-142 (199)
78 KOG3314|consensus 46.8 19 0.0004 24.8 2.2 26 14-39 86-111 (194)
79 PF04228 Zn_peptidase: Putativ 45.7 14 0.00031 27.6 1.7 14 18-31 169-182 (292)
80 PF06262 DUF1025: Possibl zinc 42.2 24 0.00053 21.9 2.1 17 15-31 69-85 (97)
81 PF14521 Aspzincin_M35: Lysine 41.1 20 0.00042 23.9 1.6 13 17-29 94-106 (148)
82 PF01447 Peptidase_M4: Thermol 40.9 14 0.0003 24.8 0.9 11 92-102 136-146 (150)
83 PF10463 Peptidase_U49: Peptid 40.5 70 0.0015 22.8 4.4 20 18-37 100-119 (206)
84 PF14891 Peptidase_M91: Effect 40.0 21 0.00046 24.3 1.7 14 18-31 102-115 (174)
85 PF08653 DASH_Dam1: DASH compl 39.3 27 0.00058 19.6 1.7 12 118-129 34-45 (58)
86 COG1451 Predicted metal-depend 39.2 29 0.00062 24.9 2.4 27 15-42 171-197 (223)
87 COG2856 Predicted Zn peptidase 39.2 25 0.00054 25.1 2.0 40 16-56 69-114 (213)
88 PF01457 Peptidase_M8: Leishma 37.2 22 0.00048 28.7 1.7 25 6-30 197-221 (521)
89 PF13265 DUF4056: Protein of u 36.8 16 0.00035 26.8 0.8 10 23-33 152-161 (270)
90 PF08219 TOM13: Outer membrane 36.0 27 0.00059 20.7 1.5 13 19-31 52-64 (77)
91 COG3824 Predicted Zn-dependent 35.7 19 0.00041 23.4 0.9 15 17-31 107-121 (136)
92 cd04271 ZnMc_ADAM_fungal Zinc- 34.2 15 0.00033 26.3 0.3 11 21-31 147-157 (228)
93 KOG2719|consensus 33.7 26 0.00056 27.7 1.5 22 15-36 276-297 (428)
94 cd00203 ZnMc Zinc-dependent me 32.7 23 0.0005 23.4 1.0 14 18-31 95-108 (167)
95 PF00413 Peptidase_M10: Matrix 30.7 36 0.00078 22.1 1.6 15 16-30 102-116 (154)
96 cd04268 ZnMc_MMP_like Zinc-dep 30.5 41 0.00089 22.1 1.9 13 18-30 93-105 (165)
97 cd04273 ZnMc_ADAMTS_like Zinc- 30.1 7.4 0.00016 27.2 -1.9 14 18-31 139-152 (207)
98 PF11458 Mistic: Membrane-inte 29.7 43 0.00093 19.8 1.6 12 118-129 18-29 (84)
99 PTZ00257 Glycoprotein GP63 (le 29.1 33 0.00071 28.5 1.4 14 17-30 254-267 (622)
100 PF02031 Peptidase_M7: Strepto 28.5 40 0.00088 22.2 1.5 13 18-30 76-88 (132)
101 PF07998 Peptidase_M54: Peptid 27.6 51 0.0011 23.2 2.0 23 15-39 141-163 (194)
102 cd06007 R3H_DEXH_helicase R3H 27.3 73 0.0016 17.8 2.2 16 20-37 29-44 (59)
103 PRK13267 archaemetzincin-like 26.9 57 0.0012 22.6 2.1 17 15-31 121-137 (179)
104 PF12388 Peptidase_M57: Dual-a 26.4 47 0.001 23.7 1.6 14 17-30 131-144 (211)
105 PF02163 Peptidase_M50: Peptid 26.2 52 0.0011 22.4 1.9 22 19-40 7-28 (192)
106 PF13583 Reprolysin_4: Metallo 26.1 36 0.00079 23.9 1.1 11 21-31 139-149 (206)
107 PF07098 DUF1360: Protein of u 25.4 84 0.0018 19.8 2.5 33 19-51 41-73 (105)
108 PF06167 Peptidase_M90: Glucos 24.8 44 0.00096 24.5 1.3 11 21-31 156-166 (253)
109 COG3228 Uncharacterized protei 24.1 34 0.00074 25.0 0.6 11 20-30 148-158 (266)
110 PHA00527 hypothetical protein 23.7 67 0.0015 20.4 1.8 16 18-33 72-87 (129)
111 PRK15410 DgsA anti-repressor M 22.8 41 0.0009 24.8 0.9 12 20-31 143-154 (260)
112 PF13582 Reprolysin_3: Metallo 22.6 47 0.001 20.7 1.0 11 92-102 108-118 (124)
113 cd04279 ZnMc_MMP_like_1 Zinc-d 22.4 60 0.0013 21.4 1.5 14 17-30 102-115 (156)
114 cd02639 R3H_RRM R3H domain of 21.8 1E+02 0.0022 17.3 2.1 12 20-31 30-41 (60)
115 COG4219 MecR1 Antirepressor re 21.0 58 0.0013 24.9 1.3 48 6-53 177-232 (337)
116 PF05572 Peptidase_M43: Pregna 20.4 64 0.0014 21.6 1.4 17 16-33 66-82 (154)
117 cd06158 S2P-M50_like_1 Unchara 20.4 80 0.0017 21.7 1.9 13 20-32 10-22 (181)
No 1
>KOG1046|consensus
Probab=99.93 E-value=2.6e-27 Score=194.90 Aligned_cols=99 Identities=33% Similarity=0.575 Sum_probs=93.5
Q ss_pred CcceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhccCCCccccchhhhccc
Q psy5111 1 EKFLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQEIPTPIREKFLLTSK 80 (132)
Q Consensus 1 e~~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~~~~~~~~e~~il~~~ 80 (132)
|+.+||+|..++..+|++++.+||||+||||||||||++||+++|||||||+|+++++++...|+ | ++.+.++....
T Consensus 306 e~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLVTm~wW~dLWLnEGfAt~~~~~~v~~~~p~-~--~~~~~~~~~~l 382 (882)
T KOG1046|consen 306 ETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLVTMKWWNDLWLNEGFATYVEYLAVDHLFPE-W--DIWEQFLLENL 382 (882)
T ss_pred ehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcccHhhhhhhhhcccHHHHHHHHhhccCCcc-h--hhHHHHHHHHH
Confidence 57899999999999999999999999999999999999999999999999999999999999999 9 88888999999
Q ss_pred ccccccccccccchhhhHhHhh
Q psy5111 81 HQCTTADFQRATSVVTHEFTHQ 102 (132)
Q Consensus 81 ~~al~~D~~~~s~~I~hEvahq 102 (132)
..++..|+...++||.-.+.|.
T Consensus 383 ~~~l~~D~l~~shpi~~~v~~~ 404 (882)
T KOG1046|consen 383 ERVLSLDALASSHPISVPVESP 404 (882)
T ss_pred HHHhhhhcccccCCeeeecCCc
Confidence 9999999999999998776665
No 2
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=99.88 E-value=9.3e-24 Score=160.35 Aligned_cols=95 Identities=35% Similarity=0.605 Sum_probs=85.6
Q ss_pred CcceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhccCCCccccchhhhccc
Q psy5111 1 EKFLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQEIPTPIREKFLLTSK 80 (132)
Q Consensus 1 e~~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~~~~~~~~e~~il~~~ 80 (132)
|..++++++.++..++..+..+||||+|||||||+||++||+++||+||||+|++++..+...|+ + .+.+.+.....
T Consensus 277 ~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~vt~~~w~d~WL~Eg~a~y~~~~~~~~~~~~-~--~~~~~~~~~~~ 353 (390)
T PF01433_consen 277 ESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNLVTPKWWSDLWLNEGFATYLEYLILEKLFGE-W--QMMELFLVQEM 353 (390)
T ss_dssp GGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTTEEESSGGGHHHHHHHHHHHHHHHHHHHHGH-H--HHHHHHHHHHH
T ss_pred ccccccCcccccchhhhhhHHHHHHHHHHHHhccCCccccchhhhHHHHHHHHHHHHhHhhccCc-c--cchhhhhhhhH
Confidence 45688999999999999999999999999999999999999999999999999999999999997 7 78888888888
Q ss_pred ccccccccccccchhhhH
Q psy5111 81 HQCTTADFQRATSVVTHE 98 (132)
Q Consensus 81 ~~al~~D~~~~s~~I~hE 98 (132)
+.++..|....++||.++
T Consensus 354 ~~~~~~d~~~~~~pl~~~ 371 (390)
T PF01433_consen 354 QRALREDALPNSHPLSSE 371 (390)
T ss_dssp HHHHHHHTSTTCCCSSSS
T ss_pred HHHHHHhhcCCCcceEeC
Confidence 899999999999999844
No 3
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=99.83 E-value=1.4e-21 Score=160.93 Aligned_cols=96 Identities=29% Similarity=0.466 Sum_probs=83.2
Q ss_pred cceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhc-cCCCccccchhhhccc
Q psy5111 2 KFLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYV-QEIPTPIREKFLLTSK 80 (132)
Q Consensus 2 ~~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p-~~~~~~~~e~~il~~~ 80 (132)
..+|.+++.++...+++++.+|+||+|||||||+|||+||+++||||||++|+++...+...| . + ...+.+.....
T Consensus 290 ~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlVT~~~W~~lWLnEgfat~~e~~~~~~~~~~~-~--~~~~~~~~~~~ 366 (859)
T COG0308 290 KYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGFATFREVLWSEDLGGRA-W--KRWEDFRTLRT 366 (859)
T ss_pred eEEeeCcccchhHHHHHHHHHHHHHHhhhcccceeeccCHHHHHHhhhhHHHHHHHHHHHhcchH-H--HHHHHHHHHhh
Confidence 458889999999999999999999999999999999999999999999999999999988887 4 5 44566554444
Q ss_pred ccccccccccccchhhhHhH
Q psy5111 81 HQCTTADFQRATSVVTHEFT 100 (132)
Q Consensus 81 ~~al~~D~~~~s~~I~hEva 100 (132)
..++..|+...++||...+.
T Consensus 367 ~~~~~~D~~~~~hPi~~~~~ 386 (859)
T COG0308 367 SIALAEDSLPSSHPIRVDVY 386 (859)
T ss_pred hHHHhhccccccCCcccCCC
Confidence 44888999999999987665
No 4
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=99.83 E-value=2.4e-21 Score=159.35 Aligned_cols=86 Identities=30% Similarity=0.455 Sum_probs=71.2
Q ss_pred CChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhccCCCccccchhhhccccccccccccc
Q psy5111 11 CTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQEIPTPIREKFLLTSKHQCTTADFQR 90 (132)
Q Consensus 11 ~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~~~~~~~~e~~il~~~~~al~~D~~~ 90 (132)
.+...++.++.+|+||+|||||||+||++||+++|||||||+|++++..+...|. + .....+.......++..|...
T Consensus 279 ~~~~~~~~~~~viaHElAHqWFGnlVT~~wW~dlWLnEGFAty~e~~~~~~~~~~-~--~~~~~f~~~~~~~a~~~D~~~ 355 (831)
T TIGR02412 279 ATRAEKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLASAEATEY-T--DAWTTFAAQGKQWAYEADQLP 355 (831)
T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCEeccccccchhHHHHHHHHHHHHHHHhcCCc-c--hHHHHHHHHHHHHHHHHhccc
Confidence 3456788899999999999999999999999999999999999999999888776 6 555666555555667778877
Q ss_pred ccchhhhHh
Q psy5111 91 ATSVVTHEF 99 (132)
Q Consensus 91 ~s~~I~hEv 99 (132)
.++||..++
T Consensus 356 ~t~Pi~~~~ 364 (831)
T TIGR02412 356 TTHPIVADV 364 (831)
T ss_pred CCCCCccCC
Confidence 788886443
No 5
>KOG1046|consensus
Probab=99.82 E-value=6e-21 Score=157.48 Aligned_cols=85 Identities=35% Similarity=0.638 Sum_probs=71.7
Q ss_pred HHHHHHHhhhhhhhhhccC-------CC-c-----cc---cchhhhcccccccccccccccchhhhHhHhhhhhcccCCC
Q psy5111 48 EAFARLFEYFGTRMLYVQE-------IP-T-----PI---REKFLLTSKHQCTTADFQRATSVVTHEFTHQWFGDLVTPA 111 (132)
Q Consensus 48 Eg~a~y~~~~~~~~~~p~~-------~~-~-----~~---~e~~il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v~~~ 111 (132)
..|..|..++++.|+.|+. +. | |+ ++..++.+.+.....+.++.+.+|+||+|||||||+||++
T Consensus 265 ~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLVTm~ 344 (882)
T KOG1046|consen 265 KVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLVTMK 344 (882)
T ss_pred HHHHHHHHHhCCCCCCccccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcccHh
Confidence 5677888899999999863 10 0 22 6667777777777778888999999999999999999999
Q ss_pred CchhhHHhHHHHHHHHHhccC
Q psy5111 112 SWNFAWLNEAFARLFEYFGTR 132 (132)
Q Consensus 112 ~w~~~wl~e~~~~~~~~~~~~ 132 (132)
||+++|||||||+|+++++++
T Consensus 345 wW~dLWLnEGfAt~~~~~~v~ 365 (882)
T KOG1046|consen 345 WWNDLWLNEGFATYVEYLAVD 365 (882)
T ss_pred hhhhhhhcccHHHHHHHHhhc
Confidence 999999999999999999875
No 6
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=99.81 E-value=1.9e-20 Score=153.78 Aligned_cols=94 Identities=24% Similarity=0.331 Sum_probs=67.8
Q ss_pred CcceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhccCCCccccchhhhccc
Q psy5111 1 EKFLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQEIPTPIREKFLLTSK 80 (132)
Q Consensus 1 e~~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~~~~~~~~e~~il~~~ 80 (132)
|..+|+++...++.+++++..+|+||++||||||+|||+||+++|||||||+|++........+. ...++.+...+..
T Consensus 265 e~~lL~~~~~~td~~~~~i~~VIaHElaHqWfGNlVT~~~W~~LWLnEGfAty~e~~~~~~~~~~-~~~~~~~~~~lr~- 342 (863)
T TIGR02414 265 SKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSADMTSR-AVKRIEDVRLLRA- 342 (863)
T ss_pred cceEEeCCCCCCHHHHHHHHHHHHHHHHHHHhcceeeecchhhhhhhhhHHHHHHHHHHHHhhhH-HHHHHHHHHHHHh-
Confidence 35688898888888999999999999999999999999999999999999999986443333221 0002223322222
Q ss_pred ccccccccccccchhhh
Q psy5111 81 HQCTTADFQRATSVVTH 97 (132)
Q Consensus 81 ~~al~~D~~~~s~~I~h 97 (132)
.++..|....++||..
T Consensus 343 -~~f~~D~~p~~~Pi~~ 358 (863)
T TIGR02414 343 -HQFPEDAGPMAHPVRP 358 (863)
T ss_pred -hhhcccccccCCCCCC
Confidence 2455676776777653
No 7
>PRK14015 pepN aminopeptidase N; Provisional
Probab=99.79 E-value=5.8e-20 Score=151.20 Aligned_cols=59 Identities=31% Similarity=0.506 Sum_probs=52.5
Q ss_pred cceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhh
Q psy5111 2 KFLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTR 60 (132)
Q Consensus 2 ~~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~ 60 (132)
..+|++++..+..++.++..+|+||++||||||+|||+||+++|||||||+|++.....
T Consensus 279 ~~lL~~~~~~t~~~~~~i~~vIaHElaHqWFGNlVT~~~W~dLWLnEGFAty~e~~~~~ 337 (875)
T PRK14015 279 KYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSA 337 (875)
T ss_pred ceEecCcccCCHHHHHHHHHHHHHHHHHHHHhCcceecchhhhhhhhHHHHHHHHHHHH
Confidence 45778888888888999999999999999999999999999999999999999754433
No 8
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=99.76 E-value=4.2e-19 Score=141.71 Aligned_cols=78 Identities=29% Similarity=0.350 Sum_probs=53.0
Q ss_pred HhhhhhhhhhccC--------CCccccchhhhcccccccccccccccchhhhHhHhhhhhcccCCCCchhhHHhHHHHHH
Q psy5111 54 FEYFGTRMLYVQE--------IPTPIREKFLLTSKHQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARL 125 (132)
Q Consensus 54 ~~~~~~~~~~p~~--------~~~~~~e~~il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~ 125 (132)
.+.++..|+.|+. ++.+-||+..+....+.+..+......+|+||||||||||+||+.||+++|||||||+|
T Consensus 234 ~e~~~~pYp~~k~d~vvlpp~f~~GgMEN~~ltf~~~~ll~~d~s~~~viaHElAHqWfGNlVT~~~W~d~WLnEGfaty 313 (601)
T TIGR02411 234 AEDLIFPYEWGQYDLLVLPPSFPYGGMENPNLTFATPTLIAGDRSNVDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVY 313 (601)
T ss_pred HHHhCCCCcCccceEEEecCcccccccccccceeeccccccCChhhhhhHHHHHHhhccCceeecCCchHHHHHhhHHHH
Confidence 3445566665541 22122344333333333333333346799999999999999999999999999999999
Q ss_pred HHHhcc
Q psy5111 126 FEYFGT 131 (132)
Q Consensus 126 ~~~~~~ 131 (132)
++++++
T Consensus 314 ~e~~~~ 319 (601)
T TIGR02411 314 LERRIV 319 (601)
T ss_pred HHHHHH
Confidence 999764
No 9
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=99.75 E-value=5.1e-19 Score=145.67 Aligned_cols=84 Identities=32% Similarity=0.560 Sum_probs=60.1
Q ss_pred HHHHHHHHhhhhhhhhhccCC-----C---ccccchh-hhcccc------cccccccccccchhhhHhHhhhhhcccCCC
Q psy5111 47 NEAFARLFEYFGTRMLYVQEI-----P---TPIREKF-LLTSKH------QCTTADFQRATSVVTHEFTHQWFGDLVTPA 111 (132)
Q Consensus 47 ~Eg~a~y~~~~~~~~~~p~~~-----~---~~~~e~~-il~~~~------~al~~D~~~~s~~I~hEvahqWfg~~v~~~ 111 (132)
.+++..|.++++..|++|+ + | .+-||+. ++.... .....+....+.+|+||+|||||||+||++
T Consensus 229 ~~~l~~~e~~fg~pYP~~k-~d~V~vP~f~~GaMEn~Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlVT~~ 307 (831)
T TIGR02412 229 RQGLAFFHRKFGYPYPFKK-YDQIFVPEFNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLVTMR 307 (831)
T ss_pred HHHHHHHHHHhCCCCCccc-CCEEEcCCCCCCcccccceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEeccc
Confidence 5788888999999998887 4 1 0112222 111111 111222334567999999999999999999
Q ss_pred CchhhHHhHHHHHHHHHhcc
Q psy5111 112 SWNFAWLNEAFARLFEYFGT 131 (132)
Q Consensus 112 ~w~~~wl~e~~~~~~~~~~~ 131 (132)
||+++|||||||+|++++++
T Consensus 308 wW~dlWLnEGFAty~e~~~~ 327 (831)
T TIGR02412 308 WWNDLWLNESFAEYMGTLAS 327 (831)
T ss_pred cccchhHHHHHHHHHHHHHH
Confidence 99999999999999999875
No 10
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=99.74 E-value=1e-18 Score=132.75 Aligned_cols=86 Identities=40% Similarity=0.697 Sum_probs=59.8
Q ss_pred HHHHHHHHHhhhhhhhhhccC-------CCc-cc--------cchhhhcccccccccccccccchhhhHhHhhhhhcccC
Q psy5111 46 LNEAFARLFEYFGTRMLYVQE-------IPT-PI--------REKFLLTSKHQCTTADFQRATSVVTHEFTHQWFGDLVT 109 (132)
Q Consensus 46 l~Eg~a~y~~~~~~~~~~p~~-------~~~-~~--------~e~~il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v~ 109 (132)
..+.+..|.++++..|++++. ++. ++ .+..++.........+......+|+||+|||||||+||
T Consensus 234 ~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~vt 313 (390)
T PF01433_consen 234 APKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNLVT 313 (390)
T ss_dssp HHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTTEE
T ss_pred hHHHHHHHHhhccccceecceeEEEEeccccccccccccccccccccccCcccccchhhhhhHHHHHHHHHHHHhccCCc
Confidence 457777777888877766541 110 11 33333333333444455667889999999999999999
Q ss_pred CCCchhhHHhHHHHHHHHHhcc
Q psy5111 110 PASWNFAWLNEAFARLFEYFGT 131 (132)
Q Consensus 110 ~~~w~~~wl~e~~~~~~~~~~~ 131 (132)
+++|+++||+||||+|++++++
T Consensus 314 ~~~w~d~WL~Eg~a~y~~~~~~ 335 (390)
T PF01433_consen 314 PKWWSDLWLNEGFATYLEYLIL 335 (390)
T ss_dssp ESSGGGHHHHHHHHHHHHHHHH
T ss_pred cccchhhhHHHHHHHHHHHHhH
Confidence 9999999999999999999875
No 11
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=99.73 E-value=2e-18 Score=142.01 Aligned_cols=84 Identities=27% Similarity=0.343 Sum_probs=61.1
Q ss_pred HHHHHHHHHHhhhhhhhhhccCC--------Cccccchh---------hhcccccccccccccccchhhhHhHhhhhhcc
Q psy5111 45 WLNEAFARLFEYFGTRMLYVQEI--------PTPIREKF---------LLTSKHQCTTADFQRATSVVTHEFTHQWFGDL 107 (132)
Q Consensus 45 wl~Eg~a~y~~~~~~~~~~p~~~--------~~~~~e~~---------il~~~~~al~~D~~~~s~~I~hEvahqWfg~~ 107 (132)
-+.+.+..|.++++.+|++++ + ..+-||+. ++.........+......+|+||++||||||+
T Consensus 221 ~~~~~L~~~E~~fG~pYPl~k-~diVavpdf~~GaMEN~GLi~f~e~~lL~~~~~~td~~~~~i~~VIaHElaHqWfGNl 299 (863)
T TIGR02414 221 SLKKAMKWDEEVFGLEYDLDI-FMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNR 299 (863)
T ss_pred HHHHHHHHHHHHhCCCCChhh-ccEEecCCCCCccccccceeccccceEEeCCCCCCHHHHHHHHHHHHHHHHHHHhcce
Confidence 356888889999998888776 4 00112322 22222222333444566899999999999999
Q ss_pred cCCCCchhhHHhHHHHHHHHHh
Q psy5111 108 VTPASWNFAWLNEAFARLFEYF 129 (132)
Q Consensus 108 v~~~~w~~~wl~e~~~~~~~~~ 129 (132)
|||+||+++|||||||+|++.+
T Consensus 300 VT~~~W~~LWLnEGfAty~e~~ 321 (863)
T TIGR02414 300 VTCRDWFQLSLKEGLTVFRDQE 321 (863)
T ss_pred eeecchhhhhhhhhHHHHHHHH
Confidence 9999999999999999999864
No 12
>PRK14015 pepN aminopeptidase N; Provisional
Probab=99.73 E-value=2.4e-18 Score=141.75 Aligned_cols=84 Identities=26% Similarity=0.328 Sum_probs=58.6
Q ss_pred HHHHHHHHHhhhhhhhhhccCC--------Cccccchh---------hhcccccccccccccccchhhhHhHhhhhhccc
Q psy5111 46 LNEAFARLFEYFGTRMLYVQEI--------PTPIREKF---------LLTSKHQCTTADFQRATSVVTHEFTHQWFGDLV 108 (132)
Q Consensus 46 l~Eg~a~y~~~~~~~~~~p~~~--------~~~~~e~~---------il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v 108 (132)
+.+.+..|.+++|.+|++++ + ..+-+|+. ++.........+......+|+||++||||||+|
T Consensus 235 ~~~~L~~~E~~FG~pYP~~k-~diVavp~f~~GaMEN~Gl~~f~~~~lL~~~~~~t~~~~~~i~~vIaHElaHqWFGNlV 313 (875)
T PRK14015 235 LKKSMKWDEERFGLEYDLDI-FMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRV 313 (875)
T ss_pred HHHHHHHHHHHhCCCCChhh-hCEEeCCCCCCcccccccccccccceEecCcccCCHHHHHHHHHHHHHHHHHHHHhCcc
Confidence 45788888888888887766 4 00112222 222222222223344567999999999999999
Q ss_pred CCCCchhhHHhHHHHHHHHHhc
Q psy5111 109 TPASWNFAWLNEAFARLFEYFG 130 (132)
Q Consensus 109 ~~~~w~~~wl~e~~~~~~~~~~ 130 (132)
||+||+++|||||||+|++.++
T Consensus 314 T~~~W~dLWLnEGFAty~e~~~ 335 (875)
T PRK14015 314 TCRDWFQLSLKEGLTVFRDQEF 335 (875)
T ss_pred eecchhhhhhhhHHHHHHHHHH
Confidence 9999999999999999997543
No 13
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=99.72 E-value=3.1e-18 Score=136.80 Aligned_cols=47 Identities=38% Similarity=0.725 Sum_probs=43.4
Q ss_pred hHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhcc
Q psy5111 19 ATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQ 65 (132)
Q Consensus 19 ~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~ 65 (132)
...+||||+|||||||+||++||+++|||||||+|++.++.+.++|+
T Consensus 279 ~~~viaHElAHqWfGNlVT~~~W~d~WLnEGfaty~e~~~~~~~~~e 325 (601)
T TIGR02411 279 NVDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLERRIVGRLYGE 325 (601)
T ss_pred hhhhHHHHHHhhccCceeecCCchHHHHHhhHHHHHHHHHHHHhcCc
Confidence 35799999999999999999999999999999999999888877665
No 14
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=99.71 E-value=3e-18 Score=141.41 Aligned_cols=86 Identities=35% Similarity=0.640 Sum_probs=66.9
Q ss_pred HHHHHHHHHhhhhhhhhhccC------CCccc---------cchhhhcccccccccccccccchhhhHhHhhhhhcccCC
Q psy5111 46 LNEAFARLFEYFGTRMLYVQE------IPTPI---------REKFLLTSKHQCTTADFQRATSVVTHEFTHQWFGDLVTP 110 (132)
Q Consensus 46 l~Eg~a~y~~~~~~~~~~p~~------~~~~~---------~e~~il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v~~ 110 (132)
+..++..|.++++.+|..|.+ +..|- ++.+++.....+...+..+.+.+|+||++||||||+|||
T Consensus 247 ~~~~~~~~e~~fg~~y~l~~~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlVT~ 326 (859)
T COG0308 247 TKRSIEFYEEYFGLPYALPIDIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTM 326 (859)
T ss_pred HHHHhhhHHHhcCCCCCCcccEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhcccceeec
Confidence 457888888999999988741 11011 444455555556666667777799999999999999999
Q ss_pred CCchhhHHhHHHHHHHHHhcc
Q psy5111 111 ASWNFAWLNEAFARLFEYFGT 131 (132)
Q Consensus 111 ~~w~~~wl~e~~~~~~~~~~~ 131 (132)
+||+++|||||||+|+++.+.
T Consensus 327 ~~W~~lWLnEgfat~~e~~~~ 347 (859)
T COG0308 327 KWWDDLWLNEGFATFREVLWS 347 (859)
T ss_pred cCHHHHHHhhhhHHHHHHHHH
Confidence 999999999999999998764
No 15
>KOG1047|consensus
Probab=99.59 E-value=4e-16 Score=120.68 Aligned_cols=65 Identities=35% Similarity=0.483 Sum_probs=50.4
Q ss_pred CCccccchhhhcccccccccccccccchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHhcc
Q psy5111 67 IPTPIREKFLLTSKHQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGT 131 (132)
Q Consensus 67 ~~~~~~e~~il~~~~~al~~D~~~~s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~~~ 131 (132)
||-+-|++-.+.-..+.+.......+.+|+|||||.||||+||+..|+|.||||||++|++..|+
T Consensus 264 FP~gGMENPcltF~TpTllaGDrsl~~vIaHEIAHSWtGNlVTN~sWehfWLNEGfTvylErrI~ 328 (613)
T KOG1047|consen 264 FPFGGMENPCLTFVTPTLLAGDRSLVDVIAHEIAHSWTGNLVTNASWEHFWLNEGFTVYLERRIV 328 (613)
T ss_pred CCcccccCcceeeecchhhcCCcchhhHHHHHhhhhhcccccccCccchhhhcccchhhhhhhhh
Confidence 44233555544445444444444458999999999999999999999999999999999998774
No 16
>KOG1047|consensus
Probab=99.44 E-value=2.4e-14 Score=110.99 Aligned_cols=49 Identities=41% Similarity=0.731 Sum_probs=43.8
Q ss_pred hhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhcc
Q psy5111 17 QRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQ 65 (132)
Q Consensus 17 ~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~ 65 (132)
...+.+|||||||.|||||||...|.+.||||||+.|.|..++..++++
T Consensus 286 rsl~~vIaHEIAHSWtGNlVTN~sWehfWLNEGfTvylErrI~g~~~g~ 334 (613)
T KOG1047|consen 286 RSLVDVIAHEIAHSWTGNLVTNASWEHFWLNEGFTVYLERRIVGRLYGE 334 (613)
T ss_pred cchhhHHHHHhhhhhcccccccCccchhhhcccchhhhhhhhhhhhcch
Confidence 3458899999999999999999999999999999999998887766654
No 17
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=98.59 E-value=4.5e-08 Score=62.90 Aligned_cols=43 Identities=33% Similarity=0.577 Sum_probs=37.5
Q ss_pred hhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhh
Q psy5111 14 ADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEY 56 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~ 56 (132)
.+...+..+++||++|+|+++.++.......|++||+|.|++.
T Consensus 20 ~~~~~~~~~l~HE~~H~~~~~~~~~~~~~~~W~~EG~A~y~~~ 62 (128)
T PF13485_consen 20 SDEDWLDRVLAHELAHQWFGNYFGGDDNAPRWFNEGLAEYVEG 62 (128)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHHHHHhc
Confidence 4555566899999999999999987778889999999999984
No 18
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=98.50 E-value=7.1e-08 Score=61.97 Aligned_cols=43 Identities=33% Similarity=0.544 Sum_probs=36.4
Q ss_pred ccccccchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHh
Q psy5111 87 DFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYF 129 (132)
Q Consensus 87 D~~~~s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~ 129 (132)
+.....++++||++|+|+++.++.......|++||+|+|++..
T Consensus 21 ~~~~~~~~l~HE~~H~~~~~~~~~~~~~~~W~~EG~A~y~~~~ 63 (128)
T PF13485_consen 21 DEDWLDRVLAHELAHQWFGNYFGGDDNAPRWFNEGLAEYVEGR 63 (128)
T ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHHHHHhcC
Confidence 3334458999999999999999877788899999999999854
No 19
>KOG1932|consensus
Probab=98.39 E-value=1.4e-07 Score=78.62 Aligned_cols=108 Identities=17% Similarity=0.210 Sum_probs=75.7
Q ss_pred HHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhccCC-----C---ccccchh-----hhccccccccccccc
Q psy5111 24 THEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYVQEI-----P---TPIREKF-----LLTSKHQCTTADFQR 90 (132)
Q Consensus 24 ~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p~~~-----~---~~~~e~~-----il~~~~~al~~D~~~ 90 (132)
+|++.|-....+-.+---.-..|.+.+..|.+.++..|+++- + + ..+.... -...+.+....|...
T Consensus 261 ~~~i~~f~LP~~~~~v~nt~~~l~k~iefye~~ls~rYPF~~-~k~VFvd~~~~~i~~~asl~I~st~lLy~~~iIDq~~ 339 (1180)
T KOG1932|consen 261 MIDITHFCLPGLEPLVKNTTVYLHKAIEFYEEELSSRYPFSC-YKTVFVDEAAVEISSYASLSIFSTSLLYSKNIIDQTF 339 (1180)
T ss_pred cCcceeEecCcchHHhhhHHHHHHHHHHHHHHHhccCCCcce-eeEEEecCCcceeeecceeeeeeccccchHhhhhHHH
Confidence 667777777777666667778899999999999999898875 4 0 0111110 000111112223333
Q ss_pred c-cchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHhccC
Q psy5111 91 A-TSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTR 132 (132)
Q Consensus 91 ~-s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~~~~ 132 (132)
. .+-++--+|.||||.++|+..|+|.||.+|+|.|+..+++|
T Consensus 340 ~tr~~La~aLA~Q~fg~yIsp~~wsD~Wl~~GiagYl~~l~~k 382 (1180)
T KOG1932|consen 340 LTRRKLAWALASQWFGVYISPVDWSDFWLLKGIAGYLTGLFVK 382 (1180)
T ss_pred HHHHHHHHHHHHhhhEEEeeccchhhhHHHHhHHHHHHHHHHH
Confidence 2 44677889999999999999999999999999999988764
No 20
>KOG1932|consensus
Probab=97.69 E-value=4.1e-05 Score=64.44 Aligned_cols=60 Identities=22% Similarity=0.372 Sum_probs=47.5
Q ss_pred ceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhhhc
Q psy5111 3 FLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRMLYV 64 (132)
Q Consensus 3 ~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~~p 64 (132)
.+||+.+ ...+-......+|--+|.||||-.+||..|+|.||-+|+|.|+..+.+....+
T Consensus 327 ~lLy~~~--iIDq~~~tr~~La~aLA~Q~fg~yIsp~~wsD~Wl~~GiagYl~~l~~kk~lG 386 (1180)
T KOG1932|consen 327 SLLYSKN--IIDQTFLTRRKLAWALASQWFGVYISPVDWSDFWLLKGIAGYLTGLFVKKFLG 386 (1180)
T ss_pred cccchHh--hhhHHHHHHHHHHHHHHHhhhEEEeeccchhhhHHHHhHHHHHHHHHHHHHhC
Confidence 4666544 23344445667899999999999999999999999999999998777776555
No 21
>PF05299 Peptidase_M61: M61 glycyl aminopeptidase; InterPro: IPR007963 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M61 (glycyl aminopeptidase family, clan MA(E)).The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example is glycyl aminopeptidase from Sphingomonas capsulata.
Probab=97.08 E-value=9.3e-05 Score=48.03 Aligned_cols=43 Identities=26% Similarity=0.479 Sum_probs=33.4
Q ss_pred hhhHHHHHHHHHHHHhhCC-----------CCchhhhHHHHHHHHHHHHhhhhh
Q psy5111 17 QRATSVVTHEFTHQWFGDL-----------VTPASWNFAWLNEAFARLFEYFGT 59 (132)
Q Consensus 17 ~~~~~~i~he~ah~w~g~l-----------vt~~~w~~~wl~Eg~a~y~~~~~~ 59 (132)
++...++|||+.|.|-+-. -.+.--+.+|+.||+++|++.+..
T Consensus 2 ~~~l~l~sHEffH~WnvkrirP~~l~p~dy~~~~~t~~LWv~EG~T~Y~~~l~l 55 (122)
T PF05299_consen 2 QRFLGLLSHEFFHSWNVKRIRPAELGPFDYEKPNYTELLWVYEGFTSYYGDLLL 55 (122)
T ss_pred cchhhhhhhhccccccceEeccccccCCCCCCCCCCCCEeeeeCcHHHHHHHHH
Confidence 3456789999999998544 445556678999999999986554
No 22
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=96.93 E-value=0.00081 Score=44.03 Aligned_cols=39 Identities=33% Similarity=0.514 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHhhCC--CC-chhhhHHHHHHHHHHHHhhhhh
Q psy5111 20 TSVVTHEFTHQWFGDL--VT-PASWNFAWLNEAFARLFEYFGT 59 (132)
Q Consensus 20 ~~~i~he~ah~w~g~l--vt-~~~w~~~wl~Eg~a~y~~~~~~ 59 (132)
+.+|+||-+||=.-|. .+ ...| -.|+.||||.|++--..
T Consensus 2 ~~T~~HEa~HQl~~N~Gl~~r~~~~-P~Wv~EGlA~yFE~~~~ 43 (128)
T PF07607_consen 2 IATIAHEATHQLAFNTGLHPRLADW-PRWVSEGLATYFETPGM 43 (128)
T ss_pred chHHHHHHHHHHHHHccccccCCCC-chHHHHhHHHHcCCCcc
Confidence 4679999999966543 22 2233 39999999999996554
No 23
>PF05299 Peptidase_M61: M61 glycyl aminopeptidase; InterPro: IPR007963 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M61 (glycyl aminopeptidase family, clan MA(E)).The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example is glycyl aminopeptidase from Sphingomonas capsulata.
Probab=95.65 E-value=0.0034 Score=40.76 Aligned_cols=39 Identities=23% Similarity=0.536 Sum_probs=30.8
Q ss_pred cchhhhHhHhhhhhcccCC-----------CCchhhHHhHHHHHHHHHhc
Q psy5111 92 TSVVTHEFTHQWFGDLVTP-----------ASWNFAWLNEAFARLFEYFG 130 (132)
Q Consensus 92 s~~I~hEvahqWfg~~v~~-----------~~w~~~wl~e~~~~~~~~~~ 130 (132)
...++||.-|.|=+..+.| ..=+.+|+-||+++|++.++
T Consensus 5 l~l~sHEffH~WnvkrirP~~l~p~dy~~~~~t~~LWv~EG~T~Y~~~l~ 54 (122)
T PF05299_consen 5 LGLLSHEFFHSWNVKRIRPAELGPFDYEKPNYTELLWVYEGFTSYYGDLL 54 (122)
T ss_pred hhhhhhhccccccceEeccccccCCCCCCCCCCCCEeeeeCcHHHHHHHH
Confidence 4578999999999766654 33446899999999998765
No 24
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=95.59 E-value=0.015 Score=44.44 Aligned_cols=49 Identities=22% Similarity=0.239 Sum_probs=37.5
Q ss_pred hhhhhHHHHHHHHHHH--HhhCCCCch--hhhHHHHHHHHHHHHhhhhhhhhh
Q psy5111 15 DFQRATSVVTHEFTHQ--WFGDLVTPA--SWNFAWLNEAFARLFEYFGTRMLY 63 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~--w~g~lvt~~--~w~~~wl~Eg~a~y~~~~~~~~~~ 63 (132)
+...+..++|||+-|+ +.-+.|... .=.+.||+|+++...+.+....+.
T Consensus 135 ~~~~~~sTlAHEfQHmInfy~~~v~~g~~~~~dtWLnE~lS~~aEdl~s~~~~ 187 (366)
T PF10460_consen 135 GPDTVYSTLAHEFQHMINFYQRGVLHGKQYAMDTWLNEMLSMSAEDLYSSKID 187 (366)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccHHHHHHHHHHHHHHhcCCC
Confidence 3566789999999998 555555443 347999999999999987766553
No 25
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=95.48 E-value=0.0078 Score=47.73 Aligned_cols=55 Identities=24% Similarity=0.536 Sum_probs=40.8
Q ss_pred eeccCCCCChhh-hhhhHHHHHHHHHHHHhhCCCCchhh-----------hHHHHHHHHHHHHhhhh
Q psy5111 4 LLTSKGQCTTAD-FQRATSVVTHEFTHQWFGDLVTPASW-----------NFAWLNEAFARLFEYFG 58 (132)
Q Consensus 4 ~l~~~~~~~~~~-~~~~~~~i~he~ah~w~g~lvt~~~w-----------~~~wl~Eg~a~y~~~~~ 58 (132)
+.++.-.-++.+ .+....+++||+.|-|-+-.+-|.-- .-+|+.|||++|.....
T Consensus 231 l~~~r~~~~~~~ky~~~l~llsHEyfH~WNvKrIrpa~l~p~~~d~en~t~~lW~~EG~T~Yy~~ll 297 (558)
T COG3975 231 LIYDRFGFTDQDKYQDLLGLLSHEYFHAWNVKRIRPAALEPFNLDKENYTPLLWFSEGFTSYYDRLL 297 (558)
T ss_pred cccccccccchhHHHHHHHHHHHHHHHhccceeccccccCCccccccCCCcceeeecCchHHHHHHH
Confidence 445544444445 48889999999999999877765543 35999999999987544
No 26
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=93.75 E-value=0.063 Score=39.49 Aligned_cols=42 Identities=33% Similarity=0.283 Sum_probs=33.7
Q ss_pred hhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhh
Q psy5111 15 DFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRML 62 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~ 62 (132)
+...++.+|-||+|||=+- --+|.-.||+|+++.+..|+...
T Consensus 193 dd~~lA~LIFHELAHQk~Y------~~~DtAFNEsFAtaVEt~Gvr~W 234 (376)
T COG4324 193 DDTYLASLIFHELAHQKIY------VNNDTAFNESFATAVETSGVRKW 234 (376)
T ss_pred ChHHHHHHHHHHHhhheEe------ecCcchHhHHHHHHHHHHhHHHH
Confidence 3457899999999999654 23577789999999998887763
No 27
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=93.47 E-value=0.062 Score=40.71 Aligned_cols=41 Identities=27% Similarity=0.244 Sum_probs=31.7
Q ss_pred hhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhhh
Q psy5111 16 FQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRML 62 (132)
Q Consensus 16 ~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~~ 62 (132)
...++.+|-||+|||=+. ==+|.=+||+||++++..|+...
T Consensus 162 ~~~LA~LIfHELaHq~~Y------v~~dt~FNEsfAtfVe~~G~~~w 202 (337)
T PF10023_consen 162 DGELARLIFHELAHQTLY------VKGDTAFNESFATFVEREGARRW 202 (337)
T ss_pred chHHHHHHHHHHhhceee------cCCCchhhHHHHHHHHHHHHHHH
Confidence 357899999999999432 11377789999999998887653
No 28
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=93.15 E-value=0.26 Score=34.40 Aligned_cols=47 Identities=13% Similarity=0.213 Sum_probs=32.4
Q ss_pred hhhhhhhHHHHHHHHHHHH-hhCCC---CchhhhHHHHHHHHHHHHhhhhh
Q psy5111 13 TADFQRATSVVTHEFTHQW-FGDLV---TPASWNFAWLNEAFARLFEYFGT 59 (132)
Q Consensus 13 ~~~~~~~~~~i~he~ah~w-~g~lv---t~~~w~~~wl~Eg~a~y~~~~~~ 59 (132)
.....++..+||||+.|.+ +.... .-.---+..+.||+|.++.....
T Consensus 59 ~~~~~~l~~~iaHE~hH~~r~~~~~~~~~~~TLld~~I~EGlAe~f~~~~~ 109 (195)
T PF10026_consen 59 DYSLEELPALIAHEYHHNCRYEQIGWDPEDTTLLDSLIMEGLAEYFAEELY 109 (195)
T ss_pred cccHHHHHHHHHHHHHHHHHHhccCCCCCCCCHHHHHHHhhHHHHHHHHHc
Confidence 3445588999999999995 33332 12234477889999999875443
No 29
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=92.68 E-value=0.11 Score=39.88 Aligned_cols=41 Identities=27% Similarity=0.309 Sum_probs=29.6
Q ss_pred cccchhhhHhHhh--hhhcccCCC--CchhhHHhHHHHHHHHHhc
Q psy5111 90 RATSVVTHEFTHQ--WFGDLVTPA--SWNFAWLNEAFARLFEYFG 130 (132)
Q Consensus 90 ~~s~~I~hEvahq--Wfg~~v~~~--~w~~~wl~e~~~~~~~~~~ 130 (132)
....+++||..|. +--+.|... .=.++|||||+|.-++.++
T Consensus 138 ~~~sTlAHEfQHmInfy~~~v~~g~~~~~dtWLnE~lS~~aEdl~ 182 (366)
T PF10460_consen 138 TVYSTLAHEFQHMINFYQRGVLHGKQYAMDTWLNEMLSMSAEDLY 182 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcccccHHHHHHHHHHHHHH
Confidence 3456899999998 333444443 3468999999999888764
No 30
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=92.12 E-value=0.15 Score=36.09 Aligned_cols=36 Identities=25% Similarity=0.232 Sum_probs=27.2
Q ss_pred cccchhhhHhHhhhhhcccC-CCCchhhHHhHHHHHHHHHh
Q psy5111 90 RATSVVTHEFTHQWFGDLVT-PASWNFAWLNEAFARLFEYF 129 (132)
Q Consensus 90 ~~s~~I~hEvahqWfg~~v~-~~~w~~~wl~e~~~~~~~~~ 129 (132)
...-+|.|||+|-|-.+--+ +.+ ||-||+|.|+...
T Consensus 95 Ei~Gvl~HE~~H~~Q~~~~~~~P~----~liEGIADyVRl~ 131 (205)
T PF04450_consen 95 EIIGVLYHEMVHCWQWDGRGTAPG----GLIEGIADYVRLK 131 (205)
T ss_pred HHHHHHHHHHHHHhhcCCCCCCCh----hheecHHHHHHHH
Confidence 34568999999988765433 333 7999999999875
No 31
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=90.60 E-value=0.13 Score=38.95 Aligned_cols=41 Identities=27% Similarity=0.250 Sum_probs=30.0
Q ss_pred cccccccccchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHhc
Q psy5111 84 TTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFG 130 (132)
Q Consensus 84 l~~D~~~~s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~~ 130 (132)
+..+....+..|=||++||=+. |. +|+=.||+||+|++...
T Consensus 158 l~~~~~~LA~LIfHELaHq~~Y--v~----~dt~FNEsfAtfVe~~G 198 (337)
T PF10023_consen 158 LRYPDGELARLIFHELAHQTLY--VK----GDTAFNESFATFVEREG 198 (337)
T ss_pred cCCCchHHHHHHHHHHhhceee--cC----CCchhhHHHHHHHHHHH
Confidence 3456677889999999999222 21 35678999999998654
No 32
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=89.41 E-value=0.44 Score=33.74 Aligned_cols=42 Identities=21% Similarity=0.096 Sum_probs=31.5
Q ss_pred hhhhhhHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhh
Q psy5111 14 ADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFG 58 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~ 58 (132)
..+..+..++.||++|-|-.+--. =.--||-||+|.|..+..
T Consensus 91 ~~~~Ei~Gvl~HE~~H~~Q~~~~~---~~P~~liEGIADyVRl~a 132 (205)
T PF04450_consen 91 DVRDEIIGVLYHEMVHCWQWDGRG---TAPGGLIEGIADYVRLKA 132 (205)
T ss_pred chHHHHHHHHHHHHHHHhhcCCCC---CCChhheecHHHHHHHHc
Confidence 455788999999999987544332 224589999999998763
No 33
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=89.40 E-value=0.18 Score=37.23 Aligned_cols=41 Identities=32% Similarity=0.255 Sum_probs=32.2
Q ss_pred cccccccccchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHhc
Q psy5111 84 TTADFQRATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFG 130 (132)
Q Consensus 84 l~~D~~~~s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~~ 130 (132)
+..|....++.|=||+|||=|. -.+|.=.||+|+++.+.-.
T Consensus 190 lr~dd~~lA~LIFHELAHQk~Y------~~~DtAFNEsFAtaVEt~G 230 (376)
T COG4324 190 LRQDDTYLASLIFHELAHQKIY------VNNDTAFNESFATAVETSG 230 (376)
T ss_pred hcCChHHHHHHHHHHHhhheEe------ecCcchHhHHHHHHHHHHh
Confidence 4667888899999999999554 2345678999999988643
No 34
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=86.19 E-value=0.38 Score=31.98 Aligned_cols=29 Identities=17% Similarity=0.092 Sum_probs=19.9
Q ss_pred eccCCCCChhhhhhhHHHHHHHHHHHHhh
Q psy5111 5 LTSKGQCTTADFQRATSVVTHEFTHQWFG 33 (132)
Q Consensus 5 l~~~~~~~~~~~~~~~~~i~he~ah~w~g 33 (132)
-+++..........+..+|.||++|.+..
T Consensus 45 ~ln~~l~~~~~~~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 45 RLNPKLLTENGRDRLRETLLHELCHAALY 73 (146)
T ss_pred EeCHHHHhhccHHHHHhhHHHHHHHHHHH
Confidence 34444434344567788999999999875
No 35
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=85.31 E-value=1.1 Score=27.80 Aligned_cols=31 Identities=13% Similarity=0.214 Sum_probs=19.9
Q ss_pred ceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCC
Q psy5111 3 FLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVT 37 (132)
Q Consensus 3 ~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt 37 (132)
.+++|++. +. .+...+++||++|..++.--.
T Consensus 30 ~I~in~~~-~~---~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 30 IIFINSNL-SP---ERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp EEEEESSS--H---HHHHHHHHHHHHHHHHHH-HH
T ss_pred EEEECCCC-CH---HHHHHHHHHHHHHHHhhhccc
Confidence 35666662 22 233668999999999985553
No 36
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=84.97 E-value=0.88 Score=31.72 Aligned_cols=40 Identities=10% Similarity=0.182 Sum_probs=25.2
Q ss_pred eccCCCCChhhhhhhHHHHHHHHHHHHhhCCC-------CchhhhHH
Q psy5111 5 LTSKGQCTTADFQRATSVVTHEFTHQWFGDLV-------TPASWNFA 44 (132)
Q Consensus 5 l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lv-------t~~~w~~~ 44 (132)
+|+++.....+.-.+..+|||||+|-.-..-+ ...||+..
T Consensus 22 ~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~~D~~g~~~~wWs~~ 68 (206)
T PF01431_consen 22 FFDPNYPPALNYGGLGFILAHELMHAFDPEGINYDEDGNLRNWWSEE 68 (206)
T ss_dssp T--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGGB-TTS-BS--S-HH
T ss_pred cCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhhcCcccccccccchh
Confidence 56777888888889999999999998633222 23577654
No 37
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=84.36 E-value=0.48 Score=31.03 Aligned_cols=14 Identities=36% Similarity=0.702 Sum_probs=12.2
Q ss_pred hHHhHHHHHHHHHh
Q psy5111 116 AWLNEAFARLFEYF 129 (132)
Q Consensus 116 ~wl~e~~~~~~~~~ 129 (132)
.|+.||||.|++..
T Consensus 28 ~Wv~EGlA~yFE~~ 41 (128)
T PF07607_consen 28 RWVSEGLATYFETP 41 (128)
T ss_pred hHHHHhHHHHcCCC
Confidence 69999999999754
No 38
>PRK04860 hypothetical protein; Provisional
Probab=83.72 E-value=0.48 Score=32.22 Aligned_cols=27 Identities=19% Similarity=0.102 Sum_probs=19.7
Q ss_pred ccCCCCChhhhhhhHHHHHHHHHHHHh
Q psy5111 6 TSKGQCTTADFQRATSVVTHEFTHQWF 32 (132)
Q Consensus 6 ~~~~~~~~~~~~~~~~~i~he~ah~w~ 32 (132)
+||..-.......+..+|.||+||-+-
T Consensus 50 ~Np~ll~~~~~~~l~~~v~HEl~H~~~ 76 (160)
T PRK04860 50 LNPVLLLENQQAFIDEVVPHELAHLLV 76 (160)
T ss_pred eCHHHHhhCcHHHHHhHHHHHHHHHHH
Confidence 444444445678889999999999864
No 39
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=83.59 E-value=0.48 Score=37.98 Aligned_cols=42 Identities=24% Similarity=0.612 Sum_probs=32.4
Q ss_pred ccccchhhhHhHhhhhhcccCCCCch-----------hhHHhHHHHHHHHHhc
Q psy5111 89 QRATSVVTHEFTHQWFGDLVTPASWN-----------FAWLNEAFARLFEYFG 130 (132)
Q Consensus 89 ~~~s~~I~hEvahqWfg~~v~~~~w~-----------~~wl~e~~~~~~~~~~ 130 (132)
+..-..++||..|-|=+..+.+..-. -+|+.|||++|+.-++
T Consensus 245 ~~~l~llsHEyfH~WNvKrIrpa~l~p~~~d~en~t~~lW~~EG~T~Yy~~ll 297 (558)
T COG3975 245 QDLLGLLSHEYFHAWNVKRIRPAALEPFNLDKENYTPLLWFSEGFTSYYDRLL 297 (558)
T ss_pred HHHHHHHHHHHHHhccceeccccccCCccccccCCCcceeeecCchHHHHHHH
Confidence 33356899999999988776655433 3899999999998764
No 40
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=83.05 E-value=0.63 Score=30.99 Aligned_cols=21 Identities=14% Similarity=0.064 Sum_probs=17.4
Q ss_pred hhhhhhhHHHHHHHHHHHHhh
Q psy5111 13 TADFQRATSVVTHEFTHQWFG 33 (132)
Q Consensus 13 ~~~~~~~~~~i~he~ah~w~g 33 (132)
......+..++.||++|.|..
T Consensus 54 ~~~~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 54 RNPEEELIDTLLHEMAHAAAY 74 (157)
T ss_pred hhHHHHHHHHHHHHHHHHHhh
Confidence 345678899999999999983
No 41
>PRK04351 hypothetical protein; Provisional
Probab=82.82 E-value=0.46 Score=31.94 Aligned_cols=24 Identities=13% Similarity=0.263 Sum_probs=17.8
Q ss_pred cCCCCChhhhhhhHHHHHHHHHHH
Q psy5111 7 SKGQCTTADFQRATSVVTHEFTHQ 30 (132)
Q Consensus 7 ~~~~~~~~~~~~~~~~i~he~ah~ 30 (132)
+|..-.......+..+|+||++|=
T Consensus 49 np~ll~~~~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 49 NPKMLEEYGLEELIGIIKHELCHY 72 (149)
T ss_pred CHHHHhhccHHHHHhhHHHHHHHH
Confidence 344444455788999999999995
No 42
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=81.60 E-value=0.37 Score=39.00 Aligned_cols=27 Identities=19% Similarity=0.347 Sum_probs=25.3
Q ss_pred eccCCCCChhhhhhhHHHHHHHHHHHH
Q psy5111 5 LTSKGQCTTADFQRATSVVTHEFTHQW 31 (132)
Q Consensus 5 l~~~~~~~~~~~~~~~~~i~he~ah~w 31 (132)
+|+++.++..++=.|..+|+|||.|.+
T Consensus 473 Ffd~ea~~a~NYGgIGaVIgHEI~HgF 499 (654)
T COG3590 473 FFDPEADSAANYGGIGAVIGHEIGHGF 499 (654)
T ss_pred CCCCCcchhhcccCccceehhhhcccc
Confidence 689999999999999999999999985
No 43
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=81.56 E-value=0.8 Score=34.57 Aligned_cols=17 Identities=24% Similarity=0.247 Sum_probs=13.3
Q ss_pred hhhhhHHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQW 31 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w 31 (132)
-.-....+++||+|||=
T Consensus 192 p~~~~P~T~~HElAHq~ 208 (318)
T PF12725_consen 192 PPYSLPFTICHELAHQL 208 (318)
T ss_pred CcccccHHHHHHHHHHh
Confidence 34456789999999994
No 44
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=80.69 E-value=1.2 Score=31.20 Aligned_cols=25 Identities=16% Similarity=0.248 Sum_probs=19.6
Q ss_pred hhhhhhHHHHHHHHHHHHhhCCCCc
Q psy5111 14 ADFQRATSVVTHEFTHQWFGDLVTP 38 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~w~g~lvt~ 38 (132)
.+...++.++|||++|---++....
T Consensus 84 ~~~~el~aVlaHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 84 LSEDELAAVLAHELGHIKHRHILKS 108 (226)
T ss_dssp SSHHHHHHHHHHHHHHHHTTHCCCC
T ss_pred ccHHHHHHHHHHHHHHHHcCCcchH
Confidence 4566889999999999987765544
No 45
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=80.39 E-value=1.3 Score=26.44 Aligned_cols=12 Identities=25% Similarity=0.534 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHH
Q psy5111 20 TSVVTHEFTHQW 31 (132)
Q Consensus 20 ~~~i~he~ah~w 31 (132)
..+++||++|=+
T Consensus 62 ~~llaHEl~Hv~ 73 (79)
T PF13699_consen 62 RALLAHELAHVV 73 (79)
T ss_pred chhHhHHHHHHH
Confidence 567999999953
No 46
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=79.73 E-value=1.1 Score=28.72 Aligned_cols=17 Identities=24% Similarity=0.636 Sum_probs=13.2
Q ss_pred hhhhhHHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQW 31 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w 31 (132)
.......+++||++|-|
T Consensus 75 ~~KGC~~TL~HEL~H~W 91 (141)
T PHA02456 75 ANKGCRDTLAHELNHAW 91 (141)
T ss_pred cccchHHHHHHHHHHHH
Confidence 34455677999999999
No 47
>KOG2661|consensus
Probab=78.21 E-value=1.8 Score=33.10 Aligned_cols=22 Identities=23% Similarity=0.256 Sum_probs=17.5
Q ss_pred ChhhhhhhHHHHHHHHHHHHhh
Q psy5111 12 TTADFQRATSVVTHEFTHQWFG 33 (132)
Q Consensus 12 ~~~~~~~~~~~i~he~ah~w~g 33 (132)
.-.+...++.+++||+|||--+
T Consensus 268 ~ck~ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 268 SCKDDDGLATVLGHEIAHAVAR 289 (424)
T ss_pred cccChHHHHHHHHHHHHHHHHH
Confidence 3345678899999999999655
No 48
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=77.07 E-value=3.3 Score=30.32 Aligned_cols=48 Identities=27% Similarity=0.362 Sum_probs=35.8
Q ss_pred hhhhhhHHHHHHHHHHH-------HhhCCCCchhhhHHHHHHHHHHH--Hhhhhhhhhhc
Q psy5111 14 ADFQRATSVVTHEFTHQ-------WFGDLVTPASWNFAWLNEAFARL--FEYFGTRMLYV 64 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~-------w~g~lvt~~~w~~~wl~Eg~a~y--~~~~~~~~~~p 64 (132)
.+..++..+||||+=|. |-++-||.- +.-+-||||.+ ++.++.+..-|
T Consensus 135 ~~~~~v~aliaHE~HH~~R~~~i~~~eg~vtLl---e~lV~EGLAE~av~E~~Gr~~l~~ 191 (280)
T COG5504 135 STITSVPALIAHEYHHNCRLRYIDYGEGSVTLL---EALVMEGLAEHAVFELFGRGELGP 191 (280)
T ss_pred CCccchHHHHHHHHHhhheecccccCCCceeHH---HHHHHHHHHHHHHHHHhCccccCC
Confidence 45567888999999997 455555554 77888999999 67777766533
No 49
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=76.57 E-value=2.1 Score=29.84 Aligned_cols=24 Identities=17% Similarity=0.281 Sum_probs=20.5
Q ss_pred CChhhhhhhHHHHHHHHHHHHhhC
Q psy5111 11 CTTADFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 11 ~~~~~~~~~~~~i~he~ah~w~g~ 34 (132)
..--....+..++-||+||..+|+
T Consensus 74 ~~fl~~~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 74 GGFLPYETILGTMLHELAHNVHGP 97 (186)
T ss_pred CCEeeHHHHHHHHHHHHHhcccCC
Confidence 555567889999999999999887
No 50
>PRK05457 heat shock protein HtpX; Provisional
Probab=76.51 E-value=1.5 Score=32.61 Aligned_cols=22 Identities=27% Similarity=0.431 Sum_probs=17.5
Q ss_pred hhhhhhHHHHHHHHHHHHhhCC
Q psy5111 14 ADFQRATSVVTHEFTHQWFGDL 35 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~w~g~l 35 (132)
-+.+.+..++|||++|.=-++.
T Consensus 129 L~~~El~aVlAHElgHi~~~d~ 150 (284)
T PRK05457 129 MSRDEVEAVLAHEISHIANGDM 150 (284)
T ss_pred CCHHHHHHHHHHHHHHHHcCCH
Confidence 3668889999999999865553
No 51
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=76.27 E-value=5.5 Score=28.31 Aligned_cols=41 Identities=17% Similarity=0.340 Sum_probs=29.5
Q ss_pred hHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHHHHHhhhhhhh
Q psy5111 19 ATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTRM 61 (132)
Q Consensus 19 ~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a~y~~~~~~~~ 61 (132)
...++|||+.|-|.- +.--.--...+-||++..+.+...+.
T Consensus 93 ~gsiLAHE~mHa~Lr--l~g~~~L~~~vEEGiCqvla~~wL~~ 133 (212)
T PF12315_consen 93 TGSILAHELMHAWLR--LNGFPNLSPEVEEGICQVLAYLWLES 133 (212)
T ss_pred HhhHHHHHHHHHHhc--ccCCCCCChHHHHHHHHHHHHHHHhh
Confidence 467899999999973 22222335678899999998877653
No 52
>PRK03072 heat shock protein HtpX; Provisional
Probab=76.14 E-value=1.9 Score=32.09 Aligned_cols=20 Identities=25% Similarity=0.428 Sum_probs=15.7
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+.+.+..++|||++|-=-++
T Consensus 123 ~~~El~aVlAHElgHi~~~d 142 (288)
T PRK03072 123 NERELRGVLGHELSHVYNRD 142 (288)
T ss_pred CHHHHHHHHHHHHHHHhcCC
Confidence 55778999999999974444
No 53
>PRK03001 M48 family peptidase; Provisional
Probab=75.93 E-value=1.9 Score=31.86 Aligned_cols=20 Identities=25% Similarity=0.340 Sum_probs=15.6
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+.+.+..++|||++|-=-++
T Consensus 120 ~~~El~aVlAHElgHi~~~h 139 (283)
T PRK03001 120 SEREIRGVMAHELAHVKHRD 139 (283)
T ss_pred CHHHHHHHHHHHHHHHhCCC
Confidence 56778999999999974443
No 54
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=75.53 E-value=2 Score=31.57 Aligned_cols=25 Identities=16% Similarity=0.136 Sum_probs=20.0
Q ss_pred hhhhhhhHHHHHHHHHHHHhhCCCC
Q psy5111 13 TADFQRATSVVTHEFTHQWFGDLVT 37 (132)
Q Consensus 13 ~~~~~~~~~~i~he~ah~w~g~lvt 37 (132)
.-+.+.+..+++||++|.=-++.+.
T Consensus 151 ~l~~dEl~aVlaHElgHi~~rd~~~ 175 (302)
T COG0501 151 LLNDDELEAVLAHELGHIKNRHTLV 175 (302)
T ss_pred hCCHHHHHHHHHHHHHHHhcccHHH
Confidence 3456788999999999997776665
No 55
>PRK03982 heat shock protein HtpX; Provisional
Probab=75.46 E-value=2 Score=31.84 Aligned_cols=20 Identities=30% Similarity=0.353 Sum_probs=16.0
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+.+.+..++|||++|-=-++
T Consensus 121 ~~~El~AVlAHElgHi~~~h 140 (288)
T PRK03982 121 NEDELEGVIAHELTHIKNRD 140 (288)
T ss_pred CHHHHHHHHHHHHHHHHcCC
Confidence 56778999999999975443
No 56
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=74.11 E-value=4.7 Score=28.66 Aligned_cols=39 Identities=18% Similarity=0.296 Sum_probs=28.7
Q ss_pred cchhhhHhHhhhhhcccCCCCchhhHHhHHHHHHHHHhccC
Q psy5111 92 TSVVTHEFTHQWFGDLVTPASWNFAWLNEAFARLFEYFGTR 132 (132)
Q Consensus 92 s~~I~hEvahqWfg~~v~~~~w~~~wl~e~~~~~~~~~~~~ 132 (132)
..+++||+.|-|.-. -. .---...+-||++..+++++++
T Consensus 94 gsiLAHE~mHa~Lrl-~g-~~~L~~~vEEGiCqvla~~wL~ 132 (212)
T PF12315_consen 94 GSILAHELMHAWLRL-NG-FPNLSPEVEEGICQVLAYLWLE 132 (212)
T ss_pred hhHHHHHHHHHHhcc-cC-CCCCChHHHHHHHHHHHHHHHh
Confidence 457899999999842 12 2233568999999999998753
No 57
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=74.05 E-value=2.4 Score=29.01 Aligned_cols=13 Identities=31% Similarity=0.503 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHHH
Q psy5111 19 ATSVVTHEFTHQW 31 (132)
Q Consensus 19 ~~~~i~he~ah~w 31 (132)
-..++|||+.||+
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 4567999999997
No 58
>PRK04897 heat shock protein HtpX; Provisional
Probab=73.88 E-value=2.3 Score=31.76 Aligned_cols=20 Identities=25% Similarity=0.298 Sum_probs=16.0
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+.+.+..++|||++|-=-++
T Consensus 133 ~~~El~aVlAHElgHi~~~d 152 (298)
T PRK04897 133 NREELEGVIGHEISHIRNYD 152 (298)
T ss_pred CHHHHHHHHHHHHHHHhcCC
Confidence 56778999999999975444
No 59
>PRK01345 heat shock protein HtpX; Provisional
Probab=73.88 E-value=2.3 Score=32.10 Aligned_cols=20 Identities=25% Similarity=0.323 Sum_probs=16.1
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+.+.+..++|||++|-=-++
T Consensus 120 ~~dEL~aVlAHElgHi~~~d 139 (317)
T PRK01345 120 SPEEVAGVMAHELAHVKNRD 139 (317)
T ss_pred CHHHHHHHHHHHHHHHHcCC
Confidence 55788999999999985444
No 60
>PRK02870 heat shock protein HtpX; Provisional
Probab=71.50 E-value=2.8 Score=31.96 Aligned_cols=16 Identities=31% Similarity=0.532 Sum_probs=14.1
Q ss_pred hhhhhHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQ 30 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~ 30 (132)
+.+.+..++|||++|-
T Consensus 169 ~~dEL~aVlAHELgHi 184 (336)
T PRK02870 169 DRDELQAVMAHELSHI 184 (336)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 5678899999999996
No 61
>PRK02391 heat shock protein HtpX; Provisional
Probab=71.01 E-value=2.9 Score=31.20 Aligned_cols=20 Identities=30% Similarity=0.376 Sum_probs=15.8
Q ss_pred hhhhhHHHHHHHHHHHHhhC
Q psy5111 15 DFQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~ 34 (132)
+...+..++|||++|-=-++
T Consensus 129 ~~~El~aVlaHElgHi~~~d 148 (296)
T PRK02391 129 DPDELEAVLAHELSHVKNRD 148 (296)
T ss_pred CHHHHHHHHHHHHHHHHcCC
Confidence 45778899999999975444
No 62
>PRK01265 heat shock protein HtpX; Provisional
Probab=68.50 E-value=3.6 Score=31.24 Aligned_cols=19 Identities=21% Similarity=0.237 Sum_probs=15.0
Q ss_pred hhhhhHHHHHHHHHHHHhh
Q psy5111 15 DFQRATSVVTHEFTHQWFG 33 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g 33 (132)
+.+.+..++|||++|-=-+
T Consensus 136 ~~~El~aVlAHElgHik~~ 154 (324)
T PRK01265 136 NRDEIKAVAGHELGHLKHR 154 (324)
T ss_pred CHHHHHHHHHHHHHHHHcc
Confidence 5677899999999996333
No 63
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=65.23 E-value=5.9 Score=29.27 Aligned_cols=39 Identities=13% Similarity=0.285 Sum_probs=24.6
Q ss_pred eccCCCCChhhhhhhHHHHHHHHHHH--------HhhCCCCchhhhH
Q psy5111 5 LTSKGQCTTADFQRATSVVTHEFTHQ--------WFGDLVTPASWNF 43 (132)
Q Consensus 5 l~~~~~~~~~~~~~~~~~i~he~ah~--------w~g~lvt~~~w~~ 43 (132)
+..+.....-+.+.+..++.||++|- ++..++.+-.|-+
T Consensus 182 ~lP~~~~~~~~~~el~~il~HEl~Hikr~D~~~~~l~~l~~~l~Wfn 228 (299)
T PF05569_consen 182 VLPESLLEDLSEEELRAILLHELAHIKRRDLLWKLLAELLCALHWFN 228 (299)
T ss_pred EecCccccccCHHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHHHhh
Confidence 34444343445666678899999997 5556665555544
No 64
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=63.90 E-value=4.3 Score=29.96 Aligned_cols=38 Identities=11% Similarity=0.029 Sum_probs=29.1
Q ss_pred ceeccCCCCChhhhhhhHHHHHHHHHHHHhhCCCCchh
Q psy5111 3 FLLTSKGQCTTADFQRATSVVTHEFTHQWFGDLVTPAS 40 (132)
Q Consensus 3 ~~l~~~~~~~~~~~~~~~~~i~he~ah~w~g~lvt~~~ 40 (132)
.|.|||..-..-....+..+++||+-|--|+...-..-
T Consensus 44 ~l~~nP~~~~~l~~~~~~~~l~HevlH~~~~H~~r~~~ 81 (292)
T PF13203_consen 44 RLYYNPEFLESLSPEERVGLLLHEVLHCLLRHPWRRGG 81 (292)
T ss_pred EEEECcHHHhcCCHHHHHHHHHHHHHHHHccchhhhcc
Confidence 47889888555556667889999999998887765444
No 65
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=62.97 E-value=2.5 Score=28.49 Aligned_cols=16 Identities=31% Similarity=0.349 Sum_probs=12.1
Q ss_pred hhhhhHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQ 30 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~ 30 (132)
...-+..+|.||+||-
T Consensus 57 ~~~f~~~vV~HELaHl 72 (156)
T COG3091 57 GEDFIEQVVPHELAHL 72 (156)
T ss_pred cHHHHHHHHHHHHHHH
Confidence 4456778899999874
No 66
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=62.83 E-value=5 Score=31.99 Aligned_cols=22 Identities=23% Similarity=0.261 Sum_probs=15.5
Q ss_pred hhhhhHHHHHHHHHHHHhhCCC
Q psy5111 15 DFQRATSVVTHEFTHQWFGDLV 36 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~lv 36 (132)
+.-.++.|||||++|-==+.++
T Consensus 126 ~esElagViAHEigHv~qrH~a 147 (484)
T COG4783 126 NESELAGVIAHEIGHVAQRHLA 147 (484)
T ss_pred CHHHHHHHHHHHHHHHhhhhHH
Confidence 4456789999999996433443
No 67
>KOG3624|consensus
Probab=60.47 E-value=15 Score=30.57 Aligned_cols=51 Identities=16% Similarity=0.134 Sum_probs=32.8
Q ss_pred ccCCCCChhhhhhhHHHHHHHHHHHHhhCCC-------CchhhhHHHHHHHHHHHHhhh
Q psy5111 6 TSKGQCTTADFQRATSVVTHEFTHQWFGDLV-------TPASWNFAWLNEAFARLFEYF 57 (132)
Q Consensus 6 ~~~~~~~~~~~~~~~~~i~he~ah~w~g~lv-------t~~~w~~~wl~Eg~a~y~~~~ 57 (132)
|+......-+.-.+..+|+||+.|-.=.... ...||+..-..+ |..-.+.+
T Consensus 505 f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~D~~Gn~~~ww~~~s~~~-f~~~~qC~ 562 (687)
T KOG3624|consen 505 FDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQYDESGNLRDWWDTESESE-FQERAQCL 562 (687)
T ss_pred CCcccchhhhhHHHHHHHHHHHhhcccccccccCCCcccccCCCHhHHHH-HHHHHHHH
Confidence 4566666677888999999999997622211 223888776654 44443433
No 68
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=60.37 E-value=6.2 Score=27.15 Aligned_cols=15 Identities=20% Similarity=0.273 Sum_probs=12.7
Q ss_pred hhhHHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQW 31 (132)
Q Consensus 17 ~~~~~~i~he~ah~w 31 (132)
...+.++|||++|+.
T Consensus 129 ~~~a~~~AHElGH~l 143 (194)
T cd04269 129 LLFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHHhhc
Confidence 456889999999986
No 69
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=57.82 E-value=7.5 Score=31.09 Aligned_cols=34 Identities=35% Similarity=0.467 Sum_probs=23.6
Q ss_pred chhhhHhHhhhhh---cccCCCCchhhHHhHHHHHHHHH
Q psy5111 93 SVVTHEFTHQWFG---DLVTPASWNFAWLNEAFARLFEY 128 (132)
Q Consensus 93 ~~I~hEvahqWfg---~~v~~~~w~~~wl~e~~~~~~~~ 128 (132)
.+++|||+|.-.. +|+-..-=-. |||+||.-+.-
T Consensus 339 DVvAHElTHGvtq~tA~L~Y~~qsGA--LNEsfSDvfG~ 375 (507)
T COG3227 339 DVVAHELTHGVTQQTAGLIYRGQSGA--LNESFSDVFGT 375 (507)
T ss_pred ceehhhhcchhhhhccCceecCCCCc--hhhHHHHHHHH
Confidence 4789999997544 5554444333 89999987773
No 70
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=57.43 E-value=7 Score=26.83 Aligned_cols=15 Identities=20% Similarity=0.315 Sum_probs=12.2
Q ss_pred hhhHHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQW 31 (132)
Q Consensus 17 ~~~~~~i~he~ah~w 31 (132)
.....++|||+.|.+
T Consensus 140 ~~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 140 YNGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred CceehhhHHhHHHhc
Confidence 445678999999987
No 71
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=57.31 E-value=7.7 Score=24.86 Aligned_cols=30 Identities=23% Similarity=0.289 Sum_probs=18.4
Q ss_pred cCCCCChhhhhhhHHHHHHHHHHH---HhhCCCC
Q psy5111 7 SKGQCTTADFQRATSVVTHEFTHQ---WFGDLVT 37 (132)
Q Consensus 7 ~~~~~~~~~~~~~~~~i~he~ah~---w~g~lvt 37 (132)
.+.+...++.++ ..+|+||++|- .-|+|=.
T Consensus 69 sekF~rLs~~ek-vKviiHEllHIP~tfSGgLRa 101 (133)
T COG4900 69 SEKFKRLSCAEK-VKVIIHELLHIPATFSGGLRA 101 (133)
T ss_pred hhhcCCCChHHH-HHHHHHHHhcCcccccCcccc
Confidence 344445555555 46799999996 4455543
No 72
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=54.28 E-value=11 Score=25.97 Aligned_cols=28 Identities=18% Similarity=0.388 Sum_probs=19.3
Q ss_pred hhhhhhhHHHHHHHHHHHHhhCCCCchhh
Q psy5111 13 TADFQRATSVVTHEFTHQWFGDLVTPASW 41 (132)
Q Consensus 13 ~~~~~~~~~~i~he~ah~w~g~lvt~~~w 41 (132)
..++..+..+++||+.|-| .....--.|
T Consensus 65 ~~~~~~l~~~l~HELIHay-D~cr~kvD~ 92 (173)
T PF09768_consen 65 IRSQGHLEDTLTHELIHAY-DHCRAKVDW 92 (173)
T ss_pred CCCHHHHHHHHHHHHHHHH-HHHhccCCc
Confidence 3467778899999999986 434333344
No 73
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=52.32 E-value=9.5 Score=26.93 Aligned_cols=14 Identities=29% Similarity=0.401 Sum_probs=11.6
Q ss_pred hhHHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQW 31 (132)
Q Consensus 18 ~~~~~i~he~ah~w 31 (132)
..+.++|||++|..
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 34789999999985
No 74
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=50.07 E-value=8.4 Score=26.41 Aligned_cols=15 Identities=20% Similarity=0.313 Sum_probs=12.0
Q ss_pred hhhHHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQW 31 (132)
Q Consensus 17 ~~~~~~i~he~ah~w 31 (132)
...+.++|||+.|..
T Consensus 131 ~~~~~~~aHElGH~l 145 (192)
T cd04267 131 LLTALTMAHELGHNL 145 (192)
T ss_pred eeehhhhhhhHHhhc
Confidence 456789999999864
No 75
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=49.56 E-value=11 Score=27.26 Aligned_cols=13 Identities=31% Similarity=0.611 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHHH
Q psy5111 19 ATSVVTHEFTHQW 31 (132)
Q Consensus 19 ~~~~i~he~ah~w 31 (132)
++.++|||+.|.+
T Consensus 167 ~a~t~AHElGHnl 179 (244)
T cd04270 167 SDLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHHhc
Confidence 5689999999997
No 76
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=49.22 E-value=12 Score=25.90 Aligned_cols=19 Identities=26% Similarity=0.254 Sum_probs=14.4
Q ss_pred hhhhHHHHHHHHHHHHhhC
Q psy5111 16 FQRATSVVTHEFTHQWFGD 34 (132)
Q Consensus 16 ~~~~~~~i~he~ah~w~g~ 34 (132)
..-+..+|+||+||--..|
T Consensus 161 ~~~idYVvvHEL~Hl~~~n 179 (205)
T PF01863_consen 161 PEVIDYVVVHELCHLRHPN 179 (205)
T ss_pred ccHHHHHHHHHHHHhccCC
Confidence 3456788999999986544
No 77
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=48.24 E-value=13 Score=25.67 Aligned_cols=16 Identities=19% Similarity=0.310 Sum_probs=12.9
Q ss_pred hhhhhHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQ 30 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~ 30 (132)
.....+.++|||++|.
T Consensus 127 ~~~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 127 SGLSFAVIIAHELGHN 142 (199)
T ss_dssp SHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHh
Confidence 4556788999999995
No 78
>KOG3314|consensus
Probab=46.82 E-value=19 Score=24.81 Aligned_cols=26 Identities=19% Similarity=0.148 Sum_probs=18.0
Q ss_pred hhhhhhHHHHHHHHHHHHhhCCCCch
Q psy5111 14 ADFQRATSVVTHEFTHQWFGDLVTPA 39 (132)
Q Consensus 14 ~~~~~~~~~i~he~ah~w~g~lvt~~ 39 (132)
-.|..+..+|+||+.|.+=--..-.+
T Consensus 86 ~~q~h~n~vv~HElIH~fDd~r~~vD 111 (194)
T KOG3314|consen 86 TIQDHVNQVVIHELIHAFDDCRAKVD 111 (194)
T ss_pred chHHHHHHHHHHHHHHHHHhhhhhcc
Confidence 35677889999999998533333333
No 79
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=45.72 E-value=14 Score=27.62 Aligned_cols=14 Identities=29% Similarity=0.522 Sum_probs=11.7
Q ss_pred hhHHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQW 31 (132)
Q Consensus 18 ~~~~~i~he~ah~w 31 (132)
..+.+||||+.|+-
T Consensus 169 a~ayVlAHEyGHHV 182 (292)
T PF04228_consen 169 AQAYVLAHEYGHHV 182 (292)
T ss_pred HHHHHHHHHHHHHH
Confidence 45678999999985
No 80
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=42.19 E-value=24 Score=21.88 Aligned_cols=17 Identities=18% Similarity=0.325 Sum_probs=13.3
Q ss_pred hhhhhHHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQW 31 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w 31 (132)
-...+..++.||+||..
T Consensus 69 L~~~I~~tlvhEiah~f 85 (97)
T PF06262_consen 69 LAELIRDTLVHEIAHHF 85 (97)
T ss_dssp HHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 45677888999999874
No 81
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=41.06 E-value=20 Score=23.89 Aligned_cols=13 Identities=23% Similarity=0.578 Sum_probs=10.1
Q ss_pred hhhHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTH 29 (132)
Q Consensus 17 ~~~~~~i~he~ah 29 (132)
..-+.+|.||++|
T Consensus 94 ~Sk~~TLiHE~SH 106 (148)
T PF14521_consen 94 DSKEGTLIHEWSH 106 (148)
T ss_dssp T-HHHHHHHHHHH
T ss_pred CchHHHHHHhhhh
Confidence 4447789999999
No 82
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=40.86 E-value=14 Score=24.83 Aligned_cols=11 Identities=55% Similarity=0.779 Sum_probs=9.2
Q ss_pred cchhhhHhHhh
Q psy5111 92 TSVVTHEFTHQ 102 (132)
Q Consensus 92 s~~I~hEvahq 102 (132)
..+|+|||+|-
T Consensus 136 lDVvaHEltHG 146 (150)
T PF01447_consen 136 LDVVAHELTHG 146 (150)
T ss_dssp HHHHHHHHHHH
T ss_pred cceeeeccccc
Confidence 44899999996
No 83
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=40.49 E-value=70 Score=22.76 Aligned_cols=20 Identities=20% Similarity=0.285 Sum_probs=17.4
Q ss_pred hhHHHHHHHHHHHHhhCCCC
Q psy5111 18 RATSVVTHEFTHQWFGDLVT 37 (132)
Q Consensus 18 ~~~~~i~he~ah~w~g~lvt 37 (132)
.++.++.||++|-=+|....
T Consensus 100 A~~fil~HE~~Hv~~~h~~~ 119 (206)
T PF10463_consen 100 AIAFILLHELAHVVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHHHcCccc
Confidence 45688999999999998877
No 84
>PF14891 Peptidase_M91: Effector protein
Probab=40.03 E-value=21 Score=24.32 Aligned_cols=14 Identities=29% Similarity=0.487 Sum_probs=11.5
Q ss_pred hhHHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQW 31 (132)
Q Consensus 18 ~~~~~i~he~ah~w 31 (132)
.-..+++||++|=|
T Consensus 102 ~p~v~L~HEL~HA~ 115 (174)
T PF14891_consen 102 PPFVVLYHELIHAY 115 (174)
T ss_pred HHHHHHHHHHHHHH
Confidence 44678999999987
No 85
>PF08653 DASH_Dam1: DASH complex subunit Dam1; InterPro: IPR013962 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=39.27 E-value=27 Score=19.60 Aligned_cols=12 Identities=33% Similarity=0.528 Sum_probs=9.7
Q ss_pred HhHHHHHHHHHh
Q psy5111 118 LNEAFARLFEYF 129 (132)
Q Consensus 118 l~e~~~~~~~~~ 129 (132)
.||+||+|+-.+
T Consensus 34 FNESFasfLYGl 45 (58)
T PF08653_consen 34 FNESFASFLYGL 45 (58)
T ss_pred HHHHHHHHHHHH
Confidence 699999998654
No 86
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=39.23 E-value=29 Score=24.90 Aligned_cols=27 Identities=22% Similarity=0.243 Sum_probs=20.1
Q ss_pred hhhhhHHHHHHHHHHHHhhCCCCchhhh
Q psy5111 15 DFQRATSVVTHEFTHQWFGDLVTPASWN 42 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~lvt~~~w~ 42 (132)
...-+..+|+||+||-=..| =+++.|+
T Consensus 171 p~~~i~YVvvHELaHLke~n-Hs~~Fw~ 197 (223)
T COG1451 171 PEEVIDYVVVHELAHLKEKN-HSKRFWR 197 (223)
T ss_pred CHHHHHHHHHHHHHHHhhhh-ccHHHHH
Confidence 34556788999999998888 5555554
No 87
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=39.18 E-value=25 Score=25.11 Aligned_cols=40 Identities=23% Similarity=0.239 Sum_probs=24.8
Q ss_pred hhhhHHHHHHHHHHHHhhCCC------CchhhhHHHHHHHHHHHHhh
Q psy5111 16 FQRATSVVTHEFTHQWFGDLV------TPASWNFAWLNEAFARLFEY 56 (132)
Q Consensus 16 ~~~~~~~i~he~ah~w~g~lv------t~~~w~~~wl~Eg~a~y~~~ 56 (132)
..+...++|||+.|=++..-. .+. +..-=.-|..|.++..
T Consensus 69 ~~r~rFtlAHELGH~llH~~~~~~~~~~~~-~~~~~~~E~~AN~FAa 114 (213)
T COG2856 69 LERKRFTLAHELGHALLHTDLNTRFDAEPT-LQQDRKIEAEANAFAA 114 (213)
T ss_pred HHHHHHHHHHHHhHHHhccccchhhhcccc-cchhHHHHHHHHHHHH
Confidence 344467899999999998442 222 2223334777777654
No 88
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=37.23 E-value=22 Score=28.73 Aligned_cols=25 Identities=24% Similarity=0.255 Sum_probs=13.6
Q ss_pred ccCCCCChhhhhhhHHHHHHHHHHH
Q psy5111 6 TSKGQCTTADFQRATSVVTHEFTHQ 30 (132)
Q Consensus 6 ~~~~~~~~~~~~~~~~~i~he~ah~ 30 (132)
++|..-+....+....+++||++|-
T Consensus 197 ~~p~~i~~~~~~~~~~~~~HEi~Ha 221 (521)
T PF01457_consen 197 INPSYIPSFYFQEFFRTVIHEIAHA 221 (521)
T ss_dssp --GGG---S--HHHHHHHHHHHHHH
T ss_pred EchhHccchhhhcccceeeeeeeee
Confidence 3444333324566788999999996
No 89
>PF13265 DUF4056: Protein of unknown function (DUF4056)
Probab=36.81 E-value=16 Score=26.85 Aligned_cols=10 Identities=50% Similarity=1.159 Sum_probs=7.6
Q ss_pred HHHHHHHHHhh
Q psy5111 23 VTHEFTHQWFG 33 (132)
Q Consensus 23 i~he~ah~w~g 33 (132)
..|||| ||||
T Consensus 152 ~WHEIA-QWyG 161 (270)
T PF13265_consen 152 QWHEIA-QWYG 161 (270)
T ss_pred HHHHHH-HHcC
Confidence 348887 8998
No 90
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=36.01 E-value=27 Score=20.69 Aligned_cols=13 Identities=15% Similarity=0.289 Sum_probs=10.3
Q ss_pred hHHHHHHHHHHHH
Q psy5111 19 ATSVVTHEFTHQW 31 (132)
Q Consensus 19 ~~~~i~he~ah~w 31 (132)
..+++|||++..+
T Consensus 52 FGEl~AhE~~fr~ 64 (77)
T PF08219_consen 52 FGELFAHEIAFRL 64 (77)
T ss_pred HHHHHHHHHHHHh
Confidence 3478999999874
No 91
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=35.68 E-value=19 Score=23.45 Aligned_cols=15 Identities=33% Similarity=0.660 Sum_probs=11.8
Q ss_pred hhhHHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQW 31 (132)
Q Consensus 17 ~~~~~~i~he~ah~w 31 (132)
..++-++.||+.|++
T Consensus 107 d~vthvliHEIgHhF 121 (136)
T COG3824 107 DQVTHVLIHEIGHHF 121 (136)
T ss_pred hHhhhhhhhhhhhhc
Confidence 456678899999985
No 92
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=34.21 E-value=15 Score=26.31 Aligned_cols=11 Identities=36% Similarity=0.709 Sum_probs=9.2
Q ss_pred HHHHHHHHHHH
Q psy5111 21 SVVTHEFTHQW 31 (132)
Q Consensus 21 ~~i~he~ah~w 31 (132)
.++|||+.|.-
T Consensus 147 ~t~AHElGHnL 157 (228)
T cd04271 147 QVFAHEIGHTF 157 (228)
T ss_pred eehhhhhhhhc
Confidence 48999999874
No 93
>KOG2719|consensus
Probab=33.75 E-value=26 Score=27.74 Aligned_cols=22 Identities=18% Similarity=0.359 Sum_probs=16.5
Q ss_pred hhhhhHHHHHHHHHHHHhhCCC
Q psy5111 15 DFQRATSVVTHEFTHQWFGDLV 36 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~lv 36 (132)
+.+++..+++||+.|---|..+
T Consensus 276 ~~eel~AVl~HELGHW~~~H~~ 297 (428)
T KOG2719|consen 276 NNEELVAVLAHELGHWKLNHVL 297 (428)
T ss_pred ccHHHHHHHHHHhhHHHHhhHH
Confidence 4578899999999996555443
No 94
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=32.65 E-value=23 Score=23.36 Aligned_cols=14 Identities=21% Similarity=0.501 Sum_probs=11.6
Q ss_pred hhHHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQW 31 (132)
Q Consensus 18 ~~~~~i~he~ah~w 31 (132)
....+++||+.|--
T Consensus 95 ~~~~~~~HElGH~L 108 (167)
T cd00203 95 EGAQTIAHELGHAL 108 (167)
T ss_pred cchhhHHHHHHHHh
Confidence 56788999999874
No 95
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=30.65 E-value=36 Score=22.11 Aligned_cols=15 Identities=33% Similarity=0.321 Sum_probs=11.5
Q ss_pred hhhhHHHHHHHHHHH
Q psy5111 16 FQRATSVVTHEFTHQ 30 (132)
Q Consensus 16 ~~~~~~~i~he~ah~ 30 (132)
...+..++.||+.|-
T Consensus 102 ~~~~~~v~~HEiGHa 116 (154)
T PF00413_consen 102 GNDLQSVAIHEIGHA 116 (154)
T ss_dssp SEEHHHHHHHHHHHH
T ss_pred hhhhhhhhhhccccc
Confidence 345678899998875
No 96
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=30.48 E-value=41 Score=22.14 Aligned_cols=13 Identities=31% Similarity=0.526 Sum_probs=10.8
Q ss_pred hhHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQ 30 (132)
Q Consensus 18 ~~~~~i~he~ah~ 30 (132)
....+++|||.|-
T Consensus 93 ~~~~~~~HEiGHa 105 (165)
T cd04268 93 RLRNTAEHELGHA 105 (165)
T ss_pred HHHHHHHHHHHHH
Confidence 4578899999986
No 97
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=30.07 E-value=7.4 Score=27.21 Aligned_cols=14 Identities=21% Similarity=0.399 Sum_probs=11.0
Q ss_pred hhHHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQW 31 (132)
Q Consensus 18 ~~~~~i~he~ah~w 31 (132)
..+.++|||++|.-
T Consensus 139 ~~a~~~aHElGH~L 152 (207)
T cd04273 139 SSAFTIAHELGHVL 152 (207)
T ss_pred eeEEeeeeechhhc
Confidence 34688999999874
No 98
>PF11458 Mistic: Membrane-integrating protein Mistic; InterPro: IPR021078 Mistic is an integral membrane protein that folds autonomously into the membrane []. It is conserved in the Bacilli bacteria. The protein forms a helical bundle with a polar lipid-facing surface. Mistic can be used for high-level production of other membrane proteins in their native conformations [].
Probab=29.66 E-value=43 Score=19.85 Aligned_cols=12 Identities=17% Similarity=0.611 Sum_probs=10.3
Q ss_pred HhHHHHHHHHHh
Q psy5111 118 LNEAFARLFEYF 129 (132)
Q Consensus 118 l~e~~~~~~~~~ 129 (132)
||||+.+|+..|
T Consensus 18 mnEGLD~fI~lY 29 (84)
T PF11458_consen 18 MNEGLDTFIQLY 29 (84)
T ss_pred HHhhHHHHHHHH
Confidence 799999998765
No 99
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=29.08 E-value=33 Score=28.55 Aligned_cols=14 Identities=50% Similarity=0.634 Sum_probs=11.5
Q ss_pred hhhHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQ 30 (132)
Q Consensus 17 ~~~~~~i~he~ah~ 30 (132)
+....+++||++|-
T Consensus 254 ~~~~rv~~HEi~HA 267 (622)
T PTZ00257 254 QGTTRTVTHEVAHA 267 (622)
T ss_pred hHHHHHHHHHHHHH
Confidence 44578899999997
No 100
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=28.52 E-value=40 Score=22.16 Aligned_cols=13 Identities=31% Similarity=0.567 Sum_probs=9.6
Q ss_pred hhHHHHHHHHHHH
Q psy5111 18 RATSVVTHEFTHQ 30 (132)
Q Consensus 18 ~~~~~i~he~ah~ 30 (132)
.-.++++||+-|-
T Consensus 76 ~~~RIaaHE~GHi 88 (132)
T PF02031_consen 76 NSTRIAAHELGHI 88 (132)
T ss_dssp -HHHHHHHHHHHH
T ss_pred ccceeeeehhccc
Confidence 3457899999885
No 101
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=27.55 E-value=51 Score=23.20 Aligned_cols=23 Identities=22% Similarity=0.264 Sum_probs=15.7
Q ss_pred hhhhhHHHHHHHHHHHHhhCCCCch
Q psy5111 15 DFQRATSVVTHEFTHQWFGDLVTPA 39 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w~g~lvt~~ 39 (132)
-..++...+.||+.|.+ +|-.|.
T Consensus 141 ~~~R~~Kea~HElGH~~--GL~HC~ 163 (194)
T PF07998_consen 141 FLERVCKEAVHELGHLF--GLDHCE 163 (194)
T ss_dssp HHHHHHHHHHHHHHHHT--T----S
T ss_pred HHHHHHHHHHHHHHHHc--CCcCCC
Confidence 35788899999999997 555554
No 102
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=27.35 E-value=73 Score=17.79 Aligned_cols=16 Identities=19% Similarity=0.331 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHhhCCCC
Q psy5111 20 TSVVTHEFTHQWFGDLVT 37 (132)
Q Consensus 20 ~~~i~he~ah~w~g~lvt 37 (132)
.+.+.|++|+++ +|.+
T Consensus 29 eR~~vH~~a~~~--gL~s 44 (59)
T cd06007 29 ERAVIHRLCRKL--GLKS 44 (59)
T ss_pred HHHHHHHHHHHc--CCCc
Confidence 455899999994 4544
No 103
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=26.86 E-value=57 Score=22.58 Aligned_cols=17 Identities=35% Similarity=0.350 Sum_probs=13.2
Q ss_pred hhhhhHHHHHHHHHHHH
Q psy5111 15 DFQRATSVVTHEFTHQW 31 (132)
Q Consensus 15 ~~~~~~~~i~he~ah~w 31 (132)
-..++.+.++||+.|.+
T Consensus 121 ~~~R~~k~~~HElGH~l 137 (179)
T PRK13267 121 FEERVRKEVTHELGHTL 137 (179)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 34567777999999984
No 104
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=26.38 E-value=47 Score=23.74 Aligned_cols=14 Identities=36% Similarity=0.470 Sum_probs=11.0
Q ss_pred hhhHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQ 30 (132)
Q Consensus 17 ~~~~~~i~he~ah~ 30 (132)
..++.+|+|||-|-
T Consensus 131 ~~~~hvi~HEiGH~ 144 (211)
T PF12388_consen 131 NVIEHVITHEIGHC 144 (211)
T ss_pred hHHHHHHHHHhhhh
Confidence 44567899999886
No 105
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=26.18 E-value=52 Score=22.37 Aligned_cols=22 Identities=27% Similarity=0.286 Sum_probs=15.5
Q ss_pred hHHHHHHHHHHHHhhCCCCchh
Q psy5111 19 ATSVVTHEFTHQWFGDLVTPAS 40 (132)
Q Consensus 19 ~~~~i~he~ah~w~g~lvt~~~ 40 (132)
+..++.||++|-+..-+...+-
T Consensus 7 ~i~i~~HE~gH~~~a~~~G~~~ 28 (192)
T PF02163_consen 7 LISIVLHELGHALAARLYGDKV 28 (192)
T ss_dssp HHHHHHHHHHHHHHHHTTT--B
T ss_pred cccccccccccccccccccccc
Confidence 3567899999998876665543
No 106
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=26.05 E-value=36 Score=23.88 Aligned_cols=11 Identities=27% Similarity=0.498 Sum_probs=9.6
Q ss_pred HHHHHHHHHHH
Q psy5111 21 SVVTHEFTHQW 31 (132)
Q Consensus 21 ~~i~he~ah~w 31 (132)
.+++|||.|..
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 66999999986
No 107
>PF07098 DUF1360: Protein of unknown function (DUF1360); InterPro: IPR010773 This entry is represented by Mycobacterium phage PG1, Gp7. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial proteins of around 115 residues in length. Members of this family are found in Bacillus species and Streptomyces coelicolor, the function of the family is unknown.
Probab=25.38 E-value=84 Score=19.83 Aligned_cols=33 Identities=15% Similarity=0.327 Sum_probs=26.0
Q ss_pred hHHHHHHHHHHHHhhCCCCchhhhHHHHHHHHH
Q psy5111 19 ATSVVTHEFTHQWFGDLVTPASWNFAWLNEAFA 51 (132)
Q Consensus 19 ~~~~i~he~ah~w~g~lvt~~~w~~~wl~Eg~a 51 (132)
+...+...-.+.|+|.|++|-|=...|+.=+++
T Consensus 41 ~~~~~r~~~~r~~vGeLlsCpwC~gvWvA~~~~ 73 (105)
T PF07098_consen 41 VAEEPRGSGVRHWVGELLSCPWCTGVWVAAGLA 73 (105)
T ss_pred hHHhhcchhHHHHHHHHhcChhHHHHHHHHHHH
Confidence 344455566778999999999999999976665
No 108
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=24.84 E-value=44 Score=24.46 Aligned_cols=11 Identities=55% Similarity=0.942 Sum_probs=9.1
Q ss_pred HHHHHHHHHHH
Q psy5111 21 SVVTHEFTHQW 31 (132)
Q Consensus 21 ~~i~he~ah~w 31 (132)
.++-||+||+=
T Consensus 156 NVviHEfAH~L 166 (253)
T PF06167_consen 156 NVVIHEFAHKL 166 (253)
T ss_dssp -HHHHHHHHHH
T ss_pred chHHHHHHHHH
Confidence 67999999995
No 109
>COG3228 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.12 E-value=34 Score=24.98 Aligned_cols=11 Identities=45% Similarity=0.848 Sum_probs=9.6
Q ss_pred HHHHHHHHHHH
Q psy5111 20 TSVVTHEFTHQ 30 (132)
Q Consensus 20 ~~~i~he~ah~ 30 (132)
-.+|-||+||+
T Consensus 148 ~NlViHEfAHk 158 (266)
T COG3228 148 FNLVIHEFAHK 158 (266)
T ss_pred ceehHhHHhhh
Confidence 46799999999
No 110
>PHA00527 hypothetical protein
Probab=23.69 E-value=67 Score=20.36 Aligned_cols=16 Identities=25% Similarity=0.322 Sum_probs=12.5
Q ss_pred hhHHHHHHHHHHHHhh
Q psy5111 18 RATSVVTHEFTHQWFG 33 (132)
Q Consensus 18 ~~~~~i~he~ah~w~g 33 (132)
.-+.+|+||-||+=|.
T Consensus 72 GK~~T~~HECAH~AF~ 87 (129)
T PHA00527 72 GKAATLVHECAHVAFY 87 (129)
T ss_pred cHHHHHHHHHHHHHHH
Confidence 3467899999999664
No 111
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=22.82 E-value=41 Score=24.80 Aligned_cols=12 Identities=25% Similarity=0.559 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHH
Q psy5111 20 TSVVTHEFTHQW 31 (132)
Q Consensus 20 ~~~i~he~ah~w 31 (132)
..++-||+||+-
T Consensus 143 ~NVvIHEFAH~L 154 (260)
T PRK15410 143 FNLIIHEVAHKL 154 (260)
T ss_pred cchhHhHHHhHh
Confidence 467999999994
No 112
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=22.60 E-value=47 Score=20.74 Aligned_cols=11 Identities=27% Similarity=0.528 Sum_probs=9.7
Q ss_pred cchhhhHhHhh
Q psy5111 92 TSVVTHEFTHQ 102 (132)
Q Consensus 92 s~~I~hEvahq 102 (132)
..+++|||.|.
T Consensus 108 ~~~~~HEiGH~ 118 (124)
T PF13582_consen 108 VDTFAHEIGHN 118 (124)
T ss_dssp TTHHHHHHHHH
T ss_pred ceEeeehhhHh
Confidence 57899999997
No 113
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=22.40 E-value=60 Score=21.39 Aligned_cols=14 Identities=21% Similarity=0.480 Sum_probs=10.8
Q ss_pred hhhHHHHHHHHHHH
Q psy5111 17 QRATSVVTHEFTHQ 30 (132)
Q Consensus 17 ~~~~~~i~he~ah~ 30 (132)
..+..++.|||-|-
T Consensus 102 ~~~~~~~~HEiGHa 115 (156)
T cd04279 102 ENLQAIALHELGHA 115 (156)
T ss_pred hHHHHHHHHHhhhh
Confidence 45678899998774
No 114
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.79 E-value=1e+02 Score=17.29 Aligned_cols=12 Identities=8% Similarity=0.202 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHH
Q psy5111 20 TSVVTHEFTHQW 31 (132)
Q Consensus 20 ~~~i~he~ah~w 31 (132)
.+.+.|.+||+.
T Consensus 30 eRriih~la~~l 41 (60)
T cd02639 30 ERRIVHLLASRL 41 (60)
T ss_pred HHHHHHHHHHHc
Confidence 344778888876
No 115
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=20.98 E-value=58 Score=24.90 Aligned_cols=48 Identities=8% Similarity=0.088 Sum_probs=31.1
Q ss_pred ccCCCCChhhhhhhHHHHHHHHHHH--------HhhCCCCchhhhHHHHHHHHHHH
Q psy5111 6 TSKGQCTTADFQRATSVVTHEFTHQ--------WFGDLVTPASWNFAWLNEAFARL 53 (132)
Q Consensus 6 ~~~~~~~~~~~~~~~~~i~he~ah~--------w~g~lvt~~~w~~~wl~Eg~a~y 53 (132)
........-..+++.-+|+||++|- |+-+.+.+--|-..-+.=+.+..
T Consensus 177 lP~d~~~r~~~ee~~yIilHEl~Hlk~gD~i~n~i~~~~~~l~WfNP~v~l~~~~~ 232 (337)
T COG4219 177 LPADFVERLTDEELKYIILHELSHLKRGDAIINLIVVVLGVLFWFNPLVHLGKRKI 232 (337)
T ss_pred ccHHHHhhcCHHhhhhhHhHHHhhhhcccHHHHHHHHHHhHHhhcChHHHHHHHHH
Confidence 3333444456677788899999996 66677777766665554444443
No 116
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=20.41 E-value=64 Score=21.60 Aligned_cols=17 Identities=35% Similarity=0.675 Sum_probs=11.9
Q ss_pred hhhhHHHHHHHHHHHHhh
Q psy5111 16 FQRATSVVTHEFTHQWFG 33 (132)
Q Consensus 16 ~~~~~~~i~he~ah~w~g 33 (132)
......+++||+-| |||
T Consensus 66 ~~~~g~TltHEvGH-~LG 82 (154)
T PF05572_consen 66 QYNFGKTLTHEVGH-WLG 82 (154)
T ss_dssp TS-SSHHHHHHHHH-HTT
T ss_pred ccccccchhhhhhh-hhc
Confidence 33447889999997 555
No 117
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=20.39 E-value=80 Score=21.72 Aligned_cols=13 Identities=31% Similarity=0.738 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHh
Q psy5111 20 TSVVTHEFTHQWF 32 (132)
Q Consensus 20 ~~~i~he~ah~w~ 32 (132)
..++.||++|-|-
T Consensus 10 ~~i~~HE~aHa~~ 22 (181)
T cd06158 10 LAITLHEFAHAYV 22 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4678899999863
Done!