Query psy7253
Match_columns 119
No_of_seqs 110 out of 1065
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 21:57:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7253.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7253hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1046|consensus 100.0 1.8E-36 3.9E-41 260.5 7.1 110 9-118 238-365 (882)
2 TIGR02412 pepN_strep_liv amino 100.0 1.1E-31 2.3E-36 230.0 8.7 109 8-118 200-328 (831)
3 PF01433 Peptidase_M1: Peptida 100.0 2.1E-31 4.5E-36 209.1 7.0 109 10-118 210-336 (390)
4 TIGR02414 pepN_proteo aminopep 100.0 2.7E-31 5.8E-36 227.9 7.9 108 9-116 197-322 (863)
5 PRK14015 pepN aminopeptidase N 100.0 3.7E-31 7.9E-36 227.3 8.4 107 10-116 211-335 (875)
6 COG0308 PepN Aminopeptidase N 99.9 1.7E-28 3.7E-33 210.9 7.5 107 11-118 224-348 (859)
7 TIGR02411 leuko_A4_hydro leuko 99.9 9.4E-27 2E-31 194.0 5.9 98 10-117 202-319 (601)
8 KOG1047|consensus 99.8 3.4E-20 7.4E-25 151.3 5.4 101 5-115 206-326 (613)
9 KOG1932|consensus 98.5 1E-07 2.2E-12 83.6 5.2 114 2-118 251-382 (1180)
10 PF13485 Peptidase_MA_2: Pepti 98.3 6.7E-07 1.4E-11 59.2 2.8 34 81-114 22-62 (128)
11 COG3975 Predicted protease wit 96.2 0.027 6E-07 47.1 7.8 108 5-117 162-298 (558)
12 PF10026 DUF2268: Predicted Zn 95.1 0.064 1.4E-06 39.2 5.4 77 38-117 16-109 (195)
13 COG4324 Predicted aminopeptida 93.8 0.031 6.8E-07 43.5 1.5 34 82-115 195-229 (376)
14 PF10023 DUF2265: Predicted am 93.4 0.043 9.4E-07 43.6 1.6 35 82-116 163-198 (337)
15 PF12315 DUF3633: Protein of u 91.8 0.17 3.6E-06 37.8 2.8 36 84-119 93-133 (212)
16 smart00731 SprT SprT homologue 91.0 0.077 1.7E-06 37.0 0.3 61 32-98 13-73 (146)
17 PF06114 DUF955: Domain of unk 88.4 0.41 8.8E-06 30.7 2.3 32 66-101 28-59 (122)
18 PRK04860 hypothetical protein; 88.3 0.42 9E-06 34.2 2.4 63 27-98 13-77 (160)
19 PRK04351 hypothetical protein; 84.9 0.66 1.4E-05 32.8 1.9 16 81-96 58-73 (149)
20 COG4783 Putative Zn-dependent 83.5 1.2 2.6E-05 37.0 3.1 53 38-95 89-141 (484)
21 PF10263 SprT-like: SprT-like 83.4 0.43 9.4E-06 33.0 0.4 20 81-100 57-76 (157)
22 PF04450 BSP: Peptidase of pla 82.9 1.5 3.2E-05 32.6 3.1 36 80-115 92-131 (205)
23 PF12725 DUF3810: Protein of u 81.9 0.66 1.4E-05 36.5 1.0 29 83-116 195-223 (318)
24 PF10460 Peptidase_M30: Peptid 77.7 1.9 4.2E-05 34.8 2.4 38 81-118 136-184 (366)
25 cd04269 ZnMc_adamalysin_II_lik 76.5 5 0.00011 28.6 4.2 15 82-96 129-143 (194)
26 PF13574 Reprolysin_2: Metallo 75.5 1.6 3.5E-05 31.1 1.3 13 84-96 111-123 (173)
27 cd04272 ZnMc_salivary_gland_MP 75.2 12 0.00027 27.4 6.0 14 83-96 144-157 (220)
28 PF08325 WLM: WLM domain; Int 72.8 1.8 3.9E-05 31.6 1.0 20 80-99 78-97 (186)
29 PHA02456 zinc metallopeptidase 71.9 2.6 5.7E-05 28.7 1.6 15 83-97 78-92 (141)
30 KOG2661|consensus 70.9 2.8 6.1E-05 33.7 1.8 19 80-98 271-289 (424)
31 PF01435 Peptidase_M48: Peptid 70.2 2.7 5.8E-05 30.2 1.5 23 81-103 86-108 (226)
32 cd04273 ZnMc_ADAMTS_like Zinc- 68.1 8.1 0.00018 28.1 3.6 13 83-95 139-151 (207)
33 PF13699 DUF4157: Domain of un 64.5 3.3 7.3E-05 26.0 0.9 13 84-96 61-73 (79)
34 PF08014 DUF1704: Domain of un 63.5 23 0.00049 28.4 5.6 67 42-117 132-214 (349)
35 PRK03001 M48 family peptidase; 62.8 4.7 0.0001 31.0 1.6 19 81-99 121-139 (283)
36 PF01863 DUF45: Protein of unk 62.2 6.4 0.00014 28.2 2.1 35 57-99 145-179 (205)
37 PF13688 Reprolysin_5: Metallo 60.2 4.8 0.0001 28.6 1.2 15 82-96 140-154 (196)
38 COG5504 Predicted Zn-dependent 59.4 6.4 0.00014 30.5 1.7 40 80-119 136-186 (280)
39 PF01447 Peptidase_M4: Thermol 55.9 5.7 0.00012 28.0 0.9 64 28-95 81-146 (150)
40 COG0501 HtpX Zn-dependent prot 54.2 6.7 0.00015 29.6 1.1 56 40-102 117-175 (302)
41 PF01431 Peptidase_M13: Peptid 53.1 5.7 0.00012 28.6 0.5 25 72-96 24-48 (206)
42 PRK03072 heat shock protein Ht 52.9 7.2 0.00016 30.1 1.1 21 81-101 124-144 (288)
43 KOG2719|consensus 52.4 16 0.00035 30.2 3.0 65 26-95 222-291 (428)
44 PRK03982 heat shock protein Ht 51.7 7.8 0.00017 29.8 1.1 19 81-99 122-140 (288)
45 PRK01345 heat shock protein Ht 51.3 7.9 0.00017 30.4 1.1 21 81-101 121-141 (317)
46 PRK05457 heat shock protein Ht 50.6 8.3 0.00018 29.8 1.1 22 81-102 131-152 (284)
47 PRK04897 heat shock protein Ht 50.6 8.2 0.00018 29.9 1.1 21 81-101 134-154 (298)
48 PF09768 Peptidase_M76: Peptid 50.4 49 0.0011 23.9 5.0 17 80-96 67-83 (173)
49 PF13265 DUF4056: Protein of u 49.4 7.6 0.00016 30.0 0.7 21 89-110 153-174 (270)
50 cd04267 ZnMc_ADAM_like Zinc-de 48.9 7.9 0.00017 27.5 0.7 14 82-95 131-144 (192)
51 PF05569 Peptidase_M56: BlaR1 48.6 23 0.0005 27.0 3.3 15 81-95 193-207 (299)
52 COG3590 PepO Predicted metallo 46.9 7.4 0.00016 33.4 0.4 26 71-96 474-499 (654)
53 PRK02391 heat shock protein Ht 46.8 10 0.00022 29.5 1.1 21 81-101 130-150 (296)
54 PRK02870 heat shock protein Ht 46.4 10 0.00023 30.1 1.1 17 81-97 170-186 (336)
55 COG1451 Predicted metal-depend 45.4 11 0.00024 28.2 1.1 18 82-99 173-190 (223)
56 COG2856 Predicted Zn peptidase 45.4 32 0.00069 25.7 3.5 43 68-114 60-114 (213)
57 PF14891 Peptidase_M91: Effect 43.2 14 0.0003 26.3 1.3 15 83-97 102-116 (174)
58 cd04270 ZnMc_TACE_like Zinc-de 42.3 13 0.00029 27.9 1.1 13 84-96 167-179 (244)
59 PF15641 Tox-MPTase5: Metallop 42.1 19 0.0004 23.6 1.6 15 86-100 67-81 (109)
60 PF01421 Reprolysin: Reprolysi 42.1 14 0.00031 26.4 1.2 15 81-95 128-142 (199)
61 PF13582 Reprolysin_3: Metallo 42.0 14 0.0003 24.1 1.0 12 85-96 108-119 (124)
62 PRK01265 heat shock protein Ht 41.1 14 0.00031 29.3 1.1 20 81-100 137-156 (324)
63 PF08219 TOM13: Outer membrane 40.4 16 0.00036 23.0 1.1 13 84-96 52-64 (77)
64 TIGR02421 QEGLA conserved hypo 39.7 47 0.001 26.9 3.9 67 42-117 155-237 (366)
65 COG4900 Predicted metallopepti 39.7 15 0.00033 25.1 0.9 13 85-97 81-93 (133)
66 KOG3314|consensus 36.7 1.2E+02 0.0025 22.3 5.1 61 25-95 38-102 (194)
67 COG3091 SprT Zn-dependent meta 36.7 9.7 0.00021 27.2 -0.4 13 82-94 59-71 (156)
68 PF14521 Aspzincin_M35: Lysine 34.6 21 0.00046 24.8 1.1 13 82-94 94-106 (148)
69 cd04271 ZnMc_ADAM_fungal Zinc- 34.1 16 0.00034 27.3 0.4 10 86-95 147-156 (228)
70 COG5465 Uncharacterized conser 33.9 1.5E+02 0.0033 21.3 5.2 64 31-97 64-138 (166)
71 PF12388 Peptidase_M57: Dual-a 33.2 27 0.00059 26.1 1.5 27 82-109 131-167 (211)
72 PF04036 DUF372: Domain of unk 31.1 36 0.00078 18.6 1.4 23 91-113 13-36 (38)
73 PF07607 DUF1570: Protein of u 30.3 19 0.00041 24.7 0.3 12 105-116 31-42 (128)
74 cd00203 ZnMc Zinc-dependent me 29.6 26 0.00056 23.9 0.8 13 83-95 95-107 (167)
75 cd02639 R3H_RRM R3H domain of 27.4 41 0.0009 20.0 1.3 14 83-96 28-41 (60)
76 PF06262 DUF1025: Possibl zinc 27.0 39 0.00085 22.1 1.3 35 59-95 50-84 (97)
77 PF04228 Zn_peptidase: Putativ 25.9 37 0.00081 26.5 1.2 12 84-95 170-181 (292)
78 PF00413 Peptidase_M10: Matrix 25.9 38 0.00082 22.8 1.1 14 82-95 103-116 (154)
79 PF13583 Reprolysin_4: Metallo 24.9 35 0.00075 25.0 0.8 11 86-96 139-149 (206)
80 PF01457 Peptidase_M8: Leishma 23.9 49 0.0011 27.7 1.6 15 81-95 207-221 (521)
81 COG4101 Predicted mannose-6-ph 22.0 22 0.00047 24.6 -0.7 14 89-102 74-87 (142)
82 cd06007 R3H_DEXH_helicase R3H 21.9 54 0.0012 19.4 1.1 13 83-95 27-39 (59)
83 PF02163 Peptidase_M50: Peptid 21.6 47 0.001 23.4 0.9 21 84-104 7-27 (192)
84 PF14831 DUF4484: Domain of un 21.5 59 0.0013 23.7 1.4 14 85-98 145-158 (176)
85 PF02031 Peptidase_M7: Strepto 21.4 50 0.0011 23.0 1.0 12 84-95 77-88 (132)
86 COG4758 Predicted membrane pro 21.1 36 0.00078 25.9 0.3 14 91-104 115-128 (235)
87 PHA00527 hypothetical protein 20.9 62 0.0013 21.8 1.3 32 84-115 73-112 (129)
No 1
>KOG1046|consensus
Probab=100.00 E-value=1.8e-36 Score=260.46 Aligned_cols=110 Identities=37% Similarity=0.559 Sum_probs=103.7
Q ss_pred CCCCeEEEEEEcCCCCC-----------chhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCc
Q psy7253 9 REPLIKVSVVGSAKDVN-----------SVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETS 77 (119)
Q Consensus 9 ~~~~i~v~v~~~~~~~~-----------~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~ 77 (119)
+..|++||+|++|+..+ .++|+++||+++||++|+|+|++|+|..|||||||||+|+|..+|+++..++
T Consensus 238 ~~~~v~vrv~a~p~~~~~~~~al~~~~~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss 317 (882)
T KOG1046|consen 238 TKSGVPVRVYARPEKINQGQFALEVATKVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSS 317 (882)
T ss_pred cCCCceEEEEeChHHhhHHHHHHHHHHHHHHHHHHHhCCCCCCccccEEecCCccccchhcCcceeeeehhhccCCCcCc
Confidence 67789999999998764 4678999999999999999999999999999999999999999999999899
Q ss_pred cccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhhc
Q psy7253 78 ASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDFK 118 (119)
Q Consensus 78 ~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~~ 118 (119)
.+++++++.+||||+||||||||||++ |||||+|++++..+
T Consensus 318 ~~~k~~va~vIaHElAHQWFGNLVTm~wW~dLWLnEGfAt~~~~~~v~ 365 (882)
T KOG1046|consen 318 SSNKQRVAEVIAHELAHQWFGNLVTMKWWNDLWLNEGFATYVEYLAVD 365 (882)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCcccHhhhhhhhhcccHHHHHHHHhhc
Confidence 999999999999999999999999999 99999999998754
No 2
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=99.97 E-value=1.1e-31 Score=229.99 Aligned_cols=109 Identities=18% Similarity=0.203 Sum_probs=94.2
Q ss_pred eCCCCeEEEEEEcCCCCC-------------chhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCC
Q psy7253 8 DREPLIKVSVVGSAKDVN-------------SVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQ 74 (119)
Q Consensus 8 ~~~~~i~v~v~~~~~~~~-------------~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~ 74 (119)
.+.+|+||++|++|+..+ .+++++++||+|||++|+|+|++|+|..|||||||+|+|+|..+ +.+.
T Consensus 200 ~~~~gvpi~v~~~~~~~~~~~~~~al~~~~~~l~~~e~~fg~pYP~~k~d~V~vP~f~~GaMEn~Glit~~e~~l-~~~~ 278 (831)
T TIGR02412 200 DESRSYPLGIYARRSLAQYLDADAIFTITRQGLAFFHRKFGYPYPFKKYDQIFVPEFNAGAMENAGCVTFAENFL-HRAE 278 (831)
T ss_pred ecCCCEEEEEEECcchhhhhhHHHHHHHHHHHHHHHHHHhCCCCCcccCCEEEcCCCCCCcccccceeeechhhc-cCCc
Confidence 356899999999997532 35789999999999999999999999999999999999999964 4443
Q ss_pred CCccccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhhc
Q psy7253 75 ETSASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDFK 118 (119)
Q Consensus 75 ~~~~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~~ 118 (119)
.+...++.++.+||||+|||||||+||++ |||||+|+++++.+
T Consensus 279 -~~~~~~~~~~~viaHElAHqWFGnlVT~~wW~dlWLnEGFAty~e~~~~~ 328 (831)
T TIGR02412 279 -ATRAEKENRAGVILHEMAHMWFGDLVTMRWWNDLWLNESFAEYMGTLASA 328 (831)
T ss_pred -CCHHHHHHHHHHHHHHHHHHHhCCEeccccccchhHHHHHHHHHHHHHHH
Confidence 33455677889999999999999999999 99999999988754
No 3
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=99.97 E-value=2.1e-31 Score=209.11 Aligned_cols=109 Identities=29% Similarity=0.465 Sum_probs=93.6
Q ss_pred CCCeEEEEEEcCCCCCc-----------hhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCcc
Q psy7253 10 EPLIKVSVVGSAKDVNS-----------VPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSA 78 (119)
Q Consensus 10 ~~~i~v~v~~~~~~~~~-----------l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~ 78 (119)
.+|++|++|++|+..+. ++++++++|.+||++|+|+|++|+|+.+||||||+|+|+|+.++++++.++.
T Consensus 210 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~ 289 (390)
T PF01433_consen 210 KSGVPVRVYARPGDEEQLQFALDIAPKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTI 289 (390)
T ss_dssp TTEEEEEEEEECTCGGGHHHHHHHHHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-H
T ss_pred ccccchheeehhhhHHHHHHHHHhhHHHHHHHHhhccccceecceeEEEEeccccccccccccccccccccccCcccccc
Confidence 34699999999997654 4689999999999999999999999999999999999999999999888887
Q ss_pred ccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhhc
Q psy7253 79 SGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDFK 118 (119)
Q Consensus 79 ~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~~ 118 (119)
.++..+..+||||+|||||||+||++ +||||+|+++++.+
T Consensus 290 ~~~~~~~~~iahElahqWfGn~vt~~~w~d~WL~Eg~a~y~~~~~~~ 336 (390)
T PF01433_consen 290 GDKQEIASLIAHELAHQWFGNLVTPKWWSDLWLNEGFATYLEYLILE 336 (390)
T ss_dssp HHHHHHHHHHHHHHHTTTBTTTEEESSGGGHHHHHHHHHHHHHHHHH
T ss_pred hhhhhhHHHHHHHHHHHHhccCCccccchhhhHHHHHHHHHHHHhHh
Confidence 78888999999999999999999999 99999999998765
No 4
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=99.97 E-value=2.7e-31 Score=227.85 Aligned_cols=108 Identities=19% Similarity=0.187 Sum_probs=96.7
Q ss_pred CCCCeEEEEEEcCCCCC-----------chhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCc
Q psy7253 9 REPLIKVSVVGSAKDVN-----------SVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETS 77 (119)
Q Consensus 9 ~~~~i~v~v~~~~~~~~-----------~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~ 77 (119)
..+++++++|++|+..+ .++|+|++||++||++|+|+|++|+|+.|||||||+++|+++.+|.++...+
T Consensus 197 sg~~v~l~iy~~p~~~~~~~~al~~~~~~L~~~E~~fG~pYPl~k~diVavpdf~~GaMEN~GLi~f~e~~lL~~~~~~t 276 (863)
T TIGR02414 197 SGREVALRVYVEEGNKDKCDHAMESLKKAMKWDEEVFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETAT 276 (863)
T ss_pred CCCceEEEEEEccCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhccEEecCCCCCccccccceeccccceEEeCCCCCC
Confidence 34678999999997643 4678999999999999999999999999999999999999999999887666
Q ss_pred cccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhh
Q psy7253 78 ASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFD 116 (119)
Q Consensus 78 ~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~ 116 (119)
..++..+..+||||+|||||||+||++ |||||+|++.++
T Consensus 277 d~~~~~i~~VIaHElaHqWfGNlVT~~~W~~LWLnEGfAty~e~~~ 322 (863)
T TIGR02414 277 DADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEF 322 (863)
T ss_pred HHHHHHHHHHHHHHHHHHHhcceeeecchhhhhhhhhHHHHHHHHH
Confidence 666778899999999999999999999 999999999654
No 5
>PRK14015 pepN aminopeptidase N; Provisional
Probab=99.97 E-value=3.7e-31 Score=227.33 Aligned_cols=107 Identities=20% Similarity=0.204 Sum_probs=95.8
Q ss_pred CCCeEEEEEEcCCCC-----------CchhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCcc
Q psy7253 10 EPLIKVSVVGSAKDV-----------NSVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSA 78 (119)
Q Consensus 10 ~~~i~v~v~~~~~~~-----------~~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~ 78 (119)
.++++|++|++|+.. +.++|+|++||++||++|+|+|++|+|+.|||||||+++|++..++.++...+.
T Consensus 211 g~~vpl~iy~~p~~~~~~~~al~~~~~~L~~~E~~FG~pYP~~k~diVavp~f~~GaMEN~Gl~~f~~~~lL~~~~~~t~ 290 (875)
T PRK14015 211 GREVALEIYVEPGNLDKCDHAMDSLKKSMKWDEERFGLEYDLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATD 290 (875)
T ss_pred CCeEEEEEEEeCCcHHHHHHHHHHHHHHHHHHHHHhCCCCChhhhCEEeCCCCCCcccccccccccccceEecCcccCCH
Confidence 356999999999764 356789999999999999999999999999999999999999999988775566
Q ss_pred ccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhh
Q psy7253 79 SGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFD 116 (119)
Q Consensus 79 ~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~ 116 (119)
.++..+..+||||+|||||||+||++ |||||+|++..+
T Consensus 291 ~~~~~i~~vIaHElaHqWFGNlVT~~~W~dLWLnEGFAty~e~~~ 335 (875)
T PRK14015 291 ADYERIESVIAHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEF 335 (875)
T ss_pred HHHHHHHHHHHHHHHHHHHhCcceecchhhhhhhhHHHHHHHHHH
Confidence 66777899999999999999999998 999999997654
No 6
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=99.95 E-value=1.7e-28 Score=210.88 Aligned_cols=107 Identities=30% Similarity=0.430 Sum_probs=97.4
Q ss_pred CCeEEEEEEcCCC-----------CCchhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCccc
Q psy7253 11 PLIKVSVVGSAKD-----------VNSVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSAS 79 (119)
Q Consensus 11 ~~i~v~v~~~~~~-----------~~~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~ 79 (119)
+++++++|+++.. .++++|++++||.+||+++ ++|++|+|+.|||||||+++|++..+|.++...+..
T Consensus 224 ~~v~l~iy~~~g~~~~a~~~~~~~~~~~~~~e~~fg~~y~l~~-~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~ 302 (859)
T COG0308 224 RDVPLEIYVPPGVLDRAKYALDETKRSIEFYEEYFGLPYALPI-DIVAVPDFSAGAMENWGLVTFREKYLLADPETATDS 302 (859)
T ss_pred CCeeEEEEecCcchhhhhhhHHHHHHHhhhHHHhcCCCCCCcc-cEEeccCCCCccccccceeEEeeeEEeeCcccchhH
Confidence 5899999999933 2467899999999999999 999999999999999999999999999887767777
Q ss_pred cchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhhc
Q psy7253 80 GHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDFK 118 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~~ 118 (119)
.++.++.+|+||+|||||||+||++ |||||+|++..+.+
T Consensus 303 ~~~~~~~viaHElaHqWfGnlVT~~~W~~lWLnEgfat~~e~~~~~ 348 (859)
T COG0308 303 DYENVEEVIAHELAHQWFGNLVTMKWWDDLWLNEGFATFREVLWSE 348 (859)
T ss_pred HHHHHHHHHHHHHhhhcccceeeccCHHHHHHhhhhHHHHHHHHHH
Confidence 7788999999999999999999999 99999999987754
No 7
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=99.93 E-value=9.4e-27 Score=193.97 Aligned_cols=98 Identities=20% Similarity=0.184 Sum_probs=78.4
Q ss_pred CCCeEEEEEEcCCCCC------------chhHHHHhhCCCCCCCccCEEEe-CCCCCccccccccceeeccceeccCCCC
Q psy7253 10 EPLIKVSVVGSAKDVN------------SVPFLDSWARPAVRDCLDDLIAI-PDFGTGAMENWGLITYRETSILYDEQET 76 (119)
Q Consensus 10 ~~~i~v~v~~~~~~~~------------~l~~~e~~~~~~yp~~kld~v~~-P~~~~~~men~Gli~~~e~~ll~~~~~~ 76 (119)
..|..+++|++|+..+ .+++++++ +.|||++|+|+|++ |+|+.||||||| ++|.+..++.+.
T Consensus 202 ~~g~~~~v~~~p~~~~~~~~~~~~~~~~~l~~~e~~-~~pYp~~k~d~vvlpp~f~~GgMEN~~-ltf~~~~ll~~d--- 276 (601)
T TIGR02411 202 PIGPRSSVYSEPEQLEKCQYEFEHDTENFIKTAEDL-IFPYEWGQYDLLVLPPSFPYGGMENPN-LTFATPTLIAGD--- 276 (601)
T ss_pred ccCCceEEEccchhHHHHHHHHHHhHHHHHHHHHHh-CCCCcCccceEEEecCccccccccccc-ceeeccccccCC---
Confidence 3467899999986443 34455666 45899999999987 799999999999 577777766432
Q ss_pred ccccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhh
Q psy7253 77 SASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDF 117 (119)
Q Consensus 77 ~~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~ 117 (119)
+ ....+||||||||||||+||++ |||||+|+|+...
T Consensus 277 ----~-s~~~viaHElAHqWfGNlVT~~~W~d~WLnEGfaty~e~~~~ 319 (601)
T TIGR02411 277 ----R-SNVDVIAHELAHSWSGNLVTNCSWEHFWLNEGWTVYLERRIV 319 (601)
T ss_pred ----h-hhhhhHHHHHHhhccCceeecCCchHHHHHhhHHHHHHHHHH
Confidence 1 2357999999999999999999 9999999998654
No 8
>KOG1047|consensus
Probab=99.80 E-value=3.4e-20 Score=151.26 Aligned_cols=101 Identities=21% Similarity=0.175 Sum_probs=78.8
Q ss_pred EEeeCCCCeEEEEEEcCCCCCc------------hhHHHHhhCCCCCCCccCEEEeC-CCCCccccccccceeeccceec
Q psy7253 5 HVSDREPLIKVSVVGSAKDVNS------------VPFLDSWARPAVRDCLDDLIAIP-DFGTGAMENWGLITYRETSILY 71 (119)
Q Consensus 5 ~~~~~~~~i~v~v~~~~~~~~~------------l~~~e~~~~~~yp~~kld~v~~P-~~~~~~men~Gli~~~e~~ll~ 71 (119)
.++..+-|..-+||++|...++ |+-.|+.+| ||++..+|+|++| +|+.|||||+-+.+.. ..+|-
T Consensus 206 ~L~s~eIgpRs~VwaEp~~~~a~~~ef~~~~e~~L~~Ae~l~G-pY~WgryDllvlPpSFP~gGMENPcltF~T-pTlla 283 (613)
T KOG1047|consen 206 DLESREIGPRSRVWAEPCLLDACQEEFAGETEDFLKAAEKLFG-PYVWGRYDLLVLPPSFPFGGMENPCLTFVT-PTLLA 283 (613)
T ss_pred cccccccCCccceecchhhhHHHHHHHHhhhHHHHHHHHHHcC-CcccccceEEEecCCCCcccccCcceeeec-chhhc
Confidence 3455566777899999976543 344567887 6999999999975 9999999999655444 44443
Q ss_pred cCCCCccccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhh
Q psy7253 72 DEQETSASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLF 115 (119)
Q Consensus 72 ~~~~~~~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~ 115 (119)
. + .....+|||||||-|||||||.. ||||++|+|.-
T Consensus 284 G-D-------rsl~~vIaHEIAHSWtGNlVTN~sWehfWLNEGfTvylErr 326 (613)
T KOG1047|consen 284 G-D-------RSLVDVIAHEIAHSWTGNLVTNASWEHFWLNEGFTVYLERR 326 (613)
T ss_pred C-C-------cchhhHHHHHhhhhhcccccccCccchhhhcccchhhhhhh
Confidence 2 2 23478999999999999999999 99999999874
No 9
>KOG1932|consensus
Probab=98.54 E-value=1e-07 Score=83.58 Aligned_cols=114 Identities=10% Similarity=0.049 Sum_probs=91.3
Q ss_pred cceEEeeCCCCeEEEEEEcCCCCC-----------chhHHHHhhCCCCCCCccCEEEeCCCCCccccccccceeecccee
Q psy7253 2 GLIHVSDREPLIKVSVVGSAKDVN-----------SVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSIL 70 (119)
Q Consensus 2 ~~~~~~~~~~~i~v~v~~~~~~~~-----------~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll 70 (119)
|-|+..+...+++|..||.|...+ .++|+|+.++..||+.-+.+|.+|.-....+....+.++.-+ +|
T Consensus 251 G~F~~~~~P~~~~i~~f~LP~~~~~v~nt~~~l~k~iefye~~ls~rYPF~~~k~VFvd~~~~~i~~~asl~I~st~-lL 329 (1180)
T KOG1932|consen 251 GPFKSYVEPSMIDITHFCLPGLEPLVKNTTVYLHKAIEFYEEELSSRYPFSCYKTVFVDEAAVEISSYASLSIFSTS-LL 329 (1180)
T ss_pred ccccccCCCccCcceeEecCcchHHhhhHHHHHHHHHHHHHHHhccCCCcceeeEEEecCCcceeeecceeeeeecc-cc
Confidence 445556667799999999997654 457999999988999999999999777667777767666665 57
Q ss_pred ccCCCCccccchhhhhhHhhhhhHhhhcCCCCCC-------ChhHHHHHhhhhhc
Q psy7253 71 YDEQETSASGHNWVAVVVAHELAPIRHNAKVAKP-------ANGKTTFVYLFDFK 118 (119)
Q Consensus 71 ~~~~~~~~~~~~~~~~~iaHEiaHqWfGnlVt~~-------~EG~a~y~~~~~~~ 118 (119)
|+... ..+.....+.+|-.+|-||||-.+|+. -+|+|.|+..++.|
T Consensus 330 y~~~i--IDq~~~tr~~La~aLA~Q~fg~yIsp~~wsD~Wl~~GiagYl~~l~~k 382 (1180)
T KOG1932|consen 330 YSKNI--IDQTFLTRRKLAWALASQWFGVYISPVDWSDFWLLKGIAGYLTGLFVK 382 (1180)
T ss_pred chHhh--hhHHHHHHHHHHHHHHHhhhEEEeeccchhhhHHHHhHHHHHHHHHHH
Confidence 76542 334455677899999999999999999 89999999988765
No 10
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=98.26 E-value=6.7e-07 Score=59.16 Aligned_cols=34 Identities=26% Similarity=0.393 Sum_probs=28.8
Q ss_pred chhhhhhHhhhhhHhhhcCCCCC---C----ChhHHHHHhh
Q psy7253 81 HNWVAVVVAHELAPIRHNAKVAK---P----ANGKTTFVYL 114 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlVt~---~----~EG~a~y~~~ 114 (119)
......+++||++|+|+++.++. . +||+|+|++.
T Consensus 22 ~~~~~~~l~HE~~H~~~~~~~~~~~~~~~W~~EG~A~y~~~ 62 (128)
T PF13485_consen 22 EDWLDRVLAHELAHQWFGNYFGGDDNAPRWFNEGLAEYVEG 62 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCccCchHHHHHHHHHHhc
Confidence 34556899999999999999972 2 9999999985
No 11
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=96.21 E-value=0.027 Score=47.06 Aligned_cols=108 Identities=8% Similarity=-0.020 Sum_probs=64.9
Q ss_pred EEeeCCCCeEEEEEEcCC--CCCc--------hhHHHHhhCCCCCCCccCEEE-eCCCCCccccccccceeeccceeccC
Q psy7253 5 HVSDREPLIKVSVVGSAK--DVNS--------VPFLDSWARPAVRDCLDDLIA-IPDFGTGAMENWGLITYRETSILYDE 73 (119)
Q Consensus 5 ~~~~~~~~i~v~v~~~~~--~~~~--------l~~~e~~~~~~yp~~kld~v~-~P~~~~~~men~Gli~~~e~~ll~~~ 73 (119)
+.+.+..+..+.++..-. +.+. ++.-.+.||. -|++|+.+++ +-+-..||||+---........-+.
T Consensus 162 ~~e~~g~ph~~~~~g~~p~~d~~~~~~~~k~ii~~~~~vFg~-~~~~~Y~Fl~~~s~q~~GGlEH~~St~l~~~r~~~~- 239 (558)
T COG3975 162 DFEVTGAPHTIALRGELPNFDKERLASDTKKIIEAEIKVFGS-APFDKYVFLLHLSDQIYGGLEHRRSTALIYDRFGFT- 239 (558)
T ss_pred eeeccCCceeEEEeeccccccHHHHHHHHHHHHHHHHHHhcC-CCccceEEEEEecCCCCCCceecccccccccccccc-
Confidence 334455555566665531 2222 2233457775 8889998876 4555567899753333332221111
Q ss_pred CCCccccchhhhhhHhhhhhHhhhcCCCCCC------------------ChhHHHHHhhhhh
Q psy7253 74 QETSASGHNWVAVVVAHELAPIRHNAKVAKP------------------ANGKTTFVYLFDF 117 (119)
Q Consensus 74 ~~~~~~~~~~~~~~iaHEiaHqWfGnlVt~~------------------~EG~a~y~~~~~~ 117 (119)
+....+....+++||..|-|-+--+.|+ .|||+.|...+.-
T Consensus 240 ---~~~ky~~~l~llsHEyfH~WNvKrIrpa~l~p~~~d~en~t~~lW~~EG~T~Yy~~ll~ 298 (558)
T COG3975 240 ---DQDKYQDLLGLLSHEYFHAWNVKRIRPAALEPFNLDKENYTPLLWFSEGFTSYYDRLLA 298 (558)
T ss_pred ---chhHHHHHHHHHHHHHHHhccceeccccccCCccccccCCCcceeeecCchHHHHHHHH
Confidence 1111345578999999999988666555 8999999887643
No 12
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=95.08 E-value=0.064 Score=39.16 Aligned_cols=77 Identities=9% Similarity=-0.039 Sum_probs=45.1
Q ss_pred CCCCccCEEEeCCCCCc--cccc---cccceeeccceec-cCCCCccccchhhhhhHhhhhhHhhh-cCCC------CCC
Q psy7253 38 VRDCLDDLIAIPDFGTG--AMEN---WGLITYRETSILY-DEQETSASGHNWVAVVVAHELAPIRH-NAKV------AKP 104 (119)
Q Consensus 38 yp~~kld~v~~P~~~~~--~men---~Gli~~~e~~ll~-~~~~~~~~~~~~~~~~iaHEiaHqWf-GnlV------t~~ 104 (119)
+|.+.+++.++|.-+.+ .+++ .|-..+....+++ -+. ......+..+||||+.|-+- ...- |.-
T Consensus 16 ~~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~~~~---~~~~~~l~~~iaHE~hH~~r~~~~~~~~~~~TLl 92 (195)
T PF10026_consen 16 LPGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFLLPN---DYSLEELPALIAHEYHHNCRYEQIGWDPEDTTLL 92 (195)
T ss_pred cCCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEecCC---cccHHHHHHHHHHHHHHHHHHhccCCCCCCCCHH
Confidence 56668888777644333 2222 3344444443333 222 22345788999999999853 3332 111
Q ss_pred ----ChhHHHHHhhhhh
Q psy7253 105 ----ANGKTTFVYLFDF 117 (119)
Q Consensus 105 ----~EG~a~y~~~~~~ 117 (119)
.||+|.+++...+
T Consensus 93 d~~I~EGlAe~f~~~~~ 109 (195)
T PF10026_consen 93 DSLIMEGLAEYFAEELY 109 (195)
T ss_pred HHHHHhhHHHHHHHHHc
Confidence 9999999987654
No 13
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=93.85 E-value=0.031 Score=43.48 Aligned_cols=34 Identities=21% Similarity=0.163 Sum_probs=26.4
Q ss_pred hhhhhhHhhhhhHhhhcCCCCCC-ChhHHHHHhhh
Q psy7253 82 NWVAVVVAHELAPIRHNAKVAKP-ANGKTTFVYLF 115 (119)
Q Consensus 82 ~~~~~~iaHEiaHqWfGnlVt~~-~EG~a~y~~~~ 115 (119)
..++++|-||+|||=|.----.. ||.|||+.|..
T Consensus 195 ~~lA~LIFHELAHQk~Y~~~DtAFNEsFAtaVEt~ 229 (376)
T COG4324 195 TYLASLIFHELAHQKIYVNNDTAFNESFATAVETS 229 (376)
T ss_pred HHHHHHHHHHHhhheEeecCcchHhHHHHHHHHHH
Confidence 46899999999999776322222 99999999864
No 14
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=93.40 E-value=0.043 Score=43.60 Aligned_cols=35 Identities=26% Similarity=0.126 Sum_probs=25.6
Q ss_pred hhhhhhHhhhhhHhhhcCCCCCC-ChhHHHHHhhhh
Q psy7253 82 NWVAVVVAHELAPIRHNAKVAKP-ANGKTTFVYLFD 116 (119)
Q Consensus 82 ~~~~~~iaHEiaHqWfGnlVt~~-~EG~a~y~~~~~ 116 (119)
...+.+|-||+|||=+.--=... ||+|||++|..-
T Consensus 163 ~~LA~LIfHELaHq~~Yv~~dt~FNEsfAtfVe~~G 198 (337)
T PF10023_consen 163 GELARLIFHELAHQTLYVKGDTAFNESFATFVEREG 198 (337)
T ss_pred hHHHHHHHHHHhhceeecCCCchhhHHHHHHHHHHH
Confidence 35789999999999544222222 999999998653
No 15
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=91.79 E-value=0.17 Score=37.79 Aligned_cols=36 Identities=14% Similarity=0.191 Sum_probs=28.0
Q ss_pred hhhhHhhhhhHhhhcC----CCCCC-ChhHHHHHhhhhhcC
Q psy7253 84 VAVVVAHELAPIRHNA----KVAKP-ANGKTTFVYLFDFKR 119 (119)
Q Consensus 84 ~~~~iaHEiaHqWfGn----lVt~~-~EG~a~y~~~~~~~~ 119 (119)
...++|||+-|-|.-- ..++. -||+|+.++++|++|
T Consensus 93 ~gsiLAHE~mHa~Lrl~g~~~L~~~vEEGiCqvla~~wL~~ 133 (212)
T PF12315_consen 93 TGSILAHELMHAWLRLNGFPNLSPEVEEGICQVLAYLWLES 133 (212)
T ss_pred HhhHHHHHHHHHHhcccCCCCCChHHHHHHHHHHHHHHHhh
Confidence 4679999999999731 11122 899999999999985
No 16
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=91.01 E-value=0.077 Score=36.96 Aligned_cols=61 Identities=10% Similarity=0.054 Sum_probs=32.9
Q ss_pred HhhCCCCCCCccCEEEeCCCCCccccccccceeeccceeccCCCCccccchhhhhhHhhhhhHhhhc
Q psy7253 32 SWARPAVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPIRHN 98 (119)
Q Consensus 32 ~~~~~~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfG 98 (119)
.+|+.++|.+ .|.. +.-..-.+|.-..+...|-+++..........+..+|.|||||.+..
T Consensus 13 ~~F~~~l~~~---~i~w---~~r~~~~~G~~~~~~~~I~ln~~l~~~~~~~~l~~~l~HEm~H~~~~ 73 (146)
T smart00731 13 RVFGRKLPHP---KVVW---NKRLRKTGGRCLLKSAEIRLNPKLLTENGRDRLRETLLHELCHAALY 73 (146)
T ss_pred HHHCCCCCCC---EEEE---ehhhhhhhHHhhcCCCEEEeCHHHHhhccHHHHHhhHHHHHHHHHHH
Confidence 6888777755 2221 11111224444433444444443322222345678999999999986
No 17
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=88.41 E-value=0.41 Score=30.71 Aligned_cols=32 Identities=19% Similarity=0.169 Sum_probs=20.7
Q ss_pred ccceeccCCCCccccchhhhhhHhhhhhHhhhcCCC
Q psy7253 66 ETSILYDEQETSASGHNWVAVVVAHELAPIRHNAKV 101 (119)
Q Consensus 66 e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfGnlV 101 (119)
...+++++..+ ......+++||++|.+++.--
T Consensus 28 ~~~I~in~~~~----~~~~~f~laHELgH~~~~~~~ 59 (122)
T PF06114_consen 28 NPIIFINSNLS----PERQRFTLAHELGHILLHHGD 59 (122)
T ss_dssp TTEEEEESSS-----HHHHHHHHHHHHHHHHHHH-H
T ss_pred CCEEEECCCCC----HHHHHHHHHHHHHHHHhhhcc
Confidence 45555665522 233467999999999998655
No 18
>PRK04860 hypothetical protein; Provisional
Probab=88.33 E-value=0.42 Score=34.19 Aligned_cols=63 Identities=13% Similarity=0.066 Sum_probs=35.1
Q ss_pred hhHHHHhhCCCCCCCccCEEEeCCCCCcccc--ccccceeeccceeccCCCCccccchhhhhhHhhhhhHhhhc
Q psy7253 27 VPFLDSWARPAVRDCLDDLIAIPDFGTGAME--NWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPIRHN 98 (119)
Q Consensus 27 l~~~e~~~~~~yp~~kld~v~~P~~~~~~me--n~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfG 98 (119)
++..+++|+.+++-+...+=. ....+|+. ...-|-++... +.+ .....+..+|+||+||-|--
T Consensus 13 ~~~a~~~f~~~f~~p~~~f~~--R~rtaG~~~l~~~~I~~Np~l--l~~-----~~~~~l~~~v~HEl~H~~~~ 77 (160)
T PRK04860 13 LAQANLYFKRTFPEPKVSYTQ--RGTSAGTAWLQSNEIRLNPVL--LLE-----NQQAFIDEVVPHELAHLLVY 77 (160)
T ss_pred HHHHHHHhCCCCCCCEEEEee--cchhhcchhHhcCCeeeCHHH--Hhh-----CcHHHHHhHHHHHHHHHHHH
Confidence 456788999888875543322 12223332 22234444332 221 23456778999999998743
No 19
>PRK04351 hypothetical protein; Provisional
Probab=84.90 E-value=0.66 Score=32.77 Aligned_cols=16 Identities=19% Similarity=0.233 Sum_probs=13.0
Q ss_pred chhhhhhHhhhhhHhh
Q psy7253 81 HNWVAVVVAHELAPIR 96 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqW 96 (119)
...+..+|+||++|=.
T Consensus 58 ~~~l~~vv~HElcH~~ 73 (149)
T PRK04351 58 LEELIGIIKHELCHYH 73 (149)
T ss_pred HHHHHhhHHHHHHHHH
Confidence 4567899999999953
No 20
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=83.49 E-value=1.2 Score=37.03 Aligned_cols=53 Identities=19% Similarity=0.192 Sum_probs=35.7
Q ss_pred CCCCccCEEEeCCCCCccccccccceeeccceeccCCCCccccchhhhhhHhhhhhHh
Q psy7253 38 VRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPI 95 (119)
Q Consensus 38 yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHq 95 (119)
.|-.-++++.|-|=..++-..+|--++--+-++...+. ...++.+||||++|-
T Consensus 89 ~~~~~f~f~lV~d~~iNAFA~~Gg~v~vntGLll~ae~-----esElagViAHEigHv 141 (484)
T COG4783 89 LVKTPFTFFLVNDDSINAFATPGGYVVVNTGLLLTAEN-----ESELAGVIAHEIGHV 141 (484)
T ss_pred CCCCCeEEEEecCCccchhhcCCceEEEehHHHHhcCC-----HHHHHHHHHHHHHHH
Confidence 45566889998877777777766555544444443332 246889999999994
No 21
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=83.38 E-value=0.43 Score=33.02 Aligned_cols=20 Identities=15% Similarity=0.046 Sum_probs=16.2
Q ss_pred chhhhhhHhhhhhHhhhcCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAK 100 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnl 100 (119)
...+..+|.|||+|.|..-.
T Consensus 57 ~~~~~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 57 EEELIDTLLHEMAHAAAYVF 76 (157)
T ss_pred HHHHHHHHHHHHHHHHhhhc
Confidence 45678899999999998543
No 22
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=82.90 E-value=1.5 Score=32.56 Aligned_cols=36 Identities=17% Similarity=0.104 Sum_probs=27.1
Q ss_pred cchhhhhhHhhhhhHhhhcCCCC--CC--ChhHHHHHhhh
Q psy7253 80 GHNWVAVVVAHELAPIRHNAKVA--KP--ANGKTTFVYLF 115 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHqWfGnlVt--~~--~EG~a~y~~~~ 115 (119)
.+..+..++.||++|-|=.+--. |. -||+|.|+-..
T Consensus 92 ~~~Ei~Gvl~HE~~H~~Q~~~~~~~P~~liEGIADyVRl~ 131 (205)
T PF04450_consen 92 VRDEIIGVLYHEMVHCWQWDGRGTAPGGLIEGIADYVRLK 131 (205)
T ss_pred hHHHHHHHHHHHHHHHhhcCCCCCCChhheecHHHHHHHH
Confidence 34678899999999977655433 22 99999998654
No 23
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=81.92 E-value=0.66 Score=36.54 Aligned_cols=29 Identities=17% Similarity=0.287 Sum_probs=21.0
Q ss_pred hhhhhHhhhhhHhhhcCCCCCCChhHHHHHhhhh
Q psy7253 83 WVAVVVAHELAPIRHNAKVAKPANGKTTFVYLFD 116 (119)
Q Consensus 83 ~~~~~iaHEiaHqWfGnlVt~~~EG~a~y~~~~~ 116 (119)
..-.++|||+|||= .-.+|.=|+|+++++
T Consensus 195 ~~P~T~~HElAHq~-----G~a~E~EANFiayLa 223 (318)
T PF12725_consen 195 SLPFTICHELAHQL-----GFASEDEANFIAYLA 223 (318)
T ss_pred cccHHHHHHHHHHh-----CCCCHHHHHHHHHHH
Confidence 34579999999992 233777777777765
No 24
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=77.74 E-value=1.9 Score=34.78 Aligned_cols=38 Identities=13% Similarity=-0.065 Sum_probs=27.3
Q ss_pred chhhhhhHhhhhhHhh--hcCCCC------CC---ChhHHHHHhhhhhc
Q psy7253 81 HNWVAVVVAHELAPIR--HNAKVA------KP---ANGKTTFVYLFDFK 118 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqW--fGnlVt------~~---~EG~a~y~~~~~~~ 118 (119)
...+..++|||+-|+= --+.|. ++ +||+|.-.|.++-.
T Consensus 136 ~~~~~sTlAHEfQHmInfy~~~v~~g~~~~~dtWLnE~lS~~aEdl~s~ 184 (366)
T PF10460_consen 136 PDTVYSTLAHEFQHMINFYQRGVLHGKQYAMDTWLNEMLSMSAEDLYSS 184 (366)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccHHHHHHHHHHHHHHhc
Confidence 3457789999999983 323332 23 99999999988654
No 25
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=76.52 E-value=5 Score=28.64 Aligned_cols=15 Identities=40% Similarity=0.317 Sum_probs=12.4
Q ss_pred hhhhhhHhhhhhHhh
Q psy7253 82 NWVAVVVAHELAPIR 96 (119)
Q Consensus 82 ~~~~~~iaHEiaHqW 96 (119)
...+.++||||+|+.
T Consensus 129 ~~~a~~~AHElGH~l 143 (194)
T cd04269 129 LLFAVTMAHELGHNL 143 (194)
T ss_pred HHHHHHHHHHHHhhc
Confidence 345789999999985
No 26
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=75.50 E-value=1.6 Score=31.11 Aligned_cols=13 Identities=31% Similarity=0.115 Sum_probs=11.1
Q ss_pred hhhhHhhhhhHhh
Q psy7253 84 VAVVVAHELAPIR 96 (119)
Q Consensus 84 ~~~~iaHEiaHqW 96 (119)
...++||||.||+
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 4568999999996
No 27
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=75.22 E-value=12 Score=27.35 Aligned_cols=14 Identities=29% Similarity=0.418 Sum_probs=11.7
Q ss_pred hhhhhHhhhhhHhh
Q psy7253 83 WVAVVVAHELAPIR 96 (119)
Q Consensus 83 ~~~~~iaHEiaHqW 96 (119)
..+.++|||++|..
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 45789999999974
No 28
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=72.85 E-value=1.8 Score=31.59 Aligned_cols=20 Identities=25% Similarity=0.159 Sum_probs=17.0
Q ss_pred cchhhhhhHhhhhhHhhhcC
Q psy7253 80 GHNWVAVVVAHELAPIRHNA 99 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHqWfGn 99 (119)
....+..++.||+||.++|+
T Consensus 78 ~~~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 78 PYETILGTMLHELAHNVHGP 97 (186)
T ss_pred eHHHHHHHHHHHHHhcccCC
Confidence 44678899999999999887
No 29
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=71.92 E-value=2.6 Score=28.73 Aligned_cols=15 Identities=27% Similarity=0.173 Sum_probs=12.9
Q ss_pred hhhhhHhhhhhHhhh
Q psy7253 83 WVAVVVAHELAPIRH 97 (119)
Q Consensus 83 ~~~~~iaHEiaHqWf 97 (119)
-+..+++||++|-|=
T Consensus 78 GC~~TL~HEL~H~WQ 92 (141)
T PHA02456 78 GCRDTLAHELNHAWQ 92 (141)
T ss_pred chHHHHHHHHHHHHh
Confidence 467899999999994
No 30
>KOG2661|consensus
Probab=70.91 E-value=2.8 Score=33.75 Aligned_cols=19 Identities=26% Similarity=0.335 Sum_probs=14.9
Q ss_pred cchhhhhhHhhhhhHhhhc
Q psy7253 80 GHNWVAVVVAHELAPIRHN 98 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHqWfG 98 (119)
+...++.+++||+|||=-+
T Consensus 271 ~ddglAtvLgHE~aHaVar 289 (424)
T KOG2661|consen 271 DDDGLATVLGHEIAHAVAR 289 (424)
T ss_pred ChHHHHHHHHHHHHHHHHH
Confidence 3456899999999999443
No 31
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=70.20 E-value=2.7 Score=30.25 Aligned_cols=23 Identities=35% Similarity=0.505 Sum_probs=18.3
Q ss_pred chhhhhhHhhhhhHhhhcCCCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKVAK 103 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlVt~ 103 (119)
...++.++|||++|---++....
T Consensus 86 ~~el~aVlaHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 86 EDELAAVLAHELGHIKHRHILKS 108 (226)
T ss_dssp HHHHHHHHHHHHHHHHTTHCCCC
T ss_pred HHHHHHHHHHHHHHHHcCCcchH
Confidence 35688999999999988866544
No 32
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=68.14 E-value=8.1 Score=28.06 Aligned_cols=13 Identities=38% Similarity=0.508 Sum_probs=10.8
Q ss_pred hhhhhHhhhhhHh
Q psy7253 83 WVAVVVAHELAPI 95 (119)
Q Consensus 83 ~~~~~iaHEiaHq 95 (119)
..+.++||||+|.
T Consensus 139 ~~a~~~aHElGH~ 151 (207)
T cd04273 139 SSAFTIAHELGHV 151 (207)
T ss_pred eeEEeeeeechhh
Confidence 3578999999996
No 33
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=64.47 E-value=3.3 Score=25.96 Aligned_cols=13 Identities=38% Similarity=0.468 Sum_probs=10.5
Q ss_pred hhhhHhhhhhHhh
Q psy7253 84 VAVVVAHELAPIR 96 (119)
Q Consensus 84 ~~~~iaHEiaHqW 96 (119)
-..+++||++|=+
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 4579999999954
No 34
>PF08014 DUF1704: Domain of unknown function (DUF1704); InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=63.54 E-value=23 Score=28.38 Aligned_cols=67 Identities=9% Similarity=0.025 Sum_probs=42.3
Q ss_pred ccCEEEeCCCCCccccccccceeeccceeccCCCCccccchhhhhhHhhhhhHh-------------hhcCCCCCC---C
Q psy7253 42 LDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPI-------------RHNAKVAKP---A 105 (119)
Q Consensus 42 kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHq-------------WfGnlVt~~---~ 105 (119)
+..+.+.++....+|-.-+-|..+.+..+ ....+..++.||+.-+ |++.=.-+. -
T Consensus 132 ~~~V~~sddl~a~A~v~~~~l~I~~~~~f---------s~~~l~~L~~HEigvH~lt~~Ng~~QPl~~l~~Glp~~~~TQ 202 (349)
T PF08014_consen 132 EVKVELSDDLLARAMVSGDRLKINKNAMF---------SERDLEALLHHEIGVHLLTTLNGRAQPLKILSLGLPGYTPTQ 202 (349)
T ss_pred eEEEEEcCCcchhhcccCCeeEEcCCCCc---------CHHHHHHHHHHhhhhhhccccccccCCcHHhCCCCCCCCCCc
Confidence 45555566777777766655555554321 2456788999999433 554222121 9
Q ss_pred hhHHHHHhhhhh
Q psy7253 106 NGKTTFVYLFDF 117 (119)
Q Consensus 106 EG~a~y~~~~~~ 117 (119)
||+|.+.|++..
T Consensus 203 EGLAvl~E~l~g 214 (349)
T PF08014_consen 203 EGLAVLSEYLSG 214 (349)
T ss_pred hHHHHHHHHHhC
Confidence 999999998753
No 35
>PRK03001 M48 family peptidase; Provisional
Probab=62.85 E-value=4.7 Score=30.97 Aligned_cols=19 Identities=37% Similarity=0.522 Sum_probs=14.4
Q ss_pred chhhhhhHhhhhhHhhhcC
Q psy7253 81 HNWVAVVVAHELAPIRHNA 99 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGn 99 (119)
...+.-++|||++|-=-++
T Consensus 121 ~~El~aVlAHElgHi~~~h 139 (283)
T PRK03001 121 EREIRGVMAHELAHVKHRD 139 (283)
T ss_pred HHHHHHHHHHHHHHHhCCC
Confidence 3568899999999964443
No 36
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=62.21 E-value=6.4 Score=28.17 Aligned_cols=35 Identities=29% Similarity=0.340 Sum_probs=22.1
Q ss_pred cccccceeeccceeccCCCCccccchhhhhhHhhhhhHhhhcC
Q psy7253 57 ENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPIRHNA 99 (119)
Q Consensus 57 en~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfGn 99 (119)
..-|.|.++-.-+.+. ...++.+|+||+||--.-|
T Consensus 145 ~~~~~I~ln~~L~~~P--------~~~idYVvvHEL~Hl~~~n 179 (205)
T PF01863_consen 145 SSKGNITLNWRLVMAP--------PEVIDYVVVHELCHLRHPN 179 (205)
T ss_pred CCCCcEEeecccccCC--------ccHHHHHHHHHHHHhccCC
Confidence 4455666665543322 2357889999999976533
No 37
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=60.23 E-value=4.8 Score=28.65 Aligned_cols=15 Identities=27% Similarity=0.147 Sum_probs=12.1
Q ss_pred hhhhhhHhhhhhHhh
Q psy7253 82 NWVAVVVAHELAPIR 96 (119)
Q Consensus 82 ~~~~~~iaHEiaHqW 96 (119)
.....++||||.|.+
T Consensus 140 ~~~~~~~AHEiGH~l 154 (196)
T PF13688_consen 140 YNGAITFAHEIGHNL 154 (196)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred CceehhhHHhHHHhc
Confidence 345689999999975
No 38
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=59.36 E-value=6.4 Score=30.47 Aligned_cols=40 Identities=20% Similarity=0.140 Sum_probs=30.3
Q ss_pred cchhhhhhHhhhhhHh-------hhcCCCCCC----ChhHHHHHhhhhhcC
Q psy7253 80 GHNWVAVVVAHELAPI-------RHNAKVAKP----ANGKTTFVYLFDFKR 119 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHq-------WfGnlVt~~----~EG~a~y~~~~~~~~ 119 (119)
..+++-.+||||+=|- |-+..||.- .||+|.+.....|-|
T Consensus 136 ~~~~v~aliaHE~HH~~R~~~i~~~eg~vtLle~lV~EGLAE~av~E~~Gr 186 (280)
T COG5504 136 TITSVPALIAHEYHHNCRLRYIDYGEGSVTLLEALVMEGLAEHAVFELFGR 186 (280)
T ss_pred CccchHHHHHHHHHhhheecccccCCCceeHHHHHHHHHHHHHHHHHHhCc
Confidence 3456788999999886 666667666 999999976665544
No 39
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=55.87 E-value=5.7 Score=27.97 Aligned_cols=64 Identities=16% Similarity=0.160 Sum_probs=26.7
Q ss_pred hHHHHhhCC-CCCCCccCEEEeCCCCCccccccccceeeccceeccCCCC-ccccchhhhhhHhhhhhHh
Q psy7253 28 PFLDSWARP-AVRDCLDDLIAIPDFGTGAMENWGLITYRETSILYDEQET-SASGHNWVAVVVAHELAPI 95 (119)
Q Consensus 28 ~~~e~~~~~-~yp~~kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~-~~~~~~~~~~~iaHEiaHq 95 (119)
+|+.++++- ++--.-..+++.=.+. ....|. +..-+.+.|..... .........-++||||+|-
T Consensus 81 d~y~~~~gr~siD~~G~~~~~~Vhyg-~~~~NA---fW~g~~m~yGdG~~~~f~~~~~~lDVvaHEltHG 146 (150)
T PF01447_consen 81 DYYKNVFGRNSIDGNGMPIISRVHYG-KNYNNA---FWNGSQMVYGDGDGQIFKPFASSLDVVAHELTHG 146 (150)
T ss_dssp HHHHHHHSS-STTSS-S-EEEEESES-SSTT-E---EE-SSSEEEE---SSSBS-GGG-HHHHHHHHHHH
T ss_pred HHHHHHHCCCCcCCCCcEEEEEEeCC-CCccCc---cccCCEEEEECCCCcccccCccccceeeeccccc
Confidence 477788874 3443344455432332 123332 12222334433211 1111122246999999994
No 40
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=54.21 E-value=6.7 Score=29.62 Aligned_cols=56 Identities=18% Similarity=0.160 Sum_probs=33.9
Q ss_pred CCccCEEEeCCCCCccccc---cccceeeccceeccCCCCccccchhhhhhHhhhhhHhhhcCCCC
Q psy7253 40 DCLDDLIAIPDFGTGAMEN---WGLITYRETSILYDEQETSASGHNWVAVVVAHELAPIRHNAKVA 102 (119)
Q Consensus 40 ~~kld~v~~P~~~~~~men---~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfGnlVt 102 (119)
.+++.++-.|.-+.-++.+ .|.|+..+.- .+ . -....+.-++|||++|.=-++.++
T Consensus 117 ~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gL--l~-~----l~~dEl~aVlaHElgHi~~rd~~~ 175 (302)
T COG0501 117 MPEVYILETPQPNAFALGGGPKNGRVVVTTGL--LD-L----LNDDELEAVLAHELGHIKNRHTLV 175 (302)
T ss_pred CCeeEEecCCCccceecCCCCCCeeEEecHHH--Hh-h----CCHHHHHHHHHHHHHHHhcccHHH
Confidence 3555555555433334444 3666666652 22 1 123568889999999998887775
No 41
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=53.09 E-value=5.7 Score=28.59 Aligned_cols=25 Identities=24% Similarity=0.203 Sum_probs=15.4
Q ss_pred cCCCCccccchhhhhhHhhhhhHhh
Q psy7253 72 DEQETSASGHNWVAVVVAHELAPIR 96 (119)
Q Consensus 72 ~~~~~~~~~~~~~~~~iaHEiaHqW 96 (119)
+.........-.+..+|||||.|--
T Consensus 24 ~~~~p~~~~yg~lG~ilahel~haf 48 (206)
T PF01431_consen 24 DPNYPPALNYGGLGFILAHELMHAF 48 (206)
T ss_dssp -TTS-HHHHHHTHHHHHHHHHHHCT
T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 3333333334457789999999964
No 42
>PRK03072 heat shock protein HtpX; Provisional
Probab=52.94 E-value=7.2 Score=30.14 Aligned_cols=21 Identities=19% Similarity=0.253 Sum_probs=15.6
Q ss_pred chhhhhhHhhhhhHhhhcCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKV 101 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlV 101 (119)
...+.-++|||++|-==|+..
T Consensus 124 ~~El~aVlAHElgHi~~~d~~ 144 (288)
T PRK03072 124 ERELRGVLGHELSHVYNRDIL 144 (288)
T ss_pred HHHHHHHHHHHHHHHhcCCHH
Confidence 356888999999997555543
No 43
>KOG2719|consensus
Probab=52.35 E-value=16 Score=30.19 Aligned_cols=65 Identities=18% Similarity=0.155 Sum_probs=35.3
Q ss_pred chhHHHHhhCCCCCCCccCEEEeCCCCCcccccc---ccceeeccceeccCCCC--ccccchhhhhhHhhhhhHh
Q psy7253 26 SVPFLDSWARPAVRDCLDDLIAIPDFGTGAMENW---GLITYRETSILYDEQET--SASGHNWVAVVVAHELAPI 95 (119)
Q Consensus 26 ~l~~~e~~~~~~yp~~kld~v~~P~~~~~~men~---Gli~~~e~~ll~~~~~~--~~~~~~~~~~~iaHEiaHq 95 (119)
+++.+.+-. .+|..|+.++-... .++-.|. |+ .-.++.++||.-.. ....-.++..+++||+.|=
T Consensus 222 ~Ie~la~s~--gfp~~k~~vi~~s~--rs~hsNAyfyG~-~~~KRIvIyDtLl~~~~~~~~eel~AVl~HELGHW 291 (428)
T KOG2719|consen 222 KIERLADSV--GFPLSKYRVIDGSK--RSSHSNAYFYGL-CKNKRIVIYDTLLLEEEHLNNEELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHhc--CCCceEEEEEecCC--CCCCCCeeeeec-cccceEEEehhhhhhhhccccHHHHHHHHHHhhHH
Confidence 444444444 48999998887422 1233442 33 22333455554321 0012246788999999994
No 44
>PRK03982 heat shock protein HtpX; Provisional
Probab=51.68 E-value=7.8 Score=29.81 Aligned_cols=19 Identities=32% Similarity=0.470 Sum_probs=14.9
Q ss_pred chhhhhhHhhhhhHhhhcC
Q psy7253 81 HNWVAVVVAHELAPIRHNA 99 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGn 99 (119)
...+.-++|||++|-=-++
T Consensus 122 ~~El~AVlAHElgHi~~~h 140 (288)
T PRK03982 122 EDELEGVIAHELTHIKNRD 140 (288)
T ss_pred HHHHHHHHHHHHHHHHcCC
Confidence 4568899999999975444
No 45
>PRK01345 heat shock protein HtpX; Provisional
Probab=51.27 E-value=7.9 Score=30.41 Aligned_cols=21 Identities=38% Similarity=0.465 Sum_probs=16.0
Q ss_pred chhhhhhHhhhhhHhhhcCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKV 101 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlV 101 (119)
...+.-++|||++|.=-++..
T Consensus 121 ~dEL~aVlAHElgHi~~~d~~ 141 (317)
T PRK01345 121 PEEVAGVMAHELAHVKNRDTL 141 (317)
T ss_pred HHHHHHHHHHHHHHHHcCCHH
Confidence 356889999999998555543
No 46
>PRK05457 heat shock protein HtpX; Provisional
Probab=50.64 E-value=8.3 Score=29.81 Aligned_cols=22 Identities=32% Similarity=0.445 Sum_probs=17.7
Q ss_pred chhhhhhHhhhhhHhhhcCCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKVA 102 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlVt 102 (119)
...+.-++|||++|.=-|+...
T Consensus 131 ~~El~aVlAHElgHi~~~d~~~ 152 (284)
T PRK05457 131 RDEVEAVLAHEISHIANGDMVT 152 (284)
T ss_pred HHHHHHHHHHHHHHHHcCCHHH
Confidence 3568899999999997777554
No 47
>PRK04897 heat shock protein HtpX; Provisional
Probab=50.58 E-value=8.2 Score=29.94 Aligned_cols=21 Identities=24% Similarity=0.306 Sum_probs=15.9
Q ss_pred chhhhhhHhhhhhHhhhcCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKV 101 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlV 101 (119)
...+.-++|||++|-=-|+..
T Consensus 134 ~~El~aVlAHElgHi~~~d~~ 154 (298)
T PRK04897 134 REELEGVIGHEISHIRNYDIR 154 (298)
T ss_pred HHHHHHHHHHHHHHHhcCCHH
Confidence 356888999999997555543
No 48
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=50.43 E-value=49 Score=23.92 Aligned_cols=17 Identities=18% Similarity=0.229 Sum_probs=13.9
Q ss_pred cchhhhhhHhhhhhHhh
Q psy7253 80 GHNWVAVVVAHELAPIR 96 (119)
Q Consensus 80 ~~~~~~~~iaHEiaHqW 96 (119)
.+..+..+++|||.|-|
T Consensus 67 ~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 67 SQGHLEDTLTHELIHAY 83 (173)
T ss_pred CHHHHHHHHHHHHHHHH
Confidence 34567899999999975
No 49
>PF13265 DUF4056: Protein of unknown function (DUF4056)
Probab=49.39 E-value=7.6 Score=30.01 Aligned_cols=21 Identities=19% Similarity=0.172 Sum_probs=11.6
Q ss_pred hhhhhHhhhcCCCCCC-ChhHHH
Q psy7253 89 AHELAPIRHNAKVAKP-ANGKTT 110 (119)
Q Consensus 89 aHEiaHqWfGnlVt~~-~EG~a~ 110 (119)
-|||| ||||-.=-+. .|+.+.
T Consensus 153 WHEIA-QWyG~~Sv~GfsE~~SA 174 (270)
T PF13265_consen 153 WHEIA-QWYGYQSVPGFSEGISA 174 (270)
T ss_pred HHHHH-HHcCccccCCCCccccc
Confidence 37776 8999432222 444443
No 50
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=48.86 E-value=7.9 Score=27.50 Aligned_cols=14 Identities=36% Similarity=0.351 Sum_probs=11.4
Q ss_pred hhhhhhHhhhhhHh
Q psy7253 82 NWVAVVVAHELAPI 95 (119)
Q Consensus 82 ~~~~~~iaHEiaHq 95 (119)
...+.++|||++|.
T Consensus 131 ~~~~~~~aHElGH~ 144 (192)
T cd04267 131 LLTALTMAHELGHN 144 (192)
T ss_pred eeehhhhhhhHHhh
Confidence 34678999999985
No 51
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=48.59 E-value=23 Score=27.02 Aligned_cols=15 Identities=33% Similarity=0.466 Sum_probs=12.1
Q ss_pred chhhhhhHhhhhhHh
Q psy7253 81 HNWVAVVVAHELAPI 95 (119)
Q Consensus 81 ~~~~~~~iaHEiaHq 95 (119)
.+....+++||++|-
T Consensus 193 ~~el~~il~HEl~Hi 207 (299)
T PF05569_consen 193 EEELRAILLHELAHI 207 (299)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345678999999996
No 52
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=46.93 E-value=7.4 Score=33.42 Aligned_cols=26 Identities=23% Similarity=0.336 Sum_probs=19.1
Q ss_pred ccCCCCccccchhhhhhHhhhhhHhh
Q psy7253 71 YDEQETSASGHNWVAVVVAHELAPIR 96 (119)
Q Consensus 71 ~~~~~~~~~~~~~~~~~iaHEiaHqW 96 (119)
|+++.++....=-+..+|+|||.|..
T Consensus 474 fd~ea~~a~NYGgIGaVIgHEI~HgF 499 (654)
T COG3590 474 FDPEADSAANYGGIGAVIGHEIGHGF 499 (654)
T ss_pred CCCCcchhhcccCccceehhhhcccc
Confidence 55555555566667889999999973
No 53
>PRK02391 heat shock protein HtpX; Provisional
Probab=46.79 E-value=10 Score=29.49 Aligned_cols=21 Identities=24% Similarity=0.403 Sum_probs=16.0
Q ss_pred chhhhhhHhhhhhHhhhcCCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAKV 101 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnlV 101 (119)
...+.-++|||++|---++..
T Consensus 130 ~~El~aVlaHElgHi~~~di~ 150 (296)
T PRK02391 130 PDELEAVLAHELSHVKNRDVA 150 (296)
T ss_pred HHHHHHHHHHHHHHHHcCCHH
Confidence 356788999999997666543
No 54
>PRK02870 heat shock protein HtpX; Provisional
Probab=46.44 E-value=10 Score=30.14 Aligned_cols=17 Identities=47% Similarity=0.704 Sum_probs=13.6
Q ss_pred chhhhhhHhhhhhHhhh
Q psy7253 81 HNWVAVVVAHELAPIRH 97 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWf 97 (119)
...+.-++|||++|-=-
T Consensus 170 ~dEL~aVlAHELgHik~ 186 (336)
T PRK02870 170 RDELQAVMAHELSHIRH 186 (336)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 45688999999999633
No 55
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=45.43 E-value=11 Score=28.21 Aligned_cols=18 Identities=33% Similarity=0.523 Sum_probs=14.3
Q ss_pred hhhhhhHhhhhhHhhhcC
Q psy7253 82 NWVAVVVAHELAPIRHNA 99 (119)
Q Consensus 82 ~~~~~~iaHEiaHqWfGn 99 (119)
..+..+++||+||-=..|
T Consensus 173 ~~i~YVvvHELaHLke~n 190 (223)
T COG1451 173 EVIDYVVVHELAHLKEKN 190 (223)
T ss_pred HHHHHHHHHHHHHHhhhh
Confidence 346789999999976665
No 56
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=45.42 E-value=32 Score=25.69 Aligned_cols=43 Identities=12% Similarity=0.125 Sum_probs=26.1
Q ss_pred ceeccCCCCccccchhhhhhHhhhhhHhhhcCCC------CCC------ChhHHHHHhh
Q psy7253 68 SILYDEQETSASGHNWVAVVVAHELAPIRHNAKV------AKP------ANGKTTFVYL 114 (119)
Q Consensus 68 ~ll~~~~~~~~~~~~~~~~~iaHEiaHqWfGnlV------t~~------~EG~a~y~~~ 114 (119)
.++++.+.+. .+-..++|||+.|=|+.--. .+. .|-.|++++.
T Consensus 60 ~I~iN~n~~~----~r~rFtlAHELGH~llH~~~~~~~~~~~~~~~~~~~E~~AN~FAa 114 (213)
T COG2856 60 VIYINANNSL----ERKRFTLAHELGHALLHTDLNTRFDAEPTLQQDRKIEAEANAFAA 114 (213)
T ss_pred eEEEeCCCCH----HHHHHHHHHHHhHHHhccccchhhhcccccchhHHHHHHHHHHHH
Confidence 4555554322 23457999999999997442 121 5666666654
No 57
>PF14891 Peptidase_M91: Effector protein
Probab=43.15 E-value=14 Score=26.27 Aligned_cols=15 Identities=33% Similarity=0.153 Sum_probs=12.1
Q ss_pred hhhhhHhhhhhHhhh
Q psy7253 83 WVAVVVAHELAPIRH 97 (119)
Q Consensus 83 ~~~~~iaHEiaHqWf 97 (119)
.-..+++|||+|=|-
T Consensus 102 ~p~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 102 PPFVVLYHELIHAYD 116 (174)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346799999999874
No 58
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=42.33 E-value=13 Score=27.92 Aligned_cols=13 Identities=38% Similarity=0.342 Sum_probs=11.1
Q ss_pred hhhhHhhhhhHhh
Q psy7253 84 VAVVVAHELAPIR 96 (119)
Q Consensus 84 ~~~~iaHEiaHqW 96 (119)
.+.++||||.|-+
T Consensus 167 ~a~t~AHElGHnl 179 (244)
T cd04270 167 SDLVTAHELGHNF 179 (244)
T ss_pred HHHHHHHHHHHhc
Confidence 4679999999985
No 59
>PF15641 Tox-MPTase5: Metallopeptidase toxin 5
Probab=42.14 E-value=19 Score=23.58 Aligned_cols=15 Identities=13% Similarity=0.131 Sum_probs=9.9
Q ss_pred hhHhhhhhHhhhcCC
Q psy7253 86 VVVAHELAPIRHNAK 100 (119)
Q Consensus 86 ~~iaHEiaHqWfGnl 100 (119)
.+|-.|+-|.||---
T Consensus 67 ~iiheelhhrw~~rg 81 (109)
T PF15641_consen 67 TIIHEELHHRWWKRG 81 (109)
T ss_pred HHHHHHHHHHHHHhh
Confidence 345557888899643
No 60
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=42.06 E-value=14 Score=26.39 Aligned_cols=15 Identities=40% Similarity=0.379 Sum_probs=12.3
Q ss_pred chhhhhhHhhhhhHh
Q psy7253 81 HNWVAVVVAHELAPI 95 (119)
Q Consensus 81 ~~~~~~~iaHEiaHq 95 (119)
....+.++||||+|-
T Consensus 128 ~~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 128 GLSFAVIIAHELGHN 142 (199)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHh
Confidence 455788999999995
No 61
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=41.96 E-value=14 Score=24.14 Aligned_cols=12 Identities=25% Similarity=0.182 Sum_probs=10.1
Q ss_pred hhhHhhhhhHhh
Q psy7253 85 AVVVAHELAPIR 96 (119)
Q Consensus 85 ~~~iaHEiaHqW 96 (119)
..+++|||.|..
T Consensus 108 ~~~~~HEiGH~l 119 (124)
T PF13582_consen 108 VDTFAHEIGHNL 119 (124)
T ss_dssp TTHHHHHHHHHT
T ss_pred ceEeeehhhHhc
Confidence 489999999973
No 62
>PRK01265 heat shock protein HtpX; Provisional
Probab=41.08 E-value=14 Score=29.25 Aligned_cols=20 Identities=25% Similarity=0.469 Sum_probs=15.1
Q ss_pred chhhhhhHhhhhhHhhhcCC
Q psy7253 81 HNWVAVVVAHELAPIRHNAK 100 (119)
Q Consensus 81 ~~~~~~~iaHEiaHqWfGnl 100 (119)
...+.-++|||++|-=-++.
T Consensus 137 ~~El~aVlAHElgHik~~d~ 156 (324)
T PRK01265 137 RDEIKAVAGHELGHLKHRDV 156 (324)
T ss_pred HHHHHHHHHHHHHHHHcccH
Confidence 45688899999999654444
No 63
>PF08219 TOM13: Outer membrane protein TOM13; InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=40.39 E-value=16 Score=22.99 Aligned_cols=13 Identities=31% Similarity=0.243 Sum_probs=10.6
Q ss_pred hhhhHhhhhhHhh
Q psy7253 84 VAVVVAHELAPIR 96 (119)
Q Consensus 84 ~~~~iaHEiaHqW 96 (119)
..+++|||++..|
T Consensus 52 FGEl~AhE~~fr~ 64 (77)
T PF08219_consen 52 FGELFAHEIAFRL 64 (77)
T ss_pred HHHHHHHHHHHHh
Confidence 4579999999874
No 64
>TIGR02421 QEGLA conserved hypothetical protein. Members of this family include a possible metal-binding motif HEXXXH and, nearby, a perfectly conserved motif QEGLA. All members belong to the Proteobacteria, including Agrobacterium tumefaciens and several species of Vibrio and Pseudomonas, and are found in only one copy per chromosome (Vibrio vulnificus, with two chromosomes, has two). The function is unknown.
Probab=39.74 E-value=47 Score=26.91 Aligned_cols=67 Identities=10% Similarity=0.112 Sum_probs=40.2
Q ss_pred ccCEEEeCCCCCccccccccceeeccceeccCCCCccccchhhhhhHhhhhhHh-------------hhcCCC---CCCC
Q psy7253 42 LDDLIAIPDFGTGAMENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPI-------------RHNAKV---AKPA 105 (119)
Q Consensus 42 kld~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHq-------------WfGnlV---t~~~ 105 (119)
+..+.+..+....+|..-.-|..+.+..+ ....+..++.||+--+ |++.=. |...
T Consensus 155 ~~~V~~sd~l~a~a~v~~~~l~i~~~a~f---------s~~~l~~L~~HEigvH~~T~~Ng~~Qp~~~l~~G~p~~t~tQ 225 (366)
T TIGR02421 155 TIRVTLSDDLPAGAMVSGDKLKLNSDAMF---------SERDLEALIHHEIGVHLLTTLNGRAQPLRLLSIGLPGYTATQ 225 (366)
T ss_pred ceEEEECcchhHHHhccCCeEEECCCCCc---------CHHHHHHHHHHhHHhhhhhccccccCchHHHhcCCCCCCCcc
Confidence 34444556676678876654444444321 1245778999998644 443221 1119
Q ss_pred hhHHHHHhhhhh
Q psy7253 106 NGKTTFVYLFDF 117 (119)
Q Consensus 106 EG~a~y~~~~~~ 117 (119)
||+|.+.|++..
T Consensus 226 EGLAvl~E~l~g 237 (366)
T TIGR02421 226 EGLAILAEYLAG 237 (366)
T ss_pred HHHHHHHHHHhc
Confidence 999999998754
No 65
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=39.67 E-value=15 Score=25.08 Aligned_cols=13 Identities=38% Similarity=0.455 Sum_probs=10.4
Q ss_pred hhhHhhhhhHhhh
Q psy7253 85 AVVVAHELAPIRH 97 (119)
Q Consensus 85 ~~~iaHEiaHqWf 97 (119)
..+++||++|-=+
T Consensus 81 vKviiHEllHIP~ 93 (133)
T COG4900 81 VKVIIHELLHIPA 93 (133)
T ss_pred HHHHHHHHhcCcc
Confidence 4689999999744
No 66
>KOG3314|consensus
Probab=36.75 E-value=1.2e+02 Score=22.25 Aligned_cols=61 Identities=23% Similarity=0.180 Sum_probs=36.4
Q ss_pred CchhHHHH---hhCCCCCCCccCEEEeCCCCCccc-cccccceeeccceeccCCCCccccchhhhhhHhhhhhHh
Q psy7253 25 NSVPFLDS---WARPAVRDCLDDLIAIPDFGTGAM-ENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPI 95 (119)
Q Consensus 25 ~~l~~~e~---~~~~~yp~~kld~v~~P~~~~~~m-en~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHq 95 (119)
+.++|+-+ ..|-..-..++.-+..|+...||- .-.| |+.-++.| + .+..+..+++||+.|.
T Consensus 38 P~Vkfl~~am~k~~c~~~d~~isc~~C~~~~~GGy~~~~g-IvlCqN~l-~--------~q~h~n~vv~HElIH~ 102 (194)
T KOG3314|consen 38 PIVKFLMEAMEKSGCRVGDNFISCVVCTGPVAGGYTPGRG-IVLCQNRL-T--------IQDHVNQVVIHELIHA 102 (194)
T ss_pred cHHHHHHHHHHHcCCCccCCceEEeeCCCCccCCccCCCc-eEEecccc-c--------hHHHHHHHHHHHHHHH
Confidence 34455433 334444455555666677665543 3456 56666653 2 2456788999999997
No 67
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=36.71 E-value=9.7 Score=27.17 Aligned_cols=13 Identities=46% Similarity=0.754 Sum_probs=9.9
Q ss_pred hhhhhhHhhhhhH
Q psy7253 82 NWVAVVVAHELAP 94 (119)
Q Consensus 82 ~~~~~~iaHEiaH 94 (119)
..+..+|.||+||
T Consensus 59 ~f~~~vV~HELaH 71 (156)
T COG3091 59 DFIEQVVPHELAH 71 (156)
T ss_pred HHHHHHHHHHHHH
Confidence 3567788888887
No 68
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=34.60 E-value=21 Score=24.78 Aligned_cols=13 Identities=15% Similarity=0.100 Sum_probs=10.4
Q ss_pred hhhhhhHhhhhhH
Q psy7253 82 NWVAVVVAHELAP 94 (119)
Q Consensus 82 ~~~~~~iaHEiaH 94 (119)
..-+.+|.||++|
T Consensus 94 ~Sk~~TLiHE~SH 106 (148)
T PF14521_consen 94 DSKEGTLIHEWSH 106 (148)
T ss_dssp T-HHHHHHHHHHH
T ss_pred CchHHHHHHhhhh
Confidence 4457899999999
No 69
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=34.07 E-value=16 Score=27.31 Aligned_cols=10 Identities=40% Similarity=0.521 Sum_probs=8.7
Q ss_pred hhHhhhhhHh
Q psy7253 86 VVVAHELAPI 95 (119)
Q Consensus 86 ~~iaHEiaHq 95 (119)
.++||||+|-
T Consensus 147 ~t~AHElGHn 156 (228)
T cd04271 147 QVFAHEIGHT 156 (228)
T ss_pred eehhhhhhhh
Confidence 5899999995
No 70
>COG5465 Uncharacterized conserved protein [Function unknown]
Probab=33.85 E-value=1.5e+02 Score=21.29 Aligned_cols=64 Identities=19% Similarity=0.084 Sum_probs=37.8
Q ss_pred HHhhCCCCCCCcc-----------CEEEeCCCCCccccccccceeeccceeccCCCCccccchhhhhhHhhhhhHhhh
Q psy7253 31 DSWARPAVRDCLD-----------DLIAIPDFGTGAMENWGLITYRETSILYDEQETSASGHNWVAVVVAHELAPIRH 97 (119)
Q Consensus 31 e~~~~~~yp~~kl-----------d~v~~P~~~~~~men~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHqWf 97 (119)
..+|.+..|.++- ..|-+|.|. -=|-.|.|+||..-++.+ +..+.+++..+..-.|-|.+-+.|
T Consensus 64 ~c~fd~kvpes~~~~~~rLL~liN~~v~~ghF~--lweeeg~I~fR~~L~l~~-G~ep~~~q~~v~idta~eacdsff 138 (166)
T COG5465 64 ACAFDIKVPESRRAEVARLLSLINRDVLIGHFD--LWEEEGAIFFRQSLLLAG-GVEPTSQQVEVLIDTALEACDSFF 138 (166)
T ss_pred hhhccccCchhHHHHHHHHHHHhhcccccCCcC--cccccCeeeeeeehhccC-CCCccHHHHHHHHHHHHHHHHHHH
Confidence 4566655555442 245567764 456789999999865543 334444444454455666666554
No 71
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=33.22 E-value=27 Score=26.13 Aligned_cols=27 Identities=22% Similarity=0.324 Sum_probs=20.4
Q ss_pred hhhhhhHhhhhhHh-------hhcCCCCCC---ChhHH
Q psy7253 82 NWVAVVVAHELAPI-------RHNAKVAKP---ANGKT 109 (119)
Q Consensus 82 ~~~~~~iaHEiaHq-------WfGnlVt~~---~EG~a 109 (119)
...+.+|+|||-|- ||-.. +|. +||-+
T Consensus 131 ~~~~hvi~HEiGH~IGfRHTD~~~R~-SCG~~~nEg~~ 167 (211)
T PF12388_consen 131 NVIEHVITHEIGHCIGFRHTDYFNRS-SCGSGGNEGSA 167 (211)
T ss_pred hHHHHHHHHHhhhhccccccCcCCcc-cccccCCcCcc
Confidence 45778999999997 88766 666 66643
No 72
>PF04036 DUF372: Domain of unknown function (DUF372); InterPro: IPR007179 This is a group of proteins of unknown function. It is found N-terminal to another domain of unknown function (IPR007181 from INTERPRO).; PDB: 2I52_B 2IEC_D 2OGF_C.
Probab=31.15 E-value=36 Score=18.56 Aligned_cols=23 Identities=13% Similarity=-0.039 Sum_probs=15.6
Q ss_pred hhhHhhhcCCCCCC-ChhHHHHHh
Q psy7253 91 ELAPIRHNAKVAKP-ANGKTTFVY 113 (119)
Q Consensus 91 EiaHqWfGnlVt~~-~EG~a~y~~ 113 (119)
-+-|||-|-=|+++ .+.+..-+|
T Consensus 13 aLyHQF~GtPvs~~~~~~le~aie 36 (38)
T PF04036_consen 13 ALYHQFVGTPVSPETAESLEKAIE 36 (38)
T ss_dssp HHHHHHTT-EESTTTHHHHHHHHH
T ss_pred HHHHHhcCCcCCcchHHHHHHHHh
Confidence 47899999999887 555554443
No 73
>PF07607 DUF1570: Protein of unknown function (DUF1570); InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=30.33 E-value=19 Score=24.73 Aligned_cols=12 Identities=17% Similarity=0.199 Sum_probs=10.3
Q ss_pred ChhHHHHHhhhh
Q psy7253 105 ANGKTTFVYLFD 116 (119)
Q Consensus 105 ~EG~a~y~~~~~ 116 (119)
+||||+|+|.--
T Consensus 31 ~EGlA~yFE~~~ 42 (128)
T PF07607_consen 31 SEGLATYFETPG 42 (128)
T ss_pred HHhHHHHcCCCc
Confidence 999999998654
No 74
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=29.58 E-value=26 Score=23.89 Aligned_cols=13 Identities=38% Similarity=0.284 Sum_probs=10.8
Q ss_pred hhhhhHhhhhhHh
Q psy7253 83 WVAVVVAHELAPI 95 (119)
Q Consensus 83 ~~~~~iaHEiaHq 95 (119)
....+++|||.|-
T Consensus 95 ~~~~~~~HElGH~ 107 (167)
T cd00203 95 EGAQTIAHELGHA 107 (167)
T ss_pred cchhhHHHHHHHH
Confidence 4678999999985
No 75
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=27.37 E-value=41 Score=20.00 Aligned_cols=14 Identities=21% Similarity=-0.005 Sum_probs=10.7
Q ss_pred hhhhhHhhhhhHhh
Q psy7253 83 WVAVVVAHELAPIR 96 (119)
Q Consensus 83 ~~~~~iaHEiaHqW 96 (119)
...+.+.|.+||++
T Consensus 28 ~~eRriih~la~~l 41 (60)
T cd02639 28 PAERRIVHLLASRL 41 (60)
T ss_pred HHHHHHHHHHHHHc
Confidence 34578889999875
No 76
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=27.03 E-value=39 Score=22.07 Aligned_cols=35 Identities=14% Similarity=0.158 Sum_probs=21.2
Q ss_pred cccceeeccceeccCCCCccccchhhhhhHhhhhhHh
Q psy7253 59 WGLITYRETSILYDEQETSASGHNWVAVVVAHELAPI 95 (119)
Q Consensus 59 ~Gli~~~e~~ll~~~~~~~~~~~~~~~~~iaHEiaHq 95 (119)
.-+++|+..-. ....+...-...+..++.||+||.
T Consensus 50 ~rI~lyR~pl~--~~~~~~~eL~~~I~~tlvhEiah~ 84 (97)
T PF06262_consen 50 DRIVLYRRPLE--RRARSREELAELIRDTLVHEIAHH 84 (97)
T ss_dssp EEEEEEHHHHH--HT-SSHHHHHHHHHHHHHHHHHHH
T ss_pred CEEEEehHHHH--HHhCCHHHHHHHHHHHHHHHHHHH
Confidence 44777877742 222222223356788999999996
No 77
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=25.92 E-value=37 Score=26.55 Aligned_cols=12 Identities=42% Similarity=0.396 Sum_probs=10.1
Q ss_pred hhhhHhhhhhHh
Q psy7253 84 VAVVVAHELAPI 95 (119)
Q Consensus 84 ~~~~iaHEiaHq 95 (119)
.+.+||||+.|+
T Consensus 170 ~ayVlAHEyGHH 181 (292)
T PF04228_consen 170 QAYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHHH
Confidence 466899999997
No 78
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=25.88 E-value=38 Score=22.76 Aligned_cols=14 Identities=29% Similarity=0.309 Sum_probs=11.2
Q ss_pred hhhhhhHhhhhhHh
Q psy7253 82 NWVAVVVAHELAPI 95 (119)
Q Consensus 82 ~~~~~~iaHEiaHq 95 (119)
..+..++.|||.|-
T Consensus 103 ~~~~~v~~HEiGHa 116 (154)
T PF00413_consen 103 NDLQSVAIHEIGHA 116 (154)
T ss_dssp EEHHHHHHHHHHHH
T ss_pred hhhhhhhhhccccc
Confidence 45678999999884
No 79
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=24.95 E-value=35 Score=24.96 Aligned_cols=11 Identities=27% Similarity=0.238 Sum_probs=9.3
Q ss_pred hhHhhhhhHhh
Q psy7253 86 VVVAHELAPIR 96 (119)
Q Consensus 86 ~~iaHEiaHqW 96 (119)
.+++|||.|..
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 67999999974
No 80
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=23.93 E-value=49 Score=27.74 Aligned_cols=15 Identities=27% Similarity=0.171 Sum_probs=11.6
Q ss_pred chhhhhhHhhhhhHh
Q psy7253 81 HNWVAVVVAHELAPI 95 (119)
Q Consensus 81 ~~~~~~~iaHEiaHq 95 (119)
.+...++++|||+|-
T Consensus 207 ~~~~~~~~~HEi~Ha 221 (521)
T PF01457_consen 207 FQEFFRTVIHEIAHA 221 (521)
T ss_dssp -HHHHHHHHHHHHHH
T ss_pred hhcccceeeeeeeee
Confidence 455678999999995
No 81
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=22.02 E-value=22 Score=24.63 Aligned_cols=14 Identities=7% Similarity=-0.080 Sum_probs=10.2
Q ss_pred hhhhhHhhhcCCCC
Q psy7253 89 AHELAPIRHNAKVA 102 (119)
Q Consensus 89 aHEiaHqWfGnlVt 102 (119)
.-.-||.||||-+-
T Consensus 74 lsG~ah~w~G~rLE 87 (142)
T COG4101 74 LSGEAHTWYGNRLE 87 (142)
T ss_pred Eeceeeeeecccee
Confidence 33568999999653
No 82
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.95 E-value=54 Score=19.39 Aligned_cols=13 Identities=23% Similarity=0.079 Sum_probs=11.0
Q ss_pred hhhhhHhhhhhHh
Q psy7253 83 WVAVVVAHELAPI 95 (119)
Q Consensus 83 ~~~~~iaHEiaHq 95 (119)
...+.+.|++|++
T Consensus 27 ~~eR~~vH~~a~~ 39 (59)
T cd06007 27 NHERAVIHRLCRK 39 (59)
T ss_pred HHHHHHHHHHHHH
Confidence 4568899999999
No 83
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=21.56 E-value=47 Score=23.42 Aligned_cols=21 Identities=19% Similarity=0.173 Sum_probs=15.7
Q ss_pred hhhhHhhhhhHhhhcCCCCCC
Q psy7253 84 VAVVVAHELAPIRHNAKVAKP 104 (119)
Q Consensus 84 ~~~~iaHEiaHqWfGnlVt~~ 104 (119)
...++.||++|-+.+.....+
T Consensus 7 ~i~i~~HE~gH~~~a~~~G~~ 27 (192)
T PF02163_consen 7 LISIVLHELGHALAARLYGDK 27 (192)
T ss_dssp HHHHHHHHHHHHHHHHTTT--
T ss_pred ccccccccccccccccccccc
Confidence 346899999999988777665
No 84
>PF14831 DUF4484: Domain of unknown function (DUF4484)
Probab=21.46 E-value=59 Score=23.65 Aligned_cols=14 Identities=14% Similarity=0.124 Sum_probs=12.0
Q ss_pred hhhHhhhhhHhhhc
Q psy7253 85 AVVVAHELAPIRHN 98 (119)
Q Consensus 85 ~~~iaHEiaHqWfG 98 (119)
.....+|++++|||
T Consensus 145 D~~FV~el~~~yf~ 158 (176)
T PF14831_consen 145 DAAFVKELVAKYFG 158 (176)
T ss_pred HHHHHHHHHHHHcC
Confidence 35778999999999
No 85
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=21.41 E-value=50 Score=22.97 Aligned_cols=12 Identities=42% Similarity=0.562 Sum_probs=9.9
Q ss_pred hhhhHhhhhhHh
Q psy7253 84 VAVVVAHELAPI 95 (119)
Q Consensus 84 ~~~~iaHEiaHq 95 (119)
-.++++||+-|-
T Consensus 77 ~~RIaaHE~GHi 88 (132)
T PF02031_consen 77 STRIAAHELGHI 88 (132)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeeehhccc
Confidence 457999999985
No 86
>COG4758 Predicted membrane protein [Function unknown]
Probab=21.10 E-value=36 Score=25.91 Aligned_cols=14 Identities=0% Similarity=-0.192 Sum_probs=11.2
Q ss_pred hhhHhhhcCCCCCC
Q psy7253 91 ELAPIRHNAKVAKP 104 (119)
Q Consensus 91 EiaHqWfGnlVt~~ 104 (119)
.+-||||||+=++.
T Consensus 115 ~~r~~w~Gn~~~~~ 128 (235)
T COG4758 115 DFRNQWFGNQRYYY 128 (235)
T ss_pred HHHhhccccceeee
Confidence 45899999997655
No 87
>PHA00527 hypothetical protein
Probab=20.88 E-value=62 Score=21.82 Aligned_cols=32 Identities=16% Similarity=0.096 Sum_probs=21.5
Q ss_pred hhhhHhhhhhHhhhcCC----CCCC----ChhHHHHHhhh
Q psy7253 84 VAVVVAHELAPIRHNAK----VAKP----ANGKTTFVYLF 115 (119)
Q Consensus 84 ~~~~iaHEiaHqWfGnl----Vt~~----~EG~a~y~~~~ 115 (119)
-+.+|+||-||.=|.-. |+.+ ||-.+-.+..+
T Consensus 73 K~~T~~HECAH~AF~vC~~VGV~~E~G~ANETYCY~~~R~ 112 (129)
T PHA00527 73 KAATLVHECAHVAFYVCRDVGVTTEPGDANETYCYMLDRM 112 (129)
T ss_pred HHHHHHHHHHHHHHHHHHhcCcccCCCccchhHHHHHHHH
Confidence 36799999999987521 2222 88777666554
Done!