Query 010667
Match_columns 504
No_of_seqs 317 out of 571
Neff 4.5
Searched_HMMs 46136
Date Fri Mar 29 03:11:28 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/010667.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/010667hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08325 WLM: WLM domain; Int 100.0 5E-44 1.1E-48 340.0 11.5 155 2-162 20-186 (186)
2 KOG4842 Protein involved in si 99.9 4.7E-23 1E-27 202.9 4.3 143 1-158 136-278 (278)
3 PF09409 PUB: PUB domain; Int 99.8 5.7E-19 1.2E-23 148.4 8.1 83 409-498 2-87 (87)
4 KOG4842 Protein involved in si 99.7 2.7E-18 5.8E-23 169.6 6.6 166 2-173 49-236 (278)
5 smart00580 PUG domain in prote 99.0 6E-10 1.3E-14 88.5 4.8 49 415-469 1-49 (58)
6 PF01863 DUF45: Protein of unk 98.5 9.3E-08 2E-12 90.4 4.7 62 38-104 138-199 (205)
7 COG1451 Predicted metal-depend 97.9 7.2E-06 1.6E-10 81.0 4.1 62 38-104 149-210 (223)
8 smart00731 SprT SprT homologue 96.8 0.00097 2.1E-08 61.3 3.2 34 64-97 54-93 (146)
9 PF10263 SprT-like: SprT-like 95.6 0.012 2.6E-07 53.7 3.8 59 39-98 30-97 (157)
10 KOG2699 Predicted ubiquitin re 94.8 0.011 2.4E-07 63.3 1.0 64 416-486 310-373 (407)
11 PRK04351 hypothetical protein; 91.9 0.16 3.6E-06 47.7 3.7 31 66-96 58-95 (149)
12 PRK04860 hypothetical protein; 90.4 0.3 6.4E-06 46.5 3.7 31 65-95 59-96 (160)
13 KOG2699 Predicted ubiquitin re 87.3 0.28 6.1E-06 52.9 1.5 62 398-463 152-214 (407)
14 COG3091 SprT Zn-dependent meta 85.5 0.94 2E-05 43.1 3.8 30 67-96 59-95 (156)
15 KOG3854 SPRT-like metalloprote 80.7 1.3 2.8E-05 48.7 3.0 41 46-91 333-380 (505)
16 PF13485 Peptidase_MA_2: Pepti 79.8 1.8 3.9E-05 36.5 3.1 22 65-86 21-42 (128)
17 cd04270 ZnMc_TACE_like Zinc-de 76.5 2 4.2E-05 43.0 2.7 28 62-89 160-187 (244)
18 PF06114 DUF955: Domain of unk 74.3 3.1 6.7E-05 34.8 3.0 26 64-89 37-62 (122)
19 PF04450 BSP: Peptidase of pla 69.4 3.2 6.9E-05 41.0 2.2 53 49-104 74-131 (205)
20 COG4900 Predicted metallopepti 68.5 6.3 0.00014 36.2 3.7 73 19-95 26-114 (133)
21 PF05569 Peptidase_M56: BlaR1 68.5 5 0.00011 40.6 3.5 23 64-86 191-213 (299)
22 PRK04897 heat shock protein Ht 68.2 3.7 8E-05 42.3 2.5 66 12-85 88-153 (298)
23 PRK03072 heat shock protein Ht 67.8 3.8 8.3E-05 42.0 2.5 23 63-85 121-143 (288)
24 KOG3931 Uncharacterized conser 67.5 11 0.00023 40.7 5.7 51 46-97 83-142 (484)
25 PRK03982 heat shock protein Ht 66.7 4.2 9E-05 41.5 2.5 66 12-85 76-141 (288)
26 COG0501 HtpX Zn-dependent prot 65.7 4.1 8.9E-05 40.6 2.2 24 63-86 151-174 (302)
27 PRK03001 M48 family peptidase; 64.8 4.8 0.0001 41.0 2.5 65 12-84 75-139 (283)
28 PF10460 Peptidase_M30: Peptid 64.4 5.9 0.00013 42.5 3.2 41 44-84 114-154 (366)
29 PRK02391 heat shock protein Ht 62.9 5.4 0.00012 41.3 2.5 21 64-84 128-148 (296)
30 PF01435 Peptidase_M48: Peptid 62.7 5.4 0.00012 37.8 2.3 44 40-88 65-108 (226)
31 PRK02870 heat shock protein Ht 62.4 5.4 0.00012 42.2 2.5 21 64-84 168-188 (336)
32 PRK05457 heat shock protein Ht 62.2 5.7 0.00012 40.8 2.5 21 64-84 129-149 (284)
33 cd04271 ZnMc_ADAM_fungal Zinc- 60.8 3.5 7.5E-05 41.0 0.7 18 71-88 147-164 (228)
34 PRK01345 heat shock protein Ht 60.3 6.4 0.00014 41.1 2.5 21 64-84 119-139 (317)
35 PRK01265 heat shock protein Ht 58.5 7.1 0.00015 41.2 2.5 65 12-84 91-155 (324)
36 PF12388 Peptidase_M57: Dual-a 56.9 10 0.00022 37.9 3.1 40 44-88 113-152 (211)
37 PF13688 Reprolysin_5: Metallo 54.8 8.8 0.00019 36.1 2.3 25 64-88 137-161 (196)
38 cd04272 ZnMc_salivary_gland_MP 53.6 8.3 0.00018 37.5 2.0 20 69-88 145-164 (220)
39 PF13582 Reprolysin_3: Metallo 53.0 7.4 0.00016 33.8 1.4 16 71-86 109-124 (124)
40 PF13058 DUF3920: Protein of u 52.5 8.3 0.00018 35.3 1.6 20 63-82 70-89 (126)
41 cd04269 ZnMc_adamalysin_II_lik 50.6 12 0.00025 35.5 2.4 23 66-88 128-150 (194)
42 PRK14015 pepN aminopeptidase N 49.3 10 0.00022 44.9 2.1 20 65-84 292-311 (875)
43 TIGR02414 pepN_proteo aminopep 48.9 11 0.00023 44.8 2.1 21 64-84 278-298 (863)
44 cd04267 ZnMc_ADAM_like Zinc-de 48.6 9 0.0002 36.1 1.3 22 67-88 131-152 (192)
45 COG0308 PepN Aminopeptidase N 48.5 11 0.00023 44.5 2.1 25 62-86 300-324 (859)
46 TIGR02412 pepN_strep_liv amino 48.3 11 0.00025 44.0 2.3 18 66-83 284-301 (831)
47 PF01447 Peptidase_M4: Thermol 48.0 12 0.00026 35.2 2.0 20 62-81 128-147 (150)
48 PF14521 Aspzincin_M35: Lysine 46.6 11 0.00024 35.0 1.5 31 50-82 78-109 (148)
49 PF13574 Reprolysin_2: Metallo 46.2 12 0.00027 35.4 1.8 20 69-88 111-130 (173)
50 TIGR02411 leuko_A4_hydro leuko 45.2 12 0.00026 42.4 1.7 16 69-84 279-294 (601)
51 PF01433 Peptidase_M1: Peptida 43.3 17 0.00036 37.5 2.4 19 66-84 292-310 (390)
52 KOG1046 Puromycin-sensitive am 42.1 16 0.00035 43.2 2.3 21 64-84 319-339 (882)
53 PF01421 Reprolysin: Reprolysi 40.1 22 0.00047 33.9 2.4 25 64-88 126-150 (199)
54 PF13699 DUF4157: Domain of un 39.3 15 0.00032 31.0 1.1 33 47-83 43-75 (79)
55 PF06262 DUF1025: Possibl zinc 39.1 21 0.00046 31.5 2.0 17 66-82 70-86 (97)
56 PF08919 F_actin_bind: F-actin 38.6 41 0.0009 30.5 3.8 73 387-467 7-85 (110)
57 PF02102 Peptidase_M35: Deuter 37.8 17 0.00037 39.1 1.4 21 68-88 296-318 (359)
58 PF00413 Peptidase_M10: Matrix 37.6 22 0.00048 31.8 2.0 24 64-87 100-123 (154)
59 PF10023 DUF2265: Predicted am 37.0 40 0.00087 36.0 4.0 43 65-107 161-204 (337)
60 COG4219 MecR1 Antirepressor re 35.5 23 0.0005 37.7 1.9 21 64-84 185-205 (337)
61 cd04273 ZnMc_ADAMTS_like Zinc- 35.1 9 0.0002 36.9 -1.0 19 70-88 141-159 (207)
62 PHA02456 zinc metallopeptidase 34.8 22 0.00048 32.8 1.5 39 40-82 53-92 (141)
63 cd00203 ZnMc Zinc-dependent me 34.7 20 0.00044 32.4 1.3 22 67-88 94-115 (167)
64 PF12725 DUF3810: Protein of u 34.6 17 0.00038 38.0 0.9 35 64-98 191-227 (318)
65 cd04268 ZnMc_MMP_like Zinc-dep 33.6 31 0.00068 31.3 2.3 22 67-88 92-113 (165)
66 PF13203 DUF2201_N: Putative m 33.2 32 0.00069 34.8 2.5 20 65-84 56-75 (292)
67 KOG2719 Metalloprotease [Gener 33.1 44 0.00096 36.8 3.7 73 9-83 222-294 (428)
68 PF10463 Peptidase_U49: Peptid 32.0 38 0.00082 33.8 2.7 35 69-103 101-135 (206)
69 PF14891 Peptidase_M91: Effect 31.0 23 0.00049 33.7 1.0 19 70-88 104-122 (174)
70 PF09768 Peptidase_M76: Peptid 28.9 43 0.00093 32.5 2.5 20 62-81 64-83 (173)
71 PF15639 Tox-MPTase3: Metallop 28.8 24 0.00052 33.0 0.7 77 4-80 19-111 (135)
72 cd04279 ZnMc_MMP_like_1 Zinc-d 28.2 41 0.0009 30.7 2.2 22 67-88 102-123 (156)
73 PF07998 Peptidase_M54: Peptid 26.7 46 0.001 32.9 2.3 23 66-88 142-164 (194)
74 PRK13267 archaemetzincin-like 26.0 48 0.001 32.1 2.3 24 65-88 121-144 (179)
75 KOG3607 Meltrins, fertilins an 25.7 39 0.00084 39.5 1.8 22 66-87 320-341 (716)
76 PRK09672 phage exclusion prote 24.1 69 0.0015 33.9 3.1 36 69-104 165-200 (305)
77 PF14247 DUF4344: Domain of un 22.8 48 0.001 33.3 1.6 21 66-86 89-109 (220)
78 cd04277 ZnMc_serralysin_like Z 22.4 54 0.0012 30.9 1.8 38 49-87 94-131 (186)
79 cd04278 ZnMc_MMP Zinc-dependen 22.2 49 0.0011 30.3 1.5 23 65-87 103-125 (157)
80 KOG1047 Bifunctional leukotrie 21.8 43 0.00093 38.2 1.1 18 68-85 287-304 (613)
81 PF13583 Reprolysin_4: Metallo 20.4 56 0.0012 31.9 1.5 19 70-88 138-156 (206)
No 1
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=100.00 E-value=5e-44 Score=339.95 Aligned_cols=155 Identities=43% Similarity=0.590 Sum_probs=133.7
Q ss_pred cccccCcc-hhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh
Q 010667 2 HMLAADPG-IIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM 80 (504)
Q Consensus 2 ~~L~~~a~-i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~ 80 (504)
++|.++|. |.|||++|+|+|++|+||+|++ ..+||+|+|+|++|+||||+++.++|+||++|+.|||||||||
T Consensus 20 ~lL~rlA~~v~pIM~~~~~~V~~L~E~~P~~------~~llG~N~N~G~~I~lrLR~~~~~~fl~~~~i~~t~lHELaH~ 93 (186)
T PF08325_consen 20 ELLERLAADVKPIMRKHGWRVGSLEEFYPNG------ERLLGLNVNKGEKICLRLRTPDDGGFLPYETILGTMLHELAHN 93 (186)
T ss_pred HHHHHHHHHHHHHHHHcCcccCeeeccCCCC------CCCcceecCCCcEEEEEeCCCCCCCEeeHHHHHHHHHHHHHhc
Confidence 57888874 5599999999999999999996 4599999999999999999965599999999999999999999
Q ss_pred hcCccchhHHHHHHHHHHHHHHhccc------cCCCeeecCCCCCCcccccccccCCCCCCccccCCCC-----CCCccH
Q 010667 81 VYSEHDANFYGLDKQLNQEAVALDWT------KSRGHTLSGVRHTSHHEDDLFVGDSRSFSQKLGGNIS-----DQLASA 149 (504)
Q Consensus 81 vhg~Hd~~F~~L~~~L~~E~~~l~~~------~s~G~~LgG~~~~~~~~~~~~~~~~~~~g~rLGG~~~-----~~~~s~ 149 (504)
+|++||.+||++|++|..||++|+|+ ++.|++|||.........+.......++|++|||++. .++.++
T Consensus 94 ~~~~H~~~F~~l~~~l~~e~~~l~~~G~~~gf~~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~LgG~s~~~~~~~~~~~~ 173 (186)
T PF08325_consen 94 VHGPHDDKFWKLLDELRKECEELDAKGYTEGFWSSGRRLGGSSGQPSEERELRGNGLSGGGQRLGGGSSSRPRKAQPKSP 173 (186)
T ss_pred ccCCccHHHHHHHHHHHHHHHHHHhcCCccccCCCCcccCCCCcccchhhhhhccccCCCCeeCCCCCCCCCCcCCCcCH
Confidence 99999999999999999999999996 7999999987654321111223334668999999975 467899
Q ss_pred HHHHHHHHHHHHh
Q 010667 150 RASSVAAAYRRLA 162 (504)
Q Consensus 150 Re~aa~AAerR~~ 162 (504)
|+++|+||+||++
T Consensus 174 Re~~a~AAerR~~ 186 (186)
T PF08325_consen 174 REAAAAAAERRLR 186 (186)
T ss_pred HHHHHHHHHhhcC
Confidence 9999999999975
No 2
>KOG4842 consensus Protein involved in sister chromatid separation and/or segregation [Cell cycle control, cell division, chromosome partitioning]
Probab=99.87 E-value=4.7e-23 Score=202.95 Aligned_cols=143 Identities=43% Similarity=0.522 Sum_probs=130.0
Q ss_pred CcccccCcchhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh
Q 010667 1 MHMLAADPGIIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM 80 (504)
Q Consensus 1 ~~~L~~~a~i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~ 80 (504)
|++|+.+++|..+|-.|.|.|+.|+|+.|..++++++.++||+|.|.| +|.+|||+++.++||.|+.|-.|++|||||+
T Consensus 136 L~~Lrad~gii~~mg~hrW~vg~l~el~g~~nt~v~~~~tLg~stnqG-~i~lrlrtdrkkgfR~y~tissTl~heLtr~ 214 (278)
T KOG4842|consen 136 LDELRADQGIIEQMGLHRWFVGNLQELGGRANTRVNRYPTLGISTNQG-VIVLRLRTDRKKGFRHYETISSTLRHELTRE 214 (278)
T ss_pred HHHHhhchhHHHHhcccceechhhhhcccccceeecCccceeeccccc-eEEEecccchhcccccCCCchHHHHhhhhhh
Confidence 689999999999999999999999999999999999999999999999 9999999999999999999999999999999
Q ss_pred hcCccchhHHHHHHHHHHHHHHhccccCCCeeecCCCCCCcccccccccCCCCCCccccCCCCCCCccHHHHHHHHHH
Q 010667 81 VYSEHDANFYGLDKQLNQEAVALDWTKSRGHTLSGVRHTSHHEDDLFVGDSRSFSQKLGGNISDQLASARASSVAAAY 158 (504)
Q Consensus 81 vhg~Hd~~F~~L~~~L~~E~~~l~~~~s~G~~LgG~~~~~~~~~~~~~~~~~~~g~rLGG~~~~~~~s~Re~aa~AAe 158 (504)
+|+.|+..||+|..++.+|..+++|..++|++.+.... --.+.+|+.......+.|+...+|||
T Consensus 215 v~~ehde~fyrLdrql~kek~~ad~~~srg~~~sde~~--------------~vdq~de~~p~D~l~~~rdl~~aaae 278 (278)
T KOG4842|consen 215 VAAEHDERFYRLDRQLGKEKNNADQIISRGISSSDEVV--------------IVDQDDEVLPGDTLIEVRDLTYAAAE 278 (278)
T ss_pred HhhhhhhHHHHHHHHhCcccchhhhhcCCCceeeccee--------------ecccccCCCcccccchhhhhhhhhcC
Confidence 99999999999999999999999999999987653211 12467777766677788998888885
No 3
>PF09409 PUB: PUB domain; InterPro: IPR018997 The PUB (also known as PUG) domain is found in peptide N-glycanase where it functions as a AAA ATPase binding domain []. This domain is also found on other proteins linked to the ubiquitin-proteasome system. ; PDB: 2CM0_A 2CCQ_A 2D5U_A 2HPL_A 2HPJ_A.
Probab=99.77 E-value=5.7e-19 Score=148.43 Aligned_cols=83 Identities=37% Similarity=0.527 Sum_probs=67.8
Q ss_pred ChhHHHHHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccccccccCCcccEEEEc---C
Q 010667 409 SPLESTTVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDVVLDEIGKAETYLVLK---R 485 (504)
Q Consensus 409 ~~e~~~~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~~~de~g~ge~~LVL~---~ 485 (504)
+.++..+|+++|.+||.||+.||.++|||+||++|++|+++|++++||.+||.++||++.. .+.||+|+ .
T Consensus 2 ~~~~~~~al~~L~~il~NI~~~P~~~kyR~Ir~~N~~f~~~i~~~~g~~~~L~~~GF~~~~-------~~~~~vl~~~~~ 74 (87)
T PF09409_consen 2 DPEAFQKALETLEKILSNILSNPNEEKYRRIRLSNKTFQEKILPVPGARELLEALGFREVT-------DEEFLVLPEDSE 74 (87)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHSTT-CGGGEEETTSHHHHHHTTTSTTHHHHHHHHT-EE----------SSEEE----TT
T ss_pred CHHHHHHHHHHHHHHHHHHccCCCcccceEeecCcchHHHHhcCChhHHHHHHHCCCEEec-------CCCEEEEecCCc
Confidence 4578899999999999999999999999999999999999999999999999999999986 26899999 3
Q ss_pred CCHHHHHHHHHHh
Q 010667 486 NDLALLWLAKSSL 498 (504)
Q Consensus 486 ~d~~lL~~Aks~L 498 (504)
.++..|+.|++.|
T Consensus 75 ~~~~~l~~~~~~l 87 (87)
T PF09409_consen 75 PDLQHLQKALSEL 87 (87)
T ss_dssp S-HHHHHHHHH--
T ss_pred CCHHHHHHHHhcC
Confidence 3899999888765
No 4
>KOG4842 consensus Protein involved in sister chromatid separation and/or segregation [Cell cycle control, cell division, chromosome partitioning]
Probab=99.73 E-value=2.7e-18 Score=169.58 Aligned_cols=166 Identities=22% Similarity=0.235 Sum_probs=123.8
Q ss_pred cccccCcchh-hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeec-CCCCCccchHHHHHHHHHHHhh
Q 010667 2 HMLAADPGII-AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRT-DDLKGFRKYESIKKTLLHELAH 79 (504)
Q Consensus 2 ~~L~~~a~i~-pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~-~d~~~Flp~~~V~~tlLHELaH 79 (504)
..|++++..+ |||++|+|.|.++.++++.+ ..|+|+|||+|.+|.|+||. .+..+|+|++.+++|++|||||
T Consensus 49 ~llk~iahl~~p~mkEh~f~vti~~Dk~irn------q~~sg~nvn~gski~lslr~~~~e~~~lp~e~pmgtylhel~h 122 (278)
T KOG4842|consen 49 NLLKEIAHLVSPLMKEHHFKVTILVDKYIRN------QRLSGMNVNHGSKIMLSLRCSTDEFQFLPMECPMGTYLHELTH 122 (278)
T ss_pred hhhhhhhhhhhhhhccccceeEEeehhHHHh------hhhhccccCCcceEEEEeeccccccccccccccchhhhhhhhh
Confidence 3578899888 99999999999999999986 78999999999999999997 5677899999999999999999
Q ss_pred hhcCccchhHHHHHHHHHHHHH------HhccccCCCeeecCCCCCCccccc--cc---ccC--CCCCCccccCCCC---
Q 010667 80 MVYSEHDANFYGLDKQLNQEAV------ALDWTKSRGHTLSGVRHTSHHEDD--LF---VGD--SRSFSQKLGGNIS--- 143 (504)
Q Consensus 80 ~vhg~Hd~~F~~L~~~L~~E~~------~l~~~~s~G~~LgG~~~~~~~~~~--~~---~~~--~~~~g~rLGG~~~--- 143 (504)
++.+||+++|++.++.|+..-- .+.|+.+-++.|+|+..-.....- .+ .+. -+.+..|+-|...
T Consensus 123 ~lqgPhd~rfl~~L~~Lrad~gii~~mg~hrW~vg~l~el~g~~nt~v~~~~tLg~stnqG~i~lrlrtdrkkgfR~y~t 202 (278)
T KOG4842|consen 123 NLQGPHDKRFLNKLDELRADQGIIEQMGLHRWFVGNLQELGGRANTRVNRYPTLGISTNQGVIVLRLRTDRKKGFRHYET 202 (278)
T ss_pred hhcCCChHHHHHHHHHHhhchhHHHHhcccceechhhhhcccccceeecCccceeeccccceEEEecccchhcccccCCC
Confidence 9999999999999999987542 233566777888875432100000 00 000 0112222222211
Q ss_pred ----CCCccHHHHHHHHHHHHHhhhccccCCCCC
Q 010667 144 ----DQLASARASSVAAAYRRLANASANSLGVSE 173 (504)
Q Consensus 144 ----~~~~s~Re~aa~AAerR~~d~~~~~~~~s~ 173 (504)
.....+|++++++++++.+...+|..+...
T Consensus 203 issTl~heLtr~v~~ehde~fyrLdrql~kek~~ 236 (278)
T KOG4842|consen 203 ISSTLRHELTREVAAEHDERFYRLDRQLGKEKNN 236 (278)
T ss_pred chHHHHhhhhhhHhhhhhhHHHHHHHHhCcccch
Confidence 124678999999999999999998755443
No 5
>smart00580 PUG domain in protein kinases, N-glycanases and other nuclear proteins.
Probab=98.97 E-value=6e-10 Score=88.50 Aligned_cols=49 Identities=31% Similarity=0.632 Sum_probs=46.6
Q ss_pred HHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccc
Q 010667 415 TVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDV 469 (504)
Q Consensus 415 ~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~ 469 (504)
.|+++|+|+++||++||+|.| +|++||++|++++||.++|.++||....
T Consensus 1 ~sv~dLLr~irNi~~hp~e~k------~n~~~~~~l~~~pg~~~~l~~~gFp~l~ 49 (58)
T smart00580 1 ESVRDLLRALRNILHHPREEK------GNPAIKERLGDVPGGFELYFTVGFPRLL 49 (58)
T ss_pred CcHHHHHHHHHHHhhCcchhh------cCHHHHHHhcCCCcHHHHHHHcCCCccc
Confidence 378999999999999999999 9999999999999999999999999875
No 6
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=98.51 E-value=9.3e-08 Score=90.36 Aligned_cols=62 Identities=23% Similarity=0.287 Sum_probs=53.0
Q ss_pred CcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667 38 KCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD 104 (504)
Q Consensus 38 ~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~ 104 (504)
++.||-. +.--.|.|=+|- .++|.+.|.+|++|||||..|.||++.||+++.++-+++.+..
T Consensus 138 ksrWGsc-~~~~~I~ln~~L----~~~P~~~idYVvvHEL~Hl~~~nHs~~Fw~~v~~~~Pd~k~~~ 199 (205)
T PF01863_consen 138 KSRWGSC-SSKGNITLNWRL----VMAPPEVIDYVVVHELCHLRHPNHSKRFWALVEKYMPDYKERR 199 (205)
T ss_pred hhccccC-CCCCcEEeeccc----ccCCccHHHHHHHHHHHHhccCCCCHHHHHHHHHHCcCHHHHH
Confidence 5578988 544567777666 4789999999999999999999999999999999999887765
No 7
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=97.94 E-value=7.2e-06 Score=80.97 Aligned_cols=62 Identities=27% Similarity=0.336 Sum_probs=50.7
Q ss_pred CcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667 38 KCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD 104 (504)
Q Consensus 38 ~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~ 104 (504)
++.||=..-.| .|.+-+|-- -.|.+.+.++++|||||..+++|+.+||+++..+-+++.++.
T Consensus 149 k~~WGScs~~~-~i~~~~~l~----~~p~~~i~YVvvHELaHLke~nHs~~Fw~lv~~~~P~~~~~~ 210 (223)
T COG1451 149 KRRWGSCSKAG-EIRFNWRLV----MAPEEVIDYVVVHELAHLKEKNHSKRFWRLVEKYMPDYRAAK 210 (223)
T ss_pred cceeeeecCCC-cEEeehhhh----cCCHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHCCChHHHH
Confidence 45677555566 666666653 378899999999999999999999999999999998887765
No 8
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=96.80 E-value=0.00097 Score=61.34 Aligned_cols=34 Identities=29% Similarity=0.475 Sum_probs=30.3
Q ss_pred cchHHHHHHHHHHHhhhhcC------ccchhHHHHHHHHH
Q 010667 64 RKYESIKKTLLHELAHMVYS------EHDANFYGLDKQLN 97 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg------~Hd~~F~~L~~~L~ 97 (504)
.|.+.|.+|++|||||.... +|++.||+++.++.
T Consensus 54 ~~~~~l~~~l~HEm~H~~~~~~g~~~~Hg~~f~~~~~~~~ 93 (146)
T smart00731 54 NGRDRLRETLLHELCHAALYLFGRGYGHGDEWKRWMRQVN 93 (146)
T ss_pred ccHHHHHhhHHHHHHHHHHHHhCCCCCcCHHHHHHHHHHc
Confidence 46789999999999999988 99999999987664
No 9
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=95.60 E-value=0.012 Score=53.70 Aligned_cols=59 Identities=24% Similarity=0.296 Sum_probs=39.7
Q ss_pred cccccc-cCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhc--------CccchhHHHHHHHHHH
Q 010667 39 CVLGFN-KNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVY--------SEHDANFYGLDKQLNQ 98 (504)
Q Consensus 39 ~lLGlN-~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vh--------g~Hd~~F~~L~~~L~~ 98 (504)
..+|.- .+......|+|-.. .-...+.+.++.||+|||||... .+|+..|.++..++..
T Consensus 30 ~~~G~~~~~~~~~~~I~ls~~-~~~~~~~~~~~~tL~HEm~H~~~~~~~~~~~~~Hg~~fk~~~~~ig~ 97 (157)
T PF10263_consen 30 RTAGRCRYKRRSPCEIRLSPK-LLDRNPEEELIDTLLHEMAHAAAYVFGGGRRRGHGKEFKQWARRIGA 97 (157)
T ss_pred CceEEEEECCCCceEEEECHH-HHHhhHHHHHHHHHHHHHHHHHhhhccCCCCCCCCHHHHHHHHHHCC
Confidence 445652 23332344444441 11126789999999999999887 6999999998877754
No 10
>KOG2699 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=94.77 E-value=0.011 Score=63.32 Aligned_cols=64 Identities=20% Similarity=0.218 Sum_probs=59.5
Q ss_pred HHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccccccccCCcccEEEEcCC
Q 010667 416 VLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDVVLDEIGKAETYLVLKRN 486 (504)
Q Consensus 416 al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~~~de~g~ge~~LVL~~~ 486 (504)
+..++.+|..|++++|+...|+.|+..|..|...++.+.+|.+++..+||-... +.++++++..
T Consensus 310 ~~~~~~~~~~~~ak~pd~~l~q~~f~~~~~~~~~~g~~~~a~~~~~l~~el~~~-------~~e~~~~p~~ 373 (407)
T KOG2699|consen 310 AFLTLYTPFKNVAKDPDGELLQGIFLPNELLLARYGFVSEALEFMELPGELLQE-------GSEFLSLPSV 373 (407)
T ss_pred HHHHhhcccchhccCcchhhhhhhhchhHHHHHHHhccccchhhhhhhhHHhhc-------cccccccccc
Confidence 888999999999999999999999999999999999999999999999999885 3678888764
No 11
>PRK04351 hypothetical protein; Provisional
Probab=91.95 E-value=0.16 Score=47.66 Aligned_cols=31 Identities=35% Similarity=0.525 Sum_probs=24.8
Q ss_pred hHHHHHHHHHHHhhh------h-cCccchhHHHHHHHH
Q 010667 66 YESIKKTLLHELAHM------V-YSEHDANFYGLDKQL 96 (504)
Q Consensus 66 ~~~V~~tlLHELaH~------v-hg~Hd~~F~~L~~~L 96 (504)
.+.+..|+.|||||. . +.+|+..|..++.++
T Consensus 58 ~~~l~~vv~HElcH~~~~~~g~g~~h~g~~fk~~~~~v 95 (149)
T PRK04351 58 LEELIGIIKHELCHYHLHLEGKGYQHRDRDFKELLKQV 95 (149)
T ss_pred HHHHHhhHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHh
Confidence 678999999999997 2 456889998877554
No 12
>PRK04860 hypothetical protein; Provisional
Probab=90.36 E-value=0.3 Score=46.50 Aligned_cols=31 Identities=26% Similarity=0.455 Sum_probs=25.3
Q ss_pred chHHHHHHHHHHHhhhh-c---C---ccchhHHHHHHH
Q 010667 65 KYESIKKTLLHELAHMV-Y---S---EHDANFYGLDKQ 95 (504)
Q Consensus 65 p~~~V~~tlLHELaH~v-h---g---~Hd~~F~~L~~~ 95 (504)
+.+.+..|+.|||||.+ | | +|+..|..++.+
T Consensus 59 ~~~~l~~~v~HEl~H~~~~~~~g~~~~Hg~ewk~lm~~ 96 (160)
T PRK04860 59 QQAFIDEVVPHELAHLLVYQLFGRVAPHGKEWQWMMES 96 (160)
T ss_pred cHHHHHhHHHHHHHHHHHHHHcCCCCCCCHHHHHHHHH
Confidence 46889999999999975 2 3 999998888765
No 13
>KOG2699 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=87.34 E-value=0.28 Score=52.86 Aligned_cols=62 Identities=16% Similarity=0.122 Sum_probs=51.1
Q ss_pred HHHHHHHHhcCCh-hHHHHHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHc
Q 010667 398 QKAIEMLRAEVSP-LESTTVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLV 463 (504)
Q Consensus 398 ~~Al~~L~~~n~~-e~~~~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~al 463 (504)
-.++..+...+.+ +-...+.++..|+..|+-.||..+|||++ +.|..|+++| .|+.+.|.++
T Consensus 152 ~ss~~~~~~~~k~~e~se~~~d~~~kqe~~~~~~pE~ek~R~r-l~~el~~~~v---~g~he~le~v 214 (407)
T KOG2699|consen 152 PSSLTKIFVNTKAKELSESGRDIRQKQEQNTERNPEREKERKR-LGNELLKEKV---LGCHEPLEAV 214 (407)
T ss_pred chhhhhccccccchhhhhhhhhhHHHhhhhhccchhhhhhhhc-cccchhhccc---ccccchHHHH
Confidence 3444455555444 44678899999999999999999999988 9999999999 6888888887
No 14
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=85.53 E-value=0.94 Score=43.13 Aligned_cols=30 Identities=33% Similarity=0.523 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHhhhh-------cCccchhHHHHHHHH
Q 010667 67 ESIKKTLLHELAHMV-------YSEHDANFYGLDKQL 96 (504)
Q Consensus 67 ~~V~~tlLHELaH~v-------hg~Hd~~F~~L~~~L 96 (504)
+.|..|+.|||||.. +-|||..|..++.++
T Consensus 59 ~f~~~vV~HELaHl~ly~~~gr~~phg~ewk~lm~qV 95 (156)
T COG3091 59 DFIEQVVPHELAHLHLYQEFGRYKPHGKEWKLLMQQV 95 (156)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCCchhHHHHHHHh
Confidence 468899999999974 349999999888755
No 15
>KOG3854 consensus SPRT-like metalloprotease [Function unknown]
Probab=80.74 E-value=1.3 Score=48.74 Aligned_cols=41 Identities=27% Similarity=0.331 Sum_probs=28.7
Q ss_pred CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh-------hcCccchhHHH
Q 010667 46 NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM-------VYSEHDANFYG 91 (504)
Q Consensus 46 N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~-------vhg~Hd~~F~~ 91 (504)
++-.+|+|-+--- -.-+.|+.||+|||||- .+..|++.|..
T Consensus 333 kr~A~IeLs~kV~-----tTAERir~TLiHEmCHaAaWv~dr~e~gHGd~wKr 380 (505)
T KOG3854|consen 333 KRYAKIELSDKVC-----TTAERIRDTLIHEMCHAAAWVFDREELGHGDNWKR 380 (505)
T ss_pred CceeEEEehhhhh-----hHHHHHHHHHHHHHHHHHHhhccccccCcchHHHH
Confidence 3445677755441 12378999999999994 27899988654
No 16
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=79.80 E-value=1.8 Score=36.55 Aligned_cols=22 Identities=27% Similarity=0.366 Sum_probs=17.5
Q ss_pred chHHHHHHHHHHHhhhhcCccc
Q 010667 65 KYESIKKTLLHELAHMVYSEHD 86 (504)
Q Consensus 65 p~~~V~~tlLHELaH~vhg~Hd 86 (504)
+...+..++.|||+|.+++...
T Consensus 21 ~~~~~~~~l~HE~~H~~~~~~~ 42 (128)
T PF13485_consen 21 DEDWLDRVLAHELAHQWFGNYF 42 (128)
T ss_pred CHHHHHHHHHHHHHHHHHHHHc
Confidence 3466779999999999987544
No 17
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=76.49 E-value=2 Score=43.02 Aligned_cols=28 Identities=25% Similarity=0.200 Sum_probs=23.5
Q ss_pred CccchHHHHHHHHHHHhhhhcCccchhH
Q 010667 62 GFRKYESIKKTLLHELAHMVYSEHDANF 89 (504)
Q Consensus 62 ~Flp~~~V~~tlLHELaH~vhg~Hd~~F 89 (504)
...|+..+..||.|||.|+.-++||..-
T Consensus 160 ~~~~~~~~a~t~AHElGHnlGm~HD~~~ 187 (244)
T cd04270 160 KRVPTKESDLVTAHELGHNFGSPHDPDI 187 (244)
T ss_pred CccchhHHHHHHHHHHHHhcCCCCCCCC
Confidence 3456666789999999999999999864
No 18
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=74.31 E-value=3.1 Score=34.82 Aligned_cols=26 Identities=35% Similarity=0.468 Sum_probs=19.8
Q ss_pred cchHHHHHHHHHHHhhhhcCccchhH
Q 010667 64 RKYESIKKTLLHELAHMVYSEHDANF 89 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~Hd~~F 89 (504)
++...-+.|++|||+|.....+...+
T Consensus 37 ~~~~~~~f~laHELgH~~~~~~~~~~ 62 (122)
T PF06114_consen 37 LSPERQRFTLAHELGHILLHHGDETF 62 (122)
T ss_dssp S-HHHHHHHHHHHHHHHHHHH-HHHH
T ss_pred CCHHHHHHHHHHHHHHHHhhhccccc
Confidence 44566788999999999998887654
No 19
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=69.36 E-value=3.2 Score=40.99 Aligned_cols=53 Identities=25% Similarity=0.298 Sum_probs=40.4
Q ss_pred cEEEEEeec-CCC---CCccchHHHHHHHHHHHhhhhc-CccchhHHHHHHHHHHHHHHhc
Q 010667 49 EEISLRLRT-DDL---KGFRKYESIKKTLLHELAHMVY-SEHDANFYGLDKQLNQEAVALD 104 (504)
Q Consensus 49 ~~I~LRLR~-~d~---~~Flp~~~V~~tlLHELaH~vh-g~Hd~~F~~L~~~L~~E~~~l~ 104 (504)
-+|.+-++. ... +.+. .+|.++|.|||+||+. .+++..++.|...|. ++..+.
T Consensus 74 ~~I~~S~~~i~~~~~~~~~~--~Ei~Gvl~HE~~H~~Q~~~~~~~P~~liEGIA-DyVRl~ 131 (205)
T PF04450_consen 74 KEIHFSARYIAKYPADGDVR--DEIIGVLYHEMVHCWQWDGRGTAPGGLIEGIA-DYVRLK 131 (205)
T ss_pred cEEEEeHHHHhhcccccchH--HHHHHHHHHHHHHHhhcCCCCCCChhheecHH-HHHHHH
Confidence 788888888 221 2233 7899999999999996 568899999998886 455554
No 20
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=68.52 E-value=6.3 Score=36.17 Aligned_cols=73 Identities=22% Similarity=0.226 Sum_probs=46.2
Q ss_pred eeeceecccCCCCCCCCCCCccccccc--------CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC------c
Q 010667 19 WRVGIMTELAPVGYVGVSPKCVLGFNK--------NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS------E 84 (504)
Q Consensus 19 w~V~~L~Ef~P~~~~~~s~~~lLGlN~--------N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg------~ 84 (504)
+.+....=+.|.+++.- -.++||.|. |.+-.|+|- +.. ..-||-++=+.+|+|||.|+-.. +
T Consensus 26 ~~~rv~vVys~gS~~~A-~ARIwg~pki~~e~lglnP~YviEl~-sek--F~rLs~~ekvKviiHEllHIP~tfSGgLRa 101 (133)
T COG4900 26 FQVRVVVVYSPGSHSKA-VARIWGIPKIFQEVLGLNPVYVIELL-SEK--FKRLSCAEKVKVIIHELLHIPATFSGGLRA 101 (133)
T ss_pred eeeeEEEEECCCCccee-hhhhhcccHHHHHHhCCCCeeeeeee-hhh--cCCCChHHHHHHHHHHHhcCcccccCcccc
Confidence 34444444455543211 145777663 778888774 221 12377888999999999999753 8
Q ss_pred cc--hhHHHHHHH
Q 010667 85 HD--ANFYGLDKQ 95 (504)
Q Consensus 85 Hd--~~F~~L~~~ 95 (504)
|+ .+||..++-
T Consensus 102 Hg~~vn~rrv~kL 114 (133)
T COG4900 102 HGPLVNFRRVYKL 114 (133)
T ss_pred CCcchhHHHHHHH
Confidence 88 788875543
No 21
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=68.47 E-value=5 Score=40.65 Aligned_cols=23 Identities=35% Similarity=0.423 Sum_probs=19.4
Q ss_pred cchHHHHHHHHHHHhhhhcCccc
Q 010667 64 RKYESIKKTLLHELAHMVYSEHD 86 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~Hd 86 (504)
+.-+++..+++|||+|..++..=
T Consensus 191 ~~~~el~~il~HEl~Hikr~D~~ 213 (299)
T PF05569_consen 191 LSEEELRAILLHELAHIKRRDLL 213 (299)
T ss_pred cCHHHHHHHHHHHHHHHHCCChH
Confidence 45678999999999999988654
No 22
>PRK04897 heat shock protein HtpX; Provisional
Probab=68.23 E-value=3.7 Score=42.31 Aligned_cols=66 Identities=11% Similarity=0.113 Sum_probs=37.7
Q ss_pred hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCcc
Q 010667 12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEH 85 (504)
Q Consensus 12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~H 85 (504)
-+.++.|+.++.+..+. .. ....-..|++.+++ .|.+- + ..-.-++-+++..||.|||+|.+|+..
T Consensus 88 ~la~~~gip~p~v~v~~-~~---~~NAfa~G~~~~~~-~v~vt--~-gLl~~l~~~El~aVlAHElgHi~~~d~ 153 (298)
T PRK04897 88 DMAMVAQIPMPRVFIID-DP---SPNAFATGSSPKNA-AVAVT--T-GLLAIMNREELEGVIGHEISHIRNYDI 153 (298)
T ss_pred HHHHHcCCCCCcEEEec-CC---CCceEEeccCCCCc-EEEee--h-HHHhhCCHHHHHHHHHHHHHHHhcCCH
Confidence 55666677666665542 11 01134566655433 23332 1 000123569999999999999998764
No 23
>PRK03072 heat shock protein HtpX; Provisional
Probab=67.78 E-value=3.8 Score=42.04 Aligned_cols=23 Identities=22% Similarity=0.315 Sum_probs=19.7
Q ss_pred ccchHHHHHHHHHHHhhhhcCcc
Q 010667 63 FRKYESIKKTLLHELAHMVYSEH 85 (504)
Q Consensus 63 Flp~~~V~~tlLHELaH~vhg~H 85 (504)
-++.+++..||.|||+|..++.-
T Consensus 121 ~l~~~El~aVlAHElgHi~~~d~ 143 (288)
T PRK03072 121 ILNERELRGVLGHELSHVYNRDI 143 (288)
T ss_pred hCCHHHHHHHHHHHHHHHhcCCH
Confidence 34679999999999999999853
No 24
>KOG3931 consensus Uncharacterized conserved protein [Function unknown]
Probab=67.53 E-value=11 Score=40.67 Aligned_cols=51 Identities=29% Similarity=0.432 Sum_probs=39.5
Q ss_pred CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhh---------cCccchhHHHHHHHHH
Q 010667 46 NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMV---------YSEHDANFYGLDKQLN 97 (504)
Q Consensus 46 N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~v---------hg~Hd~~F~~L~~~L~ 97 (504)
.+|--..|||-.+ .-..+|..+++.|||||+-|-. .+.|.+.|.+.+..|+
T Consensus 83 g~gg~csIRLSeP-LLkLRPRkDLVETLLHEMIHAYlFV~n~~~dr~GHGP~F~~hMhrIN 142 (484)
T KOG3931|consen 83 GKGGMCSIRLSEP-LLKLRPRKDLVETLLHEMIHAYLFVTNNDKDREGHGPEFCKHMHRIN 142 (484)
T ss_pred cccceEEEEecch-hhccCchHHHHHHHHHHHHHHheeEecccccCCCCChHHHHHHHHHh
Confidence 3455577777664 2358899999999999999953 3589999999887764
No 25
>PRK03982 heat shock protein HtpX; Provisional
Probab=66.71 E-value=4.2 Score=41.53 Aligned_cols=66 Identities=12% Similarity=0.252 Sum_probs=36.9
Q ss_pred hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCcc
Q 010667 12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEH 85 (504)
Q Consensus 12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~H 85 (504)
.+.++.|+..+.+.-+ +... ...=.+|++.++|- |.| ++..-.-++-++|..||.|||+|..|+..
T Consensus 76 ~la~~~g~~~p~v~v~-~~~~---~NAfa~G~~~~~~~-V~v---t~gLl~~l~~~El~AVlAHElgHi~~~h~ 141 (288)
T PRK03982 76 RLAERANIPKPKVAIV-PTQT---PNAFATGRDPKHAV-VAV---TEGILNLLNEDELEGVIAHELTHIKNRDT 141 (288)
T ss_pred HHHHHcCCCCCeEEEE-eCCC---cceEEeccCCCCeE-EEe---ehHHHhhCCHHHHHHHHHHHHHHHHcCCH
Confidence 5666666655544333 1110 11335666554433 322 11111124678999999999999999854
No 26
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=65.73 E-value=4.1 Score=40.60 Aligned_cols=24 Identities=21% Similarity=0.314 Sum_probs=20.6
Q ss_pred ccchHHHHHHHHHHHhhhhcCccc
Q 010667 63 FRKYESIKKTLLHELAHMVYSEHD 86 (504)
Q Consensus 63 Flp~~~V~~tlLHELaH~vhg~Hd 86 (504)
-+.-++|..||.||++|.+|+..=
T Consensus 151 ~l~~dEl~aVlaHElgHi~~rd~~ 174 (302)
T COG0501 151 LLNDDELEAVLAHELGHIKNRHTL 174 (302)
T ss_pred hCCHHHHHHHHHHHHHHHhcccHH
Confidence 466789999999999999998544
No 27
>PRK03001 M48 family peptidase; Provisional
Probab=64.78 E-value=4.8 Score=40.98 Aligned_cols=65 Identities=15% Similarity=0.187 Sum_probs=36.3
Q ss_pred hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667 12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~ 84 (504)
-+.++.|+.++.+.-+.-. ....-..|.+.+++. |.|- . ..-.-+.-+++..||+|||+|.+|+.
T Consensus 75 ~la~~~g~~~p~v~v~~~~----~~NAfa~G~~~~~~~-Ivvt--~-gLl~~l~~~El~aVlAHElgHi~~~h 139 (283)
T PRK03001 75 ELAQRAGLPMPKVYLINED----QPNAFATGRNPEHAA-VAAT--T-GILRVLSEREIRGVMAHELAHVKHRD 139 (283)
T ss_pred HHHHHcCCCCCeEEEecCC----CcceEEecCCCCCeE-EEec--H-HHHhhCCHHHHHHHHHHHHHHHhCCC
Confidence 5566667766655443211 011335676555443 2221 0 00011356899999999999999873
No 28
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=64.41 E-value=5.9 Score=42.54 Aligned_cols=41 Identities=27% Similarity=0.248 Sum_probs=28.1
Q ss_pred ccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667 44 NKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 44 N~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~ 84 (504)
.-|.|+.+.|-.-+-...+.--.+.+..||+||+-|+++-.
T Consensus 114 ~SNe~e~~YiD~~~~~~~~~~~~~~~~sTlAHEfQHmInfy 154 (366)
T PF10460_consen 114 YSNESEYFYIDSETLYLGGNSGPDTVYSTLAHEFQHMINFY 154 (366)
T ss_pred CCcceeEEEEecHHhhccCCccHHHHHHHHHHHHHHHHHHH
Confidence 34778888887665211112226889999999999998653
No 29
>PRK02391 heat shock protein HtpX; Provisional
Probab=62.92 E-value=5.4 Score=41.25 Aligned_cols=21 Identities=24% Similarity=0.417 Sum_probs=18.7
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
+.-+++..++.|||+|..|+.
T Consensus 128 L~~~El~aVlaHElgHi~~~d 148 (296)
T PRK02391 128 LDPDELEAVLAHELSHVKNRD 148 (296)
T ss_pred CCHHHHHHHHHHHHHHHHcCC
Confidence 456999999999999999985
No 30
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=62.69 E-value=5.4 Score=37.77 Aligned_cols=44 Identities=16% Similarity=0.073 Sum_probs=28.4
Q ss_pred ccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchh
Q 010667 40 VLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 40 lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.+|.+. +..|.|--.-= .-++-+++..+|.|||+|..++..-..
T Consensus 65 ~~g~~~--~~~I~v~~~ll---~~~~~~el~aVlaHElgH~~~~h~~~~ 108 (226)
T PF01435_consen 65 ATGGGP--RKRIVVTSGLL---ESLSEDELAAVLAHELGHIKHRHILKS 108 (226)
T ss_dssp EETTTC----EEEEEHHHH---HHSSHHHHHHHHHHHHHHHHTTHCCCC
T ss_pred EEccCC--CcEEEEeChhh---hcccHHHHHHHHHHHHHHHHcCCcchH
Confidence 344433 44555543321 235668999999999999998876655
No 31
>PRK02870 heat shock protein HtpX; Provisional
Probab=62.42 E-value=5.4 Score=42.19 Aligned_cols=21 Identities=19% Similarity=0.430 Sum_probs=18.0
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
++-+++..||.|||+|..|+.
T Consensus 168 L~~dEL~aVlAHELgHik~~d 188 (336)
T PRK02870 168 LDRDELQAVMAHELSHIRHGD 188 (336)
T ss_pred CCHHHHHHHHHHHHHHHHccc
Confidence 466899999999999998753
No 32
>PRK05457 heat shock protein HtpX; Provisional
Probab=62.19 E-value=5.7 Score=40.82 Aligned_cols=21 Identities=19% Similarity=0.477 Sum_probs=18.8
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
++-+++..++.|||+|..|+.
T Consensus 129 L~~~El~aVlAHElgHi~~~d 149 (284)
T PRK05457 129 MSRDEVEAVLAHEISHIANGD 149 (284)
T ss_pred CCHHHHHHHHHHHHHHHHcCC
Confidence 566999999999999999974
No 33
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=60.78 E-value=3.5 Score=41.01 Aligned_cols=18 Identities=28% Similarity=0.512 Sum_probs=16.6
Q ss_pred HHHHHHHhhhhcCccchh
Q 010667 71 KTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 71 ~tlLHELaH~vhg~Hd~~ 88 (504)
.||.|||.||.-++||..
T Consensus 147 ~t~AHElGHnLGm~HD~~ 164 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCT 164 (228)
T ss_pred eehhhhhhhhcCCCCCCC
Confidence 699999999999999974
No 34
>PRK01345 heat shock protein HtpX; Provisional
Probab=60.27 E-value=6.4 Score=41.12 Aligned_cols=21 Identities=29% Similarity=0.395 Sum_probs=18.3
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
+.-++|..||.|||+|.+|+.
T Consensus 119 L~~dEL~aVlAHElgHi~~~d 139 (317)
T PRK01345 119 LSPEEVAGVMAHELAHVKNRD 139 (317)
T ss_pred CCHHHHHHHHHHHHHHHHcCC
Confidence 456899999999999999864
No 35
>PRK01265 heat shock protein HtpX; Provisional
Probab=58.53 E-value=7.1 Score=41.18 Aligned_cols=65 Identities=17% Similarity=0.129 Sum_probs=36.7
Q ss_pred hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667 12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~ 84 (504)
-+.++.|+.++.+.-+.-. ....-..|.+... ..|.|---- -.-+.-+++..|+.|||+|..|+.
T Consensus 91 ~la~~~g~~~p~vyv~~~~----~~NAfa~G~~~~~-~~Ivvt~gL---l~~l~~~El~aVlAHElgHik~~d 155 (324)
T PRK01265 91 EVAKYNGIRVPKVYIADVP----FPNAFAYGSPIAG-KRIAITLPL---LKILNRDEIKAVAGHELGHLKHRD 155 (324)
T ss_pred HHHHHcCCCCCeEEEecCC----CCCeEEeccCCCC-CEEEEehHH---HhhCCHHHHHHHHHHHHHHHHccc
Confidence 4555666666555443211 0113345655433 445442111 112567999999999999998874
No 36
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=56.92 E-value=10 Score=37.95 Aligned_cols=40 Identities=20% Similarity=0.189 Sum_probs=29.1
Q ss_pred ccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchh
Q 010667 44 NKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 44 N~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
|-+.+-.|.| +. ..-.....+..||.|||=||+--.|.+-
T Consensus 113 ~G~P~~~I~I---~~--~~~~~~~~~~hvi~HEiGH~IGfRHTD~ 152 (211)
T PF12388_consen 113 NGNPYKFIQI---YG--LSNYSVNVIEHVITHEIGHCIGFRHTDY 152 (211)
T ss_pred CCCCCceEEE---Ee--cCCCchhHHHHHHHHHhhhhccccccCc
Confidence 4456778888 31 1223556789999999999999999763
No 37
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=54.83 E-value=8.8 Score=36.14 Aligned_cols=25 Identities=24% Similarity=0.263 Sum_probs=16.5
Q ss_pred cchHHHHHHHHHHHhhhhcCccchh
Q 010667 64 RKYESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.+...-..|+.|||.|+.-.+||..
T Consensus 137 ~~~~~~~~~~AHEiGH~lGa~HD~~ 161 (196)
T PF13688_consen 137 PPTYNGAITFAHEIGHNLGAPHDGD 161 (196)
T ss_dssp --HHHHHHHHHHHHHHHTT-----S
T ss_pred CCCCceehhhHHhHHHhcCCCCCCC
Confidence 5577888999999999999999876
No 38
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=53.61 E-value=8.3 Score=37.53 Aligned_cols=20 Identities=40% Similarity=0.517 Sum_probs=18.2
Q ss_pred HHHHHHHHHhhhhcCccchh
Q 010667 69 IKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 69 V~~tlLHELaH~vhg~Hd~~ 88 (504)
...||.|||+|+.-++||..
T Consensus 145 ~~~~~AHElGH~lG~~HD~~ 164 (220)
T cd04272 145 GVYTMTHELAHLLGAPHDGS 164 (220)
T ss_pred cHHHHHHHHHHHhCCCCCCC
Confidence 36999999999999999976
No 39
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=53.02 E-value=7.4 Score=33.82 Aligned_cols=16 Identities=38% Similarity=0.580 Sum_probs=11.7
Q ss_pred HHHHHHHhhhhcCccc
Q 010667 71 KTLLHELAHMVYSEHD 86 (504)
Q Consensus 71 ~tlLHELaH~vhg~Hd 86 (504)
.|+.|||-|+.-.+||
T Consensus 109 ~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 109 DTFAHEIGHNLGLNHD 124 (124)
T ss_dssp THHHHHHHHHTT----
T ss_pred eEeeehhhHhcCCCCC
Confidence 8999999999999996
No 40
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=52.51 E-value=8.3 Score=35.27 Aligned_cols=20 Identities=45% Similarity=0.795 Sum_probs=17.6
Q ss_pred ccchHHHHHHHHHHHhhhhc
Q 010667 63 FRKYESIKKTLLHELAHMVY 82 (504)
Q Consensus 63 Flp~~~V~~tlLHELaH~vh 82 (504)
|-.|+.|+.|++||+-|-..
T Consensus 70 we~y~qvlktllhefrh~mQ 89 (126)
T PF13058_consen 70 WEEYEQVLKTLLHEFRHAMQ 89 (126)
T ss_pred hHHHHHHHHHHHHHHHHHHH
Confidence 66789999999999999765
No 41
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=50.65 E-value=12 Score=35.50 Aligned_cols=23 Identities=35% Similarity=0.390 Sum_probs=20.2
Q ss_pred hHHHHHHHHHHHhhhhcCccchh
Q 010667 66 YESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
+.....|+.|||.|+.-++||..
T Consensus 128 ~~~~a~~~AHElGH~lG~~HD~~ 150 (194)
T cd04269 128 LLLFAVTMAHELGHNLGMEHDDG 150 (194)
T ss_pred hHHHHHHHHHHHHhhcCCCcCCC
Confidence 44567899999999999999987
No 42
>PRK14015 pepN aminopeptidase N; Provisional
Probab=49.31 E-value=10 Score=44.94 Aligned_cols=20 Identities=30% Similarity=0.443 Sum_probs=17.0
Q ss_pred chHHHHHHHHHHHhhhhcCc
Q 010667 65 KYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 65 p~~~V~~tlLHELaH~vhg~ 84 (504)
-+..+..++.|||+|+++||
T Consensus 292 ~~~~i~~vIaHElaHqWFGN 311 (875)
T PRK14015 292 DYERIESVIAHEYFHNWTGN 311 (875)
T ss_pred HHHHHHHHHHHHHHHHHHhC
Confidence 34568899999999999995
No 43
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=48.88 E-value=11 Score=44.77 Aligned_cols=21 Identities=29% Similarity=0.412 Sum_probs=18.0
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
..+..|..++.||++|+++|+
T Consensus 278 ~~~~~i~~VIaHElaHqWfGN 298 (863)
T TIGR02414 278 ADYERIESVIAHEYFHNWTGN 298 (863)
T ss_pred HHHHHHHHHHHHHHHHHHhcc
Confidence 346778999999999999995
No 44
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=48.56 E-value=9 Score=36.10 Aligned_cols=22 Identities=36% Similarity=0.427 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHhhhhcCccchh
Q 010667 67 ESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 67 ~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.....|+.|||.|+.-..||..
T Consensus 131 ~~~~~~~aHElGH~lG~~HD~~ 152 (192)
T cd04267 131 LLTALTMAHELGHNLGAEHDGG 152 (192)
T ss_pred eeehhhhhhhHHhhcCCcCCCC
Confidence 4556799999999999999976
No 45
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=48.46 E-value=11 Score=44.50 Aligned_cols=25 Identities=28% Similarity=0.456 Sum_probs=20.5
Q ss_pred CccchHHHHHHHHHHHhhhhcCccc
Q 010667 62 GFRKYESIKKTLLHELAHMVYSEHD 86 (504)
Q Consensus 62 ~Flp~~~V~~tlLHELaH~vhg~Hd 86 (504)
.--.|+.|-.|.+|||||++.|+==
T Consensus 300 t~~~~~~~~~viaHElaHqWfGnlV 324 (859)
T COG0308 300 TDSDYENVEEVIAHELAHQWFGNLV 324 (859)
T ss_pred hhHHHHHHHHHHHHHHhhhccccee
Confidence 3445688889999999999999754
No 46
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=48.32 E-value=11 Score=44.03 Aligned_cols=18 Identities=39% Similarity=0.617 Sum_probs=15.9
Q ss_pred hHHHHHHHHHHHhhhhcC
Q 010667 66 YESIKKTLLHELAHMVYS 83 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg 83 (504)
...+..+++|||||.++|
T Consensus 284 ~~~~~~viaHElAHqWFG 301 (831)
T TIGR02412 284 KENRAGVILHEMAHMWFG 301 (831)
T ss_pred HHHHHHHHHHHHHHHHhC
Confidence 456789999999999999
No 47
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=48.02 E-value=12 Score=35.20 Aligned_cols=20 Identities=35% Similarity=0.487 Sum_probs=16.4
Q ss_pred CccchHHHHHHHHHHHhhhh
Q 010667 62 GFRKYESIKKTLLHELAHMV 81 (504)
Q Consensus 62 ~Flp~~~V~~tlLHELaH~v 81 (504)
.|.||-.-+.|+.|||+|=|
T Consensus 128 ~f~~~~~~lDVvaHEltHGV 147 (150)
T PF01447_consen 128 IFKPFASSLDVVAHELTHGV 147 (150)
T ss_dssp SBS-GGG-HHHHHHHHHHHH
T ss_pred ccccCccccceeeecccccc
Confidence 59999999999999999976
No 48
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=46.58 E-value=11 Score=35.02 Aligned_cols=31 Identities=26% Similarity=0.227 Sum_probs=17.6
Q ss_pred EEEEEeec-CCCCCccchHHHHHHHHHHHhhhhc
Q 010667 50 EISLRLRT-DDLKGFRKYESIKKTLLHELAHMVY 82 (504)
Q Consensus 50 ~I~LRLR~-~d~~~Flp~~~V~~tlLHELaH~vh 82 (504)
+|.|.-.- ..+. --...=..||+|||+|..-
T Consensus 78 ~IyLc~~F~~~p~--~g~~Sk~~TLiHE~SHf~~ 109 (148)
T PF14521_consen 78 TIYLCPAFFSAPT--TGKDSKEGTLIHEWSHFTA 109 (148)
T ss_dssp EEEE-HHHHHS-S--SSTT-HHHHHHHHHHHSCC
T ss_pred EEEEChhhcCCCC--CCCCchHHHHHHhhhhhhh
Confidence 57776554 2221 1223458999999999543
No 49
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=46.17 E-value=12 Score=35.37 Aligned_cols=20 Identities=35% Similarity=0.423 Sum_probs=15.6
Q ss_pred HHHHHHHHHhhhhcCccchh
Q 010667 69 IKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 69 V~~tlLHELaH~vhg~Hd~~ 88 (504)
-..|+.|||.|..-.+|+..
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~ 130 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFD 130 (173)
T ss_dssp HHHHHHHHHHHHHT---SSS
T ss_pred eeeeehhhhHhhcCCCCCCC
Confidence 67789999999999999988
No 50
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=45.19 E-value=12 Score=42.43 Aligned_cols=16 Identities=31% Similarity=0.314 Sum_probs=14.2
Q ss_pred HHHHHHHHHhhhhcCc
Q 010667 69 IKKTLLHELAHMVYSE 84 (504)
Q Consensus 69 V~~tlLHELaH~vhg~ 84 (504)
...|++|||||.++|+
T Consensus 279 ~~~viaHElAHqWfGN 294 (601)
T TIGR02411 279 NVDVIAHELAHSWSGN 294 (601)
T ss_pred hhhhHHHHHHhhccCc
Confidence 4689999999999994
No 51
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=43.27 E-value=17 Score=37.54 Aligned_cols=19 Identities=32% Similarity=0.503 Sum_probs=17.0
Q ss_pred hHHHHHHHHHHHhhhhcCc
Q 010667 66 YESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg~ 84 (504)
...+..+++|||||.++|+
T Consensus 292 ~~~~~~~iahElahqWfGn 310 (390)
T PF01433_consen 292 KQEIASLIAHELAHQWFGN 310 (390)
T ss_dssp HHHHHHHHHHHHHTTTBTT
T ss_pred hhhhHHHHHHHHHHHHhcc
Confidence 4678999999999999995
No 52
>KOG1046 consensus Puromycin-sensitive aminopeptidase and related aminopeptidases [Amino acid transport and metabolism; Posttranslational modification, protein turnover, chaperones]
Probab=42.11 E-value=16 Score=43.23 Aligned_cols=21 Identities=24% Similarity=0.482 Sum_probs=18.4
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
.-+..|..+++|||||.++||
T Consensus 319 ~~k~~va~vIaHElAHQWFGN 339 (882)
T KOG1046|consen 319 SNKQRVAEVIAHELAHQWFGN 339 (882)
T ss_pred HHHHHHHHHHHHHHHHHHhcC
Confidence 336789999999999999996
No 53
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=40.07 E-value=22 Score=33.87 Aligned_cols=25 Identities=32% Similarity=0.365 Sum_probs=18.5
Q ss_pred cchHHHHHHHHHHHhhhhcCccchh
Q 010667 64 RKYESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
..+-..-.+|.|||+|+.-++||..
T Consensus 126 ~~~~~~a~~~AHelGH~lGm~HD~~ 150 (199)
T PF01421_consen 126 RSGLSFAVIIAHELGHNLGMPHDGD 150 (199)
T ss_dssp SSHHHHHHHHHHHHHHHTT---TTT
T ss_pred chhHHHHHHHHHHHHHhcCCCCCCC
Confidence 3456677899999999999999988
No 54
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=39.29 E-value=15 Score=31.01 Aligned_cols=33 Identities=27% Similarity=0.315 Sum_probs=19.8
Q ss_pred CccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC
Q 010667 47 HGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS 83 (504)
Q Consensus 47 ~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg 83 (504)
.|..|.++--..++.. ..=...|.|||+|.+.-
T Consensus 43 ~G~~I~f~~g~~~~~s----~~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 43 VGNDIYFAPGKYNPDS----PEGRALLAHELAHVVQQ 75 (79)
T ss_pred ECCEEEEcCCCcCCCC----CCcchhHhHHHHHHHhh
Confidence 5888988421111100 11367899999998854
No 55
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=39.08 E-value=21 Score=31.51 Aligned_cols=17 Identities=47% Similarity=0.596 Sum_probs=14.0
Q ss_pred hHHHHHHHHHHHhhhhc
Q 010667 66 YESIKKTLLHELAHMVY 82 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vh 82 (504)
-+.|+.|++||+||..-
T Consensus 70 ~~~I~~tlvhEiah~fG 86 (97)
T PF06262_consen 70 AELIRDTLVHEIAHHFG 86 (97)
T ss_dssp HHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 35799999999999753
No 56
>PF08919 F_actin_bind: F-actin binding; InterPro: IPR015015 The F-actin binding domain forms a compact bundle of four antiparallel alpha-helices, which are arranged in a left-handed topology. Binding of F-actin to the F-actin binding domain may result in cytoplasmic retention and subcellular distribution of the protein, as well as possible inhibition of protein function []. ; GO: 0004715 non-membrane spanning protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1ZZP_A 2KK1_A.
Probab=38.58 E-value=41 Score=30.49 Aligned_cols=73 Identities=12% Similarity=0.170 Sum_probs=54.3
Q ss_pred hhhhhhhhHHHHHHHHHHHhc----CChhHHHHHHHHHHHHHHHHhc--CCCccccchhcccChhHHHhhcChhhHHHHH
Q 010667 387 QDTVTVVCSRLQKAIEMLRAE----VSPLESTTVLQTLCKIIRNVIE--HPDETKYKRLRKANPIIQRSVANYKAAMEIL 460 (504)
Q Consensus 387 ~e~v~~~~e~L~~Al~~L~~~----n~~e~~~~al~tL~Kil~NIl~--~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L 460 (504)
++.|...++-|+.+|..+..+ ..--++...+..|+.+|.+.+. -|--.|| .|.+.|.+++.-..-|
T Consensus 7 k~~Ile~~~~L~~~l~~l~~~~~~~~~~~qL~D~~~~l~~~C~~yaD~~~~p~~KF--------~FREllsrLE~~~rqL 78 (110)
T PF08919_consen 7 KESILELAELLENALSNLKSSSISMAQWSQLSDKLQQLHSSCSGYADSIIQPHAKF--------AFRELLSRLESQSRQL 78 (110)
T ss_dssp HHHHHHHHHHHHHHCCCTTTC------HHHHHHHHHHHHHHHHHHGGG-S-CCCHH--------HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhchhhhhHHHHHHHHHHHHHHHHHHHHHHHHcCcCcchhh--------hHHHHHHHHHHHHHHH
Confidence 455666777788888777666 2335667788889999999887 3456788 5999999999999999
Q ss_pred HHcCCcc
Q 010667 461 FLVGFNE 467 (504)
Q Consensus 461 ~alGF~e 467 (504)
..+|=.-
T Consensus 79 r~~~s~~ 85 (110)
T PF08919_consen 79 RSCGSSN 85 (110)
T ss_dssp CHSSSSS
T ss_pred HhccCCC
Confidence 9887544
No 57
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=37.76 E-value=17 Score=39.06 Aligned_cols=21 Identities=33% Similarity=0.477 Sum_probs=12.5
Q ss_pred HHHHHHHHHHhhhh--cCccchh
Q 010667 68 SIKKTLLHELAHMV--YSEHDAN 88 (504)
Q Consensus 68 ~V~~tlLHELaH~v--hg~Hd~~ 88 (504)
+--.|+||||+|+. ++|+-.+
T Consensus 296 Dqatt~LHE~TH~~~V~~pgt~D 318 (359)
T PF02102_consen 296 DQATTTLHEMTHAPAVYSPGTDD 318 (359)
T ss_dssp -HHHHHHHHHHT-TTTSSS--B-
T ss_pred CccchhhhhhhccccccCCCccc
Confidence 35689999999995 5554443
No 58
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=37.57 E-value=22 Score=31.80 Aligned_cols=24 Identities=17% Similarity=0.362 Sum_probs=20.1
Q ss_pred cchHHHHHHHHHHHhhhhcCccch
Q 010667 64 RKYESIKKTLLHELAHMVYSEHDA 87 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~Hd~ 87 (504)
.+...+..|++|||-|..-..|+.
T Consensus 100 ~~~~~~~~v~~HEiGHaLGL~H~~ 123 (154)
T PF00413_consen 100 DSGNDLQSVAIHEIGHALGLDHSN 123 (154)
T ss_dssp SSSEEHHHHHHHHHHHHTTBESSS
T ss_pred hhhhhhhhhhhhccccccCcCcCC
Confidence 345578999999999999999873
No 59
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=36.98 E-value=40 Score=36.04 Aligned_cols=43 Identities=23% Similarity=0.358 Sum_probs=34.2
Q ss_pred chHHHHHHHHHHHhhhh-cCccchhHHHHHHHHHHHHHHhcccc
Q 010667 65 KYESIKKTLLHELAHMV-YSEHDANFYGLDKQLNQEAVALDWTK 107 (504)
Q Consensus 65 p~~~V~~tlLHELaH~v-hg~Hd~~F~~L~~~L~~E~~~l~~~~ 107 (504)
|-..+..+++|||||-+ +-+-|..|-+-+.....+.-...|..
T Consensus 161 ~~~~LA~LIfHELaHq~~Yv~~dt~FNEsfAtfVe~~G~~~wl~ 204 (337)
T PF10023_consen 161 PDGELARLIFHELAHQTLYVKGDTAFNESFATFVEREGARRWLA 204 (337)
T ss_pred CchHHHHHHHHHHhhceeecCCCchhhHHHHHHHHHHHHHHHHH
Confidence 44578999999999987 66888889888888887776777743
No 60
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=35.45 E-value=23 Score=37.72 Aligned_cols=21 Identities=33% Similarity=0.510 Sum_probs=18.3
Q ss_pred cchHHHHHHHHHHHhhhhcCc
Q 010667 64 RKYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vhg~ 84 (504)
++-+++.++++|||+|..+|.
T Consensus 185 ~~~ee~~yIilHEl~Hlk~gD 205 (337)
T COG4219 185 LTDEELKYIILHELSHLKRGD 205 (337)
T ss_pred cCHHhhhhhHhHHHhhhhccc
Confidence 566889999999999999884
No 61
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=35.06 E-value=9 Score=36.93 Aligned_cols=19 Identities=37% Similarity=0.552 Sum_probs=17.6
Q ss_pred HHHHHHHHhhhhcCccchh
Q 010667 70 KKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 70 ~~tlLHELaH~vhg~Hd~~ 88 (504)
..||.|||.|+.-.+||..
T Consensus 141 a~~~aHElGH~LG~~HD~~ 159 (207)
T cd04273 141 AFTIAHELGHVLGMPHDGD 159 (207)
T ss_pred EEeeeeechhhcCCCCCCC
Confidence 5799999999999999987
No 62
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=34.75 E-value=22 Score=32.76 Aligned_cols=39 Identities=21% Similarity=0.233 Sum_probs=22.3
Q ss_pred ccccccC-CccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhc
Q 010667 40 VLGFNKN-HGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVY 82 (504)
Q Consensus 40 lLGlN~N-~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vh 82 (504)
|--+|.| .|+-+-...=.+|. .-.-.+.||.|||+|.+.
T Consensus 53 F~~L~~~LCG~~~~~i~IDP~~----~~KGC~~TL~HEL~H~WQ 92 (141)
T PHA02456 53 FRALPQDLCGQFVGWIEIDPDY----ANKGCRDTLAHELNHAWQ 92 (141)
T ss_pred hhhcCcchhhcceeEEEECCcc----cccchHHHHHHHHHHHHh
Confidence 3335555 45544333333221 113478999999999874
No 63
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=34.70 E-value=20 Score=32.43 Aligned_cols=22 Identities=32% Similarity=0.394 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHhhhhcCccchh
Q 010667 67 ESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 67 ~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.....|++|||.|..-..|+..
T Consensus 94 ~~~~~~~~HElGH~LGl~H~~~ 115 (167)
T cd00203 94 KEGAQTIAHELGHALGFYHDHD 115 (167)
T ss_pred ccchhhHHHHHHHHhCCCccCc
Confidence 4578999999999999999865
No 64
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=34.57 E-value=17 Score=38.01 Aligned_cols=35 Identities=37% Similarity=0.354 Sum_probs=24.0
Q ss_pred cchHHHHHHHHHHHhhhhc--CccchhHHHHHHHHHH
Q 010667 64 RKYESIKKTLLHELAHMVY--SEHDANFYGLDKQLNQ 98 (504)
Q Consensus 64 lp~~~V~~tlLHELaH~vh--g~Hd~~F~~L~~~L~~ 98 (504)
+|.-.+=-|+||||||-.- ..=.++|++.+.-.+.
T Consensus 191 ~p~~~~P~T~~HElAHq~G~a~E~EANFiayLac~~s 227 (318)
T PF12725_consen 191 LPPYSLPFTICHELAHQLGFASEDEANFIAYLACINS 227 (318)
T ss_pred CCcccccHHHHHHHHHHhCCCCHHHHHHHHHHHHhcC
Confidence 4555566799999999872 2445778877654443
No 65
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=33.63 E-value=31 Score=31.28 Aligned_cols=22 Identities=27% Similarity=0.391 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHhhhhcCccchh
Q 010667 67 ESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 67 ~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.....|++|||-|..-..|...
T Consensus 92 ~~~~~~~~HEiGHaLGL~H~~~ 113 (165)
T cd04268 92 ARLRNTAEHELGHALGLRHNFA 113 (165)
T ss_pred HHHHHHHHHHHHHHhcccccCc
Confidence 4689999999999999988765
No 66
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=33.18 E-value=32 Score=34.85 Aligned_cols=20 Identities=30% Similarity=0.328 Sum_probs=17.1
Q ss_pred chHHHHHHHHHHHhhhhcCc
Q 010667 65 KYESIKKTLLHELAHMVYSE 84 (504)
Q Consensus 65 p~~~V~~tlLHELaH~vhg~ 84 (504)
+.+.+..+++||+-||+++.
T Consensus 56 ~~~~~~~~l~HevlH~~~~H 75 (292)
T PF13203_consen 56 SPEERVGLLLHEVLHCLLRH 75 (292)
T ss_pred CHHHHHHHHHHHHHHHHccc
Confidence 45778899999999999874
No 67
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=33.09 E-value=44 Score=36.79 Aligned_cols=73 Identities=18% Similarity=0.136 Sum_probs=48.2
Q ss_pred chhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC
Q 010667 9 GIIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS 83 (504)
Q Consensus 9 ~i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg 83 (504)
.|.-.++..||-...+-+.--.-..+.|.+-+.|+-.|+.=+|.=-|--. ..-+.=++|..+++|||.|-.|+
T Consensus 222 ~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~--~~~~~~eel~AVl~HELGHW~~~ 294 (428)
T KOG2719|consen 222 KIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE--EEHLNNEELVAVLAHELGHWKLN 294 (428)
T ss_pred HHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhh--hhccccHHHHHHHHHHhhHHHHh
Confidence 45577788888777776666222234566789999888776665333310 00034589999999999998776
No 68
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=32.04 E-value=38 Score=33.81 Aligned_cols=35 Identities=34% Similarity=0.329 Sum_probs=22.0
Q ss_pred HHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHh
Q 010667 69 IKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVAL 103 (504)
Q Consensus 69 V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l 103 (504)
+.-.|+||++|.+++.=.......-.+.+.+|+..
T Consensus 101 ~~fil~HE~~Hv~~~h~~~~~~~~~~~eE~~AD~~ 135 (206)
T PF10463_consen 101 IAFILLHELAHVVLGHEGDSSPSQSIQEEKEADSY 135 (206)
T ss_pred HHHHHHHHHHHHHHcCccccccchhHHHHHhhhHH
Confidence 34679999999887743332555555555555543
No 69
>PF14891 Peptidase_M91: Effector protein
Probab=31.01 E-value=23 Score=33.67 Aligned_cols=19 Identities=26% Similarity=0.160 Sum_probs=15.2
Q ss_pred HHHHHHHHhhhhcCccchh
Q 010667 70 KKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 70 ~~tlLHELaH~vhg~Hd~~ 88 (504)
.-+|.|||+|..|.-++..
T Consensus 104 ~v~L~HEL~HA~~~~~Gt~ 122 (174)
T PF14891_consen 104 FVVLYHELIHAYDYMNGTM 122 (174)
T ss_pred HHHHHHHHHHHHHHHCCCC
Confidence 5689999999988766654
No 70
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=28.94 E-value=43 Score=32.51 Aligned_cols=20 Identities=35% Similarity=0.481 Sum_probs=16.2
Q ss_pred CccchHHHHHHHHHHHhhhh
Q 010667 62 GFRKYESIKKTLLHELAHMV 81 (504)
Q Consensus 62 ~Flp~~~V~~tlLHELaH~v 81 (504)
.+.....+..||.|||.|..
T Consensus 64 ~~~~~~~l~~~l~HELIHay 83 (173)
T PF09768_consen 64 RIRSQGHLEDTLTHELIHAY 83 (173)
T ss_pred CCCCHHHHHHHHHHHHHHHH
Confidence 35567889999999999953
No 71
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=28.79 E-value=24 Score=33.03 Aligned_cols=77 Identities=18% Similarity=0.204 Sum_probs=36.8
Q ss_pred cccCcchhhHhhhcC-eeeceecccCCCCCC---CCC---CCccccccc--CCccEEEEEeec-CCCCCcc------chH
Q 010667 4 LAADPGIIAIMNKHR-WRVGIMTELAPVGYV---GVS---PKCVLGFNK--NHGEEISLRLRT-DDLKGFR------KYE 67 (504)
Q Consensus 4 L~~~a~i~pIM~~~~-w~V~~L~Ef~P~~~~---~~s---~~~lLGlN~--N~G~~I~LRLR~-~d~~~Fl------p~~ 67 (504)
|...|.|+-+|+|.+ .+-.-|.|=+-+++- .|. ..+-.|+-. -+|++|.|+--. .....=. -+=
T Consensus 19 ~~~~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~~ 98 (135)
T PF15639_consen 19 VLKVPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGTAYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNVY 98 (135)
T ss_pred hhcChHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccceeeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceEE
Confidence 355677777777732 223334443333320 010 012233322 367888655433 1110000 112
Q ss_pred HHHHHHHHHHhhh
Q 010667 68 SIKKTLLHELAHM 80 (504)
Q Consensus 68 ~V~~tlLHELaH~ 80 (504)
.|-.|+||||.|-
T Consensus 99 ~v~~TlLHEliHw 111 (135)
T PF15639_consen 99 LVGSTLLHELIHW 111 (135)
T ss_pred EeehHHHHHHHHh
Confidence 4678999999994
No 72
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=28.17 E-value=41 Score=30.74 Aligned_cols=22 Identities=32% Similarity=0.472 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHhhhhcCccchh
Q 010667 67 ESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 67 ~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
..+..|++|||-|..-..|...
T Consensus 102 ~~~~~~~~HEiGHaLGL~H~~~ 123 (156)
T cd04279 102 ENLQAIALHELGHALGLWHHSD 123 (156)
T ss_pred hHHHHHHHHHhhhhhcCCCCCC
Confidence 5789999999999998888755
No 73
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=26.71 E-value=46 Score=32.89 Aligned_cols=23 Identities=35% Similarity=0.488 Sum_probs=16.5
Q ss_pred hHHHHHHHHHHHhhhhcCccchh
Q 010667 66 YESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
.+.+..+++|||.|+.-..|=..
T Consensus 142 ~~R~~Kea~HElGH~~GL~HC~~ 164 (194)
T PF07998_consen 142 LERVCKEAVHELGHLFGLDHCEN 164 (194)
T ss_dssp HHHHHHHHHHHHHHHTT----SS
T ss_pred HHHHHHHHHHHHHHHcCCcCCCC
Confidence 57889999999999998888544
No 74
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=26.04 E-value=48 Score=32.14 Aligned_cols=24 Identities=38% Similarity=0.444 Sum_probs=19.6
Q ss_pred chHHHHHHHHHHHhhhhcCccchh
Q 010667 65 KYESIKKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 65 p~~~V~~tlLHELaH~vhg~Hd~~ 88 (504)
-.+.+.++++|||.|+.-..|-..
T Consensus 121 ~~~R~~k~~~HElGH~lGL~HC~~ 144 (179)
T PRK13267 121 FEERVRKEVTHELGHTLGLEHCDN 144 (179)
T ss_pred HHHHHHHHHHHHHHHHcCCccCCC
Confidence 346788889999999999999543
No 75
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=25.72 E-value=39 Score=39.46 Aligned_cols=22 Identities=32% Similarity=0.339 Sum_probs=20.2
Q ss_pred hHHHHHHHHHHHhhhhcCccch
Q 010667 66 YESIKKTLLHELAHMVYSEHDA 87 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg~Hd~ 87 (504)
+..+..||.|||+|++-+.||.
T Consensus 320 ~~~~a~v~AhelgH~lGm~hD~ 341 (716)
T KOG3607|consen 320 LLAFAVVLAHELGHNLGMIHDE 341 (716)
T ss_pred chhHHHHHHHHHHhhcCccccc
Confidence 5667899999999999999998
No 76
>PRK09672 phage exclusion protein Lit; Provisional
Probab=24.08 E-value=69 Score=33.85 Aligned_cols=36 Identities=25% Similarity=0.262 Sum_probs=22.1
Q ss_pred HHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667 69 IKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD 104 (504)
Q Consensus 69 V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~ 104 (504)
+-.+++||++|++++.=.-.=.+.-.+.+.||+..-
T Consensus 165 ~a~i~~HEiaHv~~~h~~~~~~e~s~~eE~eaDs~A 200 (305)
T PRK09672 165 LAWILLHEIAHVEFQHSSLESNEDSIQEEKEADSYA 200 (305)
T ss_pred HHHHHHHHHHHHHhccccccCchHHHHHHHHHHHHH
Confidence 346789999999987433233334445555665543
No 77
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=22.78 E-value=48 Score=33.30 Aligned_cols=21 Identities=33% Similarity=0.479 Sum_probs=16.5
Q ss_pred hHHHHHHHHHHHhhhhcCccc
Q 010667 66 YESIKKTLLHELAHMVYSEHD 86 (504)
Q Consensus 66 ~~~V~~tlLHELaH~vhg~Hd 86 (504)
...++.||+||+.|....-.+
T Consensus 89 ~~~~~~~l~HE~GHAlI~~~~ 109 (220)
T PF14247_consen 89 IGNVLFTLYHELGHALIDDLD 109 (220)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 345789999999998876554
No 78
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=22.42 E-value=54 Score=30.89 Aligned_cols=38 Identities=21% Similarity=0.158 Sum_probs=24.3
Q ss_pred cEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccch
Q 010667 49 EEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDA 87 (504)
Q Consensus 49 ~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~ 87 (504)
-.|.|........ ..+=.....|++|||-|..-..|.-
T Consensus 94 g~i~~~~~~~~~~-~~~g~~~~~t~~HEiGHaLGL~H~~ 131 (186)
T cd04277 94 GDIWFNSSYDTNS-DSPGSYGYQTIIHEIGHALGLEHPG 131 (186)
T ss_pred ceeEEecCccccc-CCCChhhHHHHHHHHHHHhcCCCCC
Confidence 3466655442211 1233557899999999999887764
No 79
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=22.20 E-value=49 Score=30.33 Aligned_cols=23 Identities=17% Similarity=0.193 Sum_probs=19.6
Q ss_pred chHHHHHHHHHHHhhhhcCccch
Q 010667 65 KYESIKKTLLHELAHMVYSEHDA 87 (504)
Q Consensus 65 p~~~V~~tlLHELaH~vhg~Hd~ 87 (504)
+...+..|++|||-|..-..|..
T Consensus 103 ~~~~~~~~~~HEiGHaLGL~H~~ 125 (157)
T cd04278 103 GGTDLFSVAAHEIGHALGLGHSS 125 (157)
T ss_pred ccchHHHHHHHHhccccccCCCC
Confidence 34679999999999999888874
No 80
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=21.84 E-value=43 Score=38.16 Aligned_cols=18 Identities=28% Similarity=0.366 Sum_probs=15.3
Q ss_pred HHHHHHHHHHhhhhcCcc
Q 010667 68 SIKKTLLHELAHMVYSEH 85 (504)
Q Consensus 68 ~V~~tlLHELaH~vhg~H 85 (504)
....+++||+||-+.||=
T Consensus 287 sl~~vIaHEIAHSWtGNl 304 (613)
T KOG1047|consen 287 SLVDVIAHEIAHSWTGNL 304 (613)
T ss_pred chhhHHHHHhhhhhcccc
Confidence 347899999999999973
No 81
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=20.41 E-value=56 Score=31.88 Aligned_cols=19 Identities=32% Similarity=0.459 Sum_probs=17.4
Q ss_pred HHHHHHHHhhhhcCccchh
Q 010667 70 KKTLLHELAHMVYSEHDAN 88 (504)
Q Consensus 70 ~~tlLHELaH~vhg~Hd~~ 88 (504)
..|+.|||-|+.-..|+..
T Consensus 138 ~~~~aHEiGH~lGl~H~~~ 156 (206)
T PF13583_consen 138 YQTFAHEIGHNLGLRHDFD 156 (206)
T ss_pred chHHHHHHHHHhcCCCCcc
Confidence 3789999999999999987
Done!