Query psy14374
Match_columns 180
No_of_seqs 181 out of 1857
Neff 7.7
Searched_HMMs 46136
Date Fri Aug 16 18:13:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14374.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14374hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01434 Peptidase_M41: Peptid 100.0 1.2E-29 2.6E-34 202.8 4.4 104 1-111 29-132 (213)
2 KOG0734|consensus 99.9 9.8E-29 2.1E-33 215.3 4.1 136 1-145 558-693 (752)
3 PRK10733 hflB ATP-dependent me 99.9 2.3E-25 5.1E-30 202.8 12.7 104 1-111 409-512 (644)
4 CHL00176 ftsH cell division pr 99.9 6.4E-26 1.4E-30 205.6 8.7 103 1-110 439-541 (638)
5 KOG0734|consensus 99.9 1.2E-25 2.6E-30 196.1 8.8 119 53-180 526-655 (752)
6 PF01434 Peptidase_M41: Peptid 99.9 1.7E-25 3.8E-30 178.8 9.0 115 57-180 1-126 (213)
7 COG0465 HflB ATP-dependent Zn 99.9 5E-25 1.1E-29 196.5 7.8 105 1-113 407-511 (596)
8 COG0465 HflB ATP-dependent Zn 99.9 1.6E-23 3.4E-28 186.9 5.0 124 49-180 371-503 (596)
9 CHL00176 ftsH cell division pr 99.9 9.8E-23 2.1E-27 184.9 8.5 121 52-180 407-536 (638)
10 PRK10733 hflB ATP-dependent me 99.9 1.8E-22 4E-27 183.9 9.3 122 52-180 376-506 (644)
11 TIGR01241 FtsH_fam ATP-depende 99.9 3.9E-22 8.4E-27 177.0 8.9 102 1-111 312-413 (495)
12 KOG0731|consensus 99.9 7.8E-22 1.7E-26 179.2 8.3 103 1-113 570-672 (774)
13 KOG0731|consensus 99.8 7.8E-22 1.7E-26 179.2 6.7 121 50-180 535-664 (774)
14 TIGR01241 FtsH_fam ATP-depende 99.8 6.9E-19 1.5E-23 156.2 8.2 115 54-179 281-406 (495)
15 CHL00206 ycf2 Ycf2; Provisiona 99.0 5.1E-10 1.1E-14 110.1 7.3 72 2-82 1892-1969(2281)
16 CHL00206 ycf2 Ycf2; Provisiona 98.4 2.7E-07 5.8E-12 91.5 5.8 93 52-156 1861-1968(2281)
17 cd06163 S2P-M50_PDZ_RseP-like 96.6 0.0024 5.1E-08 50.0 4.0 64 3-78 12-104 (182)
18 PF13398 Peptidase_M50B: Pepti 96.6 0.0026 5.6E-08 50.4 4.2 36 3-49 25-60 (200)
19 cd06164 S2P-M50_SpoIVFB_CBS Sp 96.4 0.007 1.5E-07 48.9 5.3 58 3-79 56-113 (227)
20 TIGR00054 RIP metalloprotease 96.2 0.0062 1.3E-07 53.5 4.1 68 3-75 17-107 (420)
21 cd06161 S2P-M50_SpoIVFB SpoIVF 96.0 0.0097 2.1E-07 47.4 4.3 59 3-80 41-99 (208)
22 cd05709 S2P-M50 Site-2 proteas 95.7 0.0048 1E-07 47.7 1.5 66 3-79 11-90 (180)
23 PRK10779 zinc metallopeptidase 95.7 0.014 3.1E-07 51.6 4.5 69 2-75 17-107 (449)
24 cd06160 S2P-M50_like_2 Unchara 95.0 0.037 8.1E-07 43.3 4.3 60 3-78 44-106 (183)
25 cd06162 S2P-M50_PDZ_SREBP Ster 94.6 0.059 1.3E-06 44.9 4.8 65 3-80 138-203 (277)
26 PF02163 Peptidase_M50: Peptid 93.9 0.044 9.6E-07 42.5 2.6 17 3-19 10-26 (192)
27 cd06158 S2P-M50_like_1 Unchara 93.0 0.074 1.6E-06 41.5 2.4 69 3-78 12-95 (181)
28 cd06159 S2P-M50_PDZ_Arch Uncha 91.5 0.051 1.1E-06 45.0 -0.2 61 3-79 121-185 (263)
29 PF04298 Zn_peptidase_2: Putat 86.0 0.3 6.5E-06 39.4 0.7 12 3-14 92-103 (222)
30 cd04279 ZnMc_MMP_like_1 Zinc-d 85.6 0.31 6.7E-06 36.6 0.5 13 2-14 106-118 (156)
31 cd04268 ZnMc_MMP_like Zinc-dep 82.6 0.53 1.1E-05 35.3 0.7 12 2-13 96-107 (165)
32 smart00235 ZnMc Zinc-dependent 79.2 0.92 2E-05 33.2 1.0 12 3-14 89-100 (140)
33 PF14247 DUF4344: Domain of un 78.5 1.1 2.5E-05 36.1 1.4 14 3-16 95-108 (220)
34 cd04277 ZnMc_serralysin_like Z 76.2 1.1 2.5E-05 34.5 0.7 12 2-13 115-126 (186)
35 cd04327 ZnMc_MMP_like_3 Zinc-d 70.6 1.9 4.1E-05 33.8 0.8 12 3-14 95-106 (198)
36 cd00203 ZnMc Zinc-dependent me 66.5 2.6 5.7E-05 31.4 0.8 13 2-14 98-110 (167)
37 PF02031 Peptidase_M7: Strepto 63.0 3.6 7.8E-05 30.4 0.9 12 2-13 79-90 (132)
38 KOG2921|consensus 62.0 4.1 8.9E-05 35.8 1.2 12 4-15 135-146 (484)
39 COG2738 Predicted Zn-dependent 60.8 3.6 7.8E-05 32.7 0.6 11 3-13 95-105 (226)
40 cd04280 ZnMc_astacin_like Zinc 54.2 6.2 0.00013 30.5 1.0 13 2-14 76-88 (180)
41 PF05572 Peptidase_M43: Pregna 54.1 5 0.00011 30.4 0.4 11 3-13 72-82 (154)
42 cd04283 ZnMc_hatching_enzyme Z 50.6 7.6 0.00016 30.3 0.9 13 3-15 80-92 (182)
43 cd04281 ZnMc_BMP1_TLD Zinc-dep 48.0 8.8 0.00019 30.4 0.9 12 3-14 90-101 (200)
44 PF01400 Astacin: Astacin (Pep 45.5 10 0.00022 29.6 0.9 11 3-13 82-92 (191)
45 cd04275 ZnMc_pappalysin_like Z 42.7 4.8 0.0001 32.5 -1.3 12 2-13 139-150 (225)
46 KOG3714|consensus 42.5 11 0.00025 33.0 0.9 13 3-15 162-174 (411)
47 cd04276 ZnMc_MMP_like_2 Zinc-d 41.0 8.8 0.00019 30.4 -0.1 12 2-13 118-129 (197)
48 KOG1565|consensus 40.8 7.7 0.00017 34.8 -0.5 12 2-13 213-224 (469)
49 cd04282 ZnMc_meprin Zinc-depen 40.3 14 0.0003 30.0 1.0 12 3-14 123-134 (230)
50 PF02074 Peptidase_M32: Carbox 38.2 16 0.00035 33.1 1.1 15 4-18 263-277 (494)
51 PF06114 DUF955: Domain of unk 36.5 17 0.00037 24.8 0.9 13 3-15 45-57 (122)
52 PF13583 Reprolysin_4: Metallo 36.1 16 0.00035 28.8 0.7 13 3-15 140-152 (206)
53 COG2317 Zn-dependent carboxype 35.6 16 0.00034 32.8 0.7 15 4-18 263-277 (497)
54 COG2856 Predicted Zn peptidase 35.0 18 0.0004 29.0 0.9 13 3-15 75-87 (213)
55 cd06258 Peptidase_M3_like The 33.2 19 0.0004 30.7 0.7 16 3-18 157-172 (365)
56 PF13574 Reprolysin_2: Metallo 33.0 21 0.00045 27.3 0.9 13 3-15 114-126 (173)
57 PF12388 Peptidase_M57: Dual-a 32.8 17 0.00037 29.2 0.4 11 3-13 136-146 (211)
58 cd06460 M32_Taq Peptidase fami 32.7 20 0.00044 31.4 0.9 16 3-18 162-177 (396)
59 PF11350 DUF3152: Protein of u 25.4 27 0.00059 27.8 0.3 9 5-13 144-152 (203)
60 cd06459 M3B_Oligoendopeptidase 25.3 46 0.001 28.6 1.8 18 3-20 225-242 (427)
61 TIGR02289 M3_not_pepF oligoend 23.6 53 0.0011 29.9 1.9 16 4-19 341-356 (549)
62 cd06455 M3A_TOP Peptidase M3 T 21.8 44 0.00095 29.8 1.0 15 4-18 267-281 (472)
63 PF13485 Peptidase_MA_2: Pepti 20.6 65 0.0014 22.0 1.5 16 3-18 28-43 (128)
64 PF01432 Peptidase_M3: Peptida 20.5 70 0.0015 28.1 2.0 16 4-19 246-261 (458)
65 PF04315 DUF462: Protein of un 20.4 52 0.0011 25.3 1.0 41 3-63 44-84 (164)
No 1
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=99.96 E-value=1.2e-29 Score=202.83 Aligned_cols=104 Identities=47% Similarity=0.738 Sum_probs=92.1
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
|++|||||||||+++++++..|+ .++||+|||+++||+.+.|.+++...|+++|+++|+++||||+||+++||
T Consensus 29 ~~~A~HEAGhAvva~~l~~~~~v-------~~vsi~prg~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraAEe~~~g 101 (213)
T PF01434_consen 29 RRIAYHEAGHAVVAYLLPPADPV-------SKVSIVPRGSALGFTQFTPDEDRYIRTRSYLEDRICVLLAGRAAEELFFG 101 (213)
T ss_dssp HHHHHHHHHHHHHHHHSSS---E-------EEEESSTTCCCCHCCEECHHTT-SS-BHHHHHHHHHHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHHHHHHhcccccE-------EEEEEecCCCcceeEEeccchhcccccHHHHHhhHHHHHHHHHHHHhhcC
Confidence 46899999999999999987676 99999999999999999999887789999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeee
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTII 111 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~ 111 (180)
.++.++|+++|+.+|+.+|.+||..|||+..
T Consensus 102 ~~~~stGa~~DL~~At~iA~~mv~~~Gm~~~ 132 (213)
T PF01434_consen 102 EDNVSTGASSDLQQATEIARKMVASYGMGDS 132 (213)
T ss_dssp CCS-BGGGHHHHHHHHHHHHHHHHTST-TTT
T ss_pred cceecccchhHHHHHHHHHHHHHHHhCCCCC
Confidence 8889999999999999999999999999863
No 2
>KOG0734|consensus
Probab=99.95 E-value=9.8e-29 Score=215.25 Aligned_cols=136 Identities=51% Similarity=0.760 Sum_probs=117.2
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+++||||+|||+||...++..|+ +|+||.|||.++|+|.+.|+.+++.+|+.+++..+++.||||+||+++||
T Consensus 558 ~~TAyHE~GHAivA~yTk~A~Pl-------hKaTImPRG~sLG~t~~LPe~D~~~~Tk~q~LA~lDV~MGGRvAEELIfG 630 (752)
T KOG0734|consen 558 KITAYHEGGHAIVALYTKGAMPL-------HKATIMPRGPSLGHTSQLPEKDRYSITKAQLLARLDVCMGGRVAEELIFG 630 (752)
T ss_pred hhhhhhccCceEEEeecCCCccc-------cceeeccCCccccceeecCccchhhHHHHHHHHHHHHhhcchHHHHHhcc
Confidence 57999999999999999999887 99999999999999999999999999999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeeecCCccCCcccccCccccccccHHHHHHHHHHHhh
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMG 145 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~prg~~lG~~~~~p~e~~~~~tk~~l~~~i~v~L~ 145 (180)
.+++++|+++|+.+||.+|.+||..||||...+...+-..+ +..+....|++-+.++|+.+|-
T Consensus 631 ~D~iTsGAssDl~qAT~lA~~MVt~fGMSd~vG~v~~~~~~--~~~s~~~~t~~lidaEi~~lL~ 693 (752)
T KOG0734|consen 631 TDKITSGASSDLDQATKLARRMVTKFGMSDKVGPVTLSAED--NSSSLSPRTQELIDAEIKRLLR 693 (752)
T ss_pred CCcccccccchHHHHHHHHHHHHHHcCccccccceeeeccC--CCCCCCchhHHHHHHHHHHHHH
Confidence 99999999999999999999999999999874433322211 1122334566677777777775
No 3
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=99.93 E-value=2.3e-25 Score=202.79 Aligned_cols=104 Identities=41% Similarity=0.724 Sum_probs=98.1
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+.+||||+||||++++++..+|+ .+|||+|||+++||+.+.|.+++...|+++|+++|+++||||+||+++||
T Consensus 409 ~~~a~he~gha~~~~~~~~~~~~-------~~v~i~prg~~~g~~~~~~~~~~~~~~~~~l~~~i~~~lgGraAE~~~~g 481 (644)
T PRK10733 409 ESTAYHEAGHAIIGRLVPEHDPV-------HKVTIIPRGRALGVTFFLPEGDAISASRQKLESQISTLYGGRLAEEIIYG 481 (644)
T ss_pred HHHHHHHHHHHHHHHHccCCCce-------eEEEEeccCCCcceeEECCCcccccccHHHHHHHHHHHHhhHHHHHHHhC
Confidence 35799999999999999988887 99999999999999999998888788999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeee
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTII 111 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~ 111 (180)
..+.++|+++|+.+||.+|..||..|||+..
T Consensus 482 ~~~~ttGa~~Dl~~AT~lA~~mv~~~Gms~~ 512 (644)
T PRK10733 482 PEHVSTGASNDIKVATNLARNMVTQWGFSEK 512 (644)
T ss_pred CCCCCCCcHHHHHHHHHHHHHHHHHhCCCcc
Confidence 8788999999999999999999999999875
No 4
>CHL00176 ftsH cell division protein; Validated
Probab=99.93 E-value=6.4e-26 Score=205.63 Aligned_cols=103 Identities=46% Similarity=0.730 Sum_probs=97.9
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+++|||||||||++++++..+|+ .+|||+|||+++||+.+.|.+++..+|+.+++++|+++||||+||+++||
T Consensus 439 ~~vA~hEaGhA~v~~~l~~~~~v-------~kvtI~prg~~~G~~~~~p~~~~~~~t~~~l~~~i~~~LgGraAE~~~fg 511 (638)
T CHL00176 439 RLIAYHEVGHAIVGTLLPNHDPV-------QKVTLIPRGQAKGLTWFTPEEDQSLVSRSQILARIVGALGGRAAEEVVFG 511 (638)
T ss_pred HHHHHHhhhhHHHHhhccCCCce-------EEEEEeecCCCCCceEecCCcccccccHHHHHHHHHHHhhhHHHHHHhcC
Confidence 46899999999999999988777 99999999999999999998888999999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeee
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTI 110 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si 110 (180)
+...++|+++|+++||.+|.+||..|||+.
T Consensus 512 ~~~~~~Ga~~Dl~~AT~iA~~mv~~~Gm~~ 541 (638)
T CHL00176 512 STEVTTGASNDLQQVTNLARQMVTRFGMSS 541 (638)
T ss_pred CCCcCCCchhHHHHHHHHHHHHHHHhCCCc
Confidence 777899999999999999999999999984
No 5
>KOG0734|consensus
Probab=99.92 E-value=1.2e-25 Score=196.08 Aligned_cols=119 Identities=45% Similarity=0.671 Sum_probs=100.7
Q ss_pred ccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHH--Hhhhhccceeee---------eeeeecCCccCCccc
Q psy14374 53 RYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQ--ATSIATHMVREF---------VVTIIPRGVSLGHTA 121 (180)
Q Consensus 53 ~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~--A~~~A~~~v~~~---------g~si~prg~~lG~~~ 121 (180)
...+|.++|+- +.+++++|++.++...+++-++ |+|++||++.++ +.+|.|||.++|.+.
T Consensus 526 a~~VtM~~LE~---------akDrIlMG~ERks~~i~~eak~~TAyHE~GHAivA~yTk~A~PlhKaTImPRG~sLG~t~ 596 (752)
T KOG0734|consen 526 AEMVTMKHLEF---------AKDRILMGPERKSMVIDEEAKKITAYHEGGHAIVALYTKGAMPLHKATIMPRGPSLGHTS 596 (752)
T ss_pred cccccHHHHhh---------hhhheeecccccccccChhhhhhhhhhccCceEEEeecCCCccccceeeccCCcccccee
Confidence 34445555555 5566666666555554444444 889999999884 899999999999999
Q ss_pred ccCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 122 YIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 122 ~~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
++|+.|++..||.||++++++.||||+|||++||++++||||++||+|||++|+.||++
T Consensus 597 ~LPe~D~~~~Tk~q~LA~lDV~MGGRvAEELIfG~D~iTsGAssDl~qAT~lA~~MVt~ 655 (752)
T KOG0734|consen 597 QLPEKDRYSITKAQLLARLDVCMGGRVAEELIFGTDKITSGASSDLDQATKLARRMVTK 655 (752)
T ss_pred ecCccchhhHHHHHHHHHHHHhhcchHHHHHhccCCcccccccchHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999974
No 6
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=99.92 E-value=1.7e-25 Score=178.78 Aligned_cols=115 Identities=40% Similarity=0.579 Sum_probs=92.4
Q ss_pred CHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHH--HHhhhhccceeee---------eeeeecCCccCCcccccCc
Q psy14374 57 TKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLV--QATSIATHMVREF---------VVTIIPRGVSLGHTAYIPE 125 (180)
Q Consensus 57 t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~--~A~~~A~~~v~~~---------g~si~prg~~lG~~~~~p~ 125 (180)
|+++|++++++++.|.. +.+...+++.+ -|+|+|+|+|.+| ++|++||+.++|++.+.|.
T Consensus 1 ~~~d~~~a~drv~~G~~---------~~~~~~~~~~~~~~A~HEAGhAvva~~l~~~~~v~~vsi~prg~~~G~~~~~~~ 71 (213)
T PF01434_consen 1 TMEDIEEAIDRVLMGPE---------KKSRKLSEEEKRRIAYHEAGHAVVAYLLPPADPVSKVSIVPRGSALGFTQFTPD 71 (213)
T ss_dssp -HHHHHHHHHHHHCCSC---------CTTS---HHHHHHHHHHHHHHHHHHHHSSS---EEEEESSTTCCCCHCCEECHH
T ss_pred CHHHHHHHHHHHhcCcC---------cCCCCCCHHHHHHHHHHHHHHHHHHHHhcccccEEEEEEecCCCcceeEEeccc
Confidence 67899997776665543 32222233322 3899999998775 7899999999999999998
Q ss_pred cccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 126 KERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 126 e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
++++..||++|+++|+++||||+||+++||.+++|+|+++||++||++|++||.+
T Consensus 72 ~~~~~~t~~~l~~~i~v~LaGraAEe~~~g~~~~stGa~~DL~~At~iA~~mv~~ 126 (213)
T PF01434_consen 72 EDRYIRTRSYLEDRICVLLAGRAAEELFFGEDNVSTGASSDLQQATEIARKMVAS 126 (213)
T ss_dssp TT-SS-BHHHHHHHHHHHHHHHHHHHHHHSCCS-BGGGHHHHHHHHHHHHHHHHT
T ss_pred hhcccccHHHHHhhHHHHHHHHHHHHhhcCcceecccchhHHHHHHHHHHHHHHH
Confidence 8877899999999999999999999999998899999999999999999999964
No 7
>COG0465 HflB ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]
Probab=99.91 E-value=5e-25 Score=196.45 Aligned_cols=105 Identities=50% Similarity=0.852 Sum_probs=100.6
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+++||||+|||+++++++..+|+ +|+||+|||+++|||.+.|.+++..+|+++++++|++++|||+||+++||
T Consensus 407 ~~~AYhEaghalv~~~l~~~d~v-------~KvtIiPrG~alG~t~~~Pe~d~~l~sk~~l~~~i~~~lgGRaAEel~~g 479 (596)
T COG0465 407 KITAYHEAGHALVGLLLPDADPV-------HKVTIIPRGRALGYTLFLPEEDKYLMSKEELLDRIDVLLGGRAAEELIFG 479 (596)
T ss_pred cchHHHHHHHHHHHHhCCCCccc-------ceeeeccCchhhcchhcCCccccccccHHHHHHHHHHHhCCcHhhhhhhc
Confidence 57999999999999999999887 99999999999999999999999999999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeeecC
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTIIPR 113 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~pr 113 (180)
. ++++|+++|+.+||.+|..||..|||+...+
T Consensus 480 ~-e~ttGa~~D~~~at~~ar~mVt~~Gms~~lG 511 (596)
T COG0465 480 Y-EITTGASNDLEKATDLARAMVTEYGMSAKLG 511 (596)
T ss_pred c-cccccchhhHHHHHHHHHHhhhhcCcchhhC
Confidence 8 7999999999999999999999999997644
No 8
>COG0465 HflB ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]
Probab=99.88 E-value=1.6e-23 Score=186.89 Aligned_cols=124 Identities=36% Similarity=0.624 Sum_probs=104.9
Q ss_pred cCccccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHHHhhhhccceee---------eeeeeecCCccCCc
Q psy14374 49 PEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE---------FVVTIIPRGVSLGH 119 (180)
Q Consensus 49 ~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~A~~~A~~~v~~---------~g~si~prg~~lG~ 119 (180)
.+.++..++..+|.++++++++|.+....++.+ .+....|+|+++|++.. +++|++|||.++||
T Consensus 371 ar~n~~~i~~~~i~ea~drv~~G~erks~vise-------~ek~~~AYhEaghalv~~~l~~~d~v~KvtIiPrG~alG~ 443 (596)
T COG0465 371 ARRNKKEITMRDIEEAIDRVIAGPERKSRVISE-------AEKKITAYHEAGHALVGLLLPDADPVHKVTIIPRGRALGY 443 (596)
T ss_pred HHhcCeeEeccchHHHHHHHhcCcCcCCcccCh-------hhhcchHHHHHHHHHHHHhCCCCcccceeeeccCchhhcc
Confidence 344667788889988877777776544433332 12223489999998766 49999999999999
Q ss_pred ccccCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 120 TAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 120 ~~~~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
+++.|++|++++|++++.++|+++||||+|||++||. ++||||+|||++||++|+.||++
T Consensus 444 t~~~Pe~d~~l~sk~~l~~~i~~~lgGRaAEel~~g~-e~ttGa~~D~~~at~~ar~mVt~ 503 (596)
T COG0465 444 TLFLPEEDKYLMSKEELLDRIDVLLGGRAAEELIFGY-EITTGASNDLEKATDLARAMVTE 503 (596)
T ss_pred hhcCCccccccccHHHHHHHHHHHhCCcHhhhhhhcc-cccccchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999998 89999999999999999999985
No 9
>CHL00176 ftsH cell division protein; Validated
Probab=99.88 E-value=9.8e-23 Score=184.93 Aligned_cols=121 Identities=35% Similarity=0.527 Sum_probs=99.1
Q ss_pred cccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHHHhhhhccceeee---------eeeeecCCccCCcccc
Q psy14374 52 ERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVREF---------VVTIIPRGVSLGHTAY 122 (180)
Q Consensus 52 ~~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~A~~~A~~~v~~~---------g~si~prg~~lG~~~~ 122 (180)
++..+|.++|+.++..+++|.....+. ........|+|+|||++..+ ++|++|||.++|++++
T Consensus 407 ~~~~It~~dl~~Ai~rv~~g~~~~~~~--------~~~~~~~vA~hEaGhA~v~~~l~~~~~v~kvtI~prg~~~G~~~~ 478 (638)
T CHL00176 407 KKATITMKEIDTAIDRVIAGLEGTPLE--------DSKNKRLIAYHEVGHAIVGTLLPNHDPVQKVTLIPRGQAKGLTWF 478 (638)
T ss_pred CCCCcCHHHHHHHHHHHHhhhccCccc--------cHHHHHHHHHHhhhhHHHHhhccCCCceEEEEEeecCCCCCceEe
Confidence 444577778887777766665322211 01122334999999998764 8999999999999999
Q ss_pred cCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 123 IPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 123 ~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
.|+++++.+|+++|+++|+++|||||||+++||++++|+||+|||++||++|+.||++
T Consensus 479 ~p~~~~~~~t~~~l~~~i~~~LgGraAE~~~fg~~~~~~Ga~~Dl~~AT~iA~~mv~~ 536 (638)
T CHL00176 479 TPEEDQSLVSRSQILARIVGALGGRAAEEVVFGSTEVTTGASNDLQQVTNLARQMVTR 536 (638)
T ss_pred cCCcccccccHHHHHHHHHHHhhhHHHHHHhcCCCCcCCCchhHHHHHHHHHHHHHHH
Confidence 9988999999999999999999999999999997689999999999999999999973
No 10
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=99.87 E-value=1.8e-22 Score=183.88 Aligned_cols=122 Identities=33% Similarity=0.514 Sum_probs=97.2
Q ss_pred cccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHHHhhhhccceeee---------eeeeecCCccCCcccc
Q psy14374 52 ERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVREF---------VVTIIPRGVSLGHTAY 122 (180)
Q Consensus 52 ~~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~A~~~A~~~v~~~---------g~si~prg~~lG~~~~ 122 (180)
++..++..+|+++++.++.|.......+.+ ......|+|+|+|++..+ +++++|||.++|++++
T Consensus 376 ~~~~i~~~d~~~a~~~v~~g~~~~~~~~~~-------~~~~~~a~he~gha~~~~~~~~~~~~~~v~i~prg~~~g~~~~ 448 (644)
T PRK10733 376 NKRVVSMVEFEKAKDKIMMGAERRSMVMTE-------AQKESTAYHEAGHAIIGRLVPEHDPVHKVTIIPRGRALGVTFF 448 (644)
T ss_pred CCCcccHHHHHHHHHHHhcccccccccccH-------HHHHHHHHHHHHHHHHHHHccCCCceeEEEEeccCCCcceeEE
Confidence 445567777777666555443222211111 122344899999997653 7899999999999999
Q ss_pred cCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 123 IPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 123 ~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
.|+++.+..||++|+++|+++|||||||+++||++++||||+|||++||+||++||++
T Consensus 449 ~~~~~~~~~~~~~l~~~i~~~lgGraAE~~~~g~~~~ttGa~~Dl~~AT~lA~~mv~~ 506 (644)
T PRK10733 449 LPEGDAISASRQKLESQISTLYGGRLAEEIIYGPEHVSTGASNDIKVATNLARNMVTQ 506 (644)
T ss_pred CCCcccccccHHHHHHHHHHHHhhHHHHHHHhCCCCCCCCcHHHHHHHHHHHHHHHHH
Confidence 9998888899999999999999999999999997789999999999999999999973
No 11
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=99.87 E-value=3.9e-22 Score=176.96 Aligned_cols=102 Identities=54% Similarity=0.887 Sum_probs=94.8
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
|.+|||||||||++++++...|+ .++||.|||+++||+.+.|.++....|+++|++.|+++||||+||+++||
T Consensus 312 ~~~A~hEaGhAlv~~~l~~~~~v-------~~vsi~prg~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraAE~~~~G 384 (495)
T TIGR01241 312 KLVAYHEAGHALVGLLLKDADPV-------HKVTIIPRGQALGYTQFLPEEDKYLYTKSQLLAQIAVLLGGRAAEEIIFG 384 (495)
T ss_pred HHHHHHHHhHHHHHHhcCCCCce-------EEEEEeecCCccceEEecCccccccCCHHHHHHHHHHHhhHHHHHHHHhc
Confidence 35899999999999999877676 99999999999999999988777889999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeee
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTII 111 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~ 111 (180)
+.++|+++|+.+|+.+|.+|+..|||+..
T Consensus 385 --~~s~Ga~~Dl~~At~lA~~mv~~~Gm~~~ 413 (495)
T TIGR01241 385 --EVTTGASNDIKQATNIARAMVTEWGMSDK 413 (495)
T ss_pred --CCCCCchHHHHHHHHHHHHHHHHhCCCcc
Confidence 68999999999999999999999999874
No 12
>KOG0731|consensus
Probab=99.86 E-value=7.8e-22 Score=179.23 Aligned_cols=103 Identities=46% Similarity=0.740 Sum_probs=96.2
Q ss_pred CeehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 1 MITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 1 ~~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+.+||||||||+++|++++.+|+ .++||.| |+++||++|.|+++ ++.++++|.++|++.||||+||+++||
T Consensus 570 ~~~a~~eagha~~g~~l~~~dpl-------~kvsIiP-GqalG~a~~~P~~~-~l~sk~ql~~rm~m~LGGRaAEev~fg 640 (774)
T KOG0731|consen 570 KTVAYHEAGHAVVGWLLEHADPL-------LKVSIIP-GQALGYAQYLPTDD-YLLSKEQLFDRMVMALGGRAAEEVVFG 640 (774)
T ss_pred hhhhhhhccchhhhccccccCcc-------eeEEecc-CCccceEEECCccc-ccccHHHHHHHHHHHhCcchhhheecC
Confidence 46899999999999999998887 9999999 56999999999977 889999999999999999999999999
Q ss_pred CcccccCcchHHHHHhhhhccceeeeeeeeecC
Q psy14374 81 AEKVTSGASSDLVQATSIATHMVREFVVTIIPR 113 (180)
Q Consensus 81 ~~~~s~g~~~dl~~A~~~A~~~v~~~g~si~pr 113 (180)
.++|+|+++|+.++|.+|..+|..|||+...+
T Consensus 641 -~~iTtga~ddl~kvT~~A~~~V~~~Gms~kig 672 (774)
T KOG0731|consen 641 -SEITTGAQDDLEKVTKIARAMVASFGMSEKIG 672 (774)
T ss_pred -CccCchhhccHHHHHHHHHHHHHHcCcccccC
Confidence 68999999999999999999999999997643
No 13
>KOG0731|consensus
Probab=99.85 E-value=7.8e-22 Score=179.23 Aligned_cols=121 Identities=35% Similarity=0.521 Sum_probs=103.1
Q ss_pred CccccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHHHhhhhccceeee---------eeeeecCCccCCcc
Q psy14374 50 EKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVREF---------VVTIIPRGVSLGHT 120 (180)
Q Consensus 50 ~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~A~~~A~~~v~~~---------g~si~prg~~lG~~ 120 (180)
+++...+++.+|+++++++++|..........+ ..-.-|+|+|+|+++.| +++|+|+. ++||+
T Consensus 535 r~~~~~i~~~~~~~a~~Rvi~G~~~~~~~~~~~-------~~~~~a~~eagha~~g~~l~~~dpl~kvsIiPGq-alG~a 606 (774)
T KOG0731|consen 535 RKGLREIGTKDLEYAIERVIAGMEKKSRVLSLE-------EKKTVAYHEAGHAVVGWLLEHADPLLKVSIIPGQ-ALGYA 606 (774)
T ss_pred HhccCccchhhHHHHHHHHhccccccchhcCHh-------hhhhhhhhhccchhhhccccccCcceeEEeccCC-ccceE
Confidence 445677899999999999999986544444322 12234899999999996 89999955 99999
Q ss_pred cccCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhhC
Q psy14374 121 AYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVRE 180 (180)
Q Consensus 121 ~~~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~~ 180 (180)
+|.|.++ ++++++||.++||+.||||||||++|| +++||||.|||++||++|+.||++
T Consensus 607 ~~~P~~~-~l~sk~ql~~rm~m~LGGRaAEev~fg-~~iTtga~ddl~kvT~~A~~~V~~ 664 (774)
T KOG0731|consen 607 QYLPTDD-YLLSKEQLFDRMVMALGGRAAEEVVFG-SEITTGAQDDLEKVTKIARAMVAS 664 (774)
T ss_pred EECCccc-ccccHHHHHHHHHHHhCcchhhheecC-CccCchhhccHHHHHHHHHHHHHH
Confidence 9999866 889999999999999999999999999 589999999999999999999963
No 14
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=99.77 E-value=6.9e-19 Score=156.21 Aligned_cols=115 Identities=43% Similarity=0.654 Sum_probs=91.6
Q ss_pred cccCHHHHHHHHHHHhchhhhhhhhhCCcccccCc--chHHHHHhhhhccceeee---------eeeeecCCccCCcccc
Q psy14374 54 YHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGA--SSDLVQATSIATHMVREF---------VVTIIPRGVSLGHTAY 122 (180)
Q Consensus 54 ~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~--~~dl~~A~~~A~~~v~~~---------g~si~prg~~lG~~~~ 122 (180)
..++.++|+.++.....| .++..... ......|+|+|+|++..| ++++.||+.++|++++
T Consensus 281 ~~i~~~~l~~a~~~~~~~---------~~~~~~~~~~~~~~~~A~hEaGhAlv~~~l~~~~~v~~vsi~prg~~~G~~~~ 351 (495)
T TIGR01241 281 TEITMNDIEEAIDRVIAG---------PEKKSRVISEKEKKLVAYHEAGHALVGLLLKDADPVHKVTIIPRGQALGYTQF 351 (495)
T ss_pred CCCCHHHHHHHHHHHhcc---------cccccccccHHHHHHHHHHHHhHHHHHHhcCCCCceEEEEEeecCCccceEEe
Confidence 345666666655544433 22212222 223455999999987654 6899999999999999
Q ss_pred cCccccccccHHHHHHHHHHHhhhHHHHHHHhCCCccccChhhHHHHHHHHHHHHhh
Q psy14374 123 IPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVR 179 (180)
Q Consensus 123 ~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~~~~~~ga~~Dl~~at~~a~~mv~ 179 (180)
.|+++....|+++|+++|+++|||||||+++|| ++|+|+++||++||++|+.||.
T Consensus 352 ~~~~~~~~~t~~~l~~~i~v~LaGraAE~~~~G--~~s~Ga~~Dl~~At~lA~~mv~ 406 (495)
T TIGR01241 352 LPEEDKYLYTKSQLLAQIAVLLGGRAAEEIIFG--EVTTGASNDIKQATNIARAMVT 406 (495)
T ss_pred cCccccccCCHHHHHHHHHHHhhHHHHHHHHhc--CCCCCchHHHHHHHHHHHHHHH
Confidence 998777889999999999999999999999999 8999999999999999999986
No 15
>CHL00206 ycf2 Ycf2; Provisional
Probab=99.02 E-value=5.1e-10 Score=110.10 Aligned_cols=72 Identities=17% Similarity=0.138 Sum_probs=63.0
Q ss_pred eehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeC------CCCceEEEeecCccccccCHHHHHHHHHHHhchhhhh
Q psy14374 2 ITAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPR------GVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAE 75 (180)
Q Consensus 2 ~~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~------g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE 75 (180)
-+|+||+||||++.+++...|+ ++|||.++ |.+.|++.+.|.+ ..+++..++..|.++|||+|||
T Consensus 1892 ~ia~yEiGhAvvq~~L~~~~pv-------~kISIy~~~~~~r~~~~yl~~wyle~~--~~mkk~tiL~~Il~cLAGraAe 1962 (2281)
T CHL00206 1892 GILFYQIGRAVAQNVLLSNCPI-------DPISIYMKKKSCKEGDSYLYKWYFELG--TSMKKLTILLYLLSCSAGSVAQ 1962 (2281)
T ss_pred hhhhhHHhHHHHHHhccCCCCc-------ceEEEecCCccccCcccceeEeecCCc--ccCCHHHHHHHHHHHhhhhhhh
Confidence 3699999999999999988887 99999532 4567999999875 7899999999999999999999
Q ss_pred hhhhCCc
Q psy14374 76 ELVFGAE 82 (180)
Q Consensus 76 ~v~~g~~ 82 (180)
+++|.+.
T Consensus 1963 dlwf~~~ 1969 (2281)
T CHL00206 1963 DLWSLPG 1969 (2281)
T ss_pred hhccCcc
Confidence 9999754
No 16
>CHL00206 ycf2 Ycf2; Provisional
Probab=98.44 E-value=2.7e-07 Score=91.53 Aligned_cols=93 Identities=11% Similarity=0.036 Sum_probs=71.4
Q ss_pred cccccCHHHHHHHHHHHhchhhhhhhhhCCcccccCcchHHHHHhhhhccceeee---------eeeeec------CCcc
Q psy14374 52 ERYHVTKSQLMAMMDTMMGGRAAEELVFGAEKVTSGASSDLVQATSIATHMVREF---------VVTIIP------RGVS 116 (180)
Q Consensus 52 ~~~~~t~~~l~~~i~v~LaGraaE~v~~g~~~~s~g~~~dl~~A~~~A~~~v~~~---------g~si~p------rg~~ 116 (180)
++..++..++..++.+.++|... ++ ....+-..|.+++||||+++ ++||.+ ++.+
T Consensus 1861 ~ks~Id~~~I~~Al~Rq~~g~~~-~~---------~~~~~~~ia~yEiGhAvvq~~L~~~~pv~kISIy~~~~~~r~~~~ 1930 (2281)
T CHL00206 1861 KKSIIDTNTIRSALHRQTWDLRS-QV---------RSVQDHGILFYQIGRAVAQNVLLSNCPIDPISIYMKKKSCKEGDS 1930 (2281)
T ss_pred CCCccCHHHHHHHHHHHHhhhhh-cc---------cCcchhhhhhhHHhHHHHHHhccCCCCcceEEEecCCccccCccc
Confidence 45667888999988888877532 11 11122245889999998774 788742 2567
Q ss_pred CCcccccCccccccccHHHHHHHHHHHhhhHHHHHHHhCC
Q psy14374 117 LGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFGA 156 (180)
Q Consensus 117 lG~~~~~p~e~~~~~tk~~l~~~i~v~L~Gr~Ae~~~~g~ 156 (180)
+|+++|.|.+ ..+++.+++.+|.++|||||||+++|.+
T Consensus 1931 yl~~wyle~~--~~mkk~tiL~~Il~cLAGraAedlwf~~ 1968 (2281)
T CHL00206 1931 YLYKWYFELG--TSMKKLTILLYLLSCSAGSVAQDLWSLP 1968 (2281)
T ss_pred ceeEeecCCc--ccCCHHHHHHHHHHHhhhhhhhhhccCc
Confidence 8999999965 7889999999999999999999999975
No 17
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=96.65 E-value=0.0024 Score=50.00 Aligned_cols=64 Identities=22% Similarity=0.248 Sum_probs=39.6
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCcee--------------------EEEeeCCCCceEEEeecCccc----cccCH
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHK--------------------VTIIPRGVSLGHTAYIPEKER----YHVTK 58 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~--------------------vsi~~~g~~~G~~~~~~~~~~----~~~t~ 58 (180)
+.+||.||.++|.+++- ++ .+ +++.|=| ||+.+...++. .....
T Consensus 12 v~iHElGH~~~Ar~~Gv--~v-------~~f~iGfGp~l~~~~~~~t~~~i~~iPlG---Gyv~~~~~~~~~~~~~~~~~ 79 (182)
T cd06163 12 IFVHELGHFLVAKLFGV--KV-------EEFSIGFGPKLFSFKKGETEYSISAIPLG---GYVKMLGEDPEEEADPEDDP 79 (182)
T ss_pred HHHHHHHHHHHHHHcCC--ee-------eEeeeecCceeeeeecCCeEEEEEEEEec---cEEEecCCCcccccccccch
Confidence 57899999999998762 22 33 4555655 78887532211 11222
Q ss_pred HHH-----HHHHHHHhchhhhhhhh
Q psy14374 59 SQL-----MAMMDTMMGGRAAEELV 78 (180)
Q Consensus 59 ~~l-----~~~i~v~LaGraaE~v~ 78 (180)
.++ .+++.+.+||..+.-++
T Consensus 80 ~~f~~~~~~~ri~V~lAGP~~Nlil 104 (182)
T cd06163 80 RSFNSKPVWQRILIVFAGPLANFLL 104 (182)
T ss_pred HHHccCCcchhhhhhhhHHHHHHHH
Confidence 223 35677889998776443
No 18
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=96.64 E-value=0.0026 Score=50.44 Aligned_cols=36 Identities=31% Similarity=0.408 Sum_probs=29.2
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeec
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIP 49 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~ 49 (180)
+.+||.|||+++.++++ ++.++++.|. ..|.+.+.+
T Consensus 25 t~~HE~gHal~a~l~G~---------~v~~i~l~~~--~~G~~~~~~ 60 (200)
T PF13398_consen 25 TFVHELGHALAALLTGG---------RVKGIVLFPD--GSGVTVSSG 60 (200)
T ss_pred HHHHHHHHHHHHHHhCC---------CcceEEEEeC--CCceEEEec
Confidence 57899999999998884 4589999986 347777765
No 19
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=96.38 E-value=0.007 Score=48.93 Aligned_cols=58 Identities=24% Similarity=0.223 Sum_probs=41.8
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhh
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVF 79 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~ 79 (180)
+.+||.||++++..++- + +.++++.|=| |++.+.+... + -.+++.+.+||+.+.-++.
T Consensus 56 v~iHElgH~~~A~~~G~--~-------v~~i~l~p~G---g~~~~~~~~~----~---~~~~~~IalAGPl~Nllla 113 (227)
T cd06164 56 VLLHELGHSLVARRYGI--P-------VRSITLFLFG---GVARLEREPE----T---PGQEFVIAIAGPLVSLVLA 113 (227)
T ss_pred HHHHHHHHHHHHHHcCC--e-------ECeEEEEeee---EEEEecCCCC----C---HHHHhhhhhhHHHHHHHHH
Confidence 46899999999998773 3 3788898866 7776654322 2 2466788999998876553
No 20
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=96.15 E-value=0.0062 Score=53.51 Aligned_cols=68 Identities=21% Similarity=0.205 Sum_probs=40.9
Q ss_pred ehhhHHHHHHHHhHhCCCCCCc-------------ccCCCceeEEEeeCCCCceEEEee---cCcc-------ccccCHH
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLH-------------KVTHPLHKVTIIPRGVSLGHTAYI---PEKE-------RYHVTKS 59 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~-------------~~~~~v~~vsi~~~g~~~G~~~~~---~~~~-------~~~~t~~ 59 (180)
+.+||.||.++|.+++- .++ +..+.-..++..|-| ||+.+. +.++ .......
T Consensus 17 v~~HE~gH~~~a~~~g~--~v~~FsiGfGp~l~~~~~~~tey~i~~~plG---g~v~~~g~~~~~~~~~~~~~~~~f~~~ 91 (420)
T TIGR00054 17 IFVHELGHFLAARLCGI--KVERFSIGFGPKILKFKKNGTEYAISLIPLG---GYVKMKGLDKEMEVKPPETDGDLFNNK 91 (420)
T ss_pred HHHHhHHHHHHHHHcCC--EEEEEEEccCchheEEecCCeEEEEEEecCc---ceEeeccCCcccccCCcchhhhhhccC
Confidence 67999999999998773 110 000111136666666 888874 1111 1123344
Q ss_pred HHHHHHHHHhchhhhh
Q psy14374 60 QLMAMMDTMMGGRAAE 75 (180)
Q Consensus 60 ~l~~~i~v~LaGraaE 75 (180)
...+++.+++||..+.
T Consensus 92 ~~~~r~~i~~aGp~~N 107 (420)
T TIGR00054 92 SVFQKAIIIFAGPLAN 107 (420)
T ss_pred CHHHHHHhhhcccHHH
Confidence 5667788889998776
No 21
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=96.00 E-value=0.0097 Score=47.39 Aligned_cols=59 Identities=25% Similarity=0.268 Sum_probs=41.4
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+..||.||++++..++. + +.++++.|=| |++..... .. .-.+.+.+.+||+...-++..
T Consensus 41 v~iHElgH~~~A~~~G~--~-------v~~i~l~p~G---g~~~~~~~---~~----~~~~~~lIalAGPl~n~~la~ 99 (208)
T cd06161 41 VLLHELGHALVARRYGI--R-------VRSITLLPFG---GVAELEEE---PE----TPKEEFVIALAGPLVSLLLAG 99 (208)
T ss_pred HHHHHHHHHHHHHHcCC--C-------ccceEEEeee---eeeeeccC---CC----ChhHheeeeeehHHHHHHHHH
Confidence 56899999999998873 3 3788998876 66655322 11 235677788999988765543
No 22
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=95.75 E-value=0.0048 Score=47.67 Aligned_cols=66 Identities=23% Similarity=0.164 Sum_probs=38.0
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEE----EeeCC----------CCceEEEeecCccccccCHHHHHHHHHHH
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVT----IIPRG----------VSLGHTAYIPEKERYHVTKSQLMAMMDTM 68 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vs----i~~~g----------~~~G~~~~~~~~~~~~~t~~~l~~~i~v~ 68 (180)
+..||.||+++|..++.. + .+.+ ..|.. ...||+...+..... .+..-.+++.+.
T Consensus 11 i~iHE~gH~~~A~~~G~~--~-------~~~~~~~~~~~~~~~~~~~~~~ip~gG~~~~~~~~~~~--~~~~~~~~~~i~ 79 (180)
T cd05709 11 VTVHELGHALVARRLGVK--V-------ARFSGGFTLNPLKHGDPYGIILIPLGGYAKPVGENPRA--FKKPRWQRLLVA 79 (180)
T ss_pred HHHHHHHHHHHHHHcCCC--c-------hheeeeEEECCcCCCCEehHHHHhccCeeccCCCChhh--hccchhhhhhhh
Confidence 578999999999988742 1 2222 22211 124777655432211 122345667788
Q ss_pred hchhhhhhhhh
Q psy14374 69 MGGRAAEELVF 79 (180)
Q Consensus 69 LaGraaE~v~~ 79 (180)
+||..+.-++.
T Consensus 80 laGPl~nllla 90 (180)
T cd05709 80 LAGPLANLLLA 90 (180)
T ss_pred hhhHHHHHHHH
Confidence 99988775543
No 23
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=95.71 E-value=0.014 Score=51.60 Aligned_cols=69 Identities=20% Similarity=0.196 Sum_probs=41.0
Q ss_pred eehhhHHHHHHHHhHhCCCCCCc-------c-------cCCCceeEEEeeCCCCceEEEeecCccc--------cccCHH
Q psy14374 2 ITAYHEGGHAVVAFFTKDSHPLH-------K-------VTHPLHKVTIIPRGVSLGHTAYIPEKER--------YHVTKS 59 (180)
Q Consensus 2 ~~A~HEaGhav~a~~~~~~~~~~-------~-------~~~~v~~vsi~~~g~~~G~~~~~~~~~~--------~~~t~~ 59 (180)
+|.+||.||.++|.+++- .++ | ..+.-..++..|=| ||+.+.-.+.. .....+
T Consensus 17 li~vHElGHfl~Ar~~gv--~V~~FsiGfGp~l~~~~~~~~Tey~i~~iPlG---GyVk~~~e~~~~~~~~~~~~~f~~k 91 (449)
T PRK10779 17 LITVHEFGHFWVARRCGV--RVERFSIGFGKALWRRTDRQGTEYVIALIPLG---GYVKMLDERVEPVAPELRHHAFNNK 91 (449)
T ss_pred HHHHHHHHHHHHHHHcCC--eeeEEEeecChhHeeEecCCCcEEEEEEEcCC---CeeecCCCCCCcCChhhhhhhhccC
Confidence 367999999999998772 221 0 11222335666665 88876532110 112233
Q ss_pred HHHHHHHHHhchhhhh
Q psy14374 60 QLMAMMDTMMGGRAAE 75 (180)
Q Consensus 60 ~l~~~i~v~LaGraaE 75 (180)
-.-+++-+++||..+.
T Consensus 92 ~~~~R~~i~~AGp~~N 107 (449)
T PRK10779 92 TVGQRAAIIAAGPIAN 107 (449)
T ss_pred CHHHhhhhhhhhHHHH
Confidence 3566778889998776
No 24
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=95.00 E-value=0.037 Score=43.32 Aligned_cols=60 Identities=22% Similarity=0.122 Sum_probs=36.9
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeC---CCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhh
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPR---GVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELV 78 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~---g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~ 78 (180)
+..||.||+++|...+. +. .+..+.|- |.-++++.+... ..+ -.+++.+.++|+.+.-++
T Consensus 44 l~iHElgH~~~A~~~G~--~~-------~~~~l~P~~~~G~~G~~~~~~~~----~~~---~~~~~~IalAGPl~nl~l 106 (183)
T cd06160 44 LGIHEMGHYLAARRHGV--KA-------SLPYFIPFPFIGTFGAFIRMRSP----IPN---RKALFDIALAGPLAGLLL 106 (183)
T ss_pred HHHHHHHHHHHHHHCCC--Cc-------cceeeeehHhcCcEEEEEEecCC----CCC---hhHhehhhhhHHHHHHHH
Confidence 56899999999998773 33 55667773 222244433211 112 246678889998877443
No 25
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=94.64 E-value=0.059 Score=44.88 Aligned_cols=65 Identities=12% Similarity=0.026 Sum_probs=40.1
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCC-CCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhhhC
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRG-VSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELVFG 80 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g-~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~~g 80 (180)
+.+||.|||++|...+ .++ +.+.+.+=- -..||+...+++... .+ -.+++.+.+||....-++-.
T Consensus 138 vvvHElgHal~A~~~g--i~V-------~~iGl~l~~~~pGa~ve~~~e~~~~-~~---~~~~l~Ia~AGp~~NlvLa~ 203 (277)
T cd06162 138 GVVHEMGHGVAAVREQ--VRV-------NGFGIFFFIIYPGAYVDLFTDHLNL-IS---PVQQLRIFCAGVWHNFVLGL 203 (277)
T ss_pred HHHHHHHHHHHHHHcC--Cee-------ceEEEeeeeccCeeEEeeccccccc-CC---hhhhhheehhhHHHHHHHHH
Confidence 4689999999998765 333 777765430 114787765442221 12 22477788999887755443
No 26
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=93.95 E-value=0.044 Score=42.52 Aligned_cols=17 Identities=41% Similarity=0.481 Sum_probs=14.1
Q ss_pred ehhhHHHHHHHHhHhCC
Q psy14374 3 TAYHEGGHAVVAFFTKD 19 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~ 19 (180)
+..||.||+++|...+.
T Consensus 10 i~~HE~gH~~~a~~~G~ 26 (192)
T PF02163_consen 10 IVLHELGHALAARLYGD 26 (192)
T ss_dssp HHHHHHHHHHHHHTTT-
T ss_pred ccccccccccccccccc
Confidence 57899999999997763
No 27
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=93.02 E-value=0.074 Score=41.49 Aligned_cols=69 Identities=25% Similarity=0.310 Sum_probs=39.4
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeC------CC---------CceEEEeecCccccccCHHHHHHHHHH
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPR------GV---------SLGHTAYIPEKERYHVTKSQLMAMMDT 67 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~------g~---------~~G~~~~~~~~~~~~~t~~~l~~~i~v 67 (180)
+..||.|||++|+.++...+. .--++|+.|. |. ..|+....|-+.+. .+..-.+.+.+
T Consensus 12 i~~HE~aHa~~A~~~Gd~t~~-----~~Grltlnp~~hid~~g~l~~~~~~~~~~G~a~p~~~~~~~--~~~~r~~~~~v 84 (181)
T cd06158 12 ITLHEFAHAYVAYRLGDPTAR-----RAGRLTLNPLAHIDPIGTIILPLLLPFLFGWAKPVPVNPRN--FKNPRRGMLLV 84 (181)
T ss_pred HHHHHHHHHHHHHHcCCcHHH-----HcCceecCcHHhcCcchHHHHHHHHHhCeEEecccccChHh--hcccHhhHhhh
Confidence 578999999999988854332 1125676663 10 23443322221111 13334567778
Q ss_pred Hhchhhhhhhh
Q psy14374 68 MMGGRAAEELV 78 (180)
Q Consensus 68 ~LaGraaE~v~ 78 (180)
.+||..+.-++
T Consensus 85 alAGP~~n~~l 95 (181)
T cd06158 85 SLAGPLSNLLL 95 (181)
T ss_pred hhhhHHHHHHH
Confidence 89999877543
No 28
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=91.54 E-value=0.051 Score=44.98 Aligned_cols=61 Identities=20% Similarity=0.212 Sum_probs=36.1
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEE----eeCCCCceEEEeecCccccccCHHHHHHHHHHHhchhhhhhhh
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTI----IPRGVSLGHTAYIPEKERYHVTKSQLMAMMDTMMGGRAAEELV 78 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi----~~~g~~~G~~~~~~~~~~~~~t~~~l~~~i~v~LaGraaE~v~ 78 (180)
+.+||.||+++|...+ .++ +.+.+ .|-| ||+...++ ...+..-.+++.+.+||..+.-++
T Consensus 121 v~iHElgHa~~Ar~~G--~~V-------~~iGl~l~~ip~G---g~v~~~~~----~~~~~~~~~~~~Ia~AGP~~Nlvl 184 (263)
T cd06159 121 VVVHELSHGILARVEG--IKV-------KSGGLLLLIIPPG---AFVEPDEE----ELNKADRRIRLRIFAAGVTANFVV 184 (263)
T ss_pred HHHHHHHHHHHHHHcC--CEE-------CchhHHHHhhhcE---EEEEecch----hhccCChhheeeeeeehHHHHHHH
Confidence 5689999999999766 232 33332 2223 67654321 111222346777889998877544
Q ss_pred h
Q psy14374 79 F 79 (180)
Q Consensus 79 ~ 79 (180)
.
T Consensus 185 a 185 (263)
T cd06159 185 A 185 (263)
T ss_pred H
Confidence 3
No 29
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=86.01 E-value=0.3 Score=39.38 Aligned_cols=12 Identities=50% Similarity=0.620 Sum_probs=10.5
Q ss_pred ehhhHHHHHHHH
Q psy14374 3 TAYHEGGHAVVA 14 (180)
Q Consensus 3 ~A~HEaGhav~a 14 (180)
+|-||+|||+=.
T Consensus 92 VAAHEvGHAiQ~ 103 (222)
T PF04298_consen 92 VAAHEVGHAIQH 103 (222)
T ss_pred HHHHHHhHHHhc
Confidence 689999999974
No 30
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=85.61 E-value=0.31 Score=36.61 Aligned_cols=13 Identities=46% Similarity=0.475 Sum_probs=11.1
Q ss_pred eehhhHHHHHHHH
Q psy14374 2 ITAYHEGGHAVVA 14 (180)
Q Consensus 2 ~~A~HEaGhav~a 14 (180)
.|++||.||+|--
T Consensus 106 ~~~~HEiGHaLGL 118 (156)
T cd04279 106 AIALHELGHALGL 118 (156)
T ss_pred HHHHHHhhhhhcC
Confidence 3789999999984
No 31
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=82.55 E-value=0.53 Score=35.30 Aligned_cols=12 Identities=58% Similarity=0.504 Sum_probs=10.5
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.+++||.||||-
T Consensus 96 ~~~~HEiGHaLG 107 (165)
T cd04268 96 NTAEHELGHALG 107 (165)
T ss_pred HHHHHHHHHHhc
Confidence 368999999997
No 32
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=79.22 E-value=0.92 Score=33.21 Aligned_cols=12 Identities=50% Similarity=0.493 Sum_probs=10.5
Q ss_pred ehhhHHHHHHHH
Q psy14374 3 TAYHEGGHAVVA 14 (180)
Q Consensus 3 ~A~HEaGhav~a 14 (180)
++.||.||||--
T Consensus 89 ~~~HEigHaLGl 100 (140)
T smart00235 89 VAAHELGHALGL 100 (140)
T ss_pred cHHHHHHHHhcC
Confidence 689999999973
No 33
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=78.55 E-value=1.1 Score=36.05 Aligned_cols=14 Identities=50% Similarity=0.747 Sum_probs=11.8
Q ss_pred ehhhHHHHHHHHhH
Q psy14374 3 TAYHEGGHAVVAFF 16 (180)
Q Consensus 3 ~A~HEaGhav~a~~ 16 (180)
|-+||.|||++..+
T Consensus 95 ~l~HE~GHAlI~~~ 108 (220)
T PF14247_consen 95 TLYHELGHALIDDL 108 (220)
T ss_pred HHHHHHHHHHHHHh
Confidence 46899999999763
No 34
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=76.16 E-value=1.1 Score=34.51 Aligned_cols=12 Identities=50% Similarity=0.493 Sum_probs=10.5
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.+++||.||||-
T Consensus 115 ~t~~HEiGHaLG 126 (186)
T cd04277 115 QTIIHEIGHALG 126 (186)
T ss_pred HHHHHHHHHHhc
Confidence 368999999997
No 35
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=70.56 E-value=1.9 Score=33.81 Aligned_cols=12 Identities=42% Similarity=0.437 Sum_probs=10.4
Q ss_pred ehhhHHHHHHHH
Q psy14374 3 TAYHEGGHAVVA 14 (180)
Q Consensus 3 ~A~HEaGhav~a 14 (180)
+++||.||||--
T Consensus 95 ~i~HElgHaLG~ 106 (198)
T cd04327 95 VVLHEFGHALGF 106 (198)
T ss_pred HHHHHHHHHhcC
Confidence 678999999983
No 36
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=66.54 E-value=2.6 Score=31.39 Aligned_cols=13 Identities=46% Similarity=0.360 Sum_probs=11.0
Q ss_pred eehhhHHHHHHHH
Q psy14374 2 ITAYHEGGHAVVA 14 (180)
Q Consensus 2 ~~A~HEaGhav~a 14 (180)
.+..||.||+|--
T Consensus 98 ~~~~HElGH~LGl 110 (167)
T cd00203 98 QTIAHELGHALGF 110 (167)
T ss_pred hhHHHHHHHHhCC
Confidence 4688999999984
No 37
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=63.01 E-value=3.6 Score=30.42 Aligned_cols=12 Identities=42% Similarity=0.354 Sum_probs=10.3
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.|+.||.||.|-
T Consensus 79 RIaaHE~GHiLG 90 (132)
T PF02031_consen 79 RIAAHELGHILG 90 (132)
T ss_dssp HHHHHHHHHHHT
T ss_pred eeeeehhccccC
Confidence 378999999886
No 38
>KOG2921|consensus
Probab=62.05 E-value=4.1 Score=35.76 Aligned_cols=12 Identities=50% Similarity=0.589 Sum_probs=10.8
Q ss_pred hhhHHHHHHHHh
Q psy14374 4 AYHEGGHAVVAF 15 (180)
Q Consensus 4 A~HEaGhav~a~ 15 (180)
-+||.|||++|.
T Consensus 135 vvHElGHalAA~ 146 (484)
T KOG2921|consen 135 VVHELGHALAAA 146 (484)
T ss_pred HHHHhhHHHHHH
Confidence 479999999986
No 39
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=60.81 E-value=3.6 Score=32.69 Aligned_cols=11 Identities=55% Similarity=0.700 Sum_probs=9.9
Q ss_pred ehhhHHHHHHH
Q psy14374 3 TAYHEGGHAVV 13 (180)
Q Consensus 3 ~A~HEaGhav~ 13 (180)
+|-||.|||+=
T Consensus 95 VAAHEVGHAiQ 105 (226)
T COG2738 95 VAAHEVGHAIQ 105 (226)
T ss_pred HHHHHhhHHHh
Confidence 68899999996
No 40
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=54.16 E-value=6.2 Score=30.50 Aligned_cols=13 Identities=38% Similarity=0.064 Sum_probs=10.7
Q ss_pred eehhhHHHHHHHH
Q psy14374 2 ITAYHEGGHAVVA 14 (180)
Q Consensus 2 ~~A~HEaGhav~a 14 (180)
-+.+||.||||--
T Consensus 76 g~v~HE~~HalG~ 88 (180)
T cd04280 76 GTIVHELMHALGF 88 (180)
T ss_pred chhHHHHHHHhcC
Confidence 3678999999874
No 41
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=54.10 E-value=5 Score=30.41 Aligned_cols=11 Identities=45% Similarity=0.401 Sum_probs=9.5
Q ss_pred ehhhHHHHHHH
Q psy14374 3 TAYHEGGHAVV 13 (180)
Q Consensus 3 ~A~HEaGhav~ 13 (180)
+..||.||.|-
T Consensus 72 TltHEvGH~LG 82 (154)
T PF05572_consen 72 TLTHEVGHWLG 82 (154)
T ss_dssp HHHHHHHHHTT
T ss_pred chhhhhhhhhc
Confidence 57899999886
No 42
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=50.56 E-value=7.6 Score=30.34 Aligned_cols=13 Identities=31% Similarity=0.191 Sum_probs=10.8
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
+.+||.||||--+
T Consensus 80 ~i~HEl~HaLG~~ 92 (182)
T cd04283 80 IIQHELLHALGFY 92 (182)
T ss_pred hHHHHHHHHhCCc
Confidence 5799999999843
No 43
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=47.97 E-value=8.8 Score=30.43 Aligned_cols=12 Identities=33% Similarity=0.385 Sum_probs=10.2
Q ss_pred ehhhHHHHHHHH
Q psy14374 3 TAYHEGGHAVVA 14 (180)
Q Consensus 3 ~A~HEaGhav~a 14 (180)
+.+||.||||--
T Consensus 90 ti~HEl~HaLGf 101 (200)
T cd04281 90 IVVHELGHVIGF 101 (200)
T ss_pred hHHHHHHHHhcC
Confidence 578999999983
No 44
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=45.48 E-value=10 Score=29.62 Aligned_cols=11 Identities=55% Similarity=0.619 Sum_probs=9.8
Q ss_pred ehhhHHHHHHH
Q psy14374 3 TAYHEGGHAVV 13 (180)
Q Consensus 3 ~A~HEaGhav~ 13 (180)
++.||.||||-
T Consensus 82 ~i~HEl~HaLG 92 (191)
T PF01400_consen 82 TILHELGHALG 92 (191)
T ss_dssp HHHHHHHHHHT
T ss_pred chHHHHHHHHh
Confidence 57899999998
No 45
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=42.70 E-value=4.8 Score=32.48 Aligned_cols=12 Identities=50% Similarity=0.363 Sum_probs=10.3
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.+++||.||+|-
T Consensus 139 ~t~~HEvGH~lG 150 (225)
T cd04275 139 DTATHEVGHWLG 150 (225)
T ss_pred ceeEEeccceee
Confidence 378999999987
No 46
>KOG3714|consensus
Probab=42.51 E-value=11 Score=33.04 Aligned_cols=13 Identities=38% Similarity=0.271 Sum_probs=11.5
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
|++||.+|||--|
T Consensus 162 ~i~HEl~HaLGf~ 174 (411)
T KOG3714|consen 162 TIVHELMHALGFW 174 (411)
T ss_pred hhHHHHHHHhhhh
Confidence 6899999999965
No 47
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=40.98 E-value=8.8 Score=30.36 Aligned_cols=12 Identities=33% Similarity=0.351 Sum_probs=10.0
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.+..||.||+|-
T Consensus 118 ~~~~he~gh~lG 129 (197)
T cd04276 118 YLLAHEVGHTLG 129 (197)
T ss_pred HHHHHHHHHHhc
Confidence 357899999987
No 48
>KOG1565|consensus
Probab=40.83 E-value=7.7 Score=34.82 Aligned_cols=12 Identities=50% Similarity=0.601 Sum_probs=10.6
Q ss_pred eehhhHHHHHHH
Q psy14374 2 ITAYHEGGHAVV 13 (180)
Q Consensus 2 ~~A~HEaGhav~ 13 (180)
.+|+||.||+|-
T Consensus 213 ~Va~HEiGH~LG 224 (469)
T KOG1565|consen 213 LVAAHEIGHALG 224 (469)
T ss_pred HHhhhhcccccc
Confidence 479999999986
No 49
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=40.28 E-value=14 Score=30.03 Aligned_cols=12 Identities=42% Similarity=0.127 Sum_probs=10.2
Q ss_pred ehhhHHHHHHHH
Q psy14374 3 TAYHEGGHAVVA 14 (180)
Q Consensus 3 ~A~HEaGhav~a 14 (180)
+.+||.||||-=
T Consensus 123 ti~HEl~HalGf 134 (230)
T cd04282 123 TVEHEFLHALGF 134 (230)
T ss_pred hHHHHHHHHhCC
Confidence 578999999973
No 50
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=38.24 E-value=16 Score=33.06 Aligned_cols=15 Identities=33% Similarity=0.321 Sum_probs=11.1
Q ss_pred hhhHHHHHHHHhHhC
Q psy14374 4 AYHEGGHAVVAFFTK 18 (180)
Q Consensus 4 A~HEaGhav~a~~~~ 18 (180)
.+||+|||+=..-++
T Consensus 263 ~iHE~GHalYEq~~~ 277 (494)
T PF02074_consen 263 TIHETGHALYEQGLP 277 (494)
T ss_dssp HHHHHHHHHHHHTS-
T ss_pred HHHHHHHHHHHcCCC
Confidence 579999999865444
No 51
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=36.53 E-value=17 Score=24.79 Aligned_cols=13 Identities=38% Similarity=0.483 Sum_probs=10.8
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
+..||.||.+...
T Consensus 45 ~laHELgH~~~~~ 57 (122)
T PF06114_consen 45 TLAHELGHILLHH 57 (122)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhh
Confidence 5689999999865
No 52
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=36.05 E-value=16 Score=28.84 Aligned_cols=13 Identities=38% Similarity=0.245 Sum_probs=10.6
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
+-.||.||++-+.
T Consensus 140 ~~aHEiGH~lGl~ 152 (206)
T PF13583_consen 140 TFAHEIGHNLGLR 152 (206)
T ss_pred HHHHHHHHHhcCC
Confidence 4579999999864
No 53
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=35.61 E-value=16 Score=32.84 Aligned_cols=15 Identities=33% Similarity=0.392 Sum_probs=10.9
Q ss_pred hhhHHHHHHHHhHhC
Q psy14374 4 AYHEGGHAVVAFFTK 18 (180)
Q Consensus 4 A~HEaGhav~a~~~~ 18 (180)
.+||+|||+=---++
T Consensus 263 ~iHE~GHAlYEqn~~ 277 (497)
T COG2317 263 TIHETGHALYEQNLD 277 (497)
T ss_pred HHHhhhhHHHHcCCC
Confidence 489999999754333
No 54
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=34.96 E-value=18 Score=28.95 Aligned_cols=13 Identities=46% Similarity=0.511 Sum_probs=10.7
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
+..||.||+|...
T Consensus 75 tlAHELGH~llH~ 87 (213)
T COG2856 75 TLAHELGHALLHT 87 (213)
T ss_pred HHHHHHhHHHhcc
Confidence 5679999999853
No 55
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=33.24 E-value=19 Score=30.66 Aligned_cols=16 Identities=44% Similarity=0.621 Sum_probs=12.7
Q ss_pred ehhhHHHHHHHHhHhC
Q psy14374 3 TAYHEGGHAVVAFFTK 18 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~ 18 (180)
+-.||.|||+=..+.+
T Consensus 157 tl~HE~GHa~h~~l~~ 172 (365)
T cd06258 157 TLFHEFGHAVHFLLIQ 172 (365)
T ss_pred HHHHHHhHHHHHHHhc
Confidence 4589999999776655
No 56
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=33.00 E-value=21 Score=27.32 Aligned_cols=13 Identities=46% Similarity=0.327 Sum_probs=9.9
Q ss_pred ehhhHHHHHHHHh
Q psy14374 3 TAYHEGGHAVVAF 15 (180)
Q Consensus 3 ~A~HEaGhav~a~ 15 (180)
+..||.||.+-+.
T Consensus 114 ~~aHElGH~lGa~ 126 (173)
T PF13574_consen 114 TFAHELGHQLGAP 126 (173)
T ss_dssp HHHHHHHHHHT--
T ss_pred eehhhhHhhcCCC
Confidence 5789999999975
No 57
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=32.79 E-value=17 Score=29.18 Aligned_cols=11 Identities=36% Similarity=0.465 Sum_probs=9.4
Q ss_pred ehhhHHHHHHH
Q psy14374 3 TAYHEGGHAVV 13 (180)
Q Consensus 3 ~A~HEaGhav~ 13 (180)
+-.||.||+|-
T Consensus 136 vi~HEiGH~IG 146 (211)
T PF12388_consen 136 VITHEIGHCIG 146 (211)
T ss_pred HHHHHhhhhcc
Confidence 45799999997
No 58
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=32.75 E-value=20 Score=31.44 Aligned_cols=16 Identities=31% Similarity=0.333 Sum_probs=12.4
Q ss_pred ehhhHHHHHHHHhHhC
Q psy14374 3 TAYHEGGHAVVAFFTK 18 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~ 18 (180)
..+||.|||+=..-++
T Consensus 162 t~iHE~GHalye~~l~ 177 (396)
T cd06460 162 STIHETGHALYEQGLP 177 (396)
T ss_pred HHHHHhhHHHHHhcCC
Confidence 3589999999877444
No 59
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=25.38 E-value=27 Score=27.83 Aligned_cols=9 Identities=56% Similarity=0.682 Sum_probs=7.7
Q ss_pred hhHHHHHHH
Q psy14374 5 YHEGGHAVV 13 (180)
Q Consensus 5 ~HEaGhav~ 13 (180)
=||.||+|-
T Consensus 144 NHEVGH~LG 152 (203)
T PF11350_consen 144 NHEVGHALG 152 (203)
T ss_pred hhhhhhhcc
Confidence 399999995
No 60
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=25.28 E-value=46 Score=28.64 Aligned_cols=18 Identities=39% Similarity=0.656 Sum_probs=14.0
Q ss_pred ehhhHHHHHHHHhHhCCC
Q psy14374 3 TAYHEGGHAVVAFFTKDS 20 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~ 20 (180)
+-.||.|||+=.++....
T Consensus 225 tl~HE~GHa~h~~~~~~~ 242 (427)
T cd06459 225 TLAHELGHAFHSYLSRDN 242 (427)
T ss_pred HHHHHhhHHHHHHHHccC
Confidence 347999999988876643
No 61
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=23.61 E-value=53 Score=29.86 Aligned_cols=16 Identities=31% Similarity=0.544 Sum_probs=12.2
Q ss_pred hhhHHHHHHHHhHhCC
Q psy14374 4 AYHEGGHAVVAFFTKD 19 (180)
Q Consensus 4 A~HEaGhav~a~~~~~ 19 (180)
=.||+|||+=.++...
T Consensus 341 L~HElGHa~H~~~s~~ 356 (549)
T TIGR02289 341 LTHEAGHAFHVYESRK 356 (549)
T ss_pred HHHHhhHHHHHHHhcC
Confidence 3699999997766553
No 62
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=21.80 E-value=44 Score=29.79 Aligned_cols=15 Identities=27% Similarity=0.574 Sum_probs=11.7
Q ss_pred hhhHHHHHHHHhHhC
Q psy14374 4 AYHEGGHAVVAFFTK 18 (180)
Q Consensus 4 A~HEaGhav~a~~~~ 18 (180)
=.||.|||+=.++..
T Consensus 267 LfHEfGHalH~~ls~ 281 (472)
T cd06455 267 FFHEFGHVIHHLLGR 281 (472)
T ss_pred HHHHHHHHHHHHhcC
Confidence 369999999876554
No 63
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=20.56 E-value=65 Score=22.00 Aligned_cols=16 Identities=19% Similarity=0.102 Sum_probs=12.5
Q ss_pred ehhhHHHHHHHHhHhC
Q psy14374 3 TAYHEGGHAVVAFFTK 18 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~ 18 (180)
+..||.+|.+.....+
T Consensus 28 ~l~HE~~H~~~~~~~~ 43 (128)
T PF13485_consen 28 VLAHELAHQWFGNYFG 43 (128)
T ss_pred HHHHHHHHHHHHHHcC
Confidence 5789999999876433
No 64
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=20.51 E-value=70 Score=28.11 Aligned_cols=16 Identities=31% Similarity=0.671 Sum_probs=12.9
Q ss_pred hhhHHHHHHHHhHhCC
Q psy14374 4 AYHEGGHAVVAFFTKD 19 (180)
Q Consensus 4 A~HEaGhav~a~~~~~ 19 (180)
=.||.|||+=..+...
T Consensus 246 LfHE~GHa~H~~ls~~ 261 (458)
T PF01432_consen 246 LFHEFGHAMHSLLSRT 261 (458)
T ss_dssp HHHHHHHHHHHHHCCC
T ss_pred HHHHHhHHHHHHHhcc
Confidence 4799999999877654
No 65
>PF04315 DUF462: Protein of unknown function, DUF462; InterPro: IPR007411 This family consists of bacterial proteins of uncharacterised function.
Probab=20.39 E-value=52 Score=25.31 Aligned_cols=41 Identities=22% Similarity=0.291 Sum_probs=25.3
Q ss_pred ehhhHHHHHHHHhHhCCCCCCcccCCCceeEEEeeCCCCceEEEeecCccccccCHHHHHH
Q psy14374 3 TAYHEGGHAVVAFFTKDSHPLHKVTHPLHKVTIIPRGVSLGHTAYIPEKERYHVTKSQLMA 63 (180)
Q Consensus 3 ~A~HEaGhav~a~~~~~~~~~~~~~~~v~~vsi~~~g~~~G~~~~~~~~~~~~~t~~~l~~ 63 (180)
-|.||..|..+|- + .+-.+. .-|| .|.|+ +|..-+++.|+.
T Consensus 44 SALHEIaHWciAG--~------------~RR~l~----DfGY-WY~PD-GR~~~qQ~~FE~ 84 (164)
T PF04315_consen 44 SALHEIAHWCIAG--P------------ERRQLE----DFGY-WYCPD-GRDAEQQAEFEQ 84 (164)
T ss_pred HHHHHHHHHHhcc--c------------cccccc----cCCC-CcCCC-CCCHHHHHHHHh
Confidence 3899999999984 1 121221 2356 45665 777666766654
Done!