Query 045937
Match_columns 79
No_of_seqs 114 out of 302
Neff 5.4
Searched_HMMs 46136
Date Fri Mar 29 07:03:55 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/045937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/045937hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14541 TAXi_C: Xylanase inhi 99.8 4.1E-19 8.9E-24 119.7 5.2 60 3-62 102-161 (161)
2 cd05489 xylanase_inhibitor_I_l 99.7 1.6E-17 3.4E-22 125.1 8.0 62 3-64 301-362 (362)
3 PLN03146 aspartyl protease fam 99.6 8.3E-15 1.8E-19 112.6 7.5 61 3-73 370-430 (431)
4 cd05475 nucellin_like Nucellin 99.4 8E-13 1.7E-17 95.0 7.3 58 4-61 211-269 (273)
5 KOG1339 Aspartyl protease [Pos 99.3 3.8E-12 8.3E-17 96.3 7.7 65 3-72 331-397 (398)
6 cd05472 cnd41_like Chloroplast 99.3 1.8E-11 4E-16 88.3 7.8 62 3-71 237-299 (299)
7 cd05476 pepsin_A_like_plant Ch 99.1 3E-10 6.5E-15 81.1 7.7 61 3-71 205-265 (265)
8 cd05473 beta_secretase_like Be 98.9 1.5E-09 3.3E-14 80.8 4.5 64 4-75 285-351 (364)
9 cd06096 Plasmepsin_5 Plasmepsi 98.9 4.8E-09 1E-13 77.1 5.7 60 3-72 267-326 (326)
10 cd05474 SAP_like SAPs, pepsin- 98.7 2.2E-08 4.7E-13 71.5 5.5 58 3-63 233-295 (295)
11 cd05486 Cathespin_E Cathepsin 98.7 4E-08 8.7E-13 71.7 6.6 60 3-62 252-316 (316)
12 cd05478 pepsin_A Pepsin A, asp 98.6 4.9E-08 1.1E-12 71.3 5.3 56 3-62 261-317 (317)
13 cd05490 Cathepsin_D2 Cathepsin 98.6 1.2E-07 2.5E-12 69.4 6.2 60 3-62 261-325 (325)
14 cd05477 gastricsin Gastricsins 98.6 1.4E-07 3.1E-12 68.7 6.1 58 3-62 256-317 (318)
15 cd05485 Cathepsin_D_like Cathe 98.5 2.3E-07 5E-12 68.3 6.5 59 3-61 265-328 (329)
16 cd05487 renin_like Renin stimu 98.5 2.5E-07 5.4E-12 67.9 6.5 61 3-63 261-326 (326)
17 cd05488 Proteinase_A_fungi Fun 98.5 1.5E-07 3.3E-12 68.8 5.4 58 3-62 260-320 (320)
18 cd06098 phytepsin Phytepsin, a 98.4 5.4E-07 1.2E-11 66.0 6.0 60 3-62 253-317 (317)
19 PTZ00147 plasmepsin-1; Provisi 98.3 1.2E-06 2.6E-11 68.5 6.1 61 3-64 387-450 (453)
20 PTZ00013 plasmepsin 4 (PM4); P 98.3 1.1E-06 2.5E-11 68.7 5.8 60 3-63 386-448 (450)
21 PTZ00165 aspartyl protease; Pr 98.3 2.6E-06 5.7E-11 67.1 7.0 61 4-64 380-447 (482)
22 PF00026 Asp: Eukaryotic aspar 97.5 0.00019 4.1E-09 51.1 4.9 60 3-62 253-316 (317)
23 cd05471 pepsin_like Pepsin-lik 97.1 0.0004 8.7E-09 48.5 3.0 26 37-62 258-283 (283)
24 cd06097 Aspergillopepsin_like 96.7 0.0013 2.9E-08 47.0 2.6 26 37-62 253-278 (278)
25 PRK12442 translation initiatio 54.4 25 0.00054 22.4 3.6 55 7-62 16-71 (87)
26 TIGR00008 infA translation ini 33.1 1.1E+02 0.0024 18.4 3.9 53 7-60 14-67 (68)
27 PF14781 BBS2_N: Ciliary BBSom 31.9 62 0.0013 22.1 2.9 26 16-41 90-115 (136)
28 COG5343 Uncharacterized protei 29.9 56 0.0012 24.4 2.6 28 36-63 134-161 (240)
29 PF12768 Rax2: Cortical protei 29.0 1.9E+02 0.0041 21.6 5.4 48 3-58 25-76 (281)
30 PTZ00371 aspartyl aminopeptida 28.6 1.2E+02 0.0025 24.3 4.4 40 3-43 44-87 (465)
31 PF02186 TFIIE_beta: TFIIE bet 28.2 51 0.0011 19.5 1.8 19 45-63 44-62 (65)
32 PF13418 Kelch_4: Galactose ox 22.1 1.3E+02 0.0029 15.5 2.8 23 35-57 14-40 (49)
33 COG5428 Uncharacterized conser 20.4 1.2E+02 0.0026 18.5 2.4 18 44-61 29-46 (69)
34 PF03930 Flp_N: Recombinase Fl 20.1 1.2E+02 0.0026 19.1 2.5 17 48-64 24-40 (82)
35 PF08446 PAS_2: PAS fold; Int 20.1 89 0.0019 19.6 1.9 29 34-62 4-32 (110)
No 1
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=99.77 E-value=4.1e-19 Score=119.73 Aligned_cols=60 Identities=33% Similarity=0.474 Sum_probs=52.4
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|++|+|+++|||+...++++||||++++....+.+|||++||||++|+||++++||||+|
T Consensus 102 ga~l~l~~~~y~~~~~~~~~Cla~~~~~~~~~~~~viG~~~~~~~~v~fDl~~~~igF~~ 161 (161)
T PF14541_consen 102 GADLTLPPENYFVQVSPGVFCLAFVPSDADDDGVSVIGNFQQQNYHVVFDLENGRIGFAP 161 (161)
T ss_dssp SEEEEE-HHHHEEEECTTEEEESEEEETSTTSSSEEE-HHHCCTEEEEEETTTTEEEEEE
T ss_pred CcceeeeccceeeeccCCCEEEEEEccCCCCCCcEEECHHHhcCcEEEEECCCCEEEEeC
Confidence 789999999999999999999999988322345789999999999999999999999996
No 2
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=99.72 E-value=1.6e-17 Score=125.07 Aligned_cols=62 Identities=58% Similarity=1.086 Sum_probs=55.9
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL 64 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l 64 (79)
|++|+|+++|||+++.+++.||||++++..+.++.|||++||||++|+||++++||||+++|
T Consensus 301 g~~~~l~~~ny~~~~~~~~~Cl~f~~~~~~~~~~~IlG~~~~~~~~vvyD~~~~riGfa~~~ 362 (362)
T cd05489 301 GVNWTIFGANSMVQVKGGVACLAFVDGGSEPRPAVVIGGHQMEDNLLVFDLEKSRLGFSSSL 362 (362)
T ss_pred CeEEEEcCCceEEEcCCCcEEEEEeeCCCCCCceEEEeeheecceEEEEECCCCEeecccCC
Confidence 69999999999999988899999998754334578999999999999999999999999886
No 3
>PLN03146 aspartyl protease family protein; Provisional
Probab=99.56 E-value=8.3e-15 Score=112.56 Aligned_cols=61 Identities=23% Similarity=0.416 Sum_probs=55.5
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccCC
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCSN 73 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs~ 73 (79)
|+++.|+++|||+++++++.||||+++.. ..|||++||||++|+||++++||||+ +++|++
T Consensus 370 Ga~~~l~~~~~~~~~~~~~~Cl~~~~~~~----~~IlG~~~q~~~~vvyDl~~~~igFa------~~~C~~ 430 (431)
T PLN03146 370 GADVKLQPLNTFVKVSEDLVCFAMIPTSS----IAIFGNLAQMNFLVGYDLESKTVSFK------PTDCTK 430 (431)
T ss_pred CCeeecCcceeEEEcCCCcEEEEEecCCC----ceEECeeeEeeEEEEEECCCCEEeee------cCCcCc
Confidence 78999999999999988889999986532 47999999999999999999999999 888976
No 4
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.41 E-value=8e-13 Score=94.97 Aligned_cols=58 Identities=21% Similarity=0.398 Sum_probs=50.7
Q ss_pred ceEEEcCCCeEEEeCCceEEEEEEeCCCC-CceeEEEechhcceeEEEEECCCCeEEee
Q 045937 4 VLWSIIGANSIVRVSNDVSCLGFVDGGVT-PKTSIVIGGHQLDNNLVQFDIATSRLGFS 61 (79)
Q Consensus 4 a~~~l~~~N~~v~v~~~v~CLafv~~~~~-~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~ 61 (79)
++|+||++||++...++..|++++.+... ..+..|||+.+||+++++||++++||||+
T Consensus 211 ~~~~l~~~~y~~~~~~~~~Cl~~~~~~~~~~~~~~ilG~~~l~~~~~vfD~~~~riGfa 269 (273)
T cd05475 211 RLLEIPPENYLIISEKGNVCLGILNGSEIGLGNTNIIGDISMQGLMVIYDNEKQQIGWV 269 (273)
T ss_pred eEEEeCCCceEEEcCCCCEEEEEecCCCcCCCceEEECceEEEeeEEEEECcCCEeCcc
Confidence 79999999999988778899999865421 23468999999999999999999999999
No 5
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.34 E-value=3.8e-12 Score=96.32 Aligned_cols=65 Identities=35% Similarity=0.578 Sum_probs=57.0
Q ss_pred cceEEEcCCCeEEEeCCceE-EEEEEeCCCCCceeEEEechhcceeEEEEECC-CCeEEeeecccccccccC
Q 045937 3 KVLWSIIGANSIVRVSNDVS-CLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIA-TSRLGFSNSLLLQRTMCS 72 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~-CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~-~~~lgF~~~l~~~~~~Cs 72 (79)
|+.|.++++||++++.++.. |++++.+...+ +..|||++|||+++++||+. ++|+||++.+ +.|+
T Consensus 331 g~~~~l~~~~y~~~~~~~~~~Cl~~~~~~~~~-~~~ilG~~~~~~~~~~~D~~~~~riGfa~~~----~~c~ 397 (398)
T KOG1339|consen 331 GAVFSLPPKNYLVEVSDGGGVCLAFFNGMDSG-PLWILGDVFQQNYLVVFDLGENSRVGFAPAL----TNCS 397 (398)
T ss_pred CcEEEeCccceEEEECCCCCceeeEEecCCCC-ceEEEchHHhCCEEEEEeCCCCCEEEecccc----ccCC
Confidence 79999999999999987655 99999875432 67999999999999999999 9999999877 6675
No 6
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=99.27 E-value=1.8e-11 Score=88.30 Aligned_cols=62 Identities=23% Similarity=0.340 Sum_probs=52.2
Q ss_pred cceEEEcCCCeEEEe-CCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeeccccccccc
Q 045937 3 KVLWSIIGANSIVRV-SNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMC 71 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v-~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~C 71 (79)
|++|++++++|++.. ..+..|++|.+.+. ..+..|||+.+|++++++||++++||||+ +.+|
T Consensus 237 g~~~~l~~~~y~~~~~~~~~~C~~~~~~~~-~~~~~ilG~~fl~~~~vvfD~~~~~igfa------~~~C 299 (299)
T cd05472 237 GADVELDASGVLYPVDDSSQVCLAFAGTSD-DGGLSIIGNVQQQTFRVVYDVAGGRIGFA------PGGC 299 (299)
T ss_pred CCEEEeCcccEEEEecCCCCEEEEEeCCCC-CCCCEEEchHHccceEEEEECCCCEEeEe------cCCC
Confidence 789999999999954 45678999987532 23467999999999999999999999999 6666
No 7
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=99.12 E-value=3e-10 Score=81.05 Aligned_cols=61 Identities=30% Similarity=0.492 Sum_probs=53.5
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeeccccccccc
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMC 71 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~C 71 (79)
|+.|.++++||+++...+..|++++..+. .+..|||...++++.++||++++||||+ ..+|
T Consensus 205 ~~~~~i~~~~y~~~~~~~~~C~~~~~~~~--~~~~ilG~~fl~~~~~vFD~~~~~iGfa------~~~C 265 (265)
T cd05476 205 GADLELPPENYFVDVGEGVVCLAILSSSS--GGVSILGNIQQQNFLVEYDLENSRLGFA------PADC 265 (265)
T ss_pred CCEEEeCcccEEEECCCCCEEEEEecCCC--CCcEEEChhhcccEEEEEECCCCEEeee------cCCC
Confidence 68999999999998777889999997632 3468999999999999999999999999 6666
No 8
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=98.91 E-value=1.5e-09 Score=80.76 Aligned_cols=64 Identities=23% Similarity=0.430 Sum_probs=50.1
Q ss_pred ceEEEcCCCeEEEeCC---ceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccCCcc
Q 045937 4 VLWSIIGANSIVRVSN---DVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCSNFN 75 (79)
Q Consensus 4 a~~~l~~~N~~v~v~~---~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs~~~ 75 (79)
++++|++++|+..... +..|+++..... .+..|||+++||++.++||++++||||+ ++.|....
T Consensus 285 ~~l~l~p~~Y~~~~~~~~~~~~C~~~~~~~~--~~~~ILG~~flr~~yvvfD~~~~rIGfa------~~~C~~~~ 351 (364)
T cd05473 285 FRITILPQLYLRPVEDHGTQLDCYKFAISQS--TNGTVIGAVIMEGFYVVFDRANKRVGFA------VSTCAEHD 351 (364)
T ss_pred EEEEECHHHhhhhhccCCCcceeeEEeeecC--CCceEEeeeeEcceEEEEECCCCEEeeE------eccccccc
Confidence 4678999999986532 467987653221 2357999999999999999999999999 77887743
No 9
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=98.86 E-value=4.8e-09 Score=77.05 Aligned_cols=60 Identities=15% Similarity=0.247 Sum_probs=50.4
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccC
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCS 72 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs 72 (79)
|+.+++++++|++...+...|+++.... ...|||+..|+++.++||++++||||+ +.+|.
T Consensus 267 g~~~~i~p~~y~~~~~~~~c~~~~~~~~----~~~ILG~~flr~~y~vFD~~~~riGfa------~~~C~ 326 (326)
T cd06096 267 NLKIDWKPSSYLYKKESFWCKGGEKSVS----NKPILGASFFKNKQIIFDLDNNRIGFV------ESNCP 326 (326)
T ss_pred CcEEEECHHHhccccCCceEEEEEecCC----CceEEChHHhcCcEEEEECcCCEEeeE------cCCCC
Confidence 7899999999999876666677765432 257999999999999999999999999 67773
No 10
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=98.73 E-value=2.2e-08 Score=71.50 Aligned_cols=58 Identities=17% Similarity=0.331 Sum_probs=49.1
Q ss_pred cceEEEcCCCeEEEeCC----ceEE-EEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937 3 KVLWSIIGANSIVRVSN----DVSC-LGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS 63 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~----~v~C-Lafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~ 63 (79)
|+.+.+|+++|+.+... +..| ++|.+.+. +..|||...|+++.++||.+++||||+++
T Consensus 233 g~~~~i~~~~~~~~~~~~~~~~~~C~~~i~~~~~---~~~iLG~~fl~~~y~vfD~~~~~ig~a~a 295 (295)
T cd05474 233 GATISVPLSDLVLPASTDDGGDGACYLGIQPSTS---DYNILGDTFLRSAYVVYDLDNNEISLAQA 295 (295)
T ss_pred CeEEEEEHHHhEeccccCCCCCCCeEEEEEeCCC---CcEEeChHHhhcEEEEEECCCCEEEeecC
Confidence 67899999999998752 5678 68887643 35899999999999999999999999964
No 11
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=98.72 E-value=4e-08 Score=71.68 Aligned_cols=60 Identities=17% Similarity=0.292 Sum_probs=47.6
Q ss_pred cceEEEcCCCeEEEe--CCceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRV--SNDVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v--~~~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+.++|++++|++.. +.+..|+ +|..... ...+..|||+..|+++.++||++++||||+|
T Consensus 252 g~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~vfD~~~~~IGfA~ 316 (316)
T cd05486 252 GIPYSLSPQAYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYSVFDRGNNRVGFAP 316 (316)
T ss_pred CEEEEeCHHHeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEEEEeCCCCEeeccC
Confidence 789999999999975 3346797 4654321 1234679999999999999999999999985
No 12
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=98.65 E-value=4.9e-08 Score=71.25 Aligned_cols=56 Identities=20% Similarity=0.341 Sum_probs=46.7
Q ss_pred cceEEEcCCCeEEEeCCceEEEE-EEeCCCCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLG-FVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLa-fv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+.+.+|+++|+... +..|++ |.+.+. ....|||+.+|+++.++||++++||||++
T Consensus 261 g~~~~i~~~~y~~~~--~~~C~~~~~~~~~--~~~~IlG~~fl~~~y~vfD~~~~~iG~A~ 317 (317)
T cd05478 261 GVQYPLPPSAYILQD--QGSCTSGFQSMGL--GELWILGDVFIRQYYSVFDRANNKVGLAP 317 (317)
T ss_pred CEEEEECHHHheecC--CCEEeEEEEeCCC--CCeEEechHHhcceEEEEeCCCCEEeecC
Confidence 789999999999865 457986 555432 24579999999999999999999999985
No 13
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=98.60 E-value=1.2e-07 Score=69.36 Aligned_cols=60 Identities=22% Similarity=0.472 Sum_probs=47.7
Q ss_pred cceEEEcCCCeEEEeCC--ceEEEE-EEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+.++|++++|++.... ...|++ |...+. ...+..|||+..|+++.++||++++||||++
T Consensus 261 g~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilGd~flr~~y~vfD~~~~~IGfA~ 325 (325)
T cd05490 261 GKVYPLTGEDYILKVSQRGTTICLSGFMGLDIPPPAGPLWILGDVFIGRYYTVFDRDNDRVGFAK 325 (325)
T ss_pred CEEEEEChHHeEEeccCCCCCEEeeEEEECCCCCCCCceEEEChHhheeeEEEEEcCCcEeeccC
Confidence 67899999999998643 367985 654321 1234679999999999999999999999984
No 14
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=98.58 E-value=1.4e-07 Score=68.68 Aligned_cols=58 Identities=16% Similarity=0.292 Sum_probs=46.8
Q ss_pred cceEEEcCCCeEEEeCCceEEE-EEEeCC---CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSNDVSCL-GFVDGG---VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CL-afv~~~---~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+++.+|+++|+... +..|+ +|.+.. ..+.+..|||..+|++..++||++++||||++
T Consensus 256 g~~~~v~~~~y~~~~--~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~ 317 (318)
T cd05477 256 GVSFPLPPSAYILQN--NGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT 317 (318)
T ss_pred CEEEEECHHHeEecC--CCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence 688999999999875 34695 786531 11224689999999999999999999999996
No 15
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=98.54 E-value=2.3e-07 Score=68.35 Aligned_cols=59 Identities=20% Similarity=0.511 Sum_probs=47.5
Q ss_pred cceEEEcCCCeEEEeCC--ceEEEE-EEeCCC--CCceeEEEechhcceeEEEEECCCCeEEee
Q 045937 3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFS 61 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~ 61 (79)
|+.+.+++++|++++.+ ...|++ |..... ...+..|||...|++..++||++++||||+
T Consensus 265 g~~~~i~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y~vFD~~~~~ig~a 328 (329)
T cd05485 265 GKSFSLTGKDYVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYYTEFDLGNNRVGFA 328 (329)
T ss_pred CEEeEEChHHeEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccceEEEeCCCCEEeec
Confidence 67899999999998753 467985 664321 123457999999999999999999999997
No 16
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=98.54 E-value=2.5e-07 Score=67.89 Aligned_cols=61 Identities=18% Similarity=0.357 Sum_probs=48.9
Q ss_pred cceEEEcCCCeEEEeCC--ceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937 3 KVLWSIIGANSIVRVSN--DVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS 63 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~--~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~ 63 (79)
|+.++|++++|++...+ +..|+ +|...+. ...+..|||...|+++.++||++++||||+++
T Consensus 261 g~~~~v~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~vfD~~~~~IGfA~a 326 (326)
T cd05487 261 GKEYTLSSSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYTEFDRQNNRIGFALA 326 (326)
T ss_pred CEEEEeCHHHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEEEEeCCCCEEeeeeC
Confidence 67899999999998754 56796 6765321 12246899999999999999999999999963
No 17
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=98.54 E-value=1.5e-07 Score=68.83 Aligned_cols=58 Identities=16% Similarity=0.293 Sum_probs=45.7
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeC-CC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDG-GV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~-~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+++.||++||+++.. ..|++.+.. .. ...+..|||+.+|++..++||++++||||++
T Consensus 260 g~~~~i~~~~y~~~~~--g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a~ 320 (320)
T cd05488 260 GYNFTLGPFDYTLEVS--GSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLAK 320 (320)
T ss_pred CEEEEECHHHheecCC--CeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeecC
Confidence 6899999999998653 369975532 11 1123579999999999999999999999984
No 18
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=98.44 E-value=5.4e-07 Score=66.01 Aligned_cols=60 Identities=17% Similarity=0.333 Sum_probs=47.5
Q ss_pred cceEEEcCCCeEEEeCC--ceEEEE-EEeCC--CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGG--VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~--~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|+.++|++++|+++..+ ...|++ |...+ ....+..|||...|+++.++||++++||||++
T Consensus 253 g~~~~l~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGfA~ 317 (317)
T cd06098 253 GKTFELTPEQYILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGFAE 317 (317)
T ss_pred CEEEEEChHHeEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEeecC
Confidence 68999999999998754 357975 54322 11234679999999999999999999999984
No 19
>PTZ00147 plasmepsin-1; Provisional
Probab=98.33 E-value=1.2e-06 Score=68.54 Aligned_cols=61 Identities=11% Similarity=0.340 Sum_probs=48.8
Q ss_pred cceEEEcCCCeEEEeCC--ceEEE-EEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937 3 KVLWSIIGANSIVRVSN--DVSCL-GFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL 64 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~--~v~CL-afv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l 64 (79)
|+.++|++++|+.+..+ ...|+ +|++.+. ..+..|||+..|+++.++||++++||||+++-
T Consensus 387 g~~~~L~p~~yi~~~~~~~~~~C~~~i~~~~~-~~~~~ILGd~FLr~~YtVFD~~n~rIGfA~a~ 450 (453)
T PTZ00147 387 NKVYTLEPEYYLQPIEDIGSALCMLNIIPIDL-EKNTFILGDPFMRKYFTVFDYDNHTVGFALAK 450 (453)
T ss_pred CEEEEECHHHheeccccCCCcEEEEEEEECCC-CCCCEEECHHHhccEEEEEECCCCEEEEEEec
Confidence 67899999999986533 35797 5776542 22357999999999999999999999999753
No 20
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=98.32 E-value=1.1e-06 Score=68.72 Aligned_cols=60 Identities=12% Similarity=0.321 Sum_probs=47.8
Q ss_pred cceEEEcCCCeEEEeC--CceEEE-EEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937 3 KVLWSIIGANSIVRVS--NDVSCL-GFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS 63 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~--~~v~CL-afv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~ 63 (79)
|+.++|++++|+.+.. ++..|+ ++.+.+. ..+..|||...|+++.++||++++||||++.
T Consensus 386 g~~~~L~p~~Yi~~~~~~~~~~C~~~i~~~~~-~~~~~ILGd~FLr~~Y~VFD~~n~rIGfA~a 448 (450)
T PTZ00013 386 NNTYTLEPEYYMNPLLDVDDTLCMITMLPVDI-DDNTFILGDPFMRKYFTVFDYDKESVGFAIA 448 (450)
T ss_pred CEEEEECHHHheehhccCCCCeeEEEEEECCC-CCCCEEECHHHhccEEEEEECCCCEEEEEEe
Confidence 6789999999998642 346796 6665432 1245799999999999999999999999975
No 21
>PTZ00165 aspartyl protease; Provisional
Probab=98.28 E-value=2.6e-06 Score=67.13 Aligned_cols=61 Identities=13% Similarity=0.293 Sum_probs=48.6
Q ss_pred ceEEEcCCCeEEEe----CCceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937 4 VLWSIIGANSIVRV----SNDVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL 64 (79)
Q Consensus 4 a~~~l~~~N~~v~v----~~~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l 64 (79)
+.+.+++++|++++ ..+..|. +|.+.+. ..++..|||+..++++.++||+++.||||++..
T Consensus 380 v~~~l~p~dYi~~~~~~~~~~~~C~~g~~~~d~~~~~g~~~ILGd~Flr~yy~VFD~~n~rIGfA~a~ 447 (482)
T PTZ00165 380 IKFDMDPEDYVIEEGDSEEQEHQCVIGIIPMDVPAPRGPLFVLGNNFIRKYYSIFDRDHMMVGLVPAK 447 (482)
T ss_pred EEEEEchHHeeeecccCCCCCCeEEEEEEECCCCCCCCceEEEchhhheeEEEEEeCCCCEEEEEeec
Confidence 48999999999985 2346894 7876432 124578999999999999999999999999653
No 22
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=97.51 E-value=0.00019 Score=51.11 Aligned_cols=60 Identities=22% Similarity=0.337 Sum_probs=47.4
Q ss_pred cceEEEcCCCeEEEeCCc--eEEEEE-EeCC-CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 3 KVLWSIIGANSIVRVSND--VSCLGF-VDGG-VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~--v~CLaf-v~~~-~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
|..+++|+++|+...... ..|... .+.+ .......|+|..=++++.++||.+++|+||++
T Consensus 253 ~~~~~i~~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A~ 316 (317)
T PF00026_consen 253 GVTFTIPPSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFAQ 316 (317)
T ss_dssp TEEEEEEHHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEEE
T ss_pred CEEEEecchHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEec
Confidence 678999999999988654 378654 4411 12235689999999999999999999999996
No 23
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=97.14 E-value=0.0004 Score=48.47 Aligned_cols=26 Identities=31% Similarity=0.612 Sum_probs=24.9
Q ss_pred EEEechhcceeEEEEECCCCeEEeee
Q 045937 37 IVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 37 ~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
.|||..+|+++.++||+++++|||++
T Consensus 258 ~ilG~~fl~~~y~vfD~~~~~igfa~ 283 (283)
T cd05471 258 WILGDVFLRNYYTVFDLDNNRIGFAP 283 (283)
T ss_pred EEccHhhhhheEEEEeCCCCEEeecC
Confidence 79999999999999999999999984
No 24
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=96.68 E-value=0.0013 Score=47.03 Aligned_cols=26 Identities=31% Similarity=0.617 Sum_probs=24.7
Q ss_pred EEEechhcceeEEEEECCCCeEEeee
Q 045937 37 IVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 37 ~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
.|||...++++.++||++++||||++
T Consensus 253 ~ilGd~fl~~~y~vfD~~~~~ig~A~ 278 (278)
T cd06097 253 SILGDVFLKAQYVVFDVGGPKLGFAP 278 (278)
T ss_pred EEEcchhhCceeEEEcCCCceeeecC
Confidence 58999999999999999999999985
No 25
>PRK12442 translation initiation factor IF-1; Reviewed
Probab=54.41 E-value=25 Score=22.42 Aligned_cols=55 Identities=16% Similarity=0.279 Sum_probs=36.0
Q ss_pred EEcCCCeEEEeCCceEEEEEEeCCCCCcee-EEEechhcceeEEEEECCCCeEEeee
Q 045937 7 SIIGANSIVRVSNDVSCLGFVDGGVTPKTS-IVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 7 ~l~~~N~~v~v~~~v~CLafv~~~~~~~~~-~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
.||...+.|..+++...+|-+++.--..-. ++.|..-.-.+. -||+.++||=|.-
T Consensus 16 ~Lp~~~frV~LenG~~vla~isGKmR~~rIrIl~GD~V~VE~s-pYDltkGRIiyR~ 71 (87)
T PRK12442 16 VLPDSRFRVTLENGVEVGAYASGRMRKHRIRILAGDRVTLELS-PYDLTKGRINFRH 71 (87)
T ss_pred ECCCCEEEEEeCCCCEEEEEeccceeeeeEEecCCCEEEEEEC-cccCCceeEEEEe
Confidence 467777788888888888988764211011 334444444443 6999999999874
No 26
>TIGR00008 infA translation initiation factor IF-1. This family consists of translation initiation factor IF-1 as found in bacteria and chloroplasts. This protein, about 70 residues in length, consists largely of an S1 RNA binding domain (pfam00575).
Probab=33.12 E-value=1.1e+02 Score=18.43 Aligned_cols=53 Identities=21% Similarity=0.312 Sum_probs=33.2
Q ss_pred EEcCCCeEEEeCCceEEEEEEeCCCCC-ceeEEEechhcceeEEEEECCCCeEEe
Q 045937 7 SIIGANSIVRVSNDVSCLGFVDGGVTP-KTSIVIGGHQLDNNLVQFDIATSRLGF 60 (79)
Q Consensus 7 ~l~~~N~~v~v~~~v~CLafv~~~~~~-~~~~ViGn~Qqqn~~v~fDl~~~~lgF 60 (79)
.||...+.|...+|..-+|.+++.-.. .--+..|..-.-. +-.||+.++++=|
T Consensus 14 ~L~~~~f~V~l~ng~~vla~i~GKmr~~rI~I~~GD~V~Ve-~spyd~tkgrIi~ 67 (68)
T TIGR00008 14 SLPNAMFRVELENGHEVLAHISGKIRMHYIRILPGDKVKVE-LSPYDLTRGRITY 67 (68)
T ss_pred ECCCCEEEEEECCCCEEEEEecCcchhccEEECCCCEEEEE-ECcccCCcEeEEe
Confidence 467778888888888888888764210 1113334443333 3358999988765
No 27
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=31.85 E-value=62 Score=22.15 Aligned_cols=26 Identities=23% Similarity=0.461 Sum_probs=17.2
Q ss_pred EeCCceEEEEEEeCCCCCceeEEEec
Q 045937 16 RVSNDVSCLGFVDGGVTPKTSIVIGG 41 (79)
Q Consensus 16 ~v~~~v~CLafv~~~~~~~~~~ViGn 41 (79)
++.+|+-|+.+..-+..+.|.+|+|+
T Consensus 90 e~~DGvn~i~~g~~~~~~~~l~ivGG 115 (136)
T PF14781_consen 90 EVPDGVNAIVIGKLGDIPSPLVIVGG 115 (136)
T ss_pred hCccceeEEEEEecCCCCCcEEEECc
Confidence 55677778777655544566777765
No 28
>COG5343 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.90 E-value=56 Score=24.39 Aligned_cols=28 Identities=29% Similarity=0.280 Sum_probs=23.4
Q ss_pred eEEEechhcceeEEEEECCCCeEEeeec
Q 045937 36 SIVIGGHQLDNNLVQFDIATSRLGFSNS 63 (79)
Q Consensus 36 ~~ViGn~Qqqn~~v~fDl~~~~lgF~~~ 63 (79)
+++.-+=-|.+.+++||-+++|++-++.
T Consensus 134 a~L~~~~s~v~~l~~~D~~tg~~sl~~l 161 (240)
T COG5343 134 ASLAANGSQVNLLVVYDSETGRLSLSPL 161 (240)
T ss_pred EEeecCCCcceEEEEEecCCCcEEEEec
Confidence 3555666788999999999999999875
No 29
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=28.95 E-value=1.9e+02 Score=21.63 Aligned_cols=48 Identities=25% Similarity=0.442 Sum_probs=34.8
Q ss_pred cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcce----eEEEEECCCCeE
Q 045937 3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDN----NLVQFDIATSRL 58 (79)
Q Consensus 3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn----~~v~fDl~~~~l 58 (79)
+..|.=|+.+ +...|.-+.+++.. ...|-|++...+ .+..||.+++..
T Consensus 25 ~~qW~~~g~~----i~G~V~~l~~~~~~----~Llv~G~ft~~~~~~~~la~yd~~~~~w 76 (281)
T PF12768_consen 25 NSQWSSPGNG----ISGTVTDLQWASNN----QLLVGGNFTLNGTNSSNLATYDFKNQTW 76 (281)
T ss_pred CCEeecCCCC----ceEEEEEEEEecCC----EEEEEEeeEECCCCceeEEEEecCCCee
Confidence 5678888887 44457788888543 356777776555 899999999976
No 30
>PTZ00371 aspartyl aminopeptidase; Provisional
Probab=28.58 E-value=1.2e+02 Score=24.33 Aligned_cols=40 Identities=15% Similarity=0.297 Sum_probs=23.1
Q ss_pred cceEEEc-CCCeEEEeCCceEEEEEE-eCCC--CCceeEEEechh
Q 045937 3 KVLWSII-GANSIVRVSNDVSCLGFV-DGGV--TPKTSIVIGGHQ 43 (79)
Q Consensus 3 Ga~~~l~-~~N~~v~v~~~v~CLafv-~~~~--~~~~~~ViGn~Q 43 (79)
...|+|. +.-|++.-++..+ .||. +... ...++.|+|.|-
T Consensus 44 ~~~w~l~~g~kyyv~r~~ssl-~Af~vg~~~~~~~~g~~ivgaHt 87 (465)
T PTZ00371 44 GENWKLEKGGKYYLTRNNSTI-VAFTVGKKFDAPNGGFKIVGAHT 87 (465)
T ss_pred cccCccCCCCEEEEEcCCcEE-EEEEeCCCCccCCCCeEEEEEec
Confidence 4569974 5678886655443 4554 3321 223577888874
No 31
>PF02186 TFIIE_beta: TFIIE beta subunit core domain; InterPro: IPR003166 Initiation of eukaryotic mRNA transcription requires melting of promoter DNA with the help of the general transcription factors TFIIE and TFIIH. In higher eukaryotes, the general transcription factor TFIIE consists of two subunits: the large alpha subunit (IPR002853 from INTERPRO) and the small beta (IPR003166 from INTERPRO). TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription initiation by RNA polymerase II. The approximately 120-residue central core domain of TFIIE beta plays a role in double-stranded DNA binding of TFIIE []. The TFIIE beta central core DNA-binding domain consists of three helices with a beta hairpin at the C terminus, resembling the winged helix proteins. It shows a novel double-stranded DNA-binding activity where the DNA-binding surface locates on the opposite side to the previously reported winged helix motif by forming a positively charged furrow []. This entry represents the beta subunit of the transcription factor TFIIE.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005673 transcription factor TFIIE complex; PDB: 1D8K_A 1D8J_A.
Probab=28.16 E-value=51 Score=19.49 Aligned_cols=19 Identities=16% Similarity=0.405 Sum_probs=12.4
Q ss_pred ceeEEEEECCCCeEEeeec
Q 045937 45 DNNLVQFDIATSRLGFSNS 63 (79)
Q Consensus 45 qn~~v~fDl~~~~lgF~~~ 63 (79)
.|=.|+||-+++++.|.|.
T Consensus 44 ~npKI~~d~~~~~f~fkp~ 62 (65)
T PF02186_consen 44 NNPKIEYDPDGNTFSFKPK 62 (65)
T ss_dssp H-TTEEEE-TT-CEEE--T
T ss_pred cCCCEEEecCCCEEEeccC
Confidence 6778999999999999874
No 32
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=22.06 E-value=1.3e+02 Score=15.46 Aligned_cols=23 Identities=35% Similarity=0.544 Sum_probs=14.2
Q ss_pred eeEEEechhcce----eEEEEECCCCe
Q 045937 35 TSIVIGGHQLDN----NLVQFDIATSR 57 (79)
Q Consensus 35 ~~~ViGn~Qqqn----~~v~fDl~~~~ 57 (79)
...|+|+....+ -...||++.++
T Consensus 14 ~i~v~GG~~~~~~~~~d~~~~d~~~~~ 40 (49)
T PF13418_consen 14 SIYVFGGRDSSGSPLNDLWIFDIETNT 40 (49)
T ss_dssp EEEEE--EEE-TEE---EEEEETTTTE
T ss_pred eEEEECCCCCCCcccCCEEEEECCCCE
Confidence 367888877763 56789999886
No 33
>COG5428 Uncharacterized conserved small protein [Function unknown]
Probab=20.40 E-value=1.2e+02 Score=18.54 Aligned_cols=18 Identities=17% Similarity=0.298 Sum_probs=16.0
Q ss_pred cceeEEEEECCCCeEEee
Q 045937 44 LDNNLVQFDIATSRLGFS 61 (79)
Q Consensus 44 qqn~~v~fDl~~~~lgF~ 61 (79)
-+|.+|.+|-.++.+||.
T Consensus 29 ~edi~Idide~GkV~GiE 46 (69)
T COG5428 29 GEDILIDIDENGKVIGIE 46 (69)
T ss_pred CCcEEEEecCCCcEEEEE
Confidence 368899999999999996
No 34
>PF03930 Flp_N: Recombinase Flp protein N-terminus; InterPro: IPR022647 This entry describes the N-terminal domain of Recombinase Flp protein and is found in fungi. FLP proteins catalyse recombination between large inverted repetitions of the plasmid.; PDB: 1P4E_C 1FLO_C 1M6X_A.
Probab=20.12 E-value=1.2e+02 Score=19.14 Aligned_cols=17 Identities=29% Similarity=0.423 Sum_probs=13.0
Q ss_pred EEEEECCCCeEEeeecc
Q 045937 48 LVQFDIATSRLGFSNSL 64 (79)
Q Consensus 48 ~v~fDl~~~~lgF~~~l 64 (79)
-|.||...+.|.|.-.|
T Consensus 24 sl~fD~~~k~v~F~Yhl 40 (82)
T PF03930_consen 24 SLQFDSDDKIVSFEYHL 40 (82)
T ss_dssp H-EEETTTTEEEEEES-
T ss_pred ceEEecCCcEEEEEEec
Confidence 37899999999998544
No 35
>PF08446 PAS_2: PAS fold; InterPro: IPR013654 The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs []. The PAS fold appears in archaea, eubacteria and eukarya. ; GO: 0008020 G-protein coupled photoreceptor activity, 0006355 regulation of transcription, DNA-dependent, 0009584 detection of visible light, 0018298 protein-chromophore linkage; PDB: 3S7O_A 2O9B_A 3S7P_A 1ZTU_A 3S7N_A 3S7Q_A 2O9C_A 2OOL_A 3C2W_G 3NHQ_B ....
Probab=20.11 E-value=89 Score=19.65 Aligned_cols=29 Identities=17% Similarity=0.168 Sum_probs=22.3
Q ss_pred ceeEEEechhcceeEEEEECCCCeEEeee
Q 045937 34 KTSIVIGGHQLDNNLVQFDIATSRLGFSN 62 (79)
Q Consensus 34 ~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~ 62 (79)
+|.-+.|.+|--..++.+|.+..+|-..+
T Consensus 4 EPIH~pG~IQphG~LLa~d~~~~~I~~~S 32 (110)
T PF08446_consen 4 EPIHIPGSIQPHGALLALDPDDLRIVQAS 32 (110)
T ss_dssp S-TTC-SEE-TTSEEEEEETTTTBEEEEE
T ss_pred ccccCCCccCCCEEEEEEECCCCEEEEEc
Confidence 35667899999999999999988887764
Done!