Query         045937
Match_columns 79
No_of_seqs    114 out of 302
Neff          5.4 
Searched_HMMs 46136
Date          Fri Mar 29 07:03:55 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/045937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/045937hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF14541 TAXi_C:  Xylanase inhi  99.8 4.1E-19 8.9E-24  119.7   5.2   60    3-62    102-161 (161)
  2 cd05489 xylanase_inhibitor_I_l  99.7 1.6E-17 3.4E-22  125.1   8.0   62    3-64    301-362 (362)
  3 PLN03146 aspartyl protease fam  99.6 8.3E-15 1.8E-19  112.6   7.5   61    3-73    370-430 (431)
  4 cd05475 nucellin_like Nucellin  99.4   8E-13 1.7E-17   95.0   7.3   58    4-61    211-269 (273)
  5 KOG1339 Aspartyl protease [Pos  99.3 3.8E-12 8.3E-17   96.3   7.7   65    3-72    331-397 (398)
  6 cd05472 cnd41_like Chloroplast  99.3 1.8E-11   4E-16   88.3   7.8   62    3-71    237-299 (299)
  7 cd05476 pepsin_A_like_plant Ch  99.1   3E-10 6.5E-15   81.1   7.7   61    3-71    205-265 (265)
  8 cd05473 beta_secretase_like Be  98.9 1.5E-09 3.3E-14   80.8   4.5   64    4-75    285-351 (364)
  9 cd06096 Plasmepsin_5 Plasmepsi  98.9 4.8E-09   1E-13   77.1   5.7   60    3-72    267-326 (326)
 10 cd05474 SAP_like SAPs, pepsin-  98.7 2.2E-08 4.7E-13   71.5   5.5   58    3-63    233-295 (295)
 11 cd05486 Cathespin_E Cathepsin   98.7   4E-08 8.7E-13   71.7   6.6   60    3-62    252-316 (316)
 12 cd05478 pepsin_A Pepsin A, asp  98.6 4.9E-08 1.1E-12   71.3   5.3   56    3-62    261-317 (317)
 13 cd05490 Cathepsin_D2 Cathepsin  98.6 1.2E-07 2.5E-12   69.4   6.2   60    3-62    261-325 (325)
 14 cd05477 gastricsin Gastricsins  98.6 1.4E-07 3.1E-12   68.7   6.1   58    3-62    256-317 (318)
 15 cd05485 Cathepsin_D_like Cathe  98.5 2.3E-07   5E-12   68.3   6.5   59    3-61    265-328 (329)
 16 cd05487 renin_like Renin stimu  98.5 2.5E-07 5.4E-12   67.9   6.5   61    3-63    261-326 (326)
 17 cd05488 Proteinase_A_fungi Fun  98.5 1.5E-07 3.3E-12   68.8   5.4   58    3-62    260-320 (320)
 18 cd06098 phytepsin Phytepsin, a  98.4 5.4E-07 1.2E-11   66.0   6.0   60    3-62    253-317 (317)
 19 PTZ00147 plasmepsin-1; Provisi  98.3 1.2E-06 2.6E-11   68.5   6.1   61    3-64    387-450 (453)
 20 PTZ00013 plasmepsin 4 (PM4); P  98.3 1.1E-06 2.5E-11   68.7   5.8   60    3-63    386-448 (450)
 21 PTZ00165 aspartyl protease; Pr  98.3 2.6E-06 5.7E-11   67.1   7.0   61    4-64    380-447 (482)
 22 PF00026 Asp:  Eukaryotic aspar  97.5 0.00019 4.1E-09   51.1   4.9   60    3-62    253-316 (317)
 23 cd05471 pepsin_like Pepsin-lik  97.1  0.0004 8.7E-09   48.5   3.0   26   37-62    258-283 (283)
 24 cd06097 Aspergillopepsin_like   96.7  0.0013 2.9E-08   47.0   2.6   26   37-62    253-278 (278)
 25 PRK12442 translation initiatio  54.4      25 0.00054   22.4   3.6   55    7-62     16-71  (87)
 26 TIGR00008 infA translation ini  33.1 1.1E+02  0.0024   18.4   3.9   53    7-60     14-67  (68)
 27 PF14781 BBS2_N:  Ciliary BBSom  31.9      62  0.0013   22.1   2.9   26   16-41     90-115 (136)
 28 COG5343 Uncharacterized protei  29.9      56  0.0012   24.4   2.6   28   36-63    134-161 (240)
 29 PF12768 Rax2:  Cortical protei  29.0 1.9E+02  0.0041   21.6   5.4   48    3-58     25-76  (281)
 30 PTZ00371 aspartyl aminopeptida  28.6 1.2E+02  0.0025   24.3   4.4   40    3-43     44-87  (465)
 31 PF02186 TFIIE_beta:  TFIIE bet  28.2      51  0.0011   19.5   1.8   19   45-63     44-62  (65)
 32 PF13418 Kelch_4:  Galactose ox  22.1 1.3E+02  0.0029   15.5   2.8   23   35-57     14-40  (49)
 33 COG5428 Uncharacterized conser  20.4 1.2E+02  0.0026   18.5   2.4   18   44-61     29-46  (69)
 34 PF03930 Flp_N:  Recombinase Fl  20.1 1.2E+02  0.0026   19.1   2.5   17   48-64     24-40  (82)
 35 PF08446 PAS_2:  PAS fold;  Int  20.1      89  0.0019   19.6   1.9   29   34-62      4-32  (110)

No 1  
>PF14541 TAXi_C:  Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=99.77  E-value=4.1e-19  Score=119.73  Aligned_cols=60  Identities=33%  Similarity=0.474  Sum_probs=52.4

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |++|+|+++|||+...++++||||++++....+.+|||++||||++|+||++++||||+|
T Consensus       102 ga~l~l~~~~y~~~~~~~~~Cla~~~~~~~~~~~~viG~~~~~~~~v~fDl~~~~igF~~  161 (161)
T PF14541_consen  102 GADLTLPPENYFVQVSPGVFCLAFVPSDADDDGVSVIGNFQQQNYHVVFDLENGRIGFAP  161 (161)
T ss_dssp             SEEEEE-HHHHEEEECTTEEEESEEEETSTTSSSEEE-HHHCCTEEEEEETTTTEEEEEE
T ss_pred             CcceeeeccceeeeccCCCEEEEEEccCCCCCCcEEECHHHhcCcEEEEECCCCEEEEeC
Confidence            789999999999999999999999988322345789999999999999999999999996


No 2  
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=99.72  E-value=1.6e-17  Score=125.07  Aligned_cols=62  Identities=58%  Similarity=1.086  Sum_probs=55.9

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL   64 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l   64 (79)
                      |++|+|+++|||+++.+++.||||++++..+.++.|||++||||++|+||++++||||+++|
T Consensus       301 g~~~~l~~~ny~~~~~~~~~Cl~f~~~~~~~~~~~IlG~~~~~~~~vvyD~~~~riGfa~~~  362 (362)
T cd05489         301 GVNWTIFGANSMVQVKGGVACLAFVDGGSEPRPAVVIGGHQMEDNLLVFDLEKSRLGFSSSL  362 (362)
T ss_pred             CeEEEEcCCceEEEcCCCcEEEEEeeCCCCCCceEEEeeheecceEEEEECCCCEeecccCC
Confidence            69999999999999988899999998754334578999999999999999999999999886


No 3  
>PLN03146 aspartyl protease family protein; Provisional
Probab=99.56  E-value=8.3e-15  Score=112.56  Aligned_cols=61  Identities=23%  Similarity=0.416  Sum_probs=55.5

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccCC
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCSN   73 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs~   73 (79)
                      |+++.|+++|||+++++++.||||+++..    ..|||++||||++|+||++++||||+      +++|++
T Consensus       370 Ga~~~l~~~~~~~~~~~~~~Cl~~~~~~~----~~IlG~~~q~~~~vvyDl~~~~igFa------~~~C~~  430 (431)
T PLN03146        370 GADVKLQPLNTFVKVSEDLVCFAMIPTSS----IAIFGNLAQMNFLVGYDLESKTVSFK------PTDCTK  430 (431)
T ss_pred             CCeeecCcceeEEEcCCCcEEEEEecCCC----ceEECeeeEeeEEEEEECCCCEEeee------cCCcCc
Confidence            78999999999999988889999986532    47999999999999999999999999      888976


No 4  
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=99.41  E-value=8e-13  Score=94.97  Aligned_cols=58  Identities=21%  Similarity=0.398  Sum_probs=50.7

Q ss_pred             ceEEEcCCCeEEEeCCceEEEEEEeCCCC-CceeEEEechhcceeEEEEECCCCeEEee
Q 045937            4 VLWSIIGANSIVRVSNDVSCLGFVDGGVT-PKTSIVIGGHQLDNNLVQFDIATSRLGFS   61 (79)
Q Consensus         4 a~~~l~~~N~~v~v~~~v~CLafv~~~~~-~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~   61 (79)
                      ++|+||++||++...++..|++++.+... ..+..|||+.+||+++++||++++||||+
T Consensus       211 ~~~~l~~~~y~~~~~~~~~Cl~~~~~~~~~~~~~~ilG~~~l~~~~~vfD~~~~riGfa  269 (273)
T cd05475         211 RLLEIPPENYLIISEKGNVCLGILNGSEIGLGNTNIIGDISMQGLMVIYDNEKQQIGWV  269 (273)
T ss_pred             eEEEeCCCceEEEcCCCCEEEEEecCCCcCCCceEEECceEEEeeEEEEECcCCEeCcc
Confidence            79999999999988778899999865421 23468999999999999999999999999


No 5  
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.34  E-value=3.8e-12  Score=96.32  Aligned_cols=65  Identities=35%  Similarity=0.578  Sum_probs=57.0

Q ss_pred             cceEEEcCCCeEEEeCCceE-EEEEEeCCCCCceeEEEechhcceeEEEEECC-CCeEEeeecccccccccC
Q 045937            3 KVLWSIIGANSIVRVSNDVS-CLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIA-TSRLGFSNSLLLQRTMCS   72 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~-CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~-~~~lgF~~~l~~~~~~Cs   72 (79)
                      |+.|.++++||++++.++.. |++++.+...+ +..|||++|||+++++||+. ++|+||++.+    +.|+
T Consensus       331 g~~~~l~~~~y~~~~~~~~~~Cl~~~~~~~~~-~~~ilG~~~~~~~~~~~D~~~~~riGfa~~~----~~c~  397 (398)
T KOG1339|consen  331 GAVFSLPPKNYLVEVSDGGGVCLAFFNGMDSG-PLWILGDVFQQNYLVVFDLGENSRVGFAPAL----TNCS  397 (398)
T ss_pred             CcEEEeCccceEEEECCCCCceeeEEecCCCC-ceEEEchHHhCCEEEEEeCCCCCEEEecccc----ccCC
Confidence            79999999999999987655 99999875432 67999999999999999999 9999999877    6675


No 6  
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=99.27  E-value=1.8e-11  Score=88.30  Aligned_cols=62  Identities=23%  Similarity=0.340  Sum_probs=52.2

Q ss_pred             cceEEEcCCCeEEEe-CCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeeccccccccc
Q 045937            3 KVLWSIIGANSIVRV-SNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMC   71 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v-~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~C   71 (79)
                      |++|++++++|++.. ..+..|++|.+.+. ..+..|||+.+|++++++||++++||||+      +.+|
T Consensus       237 g~~~~l~~~~y~~~~~~~~~~C~~~~~~~~-~~~~~ilG~~fl~~~~vvfD~~~~~igfa------~~~C  299 (299)
T cd05472         237 GADVELDASGVLYPVDDSSQVCLAFAGTSD-DGGLSIIGNVQQQTFRVVYDVAGGRIGFA------PGGC  299 (299)
T ss_pred             CCEEEeCcccEEEEecCCCCEEEEEeCCCC-CCCCEEEchHHccceEEEEECCCCEEeEe------cCCC
Confidence            789999999999954 45678999987532 23467999999999999999999999999      6666


No 7  
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=99.12  E-value=3e-10  Score=81.05  Aligned_cols=61  Identities=30%  Similarity=0.492  Sum_probs=53.5

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeeccccccccc
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMC   71 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~C   71 (79)
                      |+.|.++++||+++...+..|++++..+.  .+..|||...++++.++||++++||||+      ..+|
T Consensus       205 ~~~~~i~~~~y~~~~~~~~~C~~~~~~~~--~~~~ilG~~fl~~~~~vFD~~~~~iGfa------~~~C  265 (265)
T cd05476         205 GADLELPPENYFVDVGEGVVCLAILSSSS--GGVSILGNIQQQNFLVEYDLENSRLGFA------PADC  265 (265)
T ss_pred             CCEEEeCcccEEEECCCCCEEEEEecCCC--CCcEEEChhhcccEEEEEECCCCEEeee------cCCC
Confidence            68999999999998777889999997632  3468999999999999999999999999      6666


No 8  
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=98.91  E-value=1.5e-09  Score=80.76  Aligned_cols=64  Identities=23%  Similarity=0.430  Sum_probs=50.1

Q ss_pred             ceEEEcCCCeEEEeCC---ceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccCCcc
Q 045937            4 VLWSIIGANSIVRVSN---DVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCSNFN   75 (79)
Q Consensus         4 a~~~l~~~N~~v~v~~---~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs~~~   75 (79)
                      ++++|++++|+.....   +..|+++.....  .+..|||+++||++.++||++++||||+      ++.|....
T Consensus       285 ~~l~l~p~~Y~~~~~~~~~~~~C~~~~~~~~--~~~~ILG~~flr~~yvvfD~~~~rIGfa------~~~C~~~~  351 (364)
T cd05473         285 FRITILPQLYLRPVEDHGTQLDCYKFAISQS--TNGTVIGAVIMEGFYVVFDRANKRVGFA------VSTCAEHD  351 (364)
T ss_pred             EEEEECHHHhhhhhccCCCcceeeEEeeecC--CCceEEeeeeEcceEEEEECCCCEEeeE------eccccccc
Confidence            4678999999986532   467987653221  2357999999999999999999999999      77887743


No 9  
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=98.86  E-value=4.8e-09  Score=77.05  Aligned_cols=60  Identities=15%  Similarity=0.247  Sum_probs=50.4

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecccccccccC
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSLLLQRTMCS   72 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l~~~~~~Cs   72 (79)
                      |+.+++++++|++...+...|+++....    ...|||+..|+++.++||++++||||+      +.+|.
T Consensus       267 g~~~~i~p~~y~~~~~~~~c~~~~~~~~----~~~ILG~~flr~~y~vFD~~~~riGfa------~~~C~  326 (326)
T cd06096         267 NLKIDWKPSSYLYKKESFWCKGGEKSVS----NKPILGASFFKNKQIIFDLDNNRIGFV------ESNCP  326 (326)
T ss_pred             CcEEEECHHHhccccCCceEEEEEecCC----CceEEChHHhcCcEEEEECcCCEEeeE------cCCCC
Confidence            7899999999999876666677765432    257999999999999999999999999      67773


No 10 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=98.73  E-value=2.2e-08  Score=71.50  Aligned_cols=58  Identities=17%  Similarity=0.331  Sum_probs=49.1

Q ss_pred             cceEEEcCCCeEEEeCC----ceEE-EEEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937            3 KVLWSIIGANSIVRVSN----DVSC-LGFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS   63 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~----~v~C-Lafv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~   63 (79)
                      |+.+.+|+++|+.+...    +..| ++|.+.+.   +..|||...|+++.++||.+++||||+++
T Consensus       233 g~~~~i~~~~~~~~~~~~~~~~~~C~~~i~~~~~---~~~iLG~~fl~~~y~vfD~~~~~ig~a~a  295 (295)
T cd05474         233 GATISVPLSDLVLPASTDDGGDGACYLGIQPSTS---DYNILGDTFLRSAYVVYDLDNNEISLAQA  295 (295)
T ss_pred             CeEEEEEHHHhEeccccCCCCCCCeEEEEEeCCC---CcEEeChHHhhcEEEEEECCCCEEEeecC
Confidence            67899999999998752    5678 68887643   35899999999999999999999999964


No 11 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=98.72  E-value=4e-08  Score=71.68  Aligned_cols=60  Identities=17%  Similarity=0.292  Sum_probs=47.6

Q ss_pred             cceEEEcCCCeEEEe--CCceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRV--SNDVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v--~~~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+.++|++++|++..  +.+..|+ +|.....  ...+..|||+..|+++.++||++++||||+|
T Consensus       252 g~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~vfD~~~~~IGfA~  316 (316)
T cd05486         252 GIPYSLSPQAYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYSVFDRGNNRVGFAP  316 (316)
T ss_pred             CEEEEeCHHHeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEEEEeCCCCEeeccC
Confidence            789999999999975  3346797 4654321  1234679999999999999999999999985


No 12 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=98.65  E-value=4.9e-08  Score=71.25  Aligned_cols=56  Identities=20%  Similarity=0.341  Sum_probs=46.7

Q ss_pred             cceEEEcCCCeEEEeCCceEEEE-EEeCCCCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLG-FVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLa-fv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+.+.+|+++|+...  +..|++ |.+.+.  ....|||+.+|+++.++||++++||||++
T Consensus       261 g~~~~i~~~~y~~~~--~~~C~~~~~~~~~--~~~~IlG~~fl~~~y~vfD~~~~~iG~A~  317 (317)
T cd05478         261 GVQYPLPPSAYILQD--QGSCTSGFQSMGL--GELWILGDVFIRQYYSVFDRANNKVGLAP  317 (317)
T ss_pred             CEEEEECHHHheecC--CCEEeEEEEeCCC--CCeEEechHHhcceEEEEeCCCCEEeecC
Confidence            789999999999865  457986 555432  24579999999999999999999999985


No 13 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=98.60  E-value=1.2e-07  Score=69.36  Aligned_cols=60  Identities=22%  Similarity=0.472  Sum_probs=47.7

Q ss_pred             cceEEEcCCCeEEEeCC--ceEEEE-EEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+.++|++++|++....  ...|++ |...+.  ...+..|||+..|+++.++||++++||||++
T Consensus       261 g~~~~l~~~~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilGd~flr~~y~vfD~~~~~IGfA~  325 (325)
T cd05490         261 GKVYPLTGEDYILKVSQRGTTICLSGFMGLDIPPPAGPLWILGDVFIGRYYTVFDRDNDRVGFAK  325 (325)
T ss_pred             CEEEEEChHHeEEeccCCCCCEEeeEEEECCCCCCCCceEEEChHhheeeEEEEEcCCcEeeccC
Confidence            67899999999998643  367985 654321  1234679999999999999999999999984


No 14 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=98.58  E-value=1.4e-07  Score=68.68  Aligned_cols=58  Identities=16%  Similarity=0.292  Sum_probs=46.8

Q ss_pred             cceEEEcCCCeEEEeCCceEEE-EEEeCC---CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSNDVSCL-GFVDGG---VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CL-afv~~~---~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+++.+|+++|+...  +..|+ +|.+..   ..+.+..|||..+|++..++||++++||||++
T Consensus       256 g~~~~v~~~~y~~~~--~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~  317 (318)
T cd05477         256 GVSFPLPPSAYILQN--NGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT  317 (318)
T ss_pred             CEEEEECHHHeEecC--CCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence            688999999999875  34695 786531   11224689999999999999999999999996


No 15 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=98.54  E-value=2.3e-07  Score=68.35  Aligned_cols=59  Identities=20%  Similarity=0.511  Sum_probs=47.5

Q ss_pred             cceEEEcCCCeEEEeCC--ceEEEE-EEeCCC--CCceeEEEechhcceeEEEEECCCCeEEee
Q 045937            3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFS   61 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~   61 (79)
                      |+.+.+++++|++++.+  ...|++ |.....  ...+..|||...|++..++||++++||||+
T Consensus       265 g~~~~i~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y~vFD~~~~~ig~a  328 (329)
T cd05485         265 GKSFSLTGKDYVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYYTEFDLGNNRVGFA  328 (329)
T ss_pred             CEEeEEChHHeEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccceEEEeCCCCEEeec
Confidence            67899999999998753  467985 664321  123457999999999999999999999997


No 16 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=98.54  E-value=2.5e-07  Score=67.89  Aligned_cols=61  Identities=18%  Similarity=0.357  Sum_probs=48.9

Q ss_pred             cceEEEcCCCeEEEeCC--ceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937            3 KVLWSIIGANSIVRVSN--DVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS   63 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~--~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~   63 (79)
                      |+.++|++++|++...+  +..|+ +|...+.  ...+..|||...|+++.++||++++||||+++
T Consensus       261 g~~~~v~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~vfD~~~~~IGfA~a  326 (326)
T cd05487         261 GKEYTLSSSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYTEFDRQNNRIGFALA  326 (326)
T ss_pred             CEEEEeCHHHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEEEEeCCCCEEeeeeC
Confidence            67899999999998754  56796 6765321  12246899999999999999999999999963


No 17 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=98.54  E-value=1.5e-07  Score=68.83  Aligned_cols=58  Identities=16%  Similarity=0.293  Sum_probs=45.7

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeC-CC--CCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDG-GV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~-~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+++.||++||+++..  ..|++.+.. ..  ...+..|||+.+|++..++||++++||||++
T Consensus       260 g~~~~i~~~~y~~~~~--g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a~  320 (320)
T cd05488         260 GYNFTLGPFDYTLEVS--GSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLAK  320 (320)
T ss_pred             CEEEEECHHHheecCC--CeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeecC
Confidence            6899999999998653  369975532 11  1123579999999999999999999999984


No 18 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=98.44  E-value=5.4e-07  Score=66.01  Aligned_cols=60  Identities=17%  Similarity=0.333  Sum_probs=47.5

Q ss_pred             cceEEEcCCCeEEEeCC--ceEEEE-EEeCC--CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSN--DVSCLG-FVDGG--VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~--~v~CLa-fv~~~--~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |+.++|++++|+++..+  ...|++ |...+  ....+..|||...|+++.++||++++||||++
T Consensus       253 g~~~~l~~~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGfA~  317 (317)
T cd06098         253 GKTFELTPEQYILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGFAE  317 (317)
T ss_pred             CEEEEEChHHeEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEeecC
Confidence            68999999999998754  357975 54322  11234679999999999999999999999984


No 19 
>PTZ00147 plasmepsin-1; Provisional
Probab=98.33  E-value=1.2e-06  Score=68.54  Aligned_cols=61  Identities=11%  Similarity=0.340  Sum_probs=48.8

Q ss_pred             cceEEEcCCCeEEEeCC--ceEEE-EEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937            3 KVLWSIIGANSIVRVSN--DVSCL-GFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL   64 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~--~v~CL-afv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l   64 (79)
                      |+.++|++++|+.+..+  ...|+ +|++.+. ..+..|||+..|+++.++||++++||||+++-
T Consensus       387 g~~~~L~p~~yi~~~~~~~~~~C~~~i~~~~~-~~~~~ILGd~FLr~~YtVFD~~n~rIGfA~a~  450 (453)
T PTZ00147        387 NKVYTLEPEYYLQPIEDIGSALCMLNIIPIDL-EKNTFILGDPFMRKYFTVFDYDNHTVGFALAK  450 (453)
T ss_pred             CEEEEECHHHheeccccCCCcEEEEEEEECCC-CCCCEEECHHHhccEEEEEECCCCEEEEEEec
Confidence            67899999999986533  35797 5776542 22357999999999999999999999999753


No 20 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=98.32  E-value=1.1e-06  Score=68.72  Aligned_cols=60  Identities=12%  Similarity=0.321  Sum_probs=47.8

Q ss_pred             cceEEEcCCCeEEEeC--CceEEE-EEEeCCCCCceeEEEechhcceeEEEEECCCCeEEeeec
Q 045937            3 KVLWSIIGANSIVRVS--NDVSCL-GFVDGGVTPKTSIVIGGHQLDNNLVQFDIATSRLGFSNS   63 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~--~~v~CL-afv~~~~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~   63 (79)
                      |+.++|++++|+.+..  ++..|+ ++.+.+. ..+..|||...|+++.++||++++||||++.
T Consensus       386 g~~~~L~p~~Yi~~~~~~~~~~C~~~i~~~~~-~~~~~ILGd~FLr~~Y~VFD~~n~rIGfA~a  448 (450)
T PTZ00013        386 NNTYTLEPEYYMNPLLDVDDTLCMITMLPVDI-DDNTFILGDPFMRKYFTVFDYDKESVGFAIA  448 (450)
T ss_pred             CEEEEECHHHheehhccCCCCeeEEEEEECCC-CCCCEEECHHHhccEEEEEECCCCEEEEEEe
Confidence            6789999999998642  346796 6665432 1245799999999999999999999999975


No 21 
>PTZ00165 aspartyl protease; Provisional
Probab=98.28  E-value=2.6e-06  Score=67.13  Aligned_cols=61  Identities=13%  Similarity=0.293  Sum_probs=48.6

Q ss_pred             ceEEEcCCCeEEEe----CCceEEE-EEEeCCC--CCceeEEEechhcceeEEEEECCCCeEEeeecc
Q 045937            4 VLWSIIGANSIVRV----SNDVSCL-GFVDGGV--TPKTSIVIGGHQLDNNLVQFDIATSRLGFSNSL   64 (79)
Q Consensus         4 a~~~l~~~N~~v~v----~~~v~CL-afv~~~~--~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~~l   64 (79)
                      +.+.+++++|++++    ..+..|. +|.+.+.  ..++..|||+..++++.++||+++.||||++..
T Consensus       380 v~~~l~p~dYi~~~~~~~~~~~~C~~g~~~~d~~~~~g~~~ILGd~Flr~yy~VFD~~n~rIGfA~a~  447 (482)
T PTZ00165        380 IKFDMDPEDYVIEEGDSEEQEHQCVIGIIPMDVPAPRGPLFVLGNNFIRKYYSIFDRDHMMVGLVPAK  447 (482)
T ss_pred             EEEEEchHHeeeecccCCCCCCeEEEEEEECCCCCCCCceEEEchhhheeEEEEEeCCCCEEEEEeec
Confidence            48999999999985    2346894 7876432  124578999999999999999999999999653


No 22 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=97.51  E-value=0.00019  Score=51.11  Aligned_cols=60  Identities=22%  Similarity=0.337  Sum_probs=47.4

Q ss_pred             cceEEEcCCCeEEEeCCc--eEEEEE-EeCC-CCCceeEEEechhcceeEEEEECCCCeEEeee
Q 045937            3 KVLWSIIGANSIVRVSND--VSCLGF-VDGG-VTPKTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~--v~CLaf-v~~~-~~~~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      |..+++|+++|+......  ..|... .+.+ .......|+|..=++++.++||.+++|+||++
T Consensus       253 ~~~~~i~~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A~  316 (317)
T PF00026_consen  253 GVTFTIPPSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFAQ  316 (317)
T ss_dssp             TEEEEEEHHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEEE
T ss_pred             CEEEEecchHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEec
Confidence            678999999999988654  378654 4411 12235689999999999999999999999996


No 23 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=97.14  E-value=0.0004  Score=48.47  Aligned_cols=26  Identities=31%  Similarity=0.612  Sum_probs=24.9

Q ss_pred             EEEechhcceeEEEEECCCCeEEeee
Q 045937           37 IVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus        37 ~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      .|||..+|+++.++||+++++|||++
T Consensus       258 ~ilG~~fl~~~y~vfD~~~~~igfa~  283 (283)
T cd05471         258 WILGDVFLRNYYTVFDLDNNRIGFAP  283 (283)
T ss_pred             EEccHhhhhheEEEEeCCCCEEeecC
Confidence            79999999999999999999999984


No 24 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=96.68  E-value=0.0013  Score=47.03  Aligned_cols=26  Identities=31%  Similarity=0.617  Sum_probs=24.7

Q ss_pred             EEEechhcceeEEEEECCCCeEEeee
Q 045937           37 IVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus        37 ~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      .|||...++++.++||++++||||++
T Consensus       253 ~ilGd~fl~~~y~vfD~~~~~ig~A~  278 (278)
T cd06097         253 SILGDVFLKAQYVVFDVGGPKLGFAP  278 (278)
T ss_pred             EEEcchhhCceeEEEcCCCceeeecC
Confidence            58999999999999999999999985


No 25 
>PRK12442 translation initiation factor IF-1; Reviewed
Probab=54.41  E-value=25  Score=22.42  Aligned_cols=55  Identities=16%  Similarity=0.279  Sum_probs=36.0

Q ss_pred             EEcCCCeEEEeCCceEEEEEEeCCCCCcee-EEEechhcceeEEEEECCCCeEEeee
Q 045937            7 SIIGANSIVRVSNDVSCLGFVDGGVTPKTS-IVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus         7 ~l~~~N~~v~v~~~v~CLafv~~~~~~~~~-~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      .||...+.|..+++...+|-+++.--..-. ++.|..-.-.+. -||+.++||=|.-
T Consensus        16 ~Lp~~~frV~LenG~~vla~isGKmR~~rIrIl~GD~V~VE~s-pYDltkGRIiyR~   71 (87)
T PRK12442         16 VLPDSRFRVTLENGVEVGAYASGRMRKHRIRILAGDRVTLELS-PYDLTKGRINFRH   71 (87)
T ss_pred             ECCCCEEEEEeCCCCEEEEEeccceeeeeEEecCCCEEEEEEC-cccCCceeEEEEe
Confidence            467777788888888888988764211011 334444444443 6999999999874


No 26 
>TIGR00008 infA translation initiation factor IF-1. This family consists of translation initiation factor IF-1 as found in bacteria and chloroplasts. This protein, about 70 residues in length, consists largely of an S1 RNA binding domain (pfam00575).
Probab=33.12  E-value=1.1e+02  Score=18.43  Aligned_cols=53  Identities=21%  Similarity=0.312  Sum_probs=33.2

Q ss_pred             EEcCCCeEEEeCCceEEEEEEeCCCCC-ceeEEEechhcceeEEEEECCCCeEEe
Q 045937            7 SIIGANSIVRVSNDVSCLGFVDGGVTP-KTSIVIGGHQLDNNLVQFDIATSRLGF   60 (79)
Q Consensus         7 ~l~~~N~~v~v~~~v~CLafv~~~~~~-~~~~ViGn~Qqqn~~v~fDl~~~~lgF   60 (79)
                      .||...+.|...+|..-+|.+++.-.. .--+..|..-.-. +-.||+.++++=|
T Consensus        14 ~L~~~~f~V~l~ng~~vla~i~GKmr~~rI~I~~GD~V~Ve-~spyd~tkgrIi~   67 (68)
T TIGR00008        14 SLPNAMFRVELENGHEVLAHISGKIRMHYIRILPGDKVKVE-LSPYDLTRGRITY   67 (68)
T ss_pred             ECCCCEEEEEECCCCEEEEEecCcchhccEEECCCCEEEEE-ECcccCCcEeEEe
Confidence            467778888888888888888764210 1113334443333 3358999988765


No 27 
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=31.85  E-value=62  Score=22.15  Aligned_cols=26  Identities=23%  Similarity=0.461  Sum_probs=17.2

Q ss_pred             EeCCceEEEEEEeCCCCCceeEEEec
Q 045937           16 RVSNDVSCLGFVDGGVTPKTSIVIGG   41 (79)
Q Consensus        16 ~v~~~v~CLafv~~~~~~~~~~ViGn   41 (79)
                      ++.+|+-|+.+..-+..+.|.+|+|+
T Consensus        90 e~~DGvn~i~~g~~~~~~~~l~ivGG  115 (136)
T PF14781_consen   90 EVPDGVNAIVIGKLGDIPSPLVIVGG  115 (136)
T ss_pred             hCccceeEEEEEecCCCCCcEEEECc
Confidence            55677778777655544566777765


No 28 
>COG5343 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.90  E-value=56  Score=24.39  Aligned_cols=28  Identities=29%  Similarity=0.280  Sum_probs=23.4

Q ss_pred             eEEEechhcceeEEEEECCCCeEEeeec
Q 045937           36 SIVIGGHQLDNNLVQFDIATSRLGFSNS   63 (79)
Q Consensus        36 ~~ViGn~Qqqn~~v~fDl~~~~lgF~~~   63 (79)
                      +++.-+=-|.+.+++||-+++|++-++.
T Consensus       134 a~L~~~~s~v~~l~~~D~~tg~~sl~~l  161 (240)
T COG5343         134 ASLAANGSQVNLLVVYDSETGRLSLSPL  161 (240)
T ss_pred             EEeecCCCcceEEEEEecCCCcEEEEec
Confidence            3555666788999999999999999875


No 29 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=28.95  E-value=1.9e+02  Score=21.63  Aligned_cols=48  Identities=25%  Similarity=0.442  Sum_probs=34.8

Q ss_pred             cceEEEcCCCeEEEeCCceEEEEEEeCCCCCceeEEEechhcce----eEEEEECCCCeE
Q 045937            3 KVLWSIIGANSIVRVSNDVSCLGFVDGGVTPKTSIVIGGHQLDN----NLVQFDIATSRL   58 (79)
Q Consensus         3 Ga~~~l~~~N~~v~v~~~v~CLafv~~~~~~~~~~ViGn~Qqqn----~~v~fDl~~~~l   58 (79)
                      +..|.=|+.+    +...|.-+.+++..    ...|-|++...+    .+..||.+++..
T Consensus        25 ~~qW~~~g~~----i~G~V~~l~~~~~~----~Llv~G~ft~~~~~~~~la~yd~~~~~w   76 (281)
T PF12768_consen   25 NSQWSSPGNG----ISGTVTDLQWASNN----QLLVGGNFTLNGTNSSNLATYDFKNQTW   76 (281)
T ss_pred             CCEeecCCCC----ceEEEEEEEEecCC----EEEEEEeeEECCCCceeEEEEecCCCee
Confidence            5678888887    44457788888543    356777776555    899999999976


No 30 
>PTZ00371 aspartyl aminopeptidase; Provisional
Probab=28.58  E-value=1.2e+02  Score=24.33  Aligned_cols=40  Identities=15%  Similarity=0.297  Sum_probs=23.1

Q ss_pred             cceEEEc-CCCeEEEeCCceEEEEEE-eCCC--CCceeEEEechh
Q 045937            3 KVLWSII-GANSIVRVSNDVSCLGFV-DGGV--TPKTSIVIGGHQ   43 (79)
Q Consensus         3 Ga~~~l~-~~N~~v~v~~~v~CLafv-~~~~--~~~~~~ViGn~Q   43 (79)
                      ...|+|. +.-|++.-++..+ .||. +...  ...++.|+|.|-
T Consensus        44 ~~~w~l~~g~kyyv~r~~ssl-~Af~vg~~~~~~~~g~~ivgaHt   87 (465)
T PTZ00371         44 GENWKLEKGGKYYLTRNNSTI-VAFTVGKKFDAPNGGFKIVGAHT   87 (465)
T ss_pred             cccCccCCCCEEEEEcCCcEE-EEEEeCCCCccCCCCeEEEEEec
Confidence            4569974 5678886655443 4554 3321  223577888874


No 31 
>PF02186 TFIIE_beta:  TFIIE beta subunit core domain;  InterPro: IPR003166 Initiation of eukaryotic mRNA transcription requires melting of promoter DNA with the help of the general transcription factors TFIIE and TFIIH. In higher eukaryotes, the general transcription factor TFIIE consists of two subunits: the large alpha subunit (IPR002853 from INTERPRO) and the small beta (IPR003166 from INTERPRO). TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription initiation by RNA polymerase II. The approximately 120-residue central core domain of TFIIE beta plays a role in double-stranded DNA binding of TFIIE []. The TFIIE beta central core DNA-binding domain consists of three helices with a beta hairpin at the C terminus, resembling the winged helix proteins. It shows a novel double-stranded DNA-binding activity where the DNA-binding surface locates on the opposite side to the previously reported winged helix motif by forming a positively charged furrow []. This entry represents the beta subunit of the transcription factor TFIIE.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005673 transcription factor TFIIE complex; PDB: 1D8K_A 1D8J_A.
Probab=28.16  E-value=51  Score=19.49  Aligned_cols=19  Identities=16%  Similarity=0.405  Sum_probs=12.4

Q ss_pred             ceeEEEEECCCCeEEeeec
Q 045937           45 DNNLVQFDIATSRLGFSNS   63 (79)
Q Consensus        45 qn~~v~fDl~~~~lgF~~~   63 (79)
                      .|=.|+||-+++++.|.|.
T Consensus        44 ~npKI~~d~~~~~f~fkp~   62 (65)
T PF02186_consen   44 NNPKIEYDPDGNTFSFKPK   62 (65)
T ss_dssp             H-TTEEEE-TT-CEEE--T
T ss_pred             cCCCEEEecCCCEEEeccC
Confidence            6778999999999999874


No 32 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=22.06  E-value=1.3e+02  Score=15.46  Aligned_cols=23  Identities=35%  Similarity=0.544  Sum_probs=14.2

Q ss_pred             eeEEEechhcce----eEEEEECCCCe
Q 045937           35 TSIVIGGHQLDN----NLVQFDIATSR   57 (79)
Q Consensus        35 ~~~ViGn~Qqqn----~~v~fDl~~~~   57 (79)
                      ...|+|+....+    -...||++.++
T Consensus        14 ~i~v~GG~~~~~~~~~d~~~~d~~~~~   40 (49)
T PF13418_consen   14 SIYVFGGRDSSGSPLNDLWIFDIETNT   40 (49)
T ss_dssp             EEEEE--EEE-TEE---EEEEETTTTE
T ss_pred             eEEEECCCCCCCcccCCEEEEECCCCE
Confidence            367888877763    56789999886


No 33 
>COG5428 Uncharacterized conserved small protein [Function unknown]
Probab=20.40  E-value=1.2e+02  Score=18.54  Aligned_cols=18  Identities=17%  Similarity=0.298  Sum_probs=16.0

Q ss_pred             cceeEEEEECCCCeEEee
Q 045937           44 LDNNLVQFDIATSRLGFS   61 (79)
Q Consensus        44 qqn~~v~fDl~~~~lgF~   61 (79)
                      -+|.+|.+|-.++.+||.
T Consensus        29 ~edi~Idide~GkV~GiE   46 (69)
T COG5428          29 GEDILIDIDENGKVIGIE   46 (69)
T ss_pred             CCcEEEEecCCCcEEEEE
Confidence            368899999999999996


No 34 
>PF03930 Flp_N:  Recombinase Flp protein N-terminus;  InterPro: IPR022647 This entry describes the N-terminal domain of Recombinase Flp protein and is found in fungi. FLP proteins catalyse recombination between large inverted repetitions of the plasmid.; PDB: 1P4E_C 1FLO_C 1M6X_A.
Probab=20.12  E-value=1.2e+02  Score=19.14  Aligned_cols=17  Identities=29%  Similarity=0.423  Sum_probs=13.0

Q ss_pred             EEEEECCCCeEEeeecc
Q 045937           48 LVQFDIATSRLGFSNSL   64 (79)
Q Consensus        48 ~v~fDl~~~~lgF~~~l   64 (79)
                      -|.||...+.|.|.-.|
T Consensus        24 sl~fD~~~k~v~F~Yhl   40 (82)
T PF03930_consen   24 SLQFDSDDKIVSFEYHL   40 (82)
T ss_dssp             H-EEETTTTEEEEEES-
T ss_pred             ceEEecCCcEEEEEEec
Confidence            37899999999998544


No 35 
>PF08446 PAS_2:  PAS fold;  InterPro: IPR013654 The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs []. The PAS fold appears in archaea, eubacteria and eukarya. ; GO: 0008020 G-protein coupled photoreceptor activity, 0006355 regulation of transcription, DNA-dependent, 0009584 detection of visible light, 0018298 protein-chromophore linkage; PDB: 3S7O_A 2O9B_A 3S7P_A 1ZTU_A 3S7N_A 3S7Q_A 2O9C_A 2OOL_A 3C2W_G 3NHQ_B ....
Probab=20.11  E-value=89  Score=19.65  Aligned_cols=29  Identities=17%  Similarity=0.168  Sum_probs=22.3

Q ss_pred             ceeEEEechhcceeEEEEECCCCeEEeee
Q 045937           34 KTSIVIGGHQLDNNLVQFDIATSRLGFSN   62 (79)
Q Consensus        34 ~~~~ViGn~Qqqn~~v~fDl~~~~lgF~~   62 (79)
                      +|.-+.|.+|--..++.+|.+..+|-..+
T Consensus         4 EPIH~pG~IQphG~LLa~d~~~~~I~~~S   32 (110)
T PF08446_consen    4 EPIHIPGSIQPHGALLALDPDDLRIVQAS   32 (110)
T ss_dssp             S-TTC-SEE-TTSEEEEEETTTTBEEEEE
T ss_pred             ccccCCCccCCCEEEEEEECCCCEEEEEc
Confidence            35667899999999999999988887764


Done!