RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3212
         (177 letters)



>gnl|CDD|133152 cd05485, Cathepsin_D_like, Cathepsin_D_like, pepsin family of
           proteinases.  Cathepsin D is the major aspartic
           proteinase of the lysosomal compartment where it
           functions in protein catabolism. It is a member of the
           pepsin family of proteinases. This enzyme is
           distinguished from other members of the pepsin family by
           two features that are characteristic of lysosomal
           hydrolases. First, mature Cathepsin D is found
           predominantly in a two-chain form due to a
           posttranslational cleavage event. Second, it contains
           phosphorylated, N-linked oligosaccharides that target
           the enzyme to lysosomes via mannose-6-phosphate
           receptors. Cathepsin D preferentially attacks peptide
           bonds flanked by bulky hydrophobic amino acids and its
           pH optimum is between pH 2.8 and 4.0. Two active site
           aspartic acid residues are essential for the catalytic
           activity of aspartic proteinases. Like other aspartic
           proteinases, Cathepsin D is a bilobed molecule; the two
           evolutionary related lobes are mostly made up of
           beta-sheets and flank a deep active site cleft. Each of
           the two related lobes contributes one active site
           aspartic acid residue and contains a single carbohydrate
           group. Cathepsin D is an essential enzyme. Mice
           deficient for proteinase cathepsin D, generated by gene
           targeting, develop normally during the first 2 weeks,
           stop thriving in the third week and die in a state of
           anorexia in the fourth week. The mice develop atrophy of
           ileal mucosa followed by other degradation of intestinal
           organs. In these knockout mice, lysosomal proteolysis
           was normal. These results suggest that vital functions
           of cathepsin D are exerted by limited proteolysis of
           proteins regulating cell growth and/or tissue
           homeostasis, while its contribution to bulk proteolysis
           in lysosomes appears to be non-critical. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 329

 Score =  141 bits (356), Expect = 2e-41
 Identities = 55/115 (47%), Positives = 78/115 (67%), Gaps = 1/115 (0%)

Query: 61  RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
           R+ S + GGE++ GG D   + G+ TY PV+RKGYWQF ++S+ + +  +CS  CQAIAD
Sbjct: 157 RDPSAKEGGELILGGSDPKHYTGNFTYLPVTRKGYWQFKMDSVSVGEGEFCSGGCQAIAD 216

Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
           TGTSLI GP   I +LN  IGA P+  G   V+C  +  +P++  +LGGK+F+LT
Sbjct: 217 TGTSLIAGPVDEIEKLNNAIGAKPIIGGEYMVNCSAIPSLPDITFVLGGKSFSLT 271



 Score = 80.3 bits (198), Expect = 2e-18
 Identities = 31/41 (75%), Positives = 36/41 (87%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
          AQYYG I++GTPPQ FKV+FDTGSSNLW+PS+ CS  NIAC
Sbjct: 10 AQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIAC 50


>gnl|CDD|133162 cd06098, phytepsin, Phytepsin, a plant homolog of mammalian
           lysosomal pepsins.  Phytepsin, a plant homolog of
           mammalian lysosomal pepsins, resides in grains, roots,
           stems, leaves and flowers. Phytepsin may participate in
           metabolic turnover and in protein processing events. In
           addition, it highly expressed in several plant tissues
           undergoing apoptosis. Phytepsin contains an internal
           region consisting of about 100 residues not present in
           animal or microbial pepsins. This region is thus called
           a plant specific insert. The insert is highly similar to
           saponins, which are lysosomal sphingolipid-activating
           proteins in mammalian cells. The saponin-like domain may
           have a role in the vacuolar targeting of phytepsin.
           Phytepsin, as its animal counterparts, possesses a
           topology typical of all aspartic proteases.  They are
           bilobal enzymes, each lobe contributing a catalytic Asp
           residue, with an extended active site cleft localized
           between the two lobes of the molecule. One lobe has
           probably evolved from the other through a gene
           duplication event in the distant past. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 317

 Score =  119 bits (299), Expect = 5e-33
 Identities = 52/117 (44%), Positives = 72/117 (61%), Gaps = 15/117 (12%)

Query: 61  RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
           RN  +E GGE++FGGVD   F G+ TY PV+RKGYWQF +  + I      +C+  C AI
Sbjct: 155 RNPDEEEGGELVFGGVDPKHFKGEHTYVPVTRKGYWQFEMGDVLIGGKSTGFCAGGCAAI 214

Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
           AD+GTSL+ GP+ ++ ++N            + VDC++L  MPNV   +GGK F LT
Sbjct: 215 ADSGTSLLAGPTTIVTQIN------------SAVDCNSLSSMPNVSFTIGGKTFELT 259



 Score = 76.6 bits (189), Expect = 4e-17
 Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 1/41 (2%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
          AQY+G I +GTPPQ+F VIFDTGSSNLW+PS  C   +IAC
Sbjct: 9  AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSIAC 48


>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease.  Aspartyl (acid)
           proteases include pepsins, cathepsins, and renins.
           Two-domain structure, probably arising from ancestral
           duplication. This family does not include the retroviral
           nor retrotransposon proteases (pfam00077), which are
           much smaller and appear to be homologous to a single
           domain of the eukaryotic asp proteases.
          Length = 316

 Score =  115 bits (290), Expect = 1e-31
 Identities = 52/128 (40%), Positives = 80/128 (62%), Gaps = 5/128 (3%)

Query: 49  VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
           +D   F   S + N+ D  GGEI+FGGVD  K+ G +T+ PV+ +GYWQ  ++SI +   
Sbjct: 137 IDSPAF---SVYLNSDDAGGGEIIFGGVDPSKYTGSLTWVPVTSQGYWQITLDSITVGGS 193

Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
             +CS+ CQAI DTGTSL+ GP+ +++++ K +GA     G   VDCD++  +P+V   +
Sbjct: 194 ATFCSSGCQAILDTGTSLLYGPTSIVSKIAKAVGASLSEYGGYVVDCDSISSLPDVTFFI 253

Query: 167 GGKNFTLT 174
           GG   T+ 
Sbjct: 254 GGAKITVP 261



 Score = 71.5 bits (176), Expect = 2e-15
 Identities = 26/41 (63%), Positives = 34/41 (82%), Gaps = 1/41 (2%)

Query: 8  QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
          +YYGTIS+GTPPQ+F V+FDTGSS+LW+PS +C+    AC 
Sbjct: 1  EYYGTISIGTPPQKFTVVFDTGSSDLWVPSVYCTSSY-ACK 40


>gnl|CDD|133145 cd05478, pepsin_A, Pepsin A, aspartic protease produced in gastric
           mucosa of mammals.  Pepsin, a well-known aspartic
           protease, is produced by the human gastric mucosa in
           seven different zymogen isoforms, subdivided into two
           types: pepsinogen A and pepsinogen C. The prosequence of
           the zymogens are self cleaved under acidic pH. The
           mature enzymes are called pepsin A and pepsin C,
           correspondingly. The well researched porcine pepsin is
           also in this pepsin A family. Pepsins play an integral
           role in the digestion process of vertebrates. Pepsins
           are bilobal enzymes, each lobe contributing a catalytic
           Asp residue, with an extended active site cleft
           localized between the two lobes of the molecule. One
           lobe may be evolved from the other through ancient
           gene-duplication event. More recently evolved enzymes
           have similar three-dimensional structures, however their
           amino acid sequences are more divergent except for the
           conserved catalytic site motif. Pepsins specifically
           cleave bonds in peptides which have at least six
           residues in length with hydrophobic residues in both the
           P1 and P1' positions. The active site is located at the
           groove formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap. This
           family of aspartate proteases is classified by MEROPS as
           the peptidase family A1 (pepsin A, clan AA).
          Length = 317

 Score =  111 bits (279), Expect = 5e-30
 Identities = 46/128 (35%), Positives = 70/128 (54%), Gaps = 5/128 (3%)

Query: 49  VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
           V +D F   S + +++ + G  + FGG+D   + G + + PV+ + YWQ  V+S+ I   
Sbjct: 143 VSQDLF---SVYLSSNGQQGSVVTFGGIDPSYYTGSLNWVPVTAETYWQITVDSVTINGQ 199

Query: 109 VY--CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
           V      CQAI DTGTSL++GPS  IA +   IGA    NG   V+C ++  MP+V   +
Sbjct: 200 VVACSGGCQAIVDTGTSLLVGPSSDIANIQSDIGASQNQNGEMVVNCSSISSMPDVVFTI 259

Query: 167 GGKNFTLT 174
            G  + L 
Sbjct: 260 NGVQYPLP 267



 Score = 79.4 bits (196), Expect = 4e-18
 Identities = 31/48 (64%), Positives = 39/48 (81%), Gaps = 3/48 (6%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
           +YYGTIS+GTPPQ+F VIFDTGSSNLW+PS +CS    ACS + ++F
Sbjct: 9  MEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQ--ACS-NHNRF 53


>gnl|CDD|133157 cd05490, Cathepsin_D2, Cathepsin_D2, pepsin family of proteinases. 
           Cathepsin D is the major aspartic proteinase of the
           lysosomal compartment where it functions in protein
           catabolism. It is a member of the pepsin family of
           proteinases. This enzyme is distinguished from other
           members of the pepsin family by two features that are
           characteristic of lysosomal hydrolases. First, mature
           Cathepsin D is found predominantly in a two-chain form
           due to a posttranslational cleavage event. Second, it
           contains phosphorylated, N-linked oligosaccharides that
           target the enzyme to lysosomes via mannose-6-phosphate
           receptors. Cathepsin D preferentially attacks peptide
           bonds flanked by bulky hydrophobic amino acids and its
           pH optimum is between pH 2.8 and 4.0. Two active site
           aspartic acid residues are essential for the catalytic
           activity of aspartic proteinases. Like other aspartic
           proteinases, Cathepsin D is a bilobed molecule; the two
           evolutionary related lobes are mostly made up of
           beta-sheets and flank a deep active site cleft. Each of
           the two related lobes contributes one active site
           aspartic acid residue and contains a single carbohydrate
           group. Cathepsin D is an essential enzyme. Mice
           deficient for proteinase cathepsin D, generated by gene
           targeting, develop normally during the first 2 weeks,
           stop thriving in the third week and die in a state of
           anorexia in the fourth week. The mice develop atrophy of
           ileal mucosa followed by other degradation of intestinal
           organs. In these knockout mice, lysosomal proteolysis
           was normal. These results suggest that vital functions
           of cathepsin D are exerted by limited proteolysis of
           proteins regulating cell growth and/or tissue
           homeostasis, while its contribution to bulk proteolysis
           in lysosomes appears to be non-critical. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 325

 Score =  109 bits (275), Expect = 2e-29
 Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)

Query: 61  RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YC-SNCQAIA 118
           R+   + GGE+M GG D   + GD+ Y  V+RK YWQ  ++ + +   +  C   C+AI 
Sbjct: 152 RDPDAQPGGELMLGGTDPKYYTGDLHYVNVTRKAYWQIHMDQVDVGSGLTLCKGGCEAIV 211

Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
           DTGTSLI GP + +  L K IGAVPL  G   +DC+ +  +P +   LGGK + LT
Sbjct: 212 DTGTSLITGPVEEVRALQKAIGAVPLIQGEYMIDCEKIPTLPVISFSLGGKVYPLT 267



 Score = 86.4 bits (214), Expect = 1e-20
 Identities = 31/43 (72%), Positives = 36/43 (83%)

Query: 5  YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
            AQYYG I +GTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 3  MDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIAC 45


>gnl|CDD|133155 cd05488, Proteinase_A_fungi, Fungal Proteinase A , aspartic
           proteinase superfamily.  Fungal Proteinase A, a
           proteolytic enzyme distributed among a variety of
           organisms, is a member of the aspartic proteinase
           superfamily. In Saccharomyces cerevisiae, targeted to
           the vacuole as a zymogen, activation of proteinases A at
           acidic pH can occur by two different pathways: a
           one-step process to release mature proteinase A,
           involving the intervention of proteinase B, or a
           step-wise pathway via the auto-activation product known
           as pseudo-proteinase A. Once active, S. cerevisiae
           proteinase A is essential to the activities of other
           yeast vacuolar hydrolases, including proteinase B and
           carboxypeptidase Y. The mature enzyme is bilobal, with
           each lobe providing one of the two catalytically
           essential aspartic acid residues in the active site. The
           crystal structure of free proteinase A shows that flap
           loop is atypically pointing directly into the S(1)
           pocket of the enzyme.  Proteinase A preferentially
           hydrolyzes hydrophobic residues such as Phe, Leu or Glu
           at the P1 position and Phe, Ile, Leu or Ala at P1'.
           Moreover, the enzyme is inhibited by IA3, a natural and
           highly specific inhibitor produced by S. cerevisiae.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 320

 Score =  109 bits (273), Expect = 3e-29
 Identities = 52/127 (40%), Positives = 73/127 (57%), Gaps = 4/127 (3%)

Query: 49  VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
           +D+  F   S +  +S+E+GGE  FGG+D+ +F G IT+ PV RK YW+  +E I + ++
Sbjct: 143 LDEPVF---SFYLGSSEEDGGEATFGGIDESRFTGKITWLPVRRKAYWEVELEKIGLGDE 199

Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
            +   N  A  DTGTSLI  PS +   LN  IGA    NG   VDC  +D +P++     
Sbjct: 200 ELELENTGAAIDTGTSLIALPSDLAEMLNAEIGAKKSWNGQYTVDCSKVDSLPDLTFNFD 259

Query: 168 GKNFTLT 174
           G NFTL 
Sbjct: 260 GYNFTLG 266



 Score = 75.6 bits (186), Expect = 1e-16
 Identities = 37/86 (43%), Positives = 52/86 (60%), Gaps = 8/86 (9%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
          AQY+  I+LGTPPQ+FKVI DTGSSNLW+PS  C   +IAC +    D     ++  N +
Sbjct: 9  AQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCG--SIACFLHSKYDSSASSTYKANGT 66

Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
          +   + G   + G V +D   +GD+T
Sbjct: 67 EFKIQYGSGSLEGFVSQDTLSIGDLT 92


>gnl|CDD|133153 cd05486, Cathespin_E, Cathepsin E, non-lysosomal aspartic protease.
            Cathepsin E is an intracellular, non-lysosomal aspartic
           protease expressed in a variety of cells and tissues.
           The protease has proposed physiological roles in antigen
           presentation by the MHC class II system, in the
           biogenesis of the vasoconstrictor peptide endothelin,
           and in neurodegeneration associated with brain ischemia
           and aging. Cathepsin E is the only A1 aspartic protease
           that exists as a homodimer with a disulfide bridge
           linking the two monomers. Like many other aspartic
           proteases, it is synthesized as a zymogen which is
           catalytically inactive towards its natural substrates at
           neutral pH and which auto-activates in an acidic
           environment. The overall structure follows the general
           fold of aspartic proteases of the A1 family, it is
           composed of two structurally similar beta barrel lobes,
           each lobe contributing an aspartic acid residue to form
           a catalytic dyad that acts to cleave the substrate
           peptide bond. The catalytic Asp residues are contained
           in an Asp-Thr-Gly-Ser/thr motif in both N- and
           C-terminal lobes of the enzyme. The aspartic acid
           residues act together to allow a water molecule to
           attack the peptide bond. One aspartic acid residue (in
           its deprotonated form) activates the attacking water
           molecule, whereas the other aspartic acid residue (in
           its protonated form) polarizes the peptide carbonyl,
           increasing its susceptibility to attack. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 316

 Score =  107 bits (269), Expect = 1e-28
 Identities = 50/116 (43%), Positives = 73/116 (62%), Gaps = 3/116 (2%)

Query: 61  RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
           RN +  +GGE++FGG D  +F G + + PV+ +GYWQ  +++I++   V +CS+ CQAI 
Sbjct: 144 RNPNSADGGELVFGGFDTSRFSGQLNWVPVTVQGYWQIQLDNIQVGGTVIFCSDGCQAIV 203

Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
           DTGTSLI GPS  I +L   IGA    +G   VDC  L  MP+V   + G  ++L+
Sbjct: 204 DTGTSLITGPSGDIKQLQNYIGATA-TDGEYGVDCSTLSLMPSVTFTINGIPYSLS 258



 Score = 64.9 bits (158), Expect = 6e-13
 Identities = 24/33 (72%), Positives = 29/33 (87%)

Query: 9  YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
          Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 1  YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT 33


>gnl|CDD|133138 cd05471, pepsin_like, Pepsin-like aspartic proteases, bilobal
           enzymes that cleave bonds in peptides at acidic pH.
           Pepsin-like aspartic proteases are found in mammals,
           plants, fungi and bacteria. These well known and
           extensively characterized enzymes include pepsins,
           chymosin, renin, cathepsins, and fungal aspartic
           proteases. Several have long been known to be medically
           (renin, cathepsin D and E, pepsin) or commercially
           (chymosin) important. Structurally, aspartic proteases
           are bilobal enzymes, each lobe contributing a catalytic
           Aspartate residue, with an extended active site cleft
           localized between the two lobes of the molecule. The N-
           and C-terminal domains, although structurally related by
           a 2-fold axis, have only limited sequence homology
           except the vicinity of the active site. This suggests
           that the enzymes evolved by an ancient duplication
           event.  Most members of the pepsin family specifically
           cleave bonds in peptides that are at least six residues
           in length, with hydrophobic residues in both the P1 and
           P1' positions. The active site is located at the groove
           formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap.The
           enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 283

 Score = 97.5 bits (243), Expect = 4e-25
 Identities = 45/117 (38%), Positives = 63/117 (53%), Gaps = 10/117 (8%)

Query: 61  RNASDENGGEIMFGGVDKDKFVGDITYSPV--SRKGYWQFGVESIKI---EKNVYCSNCQ 115
           R+    NGGE+ FGG+D  K+ GD+TY+PV  +  GYWQ  ++ I +             
Sbjct: 145 RDGDGGNGGELTFGGIDPSKYTGDLTYTPVVSNGPGYWQVPLDGISVGGKSVISSSGGGG 204

Query: 116 AIADTGTSLIIGPSKVIAELNKLIGA-VPLANGPAKVDCDNLDKMPNVDI----ILG 167
           AI D+GTSLI  PS V   + K +GA V  ++G   VDC   D +P++      ILG
Sbjct: 205 AIVDSGTSLIYLPSSVYDAILKALGAAVSSSDGGYGVDCSPCDTLPDITFTFLWILG 261



 Score = 68.2 bits (167), Expect = 3e-14
 Identities = 23/40 (57%), Positives = 30/40 (75%)

Query: 9  YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
          YYG I++GTPPQ+F VIFDTGSS LW+PS +C+  +    
Sbjct: 1  YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKH 40


>gnl|CDD|133144 cd05477, gastricsin, Gastricsins, asparate proteases produced in
           gastric mucosa.  Gastricsin is also called pepsinogen C.
           Gastricsins are produced in gastric mucosa of mammals.
           It is synthesized by the chief cells in the stomach as
           an inactive zymogen. It is self-converted to a mature
           enzyme under acidic conditions. Human gastricsin is
           distributed throughout all parts of the stomach.
           Gastricsin is synthesized as an inactive progastricsin
           that has an approximately 40 residue prosequence. It is
           self-converting to a mature enzyme being triggered by a
           drop in pH from neutrality to acidic conditions. Like
           other aspartic proteases, gastricsin are characterized
           by two catalytic aspartic residues at the active site,
           and display optimal activity at acidic pH. Mature enzyme
           has a pseudo-2-fold symmetry that passes through the
           active site between the catalytic aspartate residues.
           Structurally, aspartic proteases are bilobal enzymes,
           each lobe contributing a catalytic aspartate residue,
           with an extended active site cleft localized between the
           two lobes of the molecule. One lobe may be evolved from
           the other through ancient gene-duplication event.
           Although the three-dimensional structures of the two
           lobes are very similar, the amino acid sequences are
           more divergent, except for the conserved catalytic site
           motif. This family of aspartate proteases is classified
           by MEROPS as the peptidase family A1 (pepsin A, clan
           AA).
          Length = 318

 Score = 96.9 bits (241), Expect = 1e-24
 Identities = 43/115 (37%), Positives = 69/115 (60%), Gaps = 3/115 (2%)

Query: 62  NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
               + GGE++FGGVD + + G I ++PV+ + YWQ G++  +I  +   +CS  CQAI 
Sbjct: 147 GQQGQQGGELVFGGVDNNLYTGQIYWTPVTSETYWQIGIQGFQINGQATGWCSQGCQAIV 206

Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
           DTGTSL+  P +V++ L + IGA     G   V+C+N+  +P +   + G +F L
Sbjct: 207 DTGTSLLTAPQQVMSTLMQSIGAQQDQYGQYVVNCNNIQNLPTLTFTINGVSFPL 261



 Score = 63.4 bits (154), Expect = 2e-12
 Identities = 24/34 (70%), Positives = 27/34 (79%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
            YYG IS+GTPPQ F V+FDTGSSNLW+PS  C
Sbjct: 2  MSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLC 35


>gnl|CDD|133154 cd05487, renin_like, Renin stimulates production of angiotensin and
           thus affects blood pressure.  Renin, also known as
           angiotensinogenase, is a circulating enzyme that
           participates in the renin-angiotensin system that
           mediates extracellular volume, arterial
           vasoconstriction, and consequently mean arterial blood
           pressure. The enzyme is secreted by the kidneys from
           specialized juxtaglomerular cells in response to
           decreases in glomerular filtration rate (a consequence
           of low blood volume), diminished filtered sodium
           chloride and sympathetic nervous system innervation. The
           enzyme circulates in the blood stream and hydrolyzes
           angiotensinogen secreted from the liver into the peptide
           angiotensin I. Angiotensin I is further cleaved in the
           lungs by endothelial bound angiotensin converting enzyme
           (ACE) into angiotensin II, the final active peptide.
           Renin is a member of the aspartic protease family.
           Structurally, aspartic proteases are bilobal enzymes,
           each lobe contributing a catalytic Aspartate  residue,
           with an extended active site cleft localized between the
           two lobes of the molecule. The N- and C-terminal
           domains, although structurally related by a 2-fold axis,
           have only limited sequence homology except the vicinity
           of the active site. This suggests that the enzymes
           evolved by an ancient duplication event. The active site
           is located at the groove formed by the two lobes, with
           an extended loop projecting over the cleft to form an
           11-residue flap, which encloses substrates and
           inhibitors in the active site. Specificity is determined
           by nearest-neighbor hydrophobic residues surrounding the
           catalytic aspartates, and by three residues in the flap.
           The enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 326

 Score = 87.5 bits (217), Expect = 4e-21
 Identities = 51/176 (28%), Positives = 85/176 (48%), Gaps = 21/176 (11%)

Query: 7   AQYYGTISLGTPPQEFKVI---FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNA 63
           A++ G + +G P Q    +   FD       I SQ          V K+      + R++
Sbjct: 111 AKFDGVLGMGYPKQAIGGVTPVFDN------IMSQG---------VLKEDVFSVYYSRDS 155

Query: 64  SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTG 121
           S   GGEI+ GG D   + GD  Y   S+ G+WQ  ++ + +  +   C + C A+ DTG
Sbjct: 156 SHSLGGEIVLGGSDPQHYQGDFHYINTSKTGFWQIQMKGVSVGSSTLLCEDGCTAVVDTG 215

Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
            S I GP+  I++L + +GA         V C+ +  +P++   LGGK +TL+ ++
Sbjct: 216 ASFISGPTSSISKLMEALGAKERLGDYV-VKCNEVPTLPDISFHLGGKEYTLSSSD 270



 Score = 76.7 bits (189), Expect = 4e-17
 Identities = 38/70 (54%), Positives = 44/70 (62%), Gaps = 13/70 (18%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
           QYYG I +GTPPQ FKV+FDTGSSNLW+PS  CS L  AC       V H+ + +ASD 
Sbjct: 7  TQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTAC-------VTHNLY-DASDS 58

Query: 66 ----ENGGEI 71
              ENG E 
Sbjct: 59 STYKENGTEF 68


>gnl|CDD|240300 PTZ00165, PTZ00165, aspartyl protease; Provisional.
          Length = 482

 Score = 69.8 bits (171), Expect = 2e-14
 Identities = 23/34 (67%), Positives = 28/34 (82%)

Query: 8   QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
           QY+G I +GTPP+ F V+FDTGSSNLWIPS+ C 
Sbjct: 120 QYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECK 153



 Score = 69.0 bits (169), Expect = 4e-14
 Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 15/104 (14%)

Query: 69  GEIMFGGVDKDKFV--GD-ITYSPVSRKGYWQFGVESIKIE-KNV-YCSN-CQAIADTGT 122
           G I FG  D  K+   G  I + PV    YW+  V  I I+ K++ +C   C+A  DTG+
Sbjct: 278 GSISFGSADP-KYTLEGHKIWWFPVISTDYWEIEVVDILIDGKSLGFCDRKCKAAIDTGS 336

Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
           SLI GPS VI   N L+  +PL       DC N D +P +  +L
Sbjct: 337 SLITGPSSVI---NPLLEKIPLEE-----DCSNKDSLPRISFVL 372


>gnl|CDD|133161 cd06097, Aspergillopepsin_like, Aspergillopepsin_like, aspartic
           proteases of fungal origin.  The members of this family
           are aspartic proteases of fungal origin, including
           aspergillopepsin, rhizopuspepsin, endothiapepsin, and
           rodosporapepsin. The various fungal species in this
           family may be the most economically important genus of
           fungi. They may serve as virulence factors or as
           industrial aids. For example, Aspergillopepsin from A.
           fumigatus is involved in invasive aspergillosis owing to
           its elastolytic activity and Aspergillopepsins from the
           mold A. saitoi are used in fermentation industry.
           Aspartic proteinases are a group of proteolytic enzymes
           in which the scissile peptide bond is attacked by a
           nucleophilic water molecule activated by two aspartic
           residues in a DT(S)G motif at the active site. They have
           a similar fold composed of two beta-barrel domains.
           Between the N-terminal and C-terminal domains, each of
           which contributes one catalytic aspartic residue, there
           is an extended active-site cleft capable of interacting
           with multiple residues of a substrate. Although members
           of the aspartic protease family of enzymes have very
           similar three-dimensional structures and catalytic
           mechanisms, each has unique substrate specificity. The
           members of this family has an optimal acidic pH (5.5)
           and cleaves protein substrates with similar specificity
           to that of porcine pepsin A, preferring hydrophobic
           residues at P1 and P1' in the cleave site.  This family
           of aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 278

 Score = 61.9 bits (151), Expect = 6e-12
 Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 3/62 (4%)

Query: 73  FGGVDKDKFVGDITYSPVSR-KGYWQFGVESIKI--EKNVYCSNCQAIADTGTSLIIGPS 129
           FG +D+ K+ G+I+++PV    G+WQF   S  +  +     S   AIADTGT+LI+ P 
Sbjct: 155 FGYIDESKYKGEISWTPVDNSSGFWQFTSTSYTVGGDAPWSRSGFSAIADTGTTLILLPD 214

Query: 130 KV 131
            +
Sbjct: 215 AI 216



 Score = 46.1 bits (110), Expect = 2e-06
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 9  YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIA 46
          Y   + +GTPPQ   +  DTGSS+LW+ S         
Sbjct: 1  YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQG 38


>gnl|CDD|133137 cd05470, pepsin_retropepsin_like, Cellular and retroviral
          pepsin-like aspartate proteases.  This family includes
          both cellular and retroviral pepsin-like aspartate
          proteases. The cellular pepsin and pepsin-like enzymes
          are twice as long as their retroviral counterparts. The
          cellular pepsin-like aspartic proteases are found in
          mammals, plants, fungi and bacteria. These well known
          and extensively characterized enzymes include pepsins,
          chymosin, rennin, cathepsins, and fungal aspartic
          proteases. Several have long been known to be medically
          (rennin, cathepsin D and E, pepsin) or commercially
          (chymosin) important. The eukaryotic pepsin-like
          proteases contain two domains possessing similar
          topological features. The N- and C-terminal domains,
          although structurally related by a 2-fold axis, have
          only limited sequence homology except in the vicinity
          of the active site. This suggests that the enzymes
          evolved by an ancient duplication event. The eukaryotic
          pepsin-like proteases have two active site ASP residues
          with each N- and C-terminal lobe contributing one
          residue. While the fungal and mammalian pepsins are
          bilobal proteins, retropepsins function as dimers and
          the monomer resembles structure of the N- or C-terminal
          domains of eukaryotic enzyme. The active site motif
          (Asp-Thr/Ser-Gly-Ser) is conserved between the
          retroviral and eukaryotic proteases and between the
          N-and C-terminal of eukaryotic pepsin-like proteases.
          The retropepsin-like family includes pepsin-like
          aspartate proteases from retroviruses, retrotransposons
          and retroelements; as well as eukaryotic
          DNA-damage-inducible proteins (DDIs), and bacterial
          aspartate peptidases. Retropepsin is synthesized as
          part of the POL polyprotein that contains an
          aspartyl-protease, a reverse transcriptase, RNase H,
          and an integrase. The POL polyprotein undergoes
          specific enzymatic cleavage to yield the mature
          proteins. This family of aspartate proteases is
          classified by MEROPS as the peptidase family A1 (pepsin
          A) and A2 (retropepsin family).
          Length = 109

 Score = 57.4 bits (139), Expect = 2e-11
 Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 9/55 (16%)

Query: 11 GTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
            I +GTPPQ F V+ DTGSSNLW+PS  C  L I           HS + + S 
Sbjct: 1  IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYS---------HSSYDDPSA 46


>gnl|CDD|133141 cd05474, SAP_like, SAPs, pepsin-like proteinases secreted from
          pathogens to degrade host proteins.  SAPs (Secreted
          aspartic proteinases) are secreted from a group of
          pathogenic fungi, predominantly Candida species. They
          are secreted from the pathogen to degrade host
          proteins. SAP is one of the most significant
          extracellular hydrolytic enzymes produced by C.
          albicans. SAP proteins, encoded by a family of 10 SAP
          genes. All 10 SAP genes of C. albicans encode
          preproenzymes, approximately 60 amino acid longer than
          the mature enzyme, which are processed when transported
          via the secretory pathway. The mature enzymes contain
          sequence motifs typical for all aspartyl proteinases,
          including the two conserved aspartate residues other
          active site and conserved cysteine residues implicated
          in the maintenance of the three-dimensional structure.
          Most Sap proteins contain putative N-glycosylation
          sites, but it remains to be determined which Sap
          proteins are glycosylated. This family of aspartate
          proteases is classified by MEROPS as the peptidase
          family A1 (pepsin A, clan AA).  The overall structure
          of Sap protein conforms to the classical aspartic
          proteinase fold typified by pepsin. SAP is a bilobal
          enzyme, each lobe contributing a catalytic Asp residue,
          with an extended active site cleft localized between
          the two lobes of the molecule. One lobe may be evolved
          from the other through ancient gene-duplication event.
          More recently evolved enzymes have similar
          three-dimensional structures, however their amino acid
          sequences are more divergent except for the conserved
          catalytic site motif. This family of aspartate
          proteases is classified by MEROPS as the peptidase
          family A1 (pepsin A, clan AA).
          Length = 295

 Score = 53.0 bits (128), Expect = 9e-09
 Identities = 16/28 (57%), Positives = 22/28 (78%)

Query: 9  YYGTISLGTPPQEFKVIFDTGSSNLWIP 36
          Y   +S+GTPPQ+  V+ DTGSS+LW+P
Sbjct: 3  YSAELSVGTPPQKVTVLLDTGSSDLWVP 30



 Score = 51.0 bits (123), Expect = 4e-08
 Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 13/116 (11%)

Query: 69  GEIMFGGVDKDKFVGDITYSPVSRKGYW-----------QFGVESIKIEKNVYCSNCQAI 117
           G I+FGGVD  K+ GD+   P+                    V        +   N  A+
Sbjct: 123 GSILFGGVDTAKYSGDLVTLPIVNDNGGSEPSELSVTLSSISVNGSSGNTTLLSKNLPAL 182

Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK-VDCDNLDKMPNVDIILGGKNFT 172
            D+GT+L   PS ++  + K +GA   ++     VDCD  D   ++    GG   +
Sbjct: 183 LDSGTTLTYLPSDIVDAIAKQLGATYDSDEGLYVVDCDAKDDG-SLTFNFGGATIS 237


>gnl|CDD|140051 PTZ00013, PTZ00013, plasmepsin 4 (PM4); Provisional.
          Length = 450

 Score = 51.1 bits (122), Expect = 5e-08
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 17/88 (19%)

Query: 9   YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENG 68
           +YG   +G   Q+F +IFDTGS+NLW+PS+ C   +I CS+         +  ++S    
Sbjct: 139 FYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCD--SIGCSI--------KNLYDSSKSKS 188

Query: 69  GEIMFGGVDKDKFVGDITYSPVSRKGYW 96
            E       KD    DITY   + KG++
Sbjct: 189 YE-------KDGTKVDITYGSGTVKGFF 209



 Score = 44.6 bits (105), Expect = 9e-06
 Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 4/110 (3%)

Query: 65  DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSL 124
           D + G +  GG+++  + G+ITY  ++   YWQ  ++ +   K         I D+GT+ 
Sbjct: 285 DVHAGYLTIGGIEEKFYEGNITYEKLNHDLYWQIDLD-VHFGKQTM-QKANVIVDSGTTT 342

Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKV-DCDNLDKMPNVDIILGGKNFTL 173
           I  PS+ + +    +  + +   P  V  CDN  +MP ++       +TL
Sbjct: 343 ITAPSEFLNKFFANLNVIKVPFLPFYVTTCDN-KEMPTLEFKSANNTYTL 391


>gnl|CDD|133140 cd05473, beta_secretase_like, Beta-secretase, aspartic-acid
           protease important in the pathogenesis of Alzheimer's
           disease.  Beta-secretase also called BACE (beta-site of
           APP cleaving enzyme) or memapsin-2. Beta-secretase is an
           aspartic-acid protease important in the pathogenesis of
           Alzheimer's disease, and in the formation of myelin
           sheaths in peripheral nerve cells. It cleaves amyloid
           precursor protein (APP) to reveal the N-terminus of the
           beta-amyloid peptides. The beta-amyloid peptides are the
           major components of the amyloid plaques formed in the
           brain of patients with Alzheimer's disease (AD). Since
           BACE mediates one of the cleavages responsible for
           generation of AD, it is regarded as a potential target
           for pharmacological intervention in AD. Beta-secretase
           is a member of pepsin family of aspartic proteases. Same
           as other aspartic proteases, beta-secretase is a bilobal
           enzyme, each lobe contributing a catalytic Asp residue,
           with an extended active site cleft localized between the
           two lobes of the molecule.  The N- and C-terminal
           domains, although structurally related by a 2-fold axis,
           have only limited sequence homology except the vicinity
           of the active site. This suggests that the enzymes
           evolved by an ancient duplication event. The enzymes
           specifically cleave bonds in peptides which have at
           least six residues in length with hydrophobic residues
           in both the P1 and P1' positions. The active site is
           located at the groove formed by the two lobes, with an
           extended loop projecting over the cleft to form an
           11-residue flap, which encloses substrates and
           inhibitors in the active site. Specificity is determined
           by nearest-neighbor hydrophobic residues surrounding the
           catalytic aspartates, and by three residues in the flap.
           The enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 364

 Score = 47.4 bits (113), Expect = 8e-07
 Identities = 26/85 (30%), Positives = 43/85 (50%), Gaps = 12/85 (14%)

Query: 56  GHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN------- 108
           G     +AS   GG ++ GG+D   + GDI Y+P+  + Y++  V  +K+E         
Sbjct: 148 GLPVNGSASGTVGGSMVIGGIDPSLYKGDIWYTPIREEWYYE--VIILKLEVGGQSLNLD 205

Query: 109 --VYCSNCQAIADTGTSLIIGPSKV 131
              Y  + +AI D+GT+ +  P KV
Sbjct: 206 CKEYNYD-KAIVDSGTTNLRLPVKV 229



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 9  YYGTISLGTPPQEFKVIFDTGSSN 32
          YY  + +GTPPQ+  ++ DTGSSN
Sbjct: 4  YYIEMLIGTPPQKLNILVDTGSSN 27


>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
          Protease and Nucellin, pepsin-like aspartic proteases
          from plants.  This family contains pepsin like aspartic
          proteases from plants including Chloroplast Nucleoids
          DNA-binding Protease and Nucellin. Chloroplast
          Nucleoids DNA-binding Protease catalyzes the
          degradation of ribulose-1,5-bisphosphate
          carboxylase/oxygenase (Rubisco) in senescent leaves of
          tobacco and Nucellins are important regulators of
          nucellar cell's progressive degradation after ovule
          fertilization. Structurally, aspartic proteases are
          bilobal enzymes, each lobe contributing a catalytic Asp
          residue, with an extended active site cleft localized
          between the two lobes of the molecule. The N- and
          C-terminal domains, although structurally related by a
          2-fold axis, have only limited sequence homology except
          the vicinity of the active site. This suggests that the
          enzymes evolved by an ancient duplication event.  The
          enzymes specifically cleave bonds in peptides which
          have at least six residues in length with hydrophobic
          residues in both the P1 and P1' positions. The active
          site is located at the groove formed by the two lobes,
          with an extended loop projecting over the cleft to form
          an 11-residue flap, which encloses substrates and
          inhibitors in the active site. Specificity is
          determined by nearest-neighbor hydrophobic residues
          surrounding the catalytic aspartates, and by three
          residues in the flap.  The enzymes are mostly secreted
          from cells as inactive proenzymes that activate
          autocatalytically at acidic pH.
          Length = 265

 Score = 45.7 bits (109), Expect = 3e-06
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 8  QYYGTISLGTPPQEFKVIFDTGSSNLWIP 36
          +Y  T+S+GTPPQ F +I DTGS   W  
Sbjct: 1  EYLVTLSIGTPPQPFSLIVDTGSDLTWTQ 29


>gnl|CDD|140176 PTZ00147, PTZ00147, plasmepsin-1; Provisional.
          Length = 453

 Score = 45.2 bits (107), Expect = 5e-06
 Identities = 18/33 (54%), Positives = 22/33 (66%)

Query: 9   YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
            YG   LG   Q+F  IFDTGS+NLW+PS  C+
Sbjct: 140 SYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCT 172



 Score = 35.6 bits (82), Expect = 0.010
 Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 14/115 (12%)

Query: 65  DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQ------FGVESIKIEKNVYCSNCQAIA 118
           D++ G +  GG+++  + G +TY  ++   YWQ      FG        NV       I 
Sbjct: 286 DKHKGYLTIGGIEERFYEGPLTYEKLNHDLYWQVDLDVHFG--------NVSSEKANVIV 337

Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
           D+GTS+I  P++ + +  + +    +   P  V   N  K+P ++     K +TL
Sbjct: 338 DSGTSVITVPTEFLNKFVESLDVFKVPFLPLYVTTCNNTKLPTLEFRSPNKVYTL 392


>gnl|CDD|133139 cd05472, cnd41_like, Chloroplast Nucleoids DNA-binding Protease,
          catalyzes the degradation of ribulose-1,5-bisphosphate
          carboxylase/oxygenase.  Chloroplast Nucleoids
          DNA-binding Protease catalyzes the degradation of
          ribulose-1,5-bisphosphate carboxylase/oxygenase
          (Rubisco) in senescent leaves of tobacco. Antisense
          tobacco with reduced amount of CND41 maintained green
          leaves and constant protein levels, especially Rubisco.
           CND41 has DNA-binding as well as aspartic protease
          activities. The pepsin-like aspartic protease domain is
          located at the C-terminus of the protein. The enzyme is
          characterized by having two aspartic protease catalytic
          site motifs, the Asp-Thr-Gly-Ser in the N-terminal and
          Asp-Ser-Gly-Ser in the C-terminal region. Aspartic
          proteases are bilobal enzymes, each lobe contributing a
          catalytic Asp residue, with an extended active site
          cleft localized between the two lobes of the molecule.
          One lobe may be evolved from the other through ancient
          gene-duplication event. This family of aspartate
          proteases is classified by MEROPS as the peptidase
          family A1 (pepsin A, clan AA).
          Length = 299

 Score = 39.9 bits (94), Expect = 3e-04
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 8  QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
          +Y  T+ LGTP ++  VI DTGS   W+  Q C
Sbjct: 1  EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC 33


>gnl|CDD|178691 PLN03146, PLN03146, aspartyl protease family protein; Provisional.
          Length = 431

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 3   SSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWI---PSQHC 40
            S   +Y   IS+GTPP     I DTGS  +W    P   C
Sbjct: 79  ISNGGEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDC 119


>gnl|CDD|133160 cd06096, Plasmepsin_5, Plasmepsins are a class of aspartic
           proteinases produced by the plasmodium parasite.  The
           family contains a group of aspartic proteinases
           homologous to plasmepsin 5.  Plasmepsins are a class of
           at least 10 enzymes produced by the plasmodium parasite.
           Through their haemoglobin-degrading activity, they are
           an important cause of symptoms in malaria sufferers.
           This family of enzymes is a potential target for
           anti-malarial drugs. Plasmepsins are aspartic acid
           proteases, which means their active site contains two
           aspartic acid residues. These two aspartic acid residue
           act respectively as proton donor and proton acceptor,
           catalyzing the hydrolysis of peptide bond in proteins.
           Aspartic proteinases are composed of two structurally
           similar beta barrel lobes, each lobe contributing an
           aspartic acid residue to form a catalytic dyad that acts
           to cleave the substrate peptide bond. The catalytic Asp
           residues are contained in an Asp-Thr-Gly-Ser/thr motif
           in both N- and C-terminal lobes of the enzyme.  There
           are four types of plasmepsins, closely related but
           varying in the specificity of cleavage site. The name
           plasmepsin may come from plasmodium (the organism) and
           pepsin (a common aspartic acid protease with similar
           molecular structure). This family of aspartate proteases
           is classified by MEROPS as the peptidase family A1
           (pepsin A, clan AA).
          Length = 326

 Score = 36.2 bits (84), Expect = 0.005
 Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 14/88 (15%)

Query: 66  ENGGEIMFGGVDKDKFVG----------DITYSPVSRKGYWQFGVESIKI--EKN--VYC 111
           E+GGE+  GG DKD  V            I ++P++RK Y+   +E + +    +     
Sbjct: 170 EDGGELTIGGYDKDYTVRNSSIGNNKVSKIVWTPITRKYYYYVKLEGLSVYGTTSNSGNT 229

Query: 112 SNCQAIADTGTSLIIGPSKVIAELNKLI 139
                + D+G++L   P  +  ++N   
Sbjct: 230 KGLGMLVDSGSTLSHFPEDLYNKINNFF 257



 Score = 35.4 bits (82), Expect = 0.009
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 7  AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
          A Y+  I +G PPQ+  +I DTGSS+L  P   C
Sbjct: 2  AYYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQC 35


>gnl|CDD|133142 cd05475, nucellin_like, Nucellins, plant aspartic proteases
          specifically expressed in nucellar cells during
          degradation.  Nucellins are important regulators of
          nucellar cell's progressive degradation after ovule
          fertilization. This degradation is a characteristic of
          programmed cell death. Nucellins are plant aspartic
          proteases specifically expressed in nucellar cells
          during degradation. The enzyme is characterized by
          having two aspartic protease catalytic site motifs, the
          Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser
          in the C-terminal region, and two other regions nearly
          identical to two regions of plant aspartic proteases.
          Aspartic proteases are bilobal enzymes, each lobe
          contributing a catalytic Asp residue, with an extended
          active site cleft localized between the two lobes of
          the molecule. One lobe may be evolved from the other
          through ancient gene-duplication event. Although the
          three-dimensional structures of the two lobes are very
          similar, the amino acid sequences are more divergent,
          except for the conserved catalytic site motif.
          Length = 273

 Score = 29.6 bits (67), Expect = 0.63
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 8  QYYGTISLGTPPQEFKVIFDTGSSNLWI 35
           YY TI++G PP+ + +  DTGS   W+
Sbjct: 2  YYYVTINIGNPPKPYFLDIDTGSDLTWL 29



 Score = 26.2 bits (58), Expect = 8.9
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 7/45 (15%)

Query: 1   MRSSYQAQYYGT-------ISLGTPPQEFKVIFDTGSSNLWIPSQ 38
           MR   Q ++Y             T  +  +V+FD+GSS  +  +Q
Sbjct: 150 MRRESQKKHYSPGPASLLFNGQPTGGKGLEVVFDSGSSYTYFNAQ 194


>gnl|CDD|212583 cd11688, THUMP, THUMP domain, predicted to bind RNA.  The THUMP
           domain is named after THioUridine synthases, RNA
           Methyltransferases and Pseudo-uridine synthases. It is
           predicted to be an RNA-binding domain and  probably
           functions by delivering a variety of RNA modification
           enzymes to their targets.
          Length = 148

 Score = 28.6 bits (64), Expect = 1.0
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 134 ELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
           E+   +G   +     +VD DN D + NV++ 
Sbjct: 110 EIAMKVGDAIVDAFNPEVDLDNPDIVVNVEVH 141


>gnl|CDD|235966 PRK07207, PRK07207, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 965

 Score = 28.4 bits (64), Expect = 2.3
 Identities = 14/22 (63%), Positives = 15/22 (68%), Gaps = 1/22 (4%)

Query: 112 SNCQAIADTGT-SLIIGPSKVI 132
           SNC AIA T T S IIG S+ I
Sbjct: 766 SNCMAIAPTATISNIIGVSQSI 787


>gnl|CDD|182788 PRK10862, PRK10862, SoxR reducing system protein RseC; Provisional.
          Length = 154

 Score = 27.6 bits (62), Expect = 2.5
 Identities = 12/36 (33%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 96  WQFGVESIKIEKNVYCSNCQAIADTGTSLI--IGPS 129
           WQ G+  ++ E    CS+C + A  G+ L+  +GP 
Sbjct: 11  WQNGIALLRCEVKAGCSSCASRAGCGSRLLNKLGPQ 46


>gnl|CDD|235624 PRK05850, PRK05850, acyl-CoA synthetase; Validated.
          Length = 578

 Score = 27.6 bits (62), Expect = 3.7
 Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLD 157
            + DT  S+++  S V+ ++ + +   P  + P  ++ D LD
Sbjct: 104 VLRDTSPSVVLTTSAVVDDVTEYVAPQPGQSAPPVIEVDLLD 145


>gnl|CDD|188492 TIGR03977, rSAM_pair_HxsC, His-Xaa-Ser system radical SAM maturase
           HxsC.  This model describes the downstream member, HxsC,
           of a pair of uncharacterized radical SAM proteins,
           regularly found in the context of a small protein with
           four or more repeats of the tripeptide His-Xaa-Ser
           (HXS). This enzyme appears to be part of a peptide
           modification system.
          Length = 292

 Score = 27.2 bits (61), Expect = 4.1
 Identities = 8/26 (30%), Positives = 11/26 (42%), Gaps = 3/26 (11%)

Query: 93  KGYWQFGVESIKIEKNVY---CSNCQ 115
              W +  +SI   KN Y   C +C 
Sbjct: 255 PSLWPYARQSISDWKNYYPEECEDCA 280


>gnl|CDD|182461 PRK10438, PRK10438, C-N hydrolase family amidase; Provisional.
          Length = 256

 Score = 27.4 bits (61), Expect = 4.3
 Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 2/67 (2%)

Query: 89  PVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLII-GPSKVIAELNKLIGAVPLANG 147
           P  R  +WQ  + +  IE   Y + C  +   G      G S++I    ++I      + 
Sbjct: 166 PAPRSLHWQTLLTARAIENQAYVAGCNRVGSDGNGHHYRGDSRIINPQGEIIATAE-PHQ 224

Query: 148 PAKVDCD 154
             ++D +
Sbjct: 225 ATRIDAE 231


>gnl|CDD|162204 TIGR01107, Na_K_ATPase_bet, Sodium Potassium ATPase beta subunit.
           This model describes the Na+/K+ ATPase beta subunit in
           eukaryotes. Na+/K+ ATPase(also called Sodium-Potassium
           pump) is intimately associated with the plasma membrane.
           It couples the energy released by the hydrolysis of ATP
           to extrude 3 Na+ ions, with the concomitant uptake of
           2K+ ions, against their ionic gradients [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 289

 Score = 27.3 bits (61), Expect = 4.8
 Identities = 14/52 (26%), Positives = 22/52 (42%)

Query: 113 NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
           NC  + D       G   +I ++N++IG  P   G   VDC       + +I
Sbjct: 160 NCSGLNDPTFGYSEGKPCIIIKMNRIIGFKPKNPGVPPVDCTGKRDEDDENI 211


>gnl|CDD|227658 COG5360, COG5360, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 566

 Score = 27.5 bits (61), Expect = 4.9
 Identities = 9/27 (33%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 136 NKLIGAVPLANGPAKVDCDNLDKMPNV 162
           ++L+G  PL +GP+KV    +D +   
Sbjct: 409 SRLLGP-PLISGPSKVQAKRIDDVGGQ 434


>gnl|CDD|222288 pfam13650, Asp_protease_2, Aspartyl protease.  This family
          consists of predicted aspartic proteases, typically
          from 180 to 230 amino acids in length, in MEROPS clan
          AA. This model describes the well-conserved 121-residue
          C-terminal region. The poorly conserved, variable
          length N-terminal region usually contains a predicted
          transmembrane helix.
          Length = 89

 Score = 25.7 bits (57), Expect = 6.8
 Identities = 12/71 (16%), Positives = 19/71 (26%), Gaps = 8/71 (11%)

Query: 20 QEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKD 79
          +  + + DTG+S   +       L +                  S  NG           
Sbjct: 8  KPVRFLLDTGASGTLLSRSLAERLGLK--------PTKLTIGRVSTANGTVRAALVRLDS 59

Query: 80 KFVGDITYSPV 90
            +G IT   V
Sbjct: 60 LKIGGITLRNV 70


>gnl|CDD|235227 PRK04149, sat, sulfate adenylyltransferase; Reviewed.
          Length = 391

 Score = 26.7 bits (60), Expect = 8.2
 Identities = 8/26 (30%), Positives = 12/26 (46%)

Query: 98  FGVESIKIEKNVYCSNCQAIADTGTS 123
            G+  +K E+  YC  C  +A   T 
Sbjct: 314 LGITPLKFEEAFYCPKCGGMASEKTC 339


>gnl|CDD|115814 pfam07184, CTV_P33, Citrus tristeza virus P33 protein.  This family
           consists of several Citrus tristeza virus (CTV) P33
           proteins. The function of P33 is unclear although it is
           known that the protein is not needed for virion
           formation.
          Length = 303

 Score = 26.1 bits (57), Expect = 9.7
 Identities = 11/41 (26%), Positives = 17/41 (41%)

Query: 58  SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQF 98
           +H  N  +  G       VDK +F   I   P   +G++ F
Sbjct: 87  AHIDNQPNCEGNVTYIKTVDKSRFAILIKAMPAPMRGHYSF 127


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0740    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,909,910
Number of extensions: 799677
Number of successful extensions: 647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 624
Number of HSP's successfully gapped: 53
Length of query: 177
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 86
Effective length of database: 6,901,388
Effective search space: 593519368
Effective search space used: 593519368
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.3 bits)