RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3214
         (336 letters)



>gnl|CDD|133157 cd05490, Cathepsin_D2, Cathepsin_D2, pepsin family of proteinases. 
           Cathepsin D is the major aspartic proteinase of the
           lysosomal compartment where it functions in protein
           catabolism. It is a member of the pepsin family of
           proteinases. This enzyme is distinguished from other
           members of the pepsin family by two features that are
           characteristic of lysosomal hydrolases. First, mature
           Cathepsin D is found predominantly in a two-chain form
           due to a posttranslational cleavage event. Second, it
           contains phosphorylated, N-linked oligosaccharides that
           target the enzyme to lysosomes via mannose-6-phosphate
           receptors. Cathepsin D preferentially attacks peptide
           bonds flanked by bulky hydrophobic amino acids and its
           pH optimum is between pH 2.8 and 4.0. Two active site
           aspartic acid residues are essential for the catalytic
           activity of aspartic proteinases. Like other aspartic
           proteinases, Cathepsin D is a bilobed molecule; the two
           evolutionary related lobes are mostly made up of
           beta-sheets and flank a deep active site cleft. Each of
           the two related lobes contributes one active site
           aspartic acid residue and contains a single carbohydrate
           group. Cathepsin D is an essential enzyme. Mice
           deficient for proteinase cathepsin D, generated by gene
           targeting, develop normally during the first 2 weeks,
           stop thriving in the third week and die in a state of
           anorexia in the fourth week. The mice develop atrophy of
           ileal mucosa followed by other degradation of intestinal
           organs. In these knockout mice, lysosomal proteolysis
           was normal. These results suggest that vital functions
           of cathepsin D are exerted by limited proteolysis of
           proteins regulating cell growth and/or tissue
           homeostasis, while its contribution to bulk proteolysis
           in lysosomes appears to be non-critical. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 325

 Score =  154 bits (392), Expect = 1e-44
 Identities = 64/92 (69%), Positives = 79/92 (85%)

Query: 70  KNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129
           KNGT+FAI+YGSG++SGYLSQDT+ IG L ++ Q+FGEA+K+PG TF+AAKFDGILGM Y
Sbjct: 61  KNGTEFAIQYGSGSLSGYLSQDTVSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY 120

Query: 130 DNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
             I+VDGV P F NI+ QKL+E+NVF FYLNR
Sbjct: 121 PRISVDGVTPVFDNIMAQKLVEQNVFSFYLNR 152



 Score =  117 bits (294), Expect = 2e-30
 Identities = 43/69 (62%), Positives = 56/69 (81%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P +   LGGK + LT  DY+LKVS   +++CLSGF G+DIPPPAGP+WILGDVFIG++Y
Sbjct: 252 LPVISFSLGGKVYPLTGEDYILKVSQRGTTICLSGFMGLDIPPPAGPLWILGDVFIGRYY 311

Query: 61  TVFDMDNNQ 69
           TVFD DN++
Sbjct: 312 TVFDRDNDR 320


>gnl|CDD|133155 cd05488, Proteinase_A_fungi, Fungal Proteinase A , aspartic
           proteinase superfamily.  Fungal Proteinase A, a
           proteolytic enzyme distributed among a variety of
           organisms, is a member of the aspartic proteinase
           superfamily. In Saccharomyces cerevisiae, targeted to
           the vacuole as a zymogen, activation of proteinases A at
           acidic pH can occur by two different pathways: a
           one-step process to release mature proteinase A,
           involving the intervention of proteinase B, or a
           step-wise pathway via the auto-activation product known
           as pseudo-proteinase A. Once active, S. cerevisiae
           proteinase A is essential to the activities of other
           yeast vacuolar hydrolases, including proteinase B and
           carboxypeptidase Y. The mature enzyme is bilobal, with
           each lobe providing one of the two catalytically
           essential aspartic acid residues in the active site. The
           crystal structure of free proteinase A shows that flap
           loop is atypically pointing directly into the S(1)
           pocket of the enzyme.  Proteinase A preferentially
           hydrolyzes hydrophobic residues such as Phe, Leu or Glu
           at the P1 position and Phe, Ile, Leu or Ala at P1'.
           Moreover, the enzyme is inhibited by IA3, a natural and
           highly specific inhibitor produced by S. cerevisiae.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 320

 Score =  151 bits (382), Expect = 3e-43
 Identities = 55/97 (56%), Positives = 70/97 (72%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
           + NGT+F I+YGSG++ G++SQDTL IGDLTIK Q F EA  EPG  F   KFDGILG+ 
Sbjct: 62  KANGTEFKIQYGSGSLEGFVSQDTLSIGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLA 121

Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTE 165
           YD I+V+ + PPFYN+I Q LL++ VF FYL     +
Sbjct: 122 YDTISVNKIVPPFYNMINQGLLDEPVFSFYLGSSEED 158



 Score = 87.9 bits (218), Expect = 9e-20
 Identities = 32/68 (47%), Positives = 44/68 (64%), Gaps = 4/68 (5%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P++     G NFTL P DY L+VS      C+S F GMD P P GP+ I+GD F+ K+Y
Sbjct: 251 LPDLTFNFDGYNFTLGPFDYTLEVS----GSCISAFTGMDFPEPVGPLAIVGDAFLRKYY 306

Query: 61  TVFDMDNN 68
           +V+D+ NN
Sbjct: 307 SVYDLGNN 314


>gnl|CDD|133152 cd05485, Cathepsin_D_like, Cathepsin_D_like, pepsin family of
           proteinases.  Cathepsin D is the major aspartic
           proteinase of the lysosomal compartment where it
           functions in protein catabolism. It is a member of the
           pepsin family of proteinases. This enzyme is
           distinguished from other members of the pepsin family by
           two features that are characteristic of lysosomal
           hydrolases. First, mature Cathepsin D is found
           predominantly in a two-chain form due to a
           posttranslational cleavage event. Second, it contains
           phosphorylated, N-linked oligosaccharides that target
           the enzyme to lysosomes via mannose-6-phosphate
           receptors. Cathepsin D preferentially attacks peptide
           bonds flanked by bulky hydrophobic amino acids and its
           pH optimum is between pH 2.8 and 4.0. Two active site
           aspartic acid residues are essential for the catalytic
           activity of aspartic proteinases. Like other aspartic
           proteinases, Cathepsin D is a bilobed molecule; the two
           evolutionary related lobes are mostly made up of
           beta-sheets and flank a deep active site cleft. Each of
           the two related lobes contributes one active site
           aspartic acid residue and contains a single carbohydrate
           group. Cathepsin D is an essential enzyme. Mice
           deficient for proteinase cathepsin D, generated by gene
           targeting, develop normally during the first 2 weeks,
           stop thriving in the third week and die in a state of
           anorexia in the fourth week. The mice develop atrophy of
           ileal mucosa followed by other degradation of intestinal
           organs. In these knockout mice, lysosomal proteolysis
           was normal. These results suggest that vital functions
           of cathepsin D are exerted by limited proteolysis of
           proteins regulating cell growth and/or tissue
           homeostasis, while its contribution to bulk proteolysis
           in lysosomes appears to be non-critical. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 329

 Score =  146 bits (371), Expect = 2e-41
 Identities = 68/113 (60%), Positives = 87/113 (76%), Gaps = 2/113 (1%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
           +KNGT+FAI+YGSG++SG+LS DT+ +G +++K Q F EAI EPG TFVAAKFDGILGMG
Sbjct: 65  KKNGTEFAIQYGSGSLSGFLSTDTVSVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMG 124

Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPK 181
           Y +I+VDGV P FYN++ QKL++  VF FYLNR  + K   GE+ I    DPK
Sbjct: 125 YSSISVDGVVPVFYNMVNQKLVDAPVFSFYLNRDPSAK-EGGEL-ILGGSDPK 175



 Score =  115 bits (290), Expect = 9e-30
 Identities = 44/69 (63%), Positives = 58/69 (84%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P++  +LGGK+F+LT  DYVLKV+    ++CLSGF G+DIPPPAGP+WILGDVFIGK+Y
Sbjct: 256 LPDITFVLGGKSFSLTGKDYVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYY 315

Query: 61  TVFDMDNNQ 69
           T FD+ NN+
Sbjct: 316 TEFDLGNNR 324


>gnl|CDD|133162 cd06098, phytepsin, Phytepsin, a plant homolog of mammalian
           lysosomal pepsins.  Phytepsin, a plant homolog of
           mammalian lysosomal pepsins, resides in grains, roots,
           stems, leaves and flowers. Phytepsin may participate in
           metabolic turnover and in protein processing events. In
           addition, it highly expressed in several plant tissues
           undergoing apoptosis. Phytepsin contains an internal
           region consisting of about 100 residues not present in
           animal or microbial pepsins. This region is thus called
           a plant specific insert. The insert is highly similar to
           saponins, which are lysosomal sphingolipid-activating
           proteins in mammalian cells. The saponin-like domain may
           have a role in the vacuolar targeting of phytepsin.
           Phytepsin, as its animal counterparts, possesses a
           topology typical of all aspartic proteases.  They are
           bilobal enzymes, each lobe contributing a catalytic Asp
           residue, with an extended active site cleft localized
           between the two lobes of the molecule. One lobe has
           probably evolved from the other through a gene
           duplication event in the distant past. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 317

 Score =  141 bits (358), Expect = 1e-39
 Identities = 54/104 (51%), Positives = 80/104 (76%), Gaps = 1/104 (0%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
           +KNGT  +I+YG+G++SG+ SQD++ +GDL +KNQVF EA KEPG TF+ AKFDGILG+G
Sbjct: 63  KKNGTSASIQYGTGSISGFFSQDSVTVGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLG 122

Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172
           +  I+V    P +YN+++Q L+++ VF F+LNR N ++   GE+
Sbjct: 123 FQEISVGKAVPVWYNMVEQGLVKEPVFSFWLNR-NPDEEEGGEL 165



 Score =  107 bits (268), Expect = 9e-27
 Identities = 38/69 (55%), Positives = 51/69 (73%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           MPNV   +GGK F LTP  Y+LKV   +++ C+SGF  +D+PPP GP+WILGDVF+G ++
Sbjct: 244 MPNVSFTIGGKTFELTPEQYILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYH 303

Query: 61  TVFDMDNNQ 69
           TVFD  N +
Sbjct: 304 TVFDYGNLR 312


>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease.  Aspartyl (acid)
           proteases include pepsins, cathepsins, and renins.
           Two-domain structure, probably arising from ancestral
           duplication. This family does not include the retroviral
           nor retrotransposon proteases (pfam00077), which are
           much smaller and appear to be homologous to a single
           domain of the eukaryotic asp proteases.
          Length = 316

 Score =  125 bits (315), Expect = 2e-33
 Identities = 44/95 (46%), Positives = 58/95 (61%), Gaps = 2/95 (2%)

Query: 69  QKNGTQFAIRYGSG-AVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
           +  GT F+I YG G + SG+L QDT+ +G +T+ NQ FG A KEPG  F  A FDGILG+
Sbjct: 54  KSLGTTFSISYGDGSSASGFLGQDTVTVGGITVTNQQFGLATKEPGSFFATAVFDGILGL 113

Query: 128 GYDNIAVDG-VEPPFYNIIQQKLLEKNVFGFYLNR 161
           G+ +I   G   P F N+  Q L++   F  YLN 
Sbjct: 114 GFPSIEAVGTYTPVFDNLKSQGLIDSPAFSVYLNS 148



 Score = 84.3 bits (209), Expect = 2e-18
 Identities = 35/69 (50%), Positives = 44/69 (63%), Gaps = 4/69 (5%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P+V   +GG   T+ P+DYVL+ S   SS CLSGF       P GP+WILGDVF+   Y
Sbjct: 246 LPDVTFFIGGAKITVPPSDYVLQPSSGGSSTCLSGFQSS----PGGPLWILGDVFLRSAY 301

Query: 61  TVFDMDNNQ 69
            VFD DNN+
Sbjct: 302 VVFDRDNNR 310


>gnl|CDD|133154 cd05487, renin_like, Renin stimulates production of angiotensin and
           thus affects blood pressure.  Renin, also known as
           angiotensinogenase, is a circulating enzyme that
           participates in the renin-angiotensin system that
           mediates extracellular volume, arterial
           vasoconstriction, and consequently mean arterial blood
           pressure. The enzyme is secreted by the kidneys from
           specialized juxtaglomerular cells in response to
           decreases in glomerular filtration rate (a consequence
           of low blood volume), diminished filtered sodium
           chloride and sympathetic nervous system innervation. The
           enzyme circulates in the blood stream and hydrolyzes
           angiotensinogen secreted from the liver into the peptide
           angiotensin I. Angiotensin I is further cleaved in the
           lungs by endothelial bound angiotensin converting enzyme
           (ACE) into angiotensin II, the final active peptide.
           Renin is a member of the aspartic protease family.
           Structurally, aspartic proteases are bilobal enzymes,
           each lobe contributing a catalytic Aspartate  residue,
           with an extended active site cleft localized between the
           two lobes of the molecule. The N- and C-terminal
           domains, although structurally related by a 2-fold axis,
           have only limited sequence homology except the vicinity
           of the active site. This suggests that the enzymes
           evolved by an ancient duplication event. The active site
           is located at the groove formed by the two lobes, with
           an extended loop projecting over the cleft to form an
           11-residue flap, which encloses substrates and
           inhibitors in the active site. Specificity is determined
           by nearest-neighbor hydrophobic residues surrounding the
           catalytic aspartates, and by three residues in the flap.
           The enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 326

 Score =  118 bits (297), Expect = 9e-31
 Identities = 48/110 (43%), Positives = 68/110 (61%), Gaps = 3/110 (2%)

Query: 71  NGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD 130
           NGT+F I Y SG V G+LSQD + +G + +  Q+FGE    P   F+ AKFDG+LGMGY 
Sbjct: 64  NGTEFTIHYASGTVKGFLSQDIVTVGGIPV-TQMFGEVTALPAIPFMLAKFDGVLGMGYP 122

Query: 131 NIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDP 180
             A+ GV P F NI+ Q +L+++VF  Y +R ++     GE+ +    DP
Sbjct: 123 KQAIGGVTPVFDNIMSQGVLKEDVFSVYYSR-DSSHSLGGEI-VLGGSDP 170



 Score = 88.7 bits (220), Expect = 5e-20
 Identities = 36/69 (52%), Positives = 47/69 (68%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P++   LGGK +TL+ +DYVL+ S  S  +C   F  MDIPPP GP+W+LG  FI KFY
Sbjct: 252 LPDISFHLGGKEYTLSSSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFY 311

Query: 61  TVFDMDNNQ 69
           T FD  NN+
Sbjct: 312 TEFDRQNNR 320


>gnl|CDD|133138 cd05471, pepsin_like, Pepsin-like aspartic proteases, bilobal
           enzymes that cleave bonds in peptides at acidic pH.
           Pepsin-like aspartic proteases are found in mammals,
           plants, fungi and bacteria. These well known and
           extensively characterized enzymes include pepsins,
           chymosin, renin, cathepsins, and fungal aspartic
           proteases. Several have long been known to be medically
           (renin, cathepsin D and E, pepsin) or commercially
           (chymosin) important. Structurally, aspartic proteases
           are bilobal enzymes, each lobe contributing a catalytic
           Aspartate residue, with an extended active site cleft
           localized between the two lobes of the molecule. The N-
           and C-terminal domains, although structurally related by
           a 2-fold axis, have only limited sequence homology
           except the vicinity of the active site. This suggests
           that the enzymes evolved by an ancient duplication
           event.  Most members of the pepsin family specifically
           cleave bonds in peptides that are at least six residues
           in length, with hydrophobic residues in both the P1 and
           P1' positions. The active site is located at the groove
           formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap.The
           enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 283

 Score =  115 bits (289), Expect = 6e-30
 Identities = 44/101 (43%), Positives = 58/101 (57%), Gaps = 1/101 (0%)

Query: 66  DNNQKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGIL 125
              +  G  F+I YG G+V+G L  DT+ IG LTI NQ FG A  E G  F ++ FDGIL
Sbjct: 51  STYKDTGCTFSITYGDGSVTGGLGTDTVTIGGLTIPNQTFGCATSESGD-FSSSGFDGIL 109

Query: 126 GMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEK 166
           G+G+ +++VDGV   F  +  Q L+   VF FYL R     
Sbjct: 110 GLGFPSLSVDGVPSFFDQLKSQGLISSPVFSFYLGRDGDGG 150



 Score = 44.0 bits (104), Expect = 6e-05
 Identities = 15/21 (71%), Positives = 19/21 (90%)

Query: 49  WILGDVFIGKFYTVFDMDNNQ 69
           WILGDVF+  +YTVFD+DNN+
Sbjct: 258 WILGDVFLRNYYTVFDLDNNR 278


>gnl|CDD|133153 cd05486, Cathespin_E, Cathepsin E, non-lysosomal aspartic protease.
            Cathepsin E is an intracellular, non-lysosomal aspartic
           protease expressed in a variety of cells and tissues.
           The protease has proposed physiological roles in antigen
           presentation by the MHC class II system, in the
           biogenesis of the vasoconstrictor peptide endothelin,
           and in neurodegeneration associated with brain ischemia
           and aging. Cathepsin E is the only A1 aspartic protease
           that exists as a homodimer with a disulfide bridge
           linking the two monomers. Like many other aspartic
           proteases, it is synthesized as a zymogen which is
           catalytically inactive towards its natural substrates at
           neutral pH and which auto-activates in an acidic
           environment. The overall structure follows the general
           fold of aspartic proteases of the A1 family, it is
           composed of two structurally similar beta barrel lobes,
           each lobe contributing an aspartic acid residue to form
           a catalytic dyad that acts to cleave the substrate
           peptide bond. The catalytic Asp residues are contained
           in an Asp-Thr-Gly-Ser/thr motif in both N- and
           C-terminal lobes of the enzyme. The aspartic acid
           residues act together to allow a water molecule to
           attack the peptide bond. One aspartic acid residue (in
           its deprotonated form) activates the attacking water
           molecule, whereas the other aspartic acid residue (in
           its protonated form) polarizes the peptide carbonyl,
           increasing its susceptibility to attack. This family of
           aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 316

 Score =  106 bits (265), Expect = 3e-26
 Identities = 43/104 (41%), Positives = 69/104 (66%), Gaps = 1/104 (0%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
             NG  F+I+YG+G+++G +  D + +  +T++NQ F E++ EPG TF  ++FDGILG+ 
Sbjct: 52  VSNGEAFSIQYGTGSLTGIIGIDQVTVEGITVQNQQFAESVSEPGSTFQDSEFDGILGLA 111

Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172
           Y ++AVDGV P F N++ Q L+E  +F  Y++R N      GE+
Sbjct: 112 YPSLAVDGVTPVFDNMMAQNLVELPMFSVYMSR-NPNSADGGEL 154



 Score = 94.6 bits (235), Expect = 4e-22
 Identities = 35/69 (50%), Positives = 47/69 (68%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           MP+V   + G  ++L+P  Y L+        C SGF G+DIPPPAGP+WILGDVFI ++Y
Sbjct: 243 MPSVTFTINGIPYSLSPQAYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYY 302

Query: 61  TVFDMDNNQ 69
           +VFD  NN+
Sbjct: 303 SVFDRGNNR 311


>gnl|CDD|133145 cd05478, pepsin_A, Pepsin A, aspartic protease produced in gastric
           mucosa of mammals.  Pepsin, a well-known aspartic
           protease, is produced by the human gastric mucosa in
           seven different zymogen isoforms, subdivided into two
           types: pepsinogen A and pepsinogen C. The prosequence of
           the zymogens are self cleaved under acidic pH. The
           mature enzymes are called pepsin A and pepsin C,
           correspondingly. The well researched porcine pepsin is
           also in this pepsin A family. Pepsins play an integral
           role in the digestion process of vertebrates. Pepsins
           are bilobal enzymes, each lobe contributing a catalytic
           Asp residue, with an extended active site cleft
           localized between the two lobes of the molecule. One
           lobe may be evolved from the other through ancient
           gene-duplication event. More recently evolved enzymes
           have similar three-dimensional structures, however their
           amino acid sequences are more divergent except for the
           conserved catalytic site motif. Pepsins specifically
           cleave bonds in peptides which have at least six
           residues in length with hydrophobic residues in both the
           P1 and P1' positions. The active site is located at the
           groove formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap. This
           family of aspartate proteases is classified by MEROPS as
           the peptidase family A1 (pepsin A, clan AA).
          Length = 317

 Score =  102 bits (255), Expect = 7e-25
 Identities = 38/93 (40%), Positives = 60/93 (64%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
           Q  G   +I+YG+G+++G L  DT+++G ++  NQ+FG +  EPG  F  A FDGILG+ 
Sbjct: 62  QSTGQPLSIQYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFFYYAPFDGILGLA 121

Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
           Y +IA  G  P F N++ Q L+ +++F  YL+ 
Sbjct: 122 YPSIASSGATPVFDNMMSQGLVSQDLFSVYLSS 154



 Score = 65.5 bits (160), Expect = 4e-12
 Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 8/69 (11%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           MP+V   + G  + L P+ Y+L+        C SGF  M      G +WILGDVFI ++Y
Sbjct: 252 MPDVVFTINGVQYPLPPSAYILQ----DQGSCTSGFQSMG----LGELWILGDVFIRQYY 303

Query: 61  TVFDMDNNQ 69
           +VFD  NN+
Sbjct: 304 SVFDRANNK 312


>gnl|CDD|133144 cd05477, gastricsin, Gastricsins, asparate proteases produced in
           gastric mucosa.  Gastricsin is also called pepsinogen C.
           Gastricsins are produced in gastric mucosa of mammals.
           It is synthesized by the chief cells in the stomach as
           an inactive zymogen. It is self-converted to a mature
           enzyme under acidic conditions. Human gastricsin is
           distributed throughout all parts of the stomach.
           Gastricsin is synthesized as an inactive progastricsin
           that has an approximately 40 residue prosequence. It is
           self-converting to a mature enzyme being triggered by a
           drop in pH from neutrality to acidic conditions. Like
           other aspartic proteases, gastricsin are characterized
           by two catalytic aspartic residues at the active site,
           and display optimal activity at acidic pH. Mature enzyme
           has a pseudo-2-fold symmetry that passes through the
           active site between the catalytic aspartate residues.
           Structurally, aspartic proteases are bilobal enzymes,
           each lobe contributing a catalytic aspartate residue,
           with an extended active site cleft localized between the
           two lobes of the molecule. One lobe may be evolved from
           the other through ancient gene-duplication event.
           Although the three-dimensional structures of the two
           lobes are very similar, the amino acid sequences are
           more divergent, except for the conserved catalytic site
           motif. This family of aspartate proteases is classified
           by MEROPS as the peptidase family A1 (pepsin A, clan
           AA).
          Length = 318

 Score = 96.1 bits (239), Expect = 1e-22
 Identities = 38/91 (41%), Positives = 57/91 (62%)

Query: 71  NGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD 130
           NG  F+++YGSG+++G    DT+ +  + I NQ FG +  EPG  FV A+FDGILG+ Y 
Sbjct: 57  NGETFSLQYGSGSLTGIFGYDTVTVQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAYP 116

Query: 131 NIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
           +I+  G       ++QQ LL+  +F FYL+ 
Sbjct: 117 SISAGGATTVMQGMMQQNLLQAPIFSFYLSG 147



 Score = 58.0 bits (140), Expect = 1e-09
 Identities = 25/70 (35%), Positives = 42/70 (60%), Gaps = 5/70 (7%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAG-PIWILGDVFIGKF 59
           +P +   + G +F L P+ Y+L+    ++  C  G     +P   G P+WILGDVF+ ++
Sbjct: 247 LPTLTFTINGVSFPLPPSAYILQ----NNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQY 302

Query: 60  YTVFDMDNNQ 69
           Y+V+D+ NNQ
Sbjct: 303 YSVYDLGNNQ 312


>gnl|CDD|217694 pfam03723, Hemocyanin_C, Hemocyanin, ig-like domain.  This family
           includes arthropod hemocyanins and insect larval storage
           proteins.
          Length = 249

 Score = 84.1 bits (209), Expect = 6e-19
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 20/115 (17%)

Query: 229 LEYPGVELKSLRILNKQGPTDQLETFWQ-----LIRMDEVYVRDKELTEEYNIP---YHI 280
           LE+PGV+++S+ +  K      L T++      L       +   E   + ++      +
Sbjct: 8   LEFPGVKIESVEVDGK------LVTYFDDFDVDLSNA----LDFTEEVPDVSVKARQTRL 57

Query: 281 QHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDKFRV 335
            H+ F Y I + ++     T  +RIFLG K D  GNEL L+ QR  FIELDKF V
Sbjct: 58  NHKPFTYNIDVNSDKAADAT--VRIFLGPKYDENGNELSLNEQRRNFIELDKFVV 110


>gnl|CDD|133137 cd05470, pepsin_retropepsin_like, Cellular and retroviral
           pepsin-like aspartate proteases.  This family includes
           both cellular and retroviral pepsin-like aspartate
           proteases. The cellular pepsin and pepsin-like enzymes
           are twice as long as their retroviral counterparts. The
           cellular pepsin-like aspartic proteases are found in
           mammals, plants, fungi and bacteria. These well known
           and extensively characterized enzymes include pepsins,
           chymosin, rennin, cathepsins, and fungal aspartic
           proteases. Several have long been known to be medically
           (rennin, cathepsin D and E, pepsin) or commercially
           (chymosin) important. The eukaryotic pepsin-like
           proteases contain two domains possessing similar
           topological features. The N- and C-terminal domains,
           although structurally related by a 2-fold axis, have
           only limited sequence homology except in the vicinity of
           the active site. This suggests that the enzymes evolved
           by an ancient duplication event. The eukaryotic
           pepsin-like proteases have two active site ASP residues
           with each N- and C-terminal lobe contributing one
           residue. While the fungal and mammalian pepsins are
           bilobal proteins, retropepsins function as dimers and
           the monomer resembles structure of the N- or C-terminal
           domains of eukaryotic enzyme. The active site motif
           (Asp-Thr/Ser-Gly-Ser) is conserved between the
           retroviral and eukaryotic proteases and between the
           N-and C-terminal of eukaryotic pepsin-like proteases.
           The retropepsin-like family includes pepsin-like
           aspartate proteases from retroviruses, retrotransposons
           and retroelements; as well as eukaryotic
           DNA-damage-inducible proteins (DDIs), and bacterial
           aspartate peptidases. Retropepsin is synthesized as part
           of the POL polyprotein that contains an
           aspartyl-protease, a reverse transcriptase, RNase H, and
           an integrase. The POL polyprotein undergoes specific
           enzymatic cleavage to yield the mature proteins. This
           family of aspartate proteases is classified by MEROPS as
           the peptidase family A1 (pepsin A) and A2 (retropepsin
           family).
          Length = 109

 Score = 80.1 bits (198), Expect = 1e-18
 Identities = 32/59 (54%), Positives = 41/59 (69%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
             NG  F+I YG+G++SG LS DT+ IGD+ +  Q FG A  EPG TF+ A FDGILG+
Sbjct: 51  SDNGCTFSITYGTGSLSGGLSTDTVSIGDIEVVGQAFGCATDEPGATFLPALFDGILGL 109


>gnl|CDD|240300 PTZ00165, PTZ00165, aspartyl protease; Provisional.
          Length = 482

 Score = 70.6 bits (173), Expect = 2e-13
 Identities = 33/88 (37%), Positives = 52/88 (59%), Gaps = 3/88 (3%)

Query: 77  IRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY---DNIA 133
           I+YG+G     L +DT++IG L +K+Q  G AI+E    F    FDG++G+G+   D   
Sbjct: 185 IQYGTGECVLALGKDTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGFPDKDFKE 244

Query: 134 VDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
                P   NI +Q LL++N+F FY+++
Sbjct: 245 SKKALPIVDNIKKQNLLKRNIFSFYMSK 272



 Score = 65.9 bits (161), Expect = 6e-12
 Identities = 22/57 (38%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 13  FTLTPTDYVLKVSIAS--SSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDN 67
           F + P DYV++   +      C+ G   MD+P P GP+++LG+ FI K+Y++FD D+
Sbjct: 382 FDMDPEDYVIEEGDSEEQEHQCVIGIIPMDVPAPRGPLFVLGNNFIRKYYSIFDRDH 438


>gnl|CDD|140176 PTZ00147, PTZ00147, plasmepsin-1; Provisional.
          Length = 453

 Score = 62.6 bits (152), Expect = 8e-11
 Identities = 34/100 (34%), Positives = 58/100 (58%), Gaps = 3/100 (3%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGF--TFVAAKFDGILG 126
           +K+GT+  + Y SG VSG+ S+D + IG+L++  + F E     GF   +  + FDGI G
Sbjct: 191 EKDGTKVEMNYVSGTVSGFFSKDLVTIGNLSVPYK-FIEVTDTNGFEPFYTESDFDGIFG 249

Query: 127 MGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEK 166
           +G+ ++++  V+P    +  Q  +E+ VF FYL   +  K
Sbjct: 250 LGWKDLSIGSVDPYVVELKNQNKIEQAVFTFYLPPEDKHK 289



 Score = 43.7 bits (103), Expect = 9e-05
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           +P ++     K +TL P  Y+  +    S++C+     +D+       +ILGD F+ K++
Sbjct: 378 LPTLEFRSPNKVYTLEPEYYLQPIEDIGSALCMLNIIPIDLEKNT---FILGDPFMRKYF 434

Query: 61  TVFDMDNN 68
           TVFD DN+
Sbjct: 435 TVFDYDNH 442


>gnl|CDD|140051 PTZ00013, PTZ00013, plasmepsin 4 (PM4); Provisional.
          Length = 450

 Score = 60.0 bits (145), Expect = 4e-10
 Identities = 31/95 (32%), Positives = 58/95 (61%), Gaps = 7/95 (7%)

Query: 69  QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIK----EPGFTFVAAKFDGI 124
           +K+GT+  I YGSG V G+ S+D + +G L++  + F E       EP ++  +++FDGI
Sbjct: 190 EKDGTKVDITYGSGTVKGFFSKDLVTLGHLSMPYK-FIEVTDTDDLEPIYS--SSEFDGI 246

Query: 125 LGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYL 159
           LG+G+ ++++  ++P    +  Q  ++  +F FYL
Sbjct: 247 LGLGWKDLSIGSIDPIVVELKNQNKIDNALFTFYL 281



 Score = 35.7 bits (82), Expect = 0.031
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 6/77 (7%)

Query: 1   MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           MP ++       +TL P  Y+  +     ++C+     +DI       +ILGD F+ K++
Sbjct: 377 MPTLEFKSANNTYTLEPEYYMNPLLDVDDTLCMITMLPVDIDDNT---FILGDPFMRKYF 433

Query: 61  TVFDMDNNQKNGTQFAI 77
           TVFD D   K    FAI
Sbjct: 434 TVFDYD---KESVGFAI 447


>gnl|CDD|133141 cd05474, SAP_like, SAPs, pepsin-like proteinases secreted from
           pathogens to degrade host proteins.  SAPs (Secreted
           aspartic proteinases) are secreted from a group of
           pathogenic fungi, predominantly Candida species. They
           are secreted from the pathogen to degrade host proteins.
           SAP is one of the most significant extracellular
           hydrolytic enzymes produced by C. albicans. SAP
           proteins, encoded by a family of 10 SAP genes. All 10
           SAP genes of C. albicans encode preproenzymes,
           approximately 60 amino acid longer than the mature
           enzyme, which are processed when transported via the
           secretory pathway. The mature enzymes contain sequence
           motifs typical for all aspartyl proteinases, including
           the two conserved aspartate residues other active site
           and conserved cysteine residues implicated in the
           maintenance of the three-dimensional structure. Most Sap
           proteins contain putative N-glycosylation sites, but it
           remains to be determined which Sap proteins are
           glycosylated. This family of aspartate proteases is
           classified by MEROPS as the peptidase family A1 (pepsin
           A, clan AA).  The overall structure of Sap protein
           conforms to the classical aspartic proteinase fold
           typified by pepsin. SAP is a bilobal enzyme, each lobe
           contributing a catalytic Asp residue, with an extended
           active site cleft localized between the two lobes of the
           molecule. One lobe may be evolved from the other through
           ancient gene-duplication event. More recently evolved
           enzymes have similar three-dimensional structures,
           however their amino acid sequences are more divergent
           except for the conserved catalytic site motif. This
           family of aspartate proteases is classified by MEROPS as
           the peptidase family A1 (pepsin A, clan AA).
          Length = 295

 Score = 56.4 bits (137), Expect = 4e-09
 Identities = 31/92 (33%), Positives = 41/92 (44%), Gaps = 13/92 (14%)

Query: 75  FAIRYGSG-AVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD-NI 132
           F+I YG G + SG    DT+ IG  T+KN  F  A              G+LG+G   N 
Sbjct: 32  FSISYGDGTSASGTWGTDTVSIGGATVKNLQFAVANSTSSDV-------GVLGIGLPGNE 84

Query: 133 AVDGVEPPFYNIIQ----QKLLEKNVFGFYLN 160
           A  G    + N       Q L++KN +  YLN
Sbjct: 85  ATYGTGYTYPNFPIALKKQGLIKKNAYSLYLN 116



 Score = 39.5 bits (93), Expect = 0.002
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 7/69 (10%)

Query: 3   NVDIILGGKNFTLTPTDYVLKVSI--ASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
           ++    GG   ++  +D VL  S        C  G       P      ILGD F+   Y
Sbjct: 226 SLTFNFGGATISVPLSDLVLPASTDDGGDGACYLGIQ-----PSTSDYNILGDTFLRSAY 280

Query: 61  TVFDMDNNQ 69
            V+D+DNN+
Sbjct: 281 VVYDLDNNE 289


>gnl|CDD|133161 cd06097, Aspergillopepsin_like, Aspergillopepsin_like, aspartic
           proteases of fungal origin.  The members of this family
           are aspartic proteases of fungal origin, including
           aspergillopepsin, rhizopuspepsin, endothiapepsin, and
           rodosporapepsin. The various fungal species in this
           family may be the most economically important genus of
           fungi. They may serve as virulence factors or as
           industrial aids. For example, Aspergillopepsin from A.
           fumigatus is involved in invasive aspergillosis owing to
           its elastolytic activity and Aspergillopepsins from the
           mold A. saitoi are used in fermentation industry.
           Aspartic proteinases are a group of proteolytic enzymes
           in which the scissile peptide bond is attacked by a
           nucleophilic water molecule activated by two aspartic
           residues in a DT(S)G motif at the active site. They have
           a similar fold composed of two beta-barrel domains.
           Between the N-terminal and C-terminal domains, each of
           which contributes one catalytic aspartic residue, there
           is an extended active-site cleft capable of interacting
           with multiple residues of a substrate. Although members
           of the aspartic protease family of enzymes have very
           similar three-dimensional structures and catalytic
           mechanisms, each has unique substrate specificity. The
           members of this family has an optimal acidic pH (5.5)
           and cleaves protein substrates with similar specificity
           to that of porcine pepsin A, preferring hydrophobic
           residues at P1 and P1' in the cleave site.  This family
           of aspartate proteases is classified by MEROPS as the
           peptidase family A1 (pepsin A, clan AA).
          Length = 278

 Score = 52.7 bits (127), Expect = 7e-08
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 10/97 (10%)

Query: 71  NGTQFAIRYGSGA-VSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129
            G  ++I YG G+  SG +  DT+ IG + + NQ    A       F     DG+LG+ +
Sbjct: 55  PGATWSISYGDGSSASGIVYTDTVSIGGVEVPNQAIELATAVSASFFSDTASDGLLGLAF 114

Query: 130 DNIAVDGVEPP-----FYNIIQQKLLEKNVFGFYLNR 161
            +I  + V+PP     F N +    L+  +F   L +
Sbjct: 115 SSI--NTVQPPKQKTFFENALSS--LDAPLFTADLRK 147


>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
           Protease and Nucellin, pepsin-like aspartic proteases
           from plants.  This family contains pepsin like aspartic
           proteases from plants including Chloroplast Nucleoids
           DNA-binding Protease and Nucellin. Chloroplast Nucleoids
           DNA-binding Protease catalyzes the degradation of
           ribulose-1,5-bisphosphate carboxylase/oxygenase
           (Rubisco) in senescent leaves of tobacco and Nucellins
           are important regulators of nucellar cell's progressive
           degradation after ovule fertilization. Structurally,
           aspartic proteases are bilobal enzymes, each lobe
           contributing a catalytic Asp residue, with an extended
           active site cleft localized between the two lobes of the
           molecule. The N- and C-terminal domains, although
           structurally related by a 2-fold axis, have only limited
           sequence homology except the vicinity of the active
           site. This suggests that the enzymes evolved by an
           ancient duplication event.  The enzymes specifically
           cleave bonds in peptides which have at least six
           residues in length with hydrophobic residues in both the
           P1 and P1' positions. The active site is located at the
           groove formed by the two lobes, with an extended loop
           projecting over the cleft to form an 11-residue flap,
           which encloses substrates and inhibitors in the active
           site. Specificity is determined by nearest-neighbor
           hydrophobic residues surrounding the catalytic
           aspartates, and by three residues in the flap.  The
           enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
          Length = 265

 Score = 40.7 bits (96), Expect = 6e-04
 Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 5/58 (8%)

Query: 74  QFAIRYGSGAVS-GYLSQDTLRIGD--LTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
            +   YG G+ + G L+ +T   GD  +++ N  FG      G +F  A  DGILG+G
Sbjct: 32  SYEYSYGDGSSTSGVLATETFTFGDSSVSVPNVAFGCGTDNEGGSFGGA--DGILGLG 87


>gnl|CDD|133140 cd05473, beta_secretase_like, Beta-secretase, aspartic-acid
           protease important in the pathogenesis of Alzheimer's
           disease.  Beta-secretase also called BACE (beta-site of
           APP cleaving enzyme) or memapsin-2. Beta-secretase is an
           aspartic-acid protease important in the pathogenesis of
           Alzheimer's disease, and in the formation of myelin
           sheaths in peripheral nerve cells. It cleaves amyloid
           precursor protein (APP) to reveal the N-terminus of the
           beta-amyloid peptides. The beta-amyloid peptides are the
           major components of the amyloid plaques formed in the
           brain of patients with Alzheimer's disease (AD). Since
           BACE mediates one of the cleavages responsible for
           generation of AD, it is regarded as a potential target
           for pharmacological intervention in AD. Beta-secretase
           is a member of pepsin family of aspartic proteases. Same
           as other aspartic proteases, beta-secretase is a bilobal
           enzyme, each lobe contributing a catalytic Asp residue,
           with an extended active site cleft localized between the
           two lobes of the molecule.  The N- and C-terminal
           domains, although structurally related by a 2-fold axis,
           have only limited sequence homology except the vicinity
           of the active site. This suggests that the enzymes
           evolved by an ancient duplication event. The enzymes
           specifically cleave bonds in peptides which have at
           least six residues in length with hydrophobic residues
           in both the P1 and P1' positions. The active site is
           located at the groove formed by the two lobes, with an
           extended loop projecting over the cleft to form an
           11-residue flap, which encloses substrates and
           inhibitors in the active site. Specificity is determined
           by nearest-neighbor hydrophobic residues surrounding the
           catalytic aspartates, and by three residues in the flap.
           The enzymes are mostly secreted from cells as inactive
           proenzymes that activate autocatalytically at acidic pH.
           This family of aspartate proteases is classified by
           MEROPS as the peptidase family A1 (pepsin A, clan AA).
          Length = 364

 Score = 35.1 bits (81), Expect = 0.042
 Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 9/90 (10%)

Query: 72  GTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFG---EAIKEP-GFTFVAAKFDGILGM 127
           G    + Y  G+  G L  D + I      N  F     AI E   F    + ++GILG+
Sbjct: 54  GKGVTVPYTQGSWEGELGTDLVSIPKGP--NVTFRANIAAITESENFFLNGSNWEGILGL 111

Query: 128 GYDNIAV--DGVEPPFYNIIQQKLLEKNVF 155
            Y  +A     VEP F ++++Q     +VF
Sbjct: 112 AYAELARPDSSVEPFFDSLVKQ-TGIPDVF 140


>gnl|CDD|222288 pfam13650, Asp_protease_2, Aspartyl protease.  This family consists
           of predicted aspartic proteases, typically from 180 to
           230 amino acids in length, in MEROPS clan AA. This model
           describes the well-conserved 121-residue C-terminal
           region. The poorly conserved, variable length N-terminal
           region usually contains a predicted transmembrane helix.
          Length = 89

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 8/48 (16%)

Query: 81  SGAVSGYLSQ-DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
           +G V   L + D+L+IG +T++N              +    DG+LGM
Sbjct: 47  NGTVRAALVRLDSLKIGGITLRNVPA-------VVLDLGDLIDGLLGM 87


>gnl|CDD|188595 TIGR04080, rSAM_pep_cyc, KxxxW cyclic peptide radical SAM maturase.
            Members of this family are radical SAM enzymes that
           appear to perform a cyclization on an adjacent cognate
           peptide from family TIGR04079. Genomes with the complete
           system include Streptococcus thermophilus LMD-9 and
           Lactococcus lactis subsp. cremoris MG1363, among others.
           The gene symbol assigned is kwcM, for KxxxW Cyclic
           peptide Maturase [Protein fate, Protein modification and
           repair].
          Length = 440

 Score = 29.1 bits (65), Expect = 3.1
 Identities = 17/74 (22%), Positives = 32/74 (43%), Gaps = 4/74 (5%)

Query: 142 YNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPKELESRIENKYQLPEYDHVGD 201
           Y+I   + LE+ V  + L+     ++AN                 IE   +  E++ +  
Sbjct: 291 YDIADLRKLEEKVRDYILDN----EIANFNFATEGCFLFTAFPELIEEGEEFSEFEEMYF 346

Query: 202 GCSASYTEMQQWTN 215
           GC A +T+M+  +N
Sbjct: 347 GCRAKFTKMEILSN 360


>gnl|CDD|133150 cd05483, retropepsin_like_bacteria, Bacterial aspartate proteases,
           retropepsin-like protease family.  This family of
           bacteria aspartate proteases is a subfamily of
           retropepsin-like protease family, which includes enzymes
           from retrovirus and retrotransposons. While fungal and
           mammalian pepsin-like aspartate proteases are bilobal
           proteins with structurally related N- and C-termini,
           this family of bacteria aspartate proteases is half as
           long as their fungal and mammalian counterparts. The
           monomers are structurally related to one lobe of the
           pepsin molecule and function as homodimers. The active
           site aspartate occurs within a motif (Asp-Thr/Ser-Gly),
           as it does in pepsin. In aspartate peptidases, Asp
           residues are ligands of an activated water molecule in
           all examples where catalytic residues have been
           identified. This group of aspartate proteases is
           classified by MEROPS as the peptidase family A2
           (retropepsin family, clan AA), subfamily A2A.
          Length = 96

 Score = 27.6 bits (62), Expect = 3.7
 Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 7/49 (14%)

Query: 80  GSGAVSGYLSQ-DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
            +G V     + D+L+IG +T++N      +       V    DG+LGM
Sbjct: 50  ANGRVRAARVRLDSLQIGGITLRNVPA--VVLPGDALGV----DGLLGM 92


>gnl|CDD|130293 TIGR01226, phe_am_lyase, phenylalanine ammonia-lyase.  Members of
           this family are found, so far, in plants and fungi. From
           phenylalanine, this enzyme yields cinnaminic acid, a
           precursor of many important plant compounds. This
           protein shows extensive homology to histidine
           ammonia-lyase, the first enzyme of a histidine
           degradation pathway.
          Length = 680

 Score = 29.0 bits (65), Expect = 3.8
 Identities = 23/106 (21%), Positives = 37/106 (34%), Gaps = 22/106 (20%)

Query: 139 PPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPKELESRIENKYQLPEYDH 198
               NI+ +    K V G             GE+  P     K+L   ++ +     +D+
Sbjct: 509 STVKNIVSEVA--KKVLG---------MGLTGEL-HPKRFCEKDLLKVLDRETV---FDY 553

Query: 199 VGDGCSASYTEMQQWTNNIIEAINQGYAVTLEYPGVELKSLRILNK 244
           + D CSA+Y  MQ+    ++E         L     E   L    K
Sbjct: 554 IDDPCSANYPLMQKLRQVLVEH-------ALSNGEREKDLLSSFQK 592


>gnl|CDD|99816 cd06220, DHOD_e_trans_like2, FAD/NAD binding domain in the electron
           transfer subunit of dihydroorotate dehydrogenase-like
           proteins. Dihydroorotate dehydrogenases (DHODs) catalyze
           the only redox reaction in pyrimidine de novo
           biosynthesis. They catalyze the oxidation of
           (S)-dihydroorotate to orotate coupled with the reduction
           of NAD+. In L. lactis, DHOD B (encoded by pyrDa) is
           co-expressed with pyrK and both gene products are
           required for full activity, as well as 3 cofactors: FMN,
           FAD, and an [2Fe-2S] cluster.
          Length = 233

 Score = 28.4 bits (64), Expect = 5.2
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 116 FVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFG-FYLNRY 162
           FV      +    YD I V G E   Y ++ + L E+ V   F L RY
Sbjct: 154 FVTDLLKELDLEEYDAIYVCGPEIMMYKVL-EILDERGVRAQFSLERY 200


>gnl|CDD|176543 cd08601, GDPD_SaGlpQ_like, Glycerophosphodiester phosphodiesterase
           domain of Staphylococcus aureus and similar proteins.
           This subfamily corresponds to the glycerophosphodiester
           phosphodiesterase domain (GDPD) present in
           uncharacterized glycerophosphodiester phosphodiesterase
           (GP-GDE, EC 3.1.4.46) from Staphylococcus aureus,
           Bacillus subtilis and similar proteins. Members in this
           family show very high sequence similarity to Escherichia
           coli periplasmic phosphodiesterase GlpQ, which catalyzes
           the Ca2+-dependent degradation of periplasmic
           glycerophosphodiesters to produce
           sn-glycerol-3-phosphate (G3P) and the corresponding
           alcohols.
          Length = 256

 Score = 28.1 bits (63), Expect = 5.7
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 231 YPGVELKSLRILNKQGPTDQLETFWQLI 258
           YPG+E K L  L+K G         Q+I
Sbjct: 124 YPGMEEKLLATLDKYGLLTDNLKNGQVI 151


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 28.4 bits (63), Expect = 6.5
 Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 294 NSNETKTGIMRIFLG--AKNDIEGNELP 319
           N+     G++++FLG     DIEGNELP
Sbjct: 392 NNTRDHPGMIQVFLGETGARDIEGNELP 419


>gnl|CDD|215881 pfam00372, Hemocyanin_M, Hemocyanin, copper containing domain.
           This family includes arthropod hemocyanins and insect
           larval storage proteins.
          Length = 278

 Score = 27.7 bits (62), Expect = 8.5
 Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 10/82 (12%)

Query: 159 LNRYNTEKLANGEVPIPSAHDPKELES------RIENKY---QLPEYDHVGD-GCSASYT 208
           L RY+ E+L+NG   IP       ++        + +       P+  H+ D        
Sbjct: 113 LARYDLERLSNGLGRIPPFSWFDPIKEGYAPLTSLVSGVEFPVRPDNYHIHDVDNVDVVR 172

Query: 209 EMQQWTNNIIEAINQGYAVTLE 230
           +M +W   I++AI +GY    +
Sbjct: 173 DMDRWEERILDAIQKGYIEAFD 194


>gnl|CDD|223047 PHA03332, PHA03332, membrane glycoprotein; Provisional.
          Length = 1328

 Score = 28.0 bits (62), Expect = 8.9
 Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 21  VLKVSIASSSMCLSGFAGMDIPPPAGPIWI-LGDVFIGKF 59
           VL +   +S  CL  + G+  P P  P    LG +  G  
Sbjct: 200 VLSILRGTSYTCLLSYVGISEPEPNSPCVTTLGPMCYGPL 239


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.139    0.410 

Gapped
Lambda     K      H
   0.267   0.0740    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,721,488
Number of extensions: 1748788
Number of successful extensions: 1469
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1452
Number of HSP's successfully gapped: 58
Length of query: 336
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 239
Effective length of database: 6,635,264
Effective search space: 1585828096
Effective search space used: 1585828096
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)