RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3214
(336 letters)
>gnl|CDD|133157 cd05490, Cathepsin_D2, Cathepsin_D2, pepsin family of proteinases.
Cathepsin D is the major aspartic proteinase of the
lysosomal compartment where it functions in protein
catabolism. It is a member of the pepsin family of
proteinases. This enzyme is distinguished from other
members of the pepsin family by two features that are
characteristic of lysosomal hydrolases. First, mature
Cathepsin D is found predominantly in a two-chain form
due to a posttranslational cleavage event. Second, it
contains phosphorylated, N-linked oligosaccharides that
target the enzyme to lysosomes via mannose-6-phosphate
receptors. Cathepsin D preferentially attacks peptide
bonds flanked by bulky hydrophobic amino acids and its
pH optimum is between pH 2.8 and 4.0. Two active site
aspartic acid residues are essential for the catalytic
activity of aspartic proteinases. Like other aspartic
proteinases, Cathepsin D is a bilobed molecule; the two
evolutionary related lobes are mostly made up of
beta-sheets and flank a deep active site cleft. Each of
the two related lobes contributes one active site
aspartic acid residue and contains a single carbohydrate
group. Cathepsin D is an essential enzyme. Mice
deficient for proteinase cathepsin D, generated by gene
targeting, develop normally during the first 2 weeks,
stop thriving in the third week and die in a state of
anorexia in the fourth week. The mice develop atrophy of
ileal mucosa followed by other degradation of intestinal
organs. In these knockout mice, lysosomal proteolysis
was normal. These results suggest that vital functions
of cathepsin D are exerted by limited proteolysis of
proteins regulating cell growth and/or tissue
homeostasis, while its contribution to bulk proteolysis
in lysosomes appears to be non-critical. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 325
Score = 154 bits (392), Expect = 1e-44
Identities = 64/92 (69%), Positives = 79/92 (85%)
Query: 70 KNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129
KNGT+FAI+YGSG++SGYLSQDT+ IG L ++ Q+FGEA+K+PG TF+AAKFDGILGM Y
Sbjct: 61 KNGTEFAIQYGSGSLSGYLSQDTVSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY 120
Query: 130 DNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
I+VDGV P F NI+ QKL+E+NVF FYLNR
Sbjct: 121 PRISVDGVTPVFDNIMAQKLVEQNVFSFYLNR 152
Score = 117 bits (294), Expect = 2e-30
Identities = 43/69 (62%), Positives = 56/69 (81%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P + LGGK + LT DY+LKVS +++CLSGF G+DIPPPAGP+WILGDVFIG++Y
Sbjct: 252 LPVISFSLGGKVYPLTGEDYILKVSQRGTTICLSGFMGLDIPPPAGPLWILGDVFIGRYY 311
Query: 61 TVFDMDNNQ 69
TVFD DN++
Sbjct: 312 TVFDRDNDR 320
>gnl|CDD|133155 cd05488, Proteinase_A_fungi, Fungal Proteinase A , aspartic
proteinase superfamily. Fungal Proteinase A, a
proteolytic enzyme distributed among a variety of
organisms, is a member of the aspartic proteinase
superfamily. In Saccharomyces cerevisiae, targeted to
the vacuole as a zymogen, activation of proteinases A at
acidic pH can occur by two different pathways: a
one-step process to release mature proteinase A,
involving the intervention of proteinase B, or a
step-wise pathway via the auto-activation product known
as pseudo-proteinase A. Once active, S. cerevisiae
proteinase A is essential to the activities of other
yeast vacuolar hydrolases, including proteinase B and
carboxypeptidase Y. The mature enzyme is bilobal, with
each lobe providing one of the two catalytically
essential aspartic acid residues in the active site. The
crystal structure of free proteinase A shows that flap
loop is atypically pointing directly into the S(1)
pocket of the enzyme. Proteinase A preferentially
hydrolyzes hydrophobic residues such as Phe, Leu or Glu
at the P1 position and Phe, Ile, Leu or Ala at P1'.
Moreover, the enzyme is inhibited by IA3, a natural and
highly specific inhibitor produced by S. cerevisiae.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 320
Score = 151 bits (382), Expect = 3e-43
Identities = 55/97 (56%), Positives = 70/97 (72%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
+ NGT+F I+YGSG++ G++SQDTL IGDLTIK Q F EA EPG F KFDGILG+
Sbjct: 62 KANGTEFKIQYGSGSLEGFVSQDTLSIGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLA 121
Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTE 165
YD I+V+ + PPFYN+I Q LL++ VF FYL +
Sbjct: 122 YDTISVNKIVPPFYNMINQGLLDEPVFSFYLGSSEED 158
Score = 87.9 bits (218), Expect = 9e-20
Identities = 32/68 (47%), Positives = 44/68 (64%), Gaps = 4/68 (5%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P++ G NFTL P DY L+VS C+S F GMD P P GP+ I+GD F+ K+Y
Sbjct: 251 LPDLTFNFDGYNFTLGPFDYTLEVS----GSCISAFTGMDFPEPVGPLAIVGDAFLRKYY 306
Query: 61 TVFDMDNN 68
+V+D+ NN
Sbjct: 307 SVYDLGNN 314
>gnl|CDD|133152 cd05485, Cathepsin_D_like, Cathepsin_D_like, pepsin family of
proteinases. Cathepsin D is the major aspartic
proteinase of the lysosomal compartment where it
functions in protein catabolism. It is a member of the
pepsin family of proteinases. This enzyme is
distinguished from other members of the pepsin family by
two features that are characteristic of lysosomal
hydrolases. First, mature Cathepsin D is found
predominantly in a two-chain form due to a
posttranslational cleavage event. Second, it contains
phosphorylated, N-linked oligosaccharides that target
the enzyme to lysosomes via mannose-6-phosphate
receptors. Cathepsin D preferentially attacks peptide
bonds flanked by bulky hydrophobic amino acids and its
pH optimum is between pH 2.8 and 4.0. Two active site
aspartic acid residues are essential for the catalytic
activity of aspartic proteinases. Like other aspartic
proteinases, Cathepsin D is a bilobed molecule; the two
evolutionary related lobes are mostly made up of
beta-sheets and flank a deep active site cleft. Each of
the two related lobes contributes one active site
aspartic acid residue and contains a single carbohydrate
group. Cathepsin D is an essential enzyme. Mice
deficient for proteinase cathepsin D, generated by gene
targeting, develop normally during the first 2 weeks,
stop thriving in the third week and die in a state of
anorexia in the fourth week. The mice develop atrophy of
ileal mucosa followed by other degradation of intestinal
organs. In these knockout mice, lysosomal proteolysis
was normal. These results suggest that vital functions
of cathepsin D are exerted by limited proteolysis of
proteins regulating cell growth and/or tissue
homeostasis, while its contribution to bulk proteolysis
in lysosomes appears to be non-critical. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 329
Score = 146 bits (371), Expect = 2e-41
Identities = 68/113 (60%), Positives = 87/113 (76%), Gaps = 2/113 (1%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
+KNGT+FAI+YGSG++SG+LS DT+ +G +++K Q F EAI EPG TFVAAKFDGILGMG
Sbjct: 65 KKNGTEFAIQYGSGSLSGFLSTDTVSVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMG 124
Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPK 181
Y +I+VDGV P FYN++ QKL++ VF FYLNR + K GE+ I DPK
Sbjct: 125 YSSISVDGVVPVFYNMVNQKLVDAPVFSFYLNRDPSAK-EGGEL-ILGGSDPK 175
Score = 115 bits (290), Expect = 9e-30
Identities = 44/69 (63%), Positives = 58/69 (84%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P++ +LGGK+F+LT DYVLKV+ ++CLSGF G+DIPPPAGP+WILGDVFIGK+Y
Sbjct: 256 LPDITFVLGGKSFSLTGKDYVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYY 315
Query: 61 TVFDMDNNQ 69
T FD+ NN+
Sbjct: 316 TEFDLGNNR 324
>gnl|CDD|133162 cd06098, phytepsin, Phytepsin, a plant homolog of mammalian
lysosomal pepsins. Phytepsin, a plant homolog of
mammalian lysosomal pepsins, resides in grains, roots,
stems, leaves and flowers. Phytepsin may participate in
metabolic turnover and in protein processing events. In
addition, it highly expressed in several plant tissues
undergoing apoptosis. Phytepsin contains an internal
region consisting of about 100 residues not present in
animal or microbial pepsins. This region is thus called
a plant specific insert. The insert is highly similar to
saponins, which are lysosomal sphingolipid-activating
proteins in mammalian cells. The saponin-like domain may
have a role in the vacuolar targeting of phytepsin.
Phytepsin, as its animal counterparts, possesses a
topology typical of all aspartic proteases. They are
bilobal enzymes, each lobe contributing a catalytic Asp
residue, with an extended active site cleft localized
between the two lobes of the molecule. One lobe has
probably evolved from the other through a gene
duplication event in the distant past. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 317
Score = 141 bits (358), Expect = 1e-39
Identities = 54/104 (51%), Positives = 80/104 (76%), Gaps = 1/104 (0%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
+KNGT +I+YG+G++SG+ SQD++ +GDL +KNQVF EA KEPG TF+ AKFDGILG+G
Sbjct: 63 KKNGTSASIQYGTGSISGFFSQDSVTVGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLG 122
Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172
+ I+V P +YN+++Q L+++ VF F+LNR N ++ GE+
Sbjct: 123 FQEISVGKAVPVWYNMVEQGLVKEPVFSFWLNR-NPDEEEGGEL 165
Score = 107 bits (268), Expect = 9e-27
Identities = 38/69 (55%), Positives = 51/69 (73%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
MPNV +GGK F LTP Y+LKV +++ C+SGF +D+PPP GP+WILGDVF+G ++
Sbjct: 244 MPNVSFTIGGKTFELTPEQYILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYH 303
Query: 61 TVFDMDNNQ 69
TVFD N +
Sbjct: 304 TVFDYGNLR 312
>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease. Aspartyl (acid)
proteases include pepsins, cathepsins, and renins.
Two-domain structure, probably arising from ancestral
duplication. This family does not include the retroviral
nor retrotransposon proteases (pfam00077), which are
much smaller and appear to be homologous to a single
domain of the eukaryotic asp proteases.
Length = 316
Score = 125 bits (315), Expect = 2e-33
Identities = 44/95 (46%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 69 QKNGTQFAIRYGSG-AVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
+ GT F+I YG G + SG+L QDT+ +G +T+ NQ FG A KEPG F A FDGILG+
Sbjct: 54 KSLGTTFSISYGDGSSASGFLGQDTVTVGGITVTNQQFGLATKEPGSFFATAVFDGILGL 113
Query: 128 GYDNIAVDG-VEPPFYNIIQQKLLEKNVFGFYLNR 161
G+ +I G P F N+ Q L++ F YLN
Sbjct: 114 GFPSIEAVGTYTPVFDNLKSQGLIDSPAFSVYLNS 148
Score = 84.3 bits (209), Expect = 2e-18
Identities = 35/69 (50%), Positives = 44/69 (63%), Gaps = 4/69 (5%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P+V +GG T+ P+DYVL+ S SS CLSGF P GP+WILGDVF+ Y
Sbjct: 246 LPDVTFFIGGAKITVPPSDYVLQPSSGGSSTCLSGFQSS----PGGPLWILGDVFLRSAY 301
Query: 61 TVFDMDNNQ 69
VFD DNN+
Sbjct: 302 VVFDRDNNR 310
>gnl|CDD|133154 cd05487, renin_like, Renin stimulates production of angiotensin and
thus affects blood pressure. Renin, also known as
angiotensinogenase, is a circulating enzyme that
participates in the renin-angiotensin system that
mediates extracellular volume, arterial
vasoconstriction, and consequently mean arterial blood
pressure. The enzyme is secreted by the kidneys from
specialized juxtaglomerular cells in response to
decreases in glomerular filtration rate (a consequence
of low blood volume), diminished filtered sodium
chloride and sympathetic nervous system innervation. The
enzyme circulates in the blood stream and hydrolyzes
angiotensinogen secreted from the liver into the peptide
angiotensin I. Angiotensin I is further cleaved in the
lungs by endothelial bound angiotensin converting enzyme
(ACE) into angiotensin II, the final active peptide.
Renin is a member of the aspartic protease family.
Structurally, aspartic proteases are bilobal enzymes,
each lobe contributing a catalytic Aspartate residue,
with an extended active site cleft localized between the
two lobes of the molecule. The N- and C-terminal
domains, although structurally related by a 2-fold axis,
have only limited sequence homology except the vicinity
of the active site. This suggests that the enzymes
evolved by an ancient duplication event. The active site
is located at the groove formed by the two lobes, with
an extended loop projecting over the cleft to form an
11-residue flap, which encloses substrates and
inhibitors in the active site. Specificity is determined
by nearest-neighbor hydrophobic residues surrounding the
catalytic aspartates, and by three residues in the flap.
The enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 326
Score = 118 bits (297), Expect = 9e-31
Identities = 48/110 (43%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 71 NGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD 130
NGT+F I Y SG V G+LSQD + +G + + Q+FGE P F+ AKFDG+LGMGY
Sbjct: 64 NGTEFTIHYASGTVKGFLSQDIVTVGGIPV-TQMFGEVTALPAIPFMLAKFDGVLGMGYP 122
Query: 131 NIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDP 180
A+ GV P F NI+ Q +L+++VF Y +R ++ GE+ + DP
Sbjct: 123 KQAIGGVTPVFDNIMSQGVLKEDVFSVYYSR-DSSHSLGGEI-VLGGSDP 170
Score = 88.7 bits (220), Expect = 5e-20
Identities = 36/69 (52%), Positives = 47/69 (68%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P++ LGGK +TL+ +DYVL+ S S +C F MDIPPP GP+W+LG FI KFY
Sbjct: 252 LPDISFHLGGKEYTLSSSDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFY 311
Query: 61 TVFDMDNNQ 69
T FD NN+
Sbjct: 312 TEFDRQNNR 320
>gnl|CDD|133138 cd05471, pepsin_like, Pepsin-like aspartic proteases, bilobal
enzymes that cleave bonds in peptides at acidic pH.
Pepsin-like aspartic proteases are found in mammals,
plants, fungi and bacteria. These well known and
extensively characterized enzymes include pepsins,
chymosin, renin, cathepsins, and fungal aspartic
proteases. Several have long been known to be medically
(renin, cathepsin D and E, pepsin) or commercially
(chymosin) important. Structurally, aspartic proteases
are bilobal enzymes, each lobe contributing a catalytic
Aspartate residue, with an extended active site cleft
localized between the two lobes of the molecule. The N-
and C-terminal domains, although structurally related by
a 2-fold axis, have only limited sequence homology
except the vicinity of the active site. This suggests
that the enzymes evolved by an ancient duplication
event. Most members of the pepsin family specifically
cleave bonds in peptides that are at least six residues
in length, with hydrophobic residues in both the P1 and
P1' positions. The active site is located at the groove
formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap.The
enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 283
Score = 115 bits (289), Expect = 6e-30
Identities = 44/101 (43%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
Query: 66 DNNQKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGIL 125
+ G F+I YG G+V+G L DT+ IG LTI NQ FG A E G F ++ FDGIL
Sbjct: 51 STYKDTGCTFSITYGDGSVTGGLGTDTVTIGGLTIPNQTFGCATSESGD-FSSSGFDGIL 109
Query: 126 GMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEK 166
G+G+ +++VDGV F + Q L+ VF FYL R
Sbjct: 110 GLGFPSLSVDGVPSFFDQLKSQGLISSPVFSFYLGRDGDGG 150
Score = 44.0 bits (104), Expect = 6e-05
Identities = 15/21 (71%), Positives = 19/21 (90%)
Query: 49 WILGDVFIGKFYTVFDMDNNQ 69
WILGDVF+ +YTVFD+DNN+
Sbjct: 258 WILGDVFLRNYYTVFDLDNNR 278
>gnl|CDD|133153 cd05486, Cathespin_E, Cathepsin E, non-lysosomal aspartic protease.
Cathepsin E is an intracellular, non-lysosomal aspartic
protease expressed in a variety of cells and tissues.
The protease has proposed physiological roles in antigen
presentation by the MHC class II system, in the
biogenesis of the vasoconstrictor peptide endothelin,
and in neurodegeneration associated with brain ischemia
and aging. Cathepsin E is the only A1 aspartic protease
that exists as a homodimer with a disulfide bridge
linking the two monomers. Like many other aspartic
proteases, it is synthesized as a zymogen which is
catalytically inactive towards its natural substrates at
neutral pH and which auto-activates in an acidic
environment. The overall structure follows the general
fold of aspartic proteases of the A1 family, it is
composed of two structurally similar beta barrel lobes,
each lobe contributing an aspartic acid residue to form
a catalytic dyad that acts to cleave the substrate
peptide bond. The catalytic Asp residues are contained
in an Asp-Thr-Gly-Ser/thr motif in both N- and
C-terminal lobes of the enzyme. The aspartic acid
residues act together to allow a water molecule to
attack the peptide bond. One aspartic acid residue (in
its deprotonated form) activates the attacking water
molecule, whereas the other aspartic acid residue (in
its protonated form) polarizes the peptide carbonyl,
increasing its susceptibility to attack. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 316
Score = 106 bits (265), Expect = 3e-26
Identities = 43/104 (41%), Positives = 69/104 (66%), Gaps = 1/104 (0%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
NG F+I+YG+G+++G + D + + +T++NQ F E++ EPG TF ++FDGILG+
Sbjct: 52 VSNGEAFSIQYGTGSLTGIIGIDQVTVEGITVQNQQFAESVSEPGSTFQDSEFDGILGLA 111
Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEV 172
Y ++AVDGV P F N++ Q L+E +F Y++R N GE+
Sbjct: 112 YPSLAVDGVTPVFDNMMAQNLVELPMFSVYMSR-NPNSADGGEL 154
Score = 94.6 bits (235), Expect = 4e-22
Identities = 35/69 (50%), Positives = 47/69 (68%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
MP+V + G ++L+P Y L+ C SGF G+DIPPPAGP+WILGDVFI ++Y
Sbjct: 243 MPSVTFTINGIPYSLSPQAYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYY 302
Query: 61 TVFDMDNNQ 69
+VFD NN+
Sbjct: 303 SVFDRGNNR 311
>gnl|CDD|133145 cd05478, pepsin_A, Pepsin A, aspartic protease produced in gastric
mucosa of mammals. Pepsin, a well-known aspartic
protease, is produced by the human gastric mucosa in
seven different zymogen isoforms, subdivided into two
types: pepsinogen A and pepsinogen C. The prosequence of
the zymogens are self cleaved under acidic pH. The
mature enzymes are called pepsin A and pepsin C,
correspondingly. The well researched porcine pepsin is
also in this pepsin A family. Pepsins play an integral
role in the digestion process of vertebrates. Pepsins
are bilobal enzymes, each lobe contributing a catalytic
Asp residue, with an extended active site cleft
localized between the two lobes of the molecule. One
lobe may be evolved from the other through ancient
gene-duplication event. More recently evolved enzymes
have similar three-dimensional structures, however their
amino acid sequences are more divergent except for the
conserved catalytic site motif. Pepsins specifically
cleave bonds in peptides which have at least six
residues in length with hydrophobic residues in both the
P1 and P1' positions. The active site is located at the
groove formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap. This
family of aspartate proteases is classified by MEROPS as
the peptidase family A1 (pepsin A, clan AA).
Length = 317
Score = 102 bits (255), Expect = 7e-25
Identities = 38/93 (40%), Positives = 60/93 (64%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
Q G +I+YG+G+++G L DT+++G ++ NQ+FG + EPG F A FDGILG+
Sbjct: 62 QSTGQPLSIQYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFFYYAPFDGILGLA 121
Query: 129 YDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
Y +IA G P F N++ Q L+ +++F YL+
Sbjct: 122 YPSIASSGATPVFDNMMSQGLVSQDLFSVYLSS 154
Score = 65.5 bits (160), Expect = 4e-12
Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 8/69 (11%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
MP+V + G + L P+ Y+L+ C SGF M G +WILGDVFI ++Y
Sbjct: 252 MPDVVFTINGVQYPLPPSAYILQ----DQGSCTSGFQSMG----LGELWILGDVFIRQYY 303
Query: 61 TVFDMDNNQ 69
+VFD NN+
Sbjct: 304 SVFDRANNK 312
>gnl|CDD|133144 cd05477, gastricsin, Gastricsins, asparate proteases produced in
gastric mucosa. Gastricsin is also called pepsinogen C.
Gastricsins are produced in gastric mucosa of mammals.
It is synthesized by the chief cells in the stomach as
an inactive zymogen. It is self-converted to a mature
enzyme under acidic conditions. Human gastricsin is
distributed throughout all parts of the stomach.
Gastricsin is synthesized as an inactive progastricsin
that has an approximately 40 residue prosequence. It is
self-converting to a mature enzyme being triggered by a
drop in pH from neutrality to acidic conditions. Like
other aspartic proteases, gastricsin are characterized
by two catalytic aspartic residues at the active site,
and display optimal activity at acidic pH. Mature enzyme
has a pseudo-2-fold symmetry that passes through the
active site between the catalytic aspartate residues.
Structurally, aspartic proteases are bilobal enzymes,
each lobe contributing a catalytic aspartate residue,
with an extended active site cleft localized between the
two lobes of the molecule. One lobe may be evolved from
the other through ancient gene-duplication event.
Although the three-dimensional structures of the two
lobes are very similar, the amino acid sequences are
more divergent, except for the conserved catalytic site
motif. This family of aspartate proteases is classified
by MEROPS as the peptidase family A1 (pepsin A, clan
AA).
Length = 318
Score = 96.1 bits (239), Expect = 1e-22
Identities = 38/91 (41%), Positives = 57/91 (62%)
Query: 71 NGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD 130
NG F+++YGSG+++G DT+ + + I NQ FG + EPG FV A+FDGILG+ Y
Sbjct: 57 NGETFSLQYGSGSLTGIFGYDTVTVQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAYP 116
Query: 131 NIAVDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
+I+ G ++QQ LL+ +F FYL+
Sbjct: 117 SISAGGATTVMQGMMQQNLLQAPIFSFYLSG 147
Score = 58.0 bits (140), Expect = 1e-09
Identities = 25/70 (35%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAG-PIWILGDVFIGKF 59
+P + + G +F L P+ Y+L+ ++ C G +P G P+WILGDVF+ ++
Sbjct: 247 LPTLTFTINGVSFPLPPSAYILQ----NNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQY 302
Query: 60 YTVFDMDNNQ 69
Y+V+D+ NNQ
Sbjct: 303 YSVYDLGNNQ 312
>gnl|CDD|217694 pfam03723, Hemocyanin_C, Hemocyanin, ig-like domain. This family
includes arthropod hemocyanins and insect larval storage
proteins.
Length = 249
Score = 84.1 bits (209), Expect = 6e-19
Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 20/115 (17%)
Query: 229 LEYPGVELKSLRILNKQGPTDQLETFWQ-----LIRMDEVYVRDKELTEEYNIP---YHI 280
LE+PGV+++S+ + K L T++ L + E + ++ +
Sbjct: 8 LEFPGVKIESVEVDGK------LVTYFDDFDVDLSNA----LDFTEEVPDVSVKARQTRL 57
Query: 281 QHEDFKYEITIENNSNETKTGIMRIFLGAKNDIEGNELPLSAQRSLFIELDKFRV 335
H+ F Y I + ++ T +RIFLG K D GNEL L+ QR FIELDKF V
Sbjct: 58 NHKPFTYNIDVNSDKAADAT--VRIFLGPKYDENGNELSLNEQRRNFIELDKFVV 110
>gnl|CDD|133137 cd05470, pepsin_retropepsin_like, Cellular and retroviral
pepsin-like aspartate proteases. This family includes
both cellular and retroviral pepsin-like aspartate
proteases. The cellular pepsin and pepsin-like enzymes
are twice as long as their retroviral counterparts. The
cellular pepsin-like aspartic proteases are found in
mammals, plants, fungi and bacteria. These well known
and extensively characterized enzymes include pepsins,
chymosin, rennin, cathepsins, and fungal aspartic
proteases. Several have long been known to be medically
(rennin, cathepsin D and E, pepsin) or commercially
(chymosin) important. The eukaryotic pepsin-like
proteases contain two domains possessing similar
topological features. The N- and C-terminal domains,
although structurally related by a 2-fold axis, have
only limited sequence homology except in the vicinity of
the active site. This suggests that the enzymes evolved
by an ancient duplication event. The eukaryotic
pepsin-like proteases have two active site ASP residues
with each N- and C-terminal lobe contributing one
residue. While the fungal and mammalian pepsins are
bilobal proteins, retropepsins function as dimers and
the monomer resembles structure of the N- or C-terminal
domains of eukaryotic enzyme. The active site motif
(Asp-Thr/Ser-Gly-Ser) is conserved between the
retroviral and eukaryotic proteases and between the
N-and C-terminal of eukaryotic pepsin-like proteases.
The retropepsin-like family includes pepsin-like
aspartate proteases from retroviruses, retrotransposons
and retroelements; as well as eukaryotic
DNA-damage-inducible proteins (DDIs), and bacterial
aspartate peptidases. Retropepsin is synthesized as part
of the POL polyprotein that contains an
aspartyl-protease, a reverse transcriptase, RNase H, and
an integrase. The POL polyprotein undergoes specific
enzymatic cleavage to yield the mature proteins. This
family of aspartate proteases is classified by MEROPS as
the peptidase family A1 (pepsin A) and A2 (retropepsin
family).
Length = 109
Score = 80.1 bits (198), Expect = 1e-18
Identities = 32/59 (54%), Positives = 41/59 (69%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
NG F+I YG+G++SG LS DT+ IGD+ + Q FG A EPG TF+ A FDGILG+
Sbjct: 51 SDNGCTFSITYGTGSLSGGLSTDTVSIGDIEVVGQAFGCATDEPGATFLPALFDGILGL 109
>gnl|CDD|240300 PTZ00165, PTZ00165, aspartyl protease; Provisional.
Length = 482
Score = 70.6 bits (173), Expect = 2e-13
Identities = 33/88 (37%), Positives = 52/88 (59%), Gaps = 3/88 (3%)
Query: 77 IRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY---DNIA 133
I+YG+G L +DT++IG L +K+Q G AI+E F FDG++G+G+ D
Sbjct: 185 IQYGTGECVLALGKDTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGFPDKDFKE 244
Query: 134 VDGVEPPFYNIIQQKLLEKNVFGFYLNR 161
P NI +Q LL++N+F FY+++
Sbjct: 245 SKKALPIVDNIKKQNLLKRNIFSFYMSK 272
Score = 65.9 bits (161), Expect = 6e-12
Identities = 22/57 (38%), Positives = 36/57 (63%), Gaps = 2/57 (3%)
Query: 13 FTLTPTDYVLKVSIAS--SSMCLSGFAGMDIPPPAGPIWILGDVFIGKFYTVFDMDN 67
F + P DYV++ + C+ G MD+P P GP+++LG+ FI K+Y++FD D+
Sbjct: 382 FDMDPEDYVIEEGDSEEQEHQCVIGIIPMDVPAPRGPLFVLGNNFIRKYYSIFDRDH 438
>gnl|CDD|140176 PTZ00147, PTZ00147, plasmepsin-1; Provisional.
Length = 453
Score = 62.6 bits (152), Expect = 8e-11
Identities = 34/100 (34%), Positives = 58/100 (58%), Gaps = 3/100 (3%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGF--TFVAAKFDGILG 126
+K+GT+ + Y SG VSG+ S+D + IG+L++ + F E GF + + FDGI G
Sbjct: 191 EKDGTKVEMNYVSGTVSGFFSKDLVTIGNLSVPYK-FIEVTDTNGFEPFYTESDFDGIFG 249
Query: 127 MGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYLNRYNTEK 166
+G+ ++++ V+P + Q +E+ VF FYL + K
Sbjct: 250 LGWKDLSIGSVDPYVVELKNQNKIEQAVFTFYLPPEDKHK 289
Score = 43.7 bits (103), Expect = 9e-05
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
+P ++ K +TL P Y+ + S++C+ +D+ +ILGD F+ K++
Sbjct: 378 LPTLEFRSPNKVYTLEPEYYLQPIEDIGSALCMLNIIPIDLEKNT---FILGDPFMRKYF 434
Query: 61 TVFDMDNN 68
TVFD DN+
Sbjct: 435 TVFDYDNH 442
>gnl|CDD|140051 PTZ00013, PTZ00013, plasmepsin 4 (PM4); Provisional.
Length = 450
Score = 60.0 bits (145), Expect = 4e-10
Identities = 31/95 (32%), Positives = 58/95 (61%), Gaps = 7/95 (7%)
Query: 69 QKNGTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFGEAIK----EPGFTFVAAKFDGI 124
+K+GT+ I YGSG V G+ S+D + +G L++ + F E EP ++ +++FDGI
Sbjct: 190 EKDGTKVDITYGSGTVKGFFSKDLVTLGHLSMPYK-FIEVTDTDDLEPIYS--SSEFDGI 246
Query: 125 LGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFGFYL 159
LG+G+ ++++ ++P + Q ++ +F FYL
Sbjct: 247 LGLGWKDLSIGSIDPIVVELKNQNKIDNALFTFYL 281
Score = 35.7 bits (82), Expect = 0.031
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 1 MPNVDIILGGKNFTLTPTDYVLKVSIASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
MP ++ +TL P Y+ + ++C+ +DI +ILGD F+ K++
Sbjct: 377 MPTLEFKSANNTYTLEPEYYMNPLLDVDDTLCMITMLPVDIDDNT---FILGDPFMRKYF 433
Query: 61 TVFDMDNNQKNGTQFAI 77
TVFD D K FAI
Sbjct: 434 TVFDYD---KESVGFAI 447
>gnl|CDD|133141 cd05474, SAP_like, SAPs, pepsin-like proteinases secreted from
pathogens to degrade host proteins. SAPs (Secreted
aspartic proteinases) are secreted from a group of
pathogenic fungi, predominantly Candida species. They
are secreted from the pathogen to degrade host proteins.
SAP is one of the most significant extracellular
hydrolytic enzymes produced by C. albicans. SAP
proteins, encoded by a family of 10 SAP genes. All 10
SAP genes of C. albicans encode preproenzymes,
approximately 60 amino acid longer than the mature
enzyme, which are processed when transported via the
secretory pathway. The mature enzymes contain sequence
motifs typical for all aspartyl proteinases, including
the two conserved aspartate residues other active site
and conserved cysteine residues implicated in the
maintenance of the three-dimensional structure. Most Sap
proteins contain putative N-glycosylation sites, but it
remains to be determined which Sap proteins are
glycosylated. This family of aspartate proteases is
classified by MEROPS as the peptidase family A1 (pepsin
A, clan AA). The overall structure of Sap protein
conforms to the classical aspartic proteinase fold
typified by pepsin. SAP is a bilobal enzyme, each lobe
contributing a catalytic Asp residue, with an extended
active site cleft localized between the two lobes of the
molecule. One lobe may be evolved from the other through
ancient gene-duplication event. More recently evolved
enzymes have similar three-dimensional structures,
however their amino acid sequences are more divergent
except for the conserved catalytic site motif. This
family of aspartate proteases is classified by MEROPS as
the peptidase family A1 (pepsin A, clan AA).
Length = 295
Score = 56.4 bits (137), Expect = 4e-09
Identities = 31/92 (33%), Positives = 41/92 (44%), Gaps = 13/92 (14%)
Query: 75 FAIRYGSG-AVSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGYD-NI 132
F+I YG G + SG DT+ IG T+KN F A G+LG+G N
Sbjct: 32 FSISYGDGTSASGTWGTDTVSIGGATVKNLQFAVANSTSSDV-------GVLGIGLPGNE 84
Query: 133 AVDGVEPPFYNIIQ----QKLLEKNVFGFYLN 160
A G + N Q L++KN + YLN
Sbjct: 85 ATYGTGYTYPNFPIALKKQGLIKKNAYSLYLN 116
Score = 39.5 bits (93), Expect = 0.002
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 7/69 (10%)
Query: 3 NVDIILGGKNFTLTPTDYVLKVSI--ASSSMCLSGFAGMDIPPPAGPIWILGDVFIGKFY 60
++ GG ++ +D VL S C G P ILGD F+ Y
Sbjct: 226 SLTFNFGGATISVPLSDLVLPASTDDGGDGACYLGIQ-----PSTSDYNILGDTFLRSAY 280
Query: 61 TVFDMDNNQ 69
V+D+DNN+
Sbjct: 281 VVYDLDNNE 289
>gnl|CDD|133161 cd06097, Aspergillopepsin_like, Aspergillopepsin_like, aspartic
proteases of fungal origin. The members of this family
are aspartic proteases of fungal origin, including
aspergillopepsin, rhizopuspepsin, endothiapepsin, and
rodosporapepsin. The various fungal species in this
family may be the most economically important genus of
fungi. They may serve as virulence factors or as
industrial aids. For example, Aspergillopepsin from A.
fumigatus is involved in invasive aspergillosis owing to
its elastolytic activity and Aspergillopepsins from the
mold A. saitoi are used in fermentation industry.
Aspartic proteinases are a group of proteolytic enzymes
in which the scissile peptide bond is attacked by a
nucleophilic water molecule activated by two aspartic
residues in a DT(S)G motif at the active site. They have
a similar fold composed of two beta-barrel domains.
Between the N-terminal and C-terminal domains, each of
which contributes one catalytic aspartic residue, there
is an extended active-site cleft capable of interacting
with multiple residues of a substrate. Although members
of the aspartic protease family of enzymes have very
similar three-dimensional structures and catalytic
mechanisms, each has unique substrate specificity. The
members of this family has an optimal acidic pH (5.5)
and cleaves protein substrates with similar specificity
to that of porcine pepsin A, preferring hydrophobic
residues at P1 and P1' in the cleave site. This family
of aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 278
Score = 52.7 bits (127), Expect = 7e-08
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 10/97 (10%)
Query: 71 NGTQFAIRYGSGA-VSGYLSQDTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGMGY 129
G ++I YG G+ SG + DT+ IG + + NQ A F DG+LG+ +
Sbjct: 55 PGATWSISYGDGSSASGIVYTDTVSIGGVEVPNQAIELATAVSASFFSDTASDGLLGLAF 114
Query: 130 DNIAVDGVEPP-----FYNIIQQKLLEKNVFGFYLNR 161
+I + V+PP F N + L+ +F L +
Sbjct: 115 SSI--NTVQPPKQKTFFENALSS--LDAPLFTADLRK 147
>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
Protease and Nucellin, pepsin-like aspartic proteases
from plants. This family contains pepsin like aspartic
proteases from plants including Chloroplast Nucleoids
DNA-binding Protease and Nucellin. Chloroplast Nucleoids
DNA-binding Protease catalyzes the degradation of
ribulose-1,5-bisphosphate carboxylase/oxygenase
(Rubisco) in senescent leaves of tobacco and Nucellins
are important regulators of nucellar cell's progressive
degradation after ovule fertilization. Structurally,
aspartic proteases are bilobal enzymes, each lobe
contributing a catalytic Asp residue, with an extended
active site cleft localized between the two lobes of the
molecule. The N- and C-terminal domains, although
structurally related by a 2-fold axis, have only limited
sequence homology except the vicinity of the active
site. This suggests that the enzymes evolved by an
ancient duplication event. The enzymes specifically
cleave bonds in peptides which have at least six
residues in length with hydrophobic residues in both the
P1 and P1' positions. The active site is located at the
groove formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap. The
enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
Length = 265
Score = 40.7 bits (96), Expect = 6e-04
Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 5/58 (8%)
Query: 74 QFAIRYGSGAVS-GYLSQDTLRIGD--LTIKNQVFGEAIKEPGFTFVAAKFDGILGMG 128
+ YG G+ + G L+ +T GD +++ N FG G +F A DGILG+G
Sbjct: 32 SYEYSYGDGSSTSGVLATETFTFGDSSVSVPNVAFGCGTDNEGGSFGGA--DGILGLG 87
>gnl|CDD|133140 cd05473, beta_secretase_like, Beta-secretase, aspartic-acid
protease important in the pathogenesis of Alzheimer's
disease. Beta-secretase also called BACE (beta-site of
APP cleaving enzyme) or memapsin-2. Beta-secretase is an
aspartic-acid protease important in the pathogenesis of
Alzheimer's disease, and in the formation of myelin
sheaths in peripheral nerve cells. It cleaves amyloid
precursor protein (APP) to reveal the N-terminus of the
beta-amyloid peptides. The beta-amyloid peptides are the
major components of the amyloid plaques formed in the
brain of patients with Alzheimer's disease (AD). Since
BACE mediates one of the cleavages responsible for
generation of AD, it is regarded as a potential target
for pharmacological intervention in AD. Beta-secretase
is a member of pepsin family of aspartic proteases. Same
as other aspartic proteases, beta-secretase is a bilobal
enzyme, each lobe contributing a catalytic Asp residue,
with an extended active site cleft localized between the
two lobes of the molecule. The N- and C-terminal
domains, although structurally related by a 2-fold axis,
have only limited sequence homology except the vicinity
of the active site. This suggests that the enzymes
evolved by an ancient duplication event. The enzymes
specifically cleave bonds in peptides which have at
least six residues in length with hydrophobic residues
in both the P1 and P1' positions. The active site is
located at the groove formed by the two lobes, with an
extended loop projecting over the cleft to form an
11-residue flap, which encloses substrates and
inhibitors in the active site. Specificity is determined
by nearest-neighbor hydrophobic residues surrounding the
catalytic aspartates, and by three residues in the flap.
The enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 364
Score = 35.1 bits (81), Expect = 0.042
Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 9/90 (10%)
Query: 72 GTQFAIRYGSGAVSGYLSQDTLRIGDLTIKNQVFG---EAIKEP-GFTFVAAKFDGILGM 127
G + Y G+ G L D + I N F AI E F + ++GILG+
Sbjct: 54 GKGVTVPYTQGSWEGELGTDLVSIPKGP--NVTFRANIAAITESENFFLNGSNWEGILGL 111
Query: 128 GYDNIAV--DGVEPPFYNIIQQKLLEKNVF 155
Y +A VEP F ++++Q +VF
Sbjct: 112 AYAELARPDSSVEPFFDSLVKQ-TGIPDVF 140
>gnl|CDD|222288 pfam13650, Asp_protease_2, Aspartyl protease. This family consists
of predicted aspartic proteases, typically from 180 to
230 amino acids in length, in MEROPS clan AA. This model
describes the well-conserved 121-residue C-terminal
region. The poorly conserved, variable length N-terminal
region usually contains a predicted transmembrane helix.
Length = 89
Score = 28.8 bits (65), Expect = 1.1
Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 8/48 (16%)
Query: 81 SGAVSGYLSQ-DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
+G V L + D+L+IG +T++N + DG+LGM
Sbjct: 47 NGTVRAALVRLDSLKIGGITLRNVPA-------VVLDLGDLIDGLLGM 87
>gnl|CDD|188595 TIGR04080, rSAM_pep_cyc, KxxxW cyclic peptide radical SAM maturase.
Members of this family are radical SAM enzymes that
appear to perform a cyclization on an adjacent cognate
peptide from family TIGR04079. Genomes with the complete
system include Streptococcus thermophilus LMD-9 and
Lactococcus lactis subsp. cremoris MG1363, among others.
The gene symbol assigned is kwcM, for KxxxW Cyclic
peptide Maturase [Protein fate, Protein modification and
repair].
Length = 440
Score = 29.1 bits (65), Expect = 3.1
Identities = 17/74 (22%), Positives = 32/74 (43%), Gaps = 4/74 (5%)
Query: 142 YNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPKELESRIENKYQLPEYDHVGD 201
Y+I + LE+ V + L+ ++AN IE + E++ +
Sbjct: 291 YDIADLRKLEEKVRDYILDN----EIANFNFATEGCFLFTAFPELIEEGEEFSEFEEMYF 346
Query: 202 GCSASYTEMQQWTN 215
GC A +T+M+ +N
Sbjct: 347 GCRAKFTKMEILSN 360
>gnl|CDD|133150 cd05483, retropepsin_like_bacteria, Bacterial aspartate proteases,
retropepsin-like protease family. This family of
bacteria aspartate proteases is a subfamily of
retropepsin-like protease family, which includes enzymes
from retrovirus and retrotransposons. While fungal and
mammalian pepsin-like aspartate proteases are bilobal
proteins with structurally related N- and C-termini,
this family of bacteria aspartate proteases is half as
long as their fungal and mammalian counterparts. The
monomers are structurally related to one lobe of the
pepsin molecule and function as homodimers. The active
site aspartate occurs within a motif (Asp-Thr/Ser-Gly),
as it does in pepsin. In aspartate peptidases, Asp
residues are ligands of an activated water molecule in
all examples where catalytic residues have been
identified. This group of aspartate proteases is
classified by MEROPS as the peptidase family A2
(retropepsin family, clan AA), subfamily A2A.
Length = 96
Score = 27.6 bits (62), Expect = 3.7
Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 7/49 (14%)
Query: 80 GSGAVSGYLSQ-DTLRIGDLTIKNQVFGEAIKEPGFTFVAAKFDGILGM 127
+G V + D+L+IG +T++N + V DG+LGM
Sbjct: 50 ANGRVRAARVRLDSLQIGGITLRNVPA--VVLPGDALGV----DGLLGM 92
>gnl|CDD|130293 TIGR01226, phe_am_lyase, phenylalanine ammonia-lyase. Members of
this family are found, so far, in plants and fungi. From
phenylalanine, this enzyme yields cinnaminic acid, a
precursor of many important plant compounds. This
protein shows extensive homology to histidine
ammonia-lyase, the first enzyme of a histidine
degradation pathway.
Length = 680
Score = 29.0 bits (65), Expect = 3.8
Identities = 23/106 (21%), Positives = 37/106 (34%), Gaps = 22/106 (20%)
Query: 139 PPFYNIIQQKLLEKNVFGFYLNRYNTEKLANGEVPIPSAHDPKELESRIENKYQLPEYDH 198
NI+ + K V G GE+ P K+L ++ + +D+
Sbjct: 509 STVKNIVSEVA--KKVLG---------MGLTGEL-HPKRFCEKDLLKVLDRETV---FDY 553
Query: 199 VGDGCSASYTEMQQWTNNIIEAINQGYAVTLEYPGVELKSLRILNK 244
+ D CSA+Y MQ+ ++E L E L K
Sbjct: 554 IDDPCSANYPLMQKLRQVLVEH-------ALSNGEREKDLLSSFQK 592
>gnl|CDD|99816 cd06220, DHOD_e_trans_like2, FAD/NAD binding domain in the electron
transfer subunit of dihydroorotate dehydrogenase-like
proteins. Dihydroorotate dehydrogenases (DHODs) catalyze
the only redox reaction in pyrimidine de novo
biosynthesis. They catalyze the oxidation of
(S)-dihydroorotate to orotate coupled with the reduction
of NAD+. In L. lactis, DHOD B (encoded by pyrDa) is
co-expressed with pyrK and both gene products are
required for full activity, as well as 3 cofactors: FMN,
FAD, and an [2Fe-2S] cluster.
Length = 233
Score = 28.4 bits (64), Expect = 5.2
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 116 FVAAKFDGILGMGYDNIAVDGVEPPFYNIIQQKLLEKNVFG-FYLNRY 162
FV + YD I V G E Y ++ + L E+ V F L RY
Sbjct: 154 FVTDLLKELDLEEYDAIYVCGPEIMMYKVL-EILDERGVRAQFSLERY 200
>gnl|CDD|176543 cd08601, GDPD_SaGlpQ_like, Glycerophosphodiester phosphodiesterase
domain of Staphylococcus aureus and similar proteins.
This subfamily corresponds to the glycerophosphodiester
phosphodiesterase domain (GDPD) present in
uncharacterized glycerophosphodiester phosphodiesterase
(GP-GDE, EC 3.1.4.46) from Staphylococcus aureus,
Bacillus subtilis and similar proteins. Members in this
family show very high sequence similarity to Escherichia
coli periplasmic phosphodiesterase GlpQ, which catalyzes
the Ca2+-dependent degradation of periplasmic
glycerophosphodiesters to produce
sn-glycerol-3-phosphate (G3P) and the corresponding
alcohols.
Length = 256
Score = 28.1 bits (63), Expect = 5.7
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 231 YPGVELKSLRILNKQGPTDQLETFWQLI 258
YPG+E K L L+K G Q+I
Sbjct: 124 YPGMEEKLLATLDKYGLLTDNLKNGQVI 151
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 28.4 bits (63), Expect = 6.5
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 2/28 (7%)
Query: 294 NSNETKTGIMRIFLG--AKNDIEGNELP 319
N+ G++++FLG DIEGNELP
Sbjct: 392 NNTRDHPGMIQVFLGETGARDIEGNELP 419
>gnl|CDD|215881 pfam00372, Hemocyanin_M, Hemocyanin, copper containing domain.
This family includes arthropod hemocyanins and insect
larval storage proteins.
Length = 278
Score = 27.7 bits (62), Expect = 8.5
Identities = 19/82 (23%), Positives = 34/82 (41%), Gaps = 10/82 (12%)
Query: 159 LNRYNTEKLANGEVPIPSAHDPKELES------RIENKY---QLPEYDHVGD-GCSASYT 208
L RY+ E+L+NG IP ++ + + P+ H+ D
Sbjct: 113 LARYDLERLSNGLGRIPPFSWFDPIKEGYAPLTSLVSGVEFPVRPDNYHIHDVDNVDVVR 172
Query: 209 EMQQWTNNIIEAINQGYAVTLE 230
+M +W I++AI +GY +
Sbjct: 173 DMDRWEERILDAIQKGYIEAFD 194
>gnl|CDD|223047 PHA03332, PHA03332, membrane glycoprotein; Provisional.
Length = 1328
Score = 28.0 bits (62), Expect = 8.9
Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 1/40 (2%)
Query: 21 VLKVSIASSSMCLSGFAGMDIPPPAGPIWI-LGDVFIGKF 59
VL + +S CL + G+ P P P LG + G
Sbjct: 200 VLSILRGTSYTCLLSYVGISEPEPNSPCVTTLGPMCYGPL 239
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.139 0.410
Gapped
Lambda K H
0.267 0.0740 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,721,488
Number of extensions: 1748788
Number of successful extensions: 1469
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1452
Number of HSP's successfully gapped: 58
Length of query: 336
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 239
Effective length of database: 6,635,264
Effective search space: 1585828096
Effective search space used: 1585828096
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)