RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3212
(177 letters)
>gnl|CDD|133152 cd05485, Cathepsin_D_like, Cathepsin_D_like, pepsin family of
proteinases. Cathepsin D is the major aspartic
proteinase of the lysosomal compartment where it
functions in protein catabolism. It is a member of the
pepsin family of proteinases. This enzyme is
distinguished from other members of the pepsin family by
two features that are characteristic of lysosomal
hydrolases. First, mature Cathepsin D is found
predominantly in a two-chain form due to a
posttranslational cleavage event. Second, it contains
phosphorylated, N-linked oligosaccharides that target
the enzyme to lysosomes via mannose-6-phosphate
receptors. Cathepsin D preferentially attacks peptide
bonds flanked by bulky hydrophobic amino acids and its
pH optimum is between pH 2.8 and 4.0. Two active site
aspartic acid residues are essential for the catalytic
activity of aspartic proteinases. Like other aspartic
proteinases, Cathepsin D is a bilobed molecule; the two
evolutionary related lobes are mostly made up of
beta-sheets and flank a deep active site cleft. Each of
the two related lobes contributes one active site
aspartic acid residue and contains a single carbohydrate
group. Cathepsin D is an essential enzyme. Mice
deficient for proteinase cathepsin D, generated by gene
targeting, develop normally during the first 2 weeks,
stop thriving in the third week and die in a state of
anorexia in the fourth week. The mice develop atrophy of
ileal mucosa followed by other degradation of intestinal
organs. In these knockout mice, lysosomal proteolysis
was normal. These results suggest that vital functions
of cathepsin D are exerted by limited proteolysis of
proteins regulating cell growth and/or tissue
homeostasis, while its contribution to bulk proteolysis
in lysosomes appears to be non-critical. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 329
Score = 141 bits (356), Expect = 2e-41
Identities = 55/115 (47%), Positives = 78/115 (67%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S + GGE++ GG D + G+ TY PV+RKGYWQF ++S+ + + +CS CQAIAD
Sbjct: 157 RDPSAKEGGELILGGSDPKHYTGNFTYLPVTRKGYWQFKMDSVSVGEGEFCSGGCQAIAD 216
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP I +LN IGA P+ G V+C + +P++ +LGGK+F+LT
Sbjct: 217 TGTSLIAGPVDEIEKLNNAIGAKPIIGGEYMVNCSAIPSLPDITFVLGGKSFSLT 271
Score = 80.3 bits (198), Expect = 2e-18
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQ FKV+FDTGSSNLW+PS+ CS NIAC
Sbjct: 10 AQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIAC 50
>gnl|CDD|133162 cd06098, phytepsin, Phytepsin, a plant homolog of mammalian
lysosomal pepsins. Phytepsin, a plant homolog of
mammalian lysosomal pepsins, resides in grains, roots,
stems, leaves and flowers. Phytepsin may participate in
metabolic turnover and in protein processing events. In
addition, it highly expressed in several plant tissues
undergoing apoptosis. Phytepsin contains an internal
region consisting of about 100 residues not present in
animal or microbial pepsins. This region is thus called
a plant specific insert. The insert is highly similar to
saponins, which are lysosomal sphingolipid-activating
proteins in mammalian cells. The saponin-like domain may
have a role in the vacuolar targeting of phytepsin.
Phytepsin, as its animal counterparts, possesses a
topology typical of all aspartic proteases. They are
bilobal enzymes, each lobe contributing a catalytic Asp
residue, with an extended active site cleft localized
between the two lobes of the molecule. One lobe has
probably evolved from the other through a gene
duplication event in the distant past. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 317
Score = 119 bits (299), Expect = 5e-33
Identities = 52/117 (44%), Positives = 72/117 (61%), Gaps = 15/117 (12%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN +E GGE++FGGVD F G+ TY PV+RKGYWQF + + I +C+ C AI
Sbjct: 155 RNPDEEEGGELVFGGVDPKHFKGEHTYVPVTRKGYWQFEMGDVLIGGKSTGFCAGGCAAI 214
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
AD+GTSL+ GP+ ++ ++N + VDC++L MPNV +GGK F LT
Sbjct: 215 ADSGTSLLAGPTTIVTQIN------------SAVDCNSLSSMPNVSFTIGGKTFELT 259
Score = 76.6 bits (189), Expect = 4e-17
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 9 AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSIAC 48
>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease. Aspartyl (acid)
proteases include pepsins, cathepsins, and renins.
Two-domain structure, probably arising from ancestral
duplication. This family does not include the retroviral
nor retrotransposon proteases (pfam00077), which are
much smaller and appear to be homologous to a single
domain of the eukaryotic asp proteases.
Length = 316
Score = 115 bits (290), Expect = 1e-31
Identities = 52/128 (40%), Positives = 80/128 (62%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D F S + N+ D GGEI+FGGVD K+ G +T+ PV+ +GYWQ ++SI +
Sbjct: 137 IDSPAF---SVYLNSDDAGGGEIIFGGVDPSKYTGSLTWVPVTSQGYWQITLDSITVGGS 193
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+CS+ CQAI DTGTSL+ GP+ +++++ K +GA G VDCD++ +P+V +
Sbjct: 194 ATFCSSGCQAILDTGTSLLYGPTSIVSKIAKAVGASLSEYGGYVVDCDSISSLPDVTFFI 253
Query: 167 GGKNFTLT 174
GG T+
Sbjct: 254 GGAKITVP 261
Score = 71.5 bits (176), Expect = 2e-15
Identities = 26/41 (63%), Positives = 34/41 (82%), Gaps = 1/41 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+YYGTIS+GTPPQ+F V+FDTGSS+LW+PS +C+ AC
Sbjct: 1 EYYGTISIGTPPQKFTVVFDTGSSDLWVPSVYCTSSY-ACK 40
>gnl|CDD|133145 cd05478, pepsin_A, Pepsin A, aspartic protease produced in gastric
mucosa of mammals. Pepsin, a well-known aspartic
protease, is produced by the human gastric mucosa in
seven different zymogen isoforms, subdivided into two
types: pepsinogen A and pepsinogen C. The prosequence of
the zymogens are self cleaved under acidic pH. The
mature enzymes are called pepsin A and pepsin C,
correspondingly. The well researched porcine pepsin is
also in this pepsin A family. Pepsins play an integral
role in the digestion process of vertebrates. Pepsins
are bilobal enzymes, each lobe contributing a catalytic
Asp residue, with an extended active site cleft
localized between the two lobes of the molecule. One
lobe may be evolved from the other through ancient
gene-duplication event. More recently evolved enzymes
have similar three-dimensional structures, however their
amino acid sequences are more divergent except for the
conserved catalytic site motif. Pepsins specifically
cleave bonds in peptides which have at least six
residues in length with hydrophobic residues in both the
P1 and P1' positions. The active site is located at the
groove formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap. This
family of aspartate proteases is classified by MEROPS as
the peptidase family A1 (pepsin A, clan AA).
Length = 317
Score = 111 bits (279), Expect = 5e-30
Identities = 46/128 (35%), Positives = 70/128 (54%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V +D F S + +++ + G + FGG+D + G + + PV+ + YWQ V+S+ I
Sbjct: 143 VSQDLF---SVYLSSNGQQGSVVTFGGIDPSYYTGSLNWVPVTAETYWQITVDSVTINGQ 199
Query: 109 VY--CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CQAI DTGTSL++GPS IA + IGA NG V+C ++ MP+V +
Sbjct: 200 VVACSGGCQAIVDTGTSLLVGPSSDIANIQSDIGASQNQNGEMVVNCSSISSMPDVVFTI 259
Query: 167 GGKNFTLT 174
G + L
Sbjct: 260 NGVQYPLP 267
Score = 79.4 bits (196), Expect = 4e-18
Identities = 31/48 (64%), Positives = 39/48 (81%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+YYGTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ACS + ++F
Sbjct: 9 MEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQ--ACS-NHNRF 53
>gnl|CDD|133157 cd05490, Cathepsin_D2, Cathepsin_D2, pepsin family of proteinases.
Cathepsin D is the major aspartic proteinase of the
lysosomal compartment where it functions in protein
catabolism. It is a member of the pepsin family of
proteinases. This enzyme is distinguished from other
members of the pepsin family by two features that are
characteristic of lysosomal hydrolases. First, mature
Cathepsin D is found predominantly in a two-chain form
due to a posttranslational cleavage event. Second, it
contains phosphorylated, N-linked oligosaccharides that
target the enzyme to lysosomes via mannose-6-phosphate
receptors. Cathepsin D preferentially attacks peptide
bonds flanked by bulky hydrophobic amino acids and its
pH optimum is between pH 2.8 and 4.0. Two active site
aspartic acid residues are essential for the catalytic
activity of aspartic proteinases. Like other aspartic
proteinases, Cathepsin D is a bilobed molecule; the two
evolutionary related lobes are mostly made up of
beta-sheets and flank a deep active site cleft. Each of
the two related lobes contributes one active site
aspartic acid residue and contains a single carbohydrate
group. Cathepsin D is an essential enzyme. Mice
deficient for proteinase cathepsin D, generated by gene
targeting, develop normally during the first 2 weeks,
stop thriving in the third week and die in a state of
anorexia in the fourth week. The mice develop atrophy of
ileal mucosa followed by other degradation of intestinal
organs. In these knockout mice, lysosomal proteolysis
was normal. These results suggest that vital functions
of cathepsin D are exerted by limited proteolysis of
proteins regulating cell growth and/or tissue
homeostasis, while its contribution to bulk proteolysis
in lysosomes appears to be non-critical. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 325
Score = 109 bits (275), Expect = 2e-29
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YC-SNCQAIA 118
R+ + GGE+M GG D + GD+ Y V+RK YWQ ++ + + + C C+AI
Sbjct: 152 RDPDAQPGGELMLGGTDPKYYTGDLHYVNVTRKAYWQIHMDQVDVGSGLTLCKGGCEAIV 211
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G +DC+ + +P + LGGK + LT
Sbjct: 212 DTGTSLITGPVEEVRALQKAIGAVPLIQGEYMIDCEKIPTLPVISFSLGGKVYPLT 267
Score = 86.4 bits (214), Expect = 1e-20
Identities = 31/43 (72%), Positives = 36/43 (83%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 3 MDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIAC 45
>gnl|CDD|133155 cd05488, Proteinase_A_fungi, Fungal Proteinase A , aspartic
proteinase superfamily. Fungal Proteinase A, a
proteolytic enzyme distributed among a variety of
organisms, is a member of the aspartic proteinase
superfamily. In Saccharomyces cerevisiae, targeted to
the vacuole as a zymogen, activation of proteinases A at
acidic pH can occur by two different pathways: a
one-step process to release mature proteinase A,
involving the intervention of proteinase B, or a
step-wise pathway via the auto-activation product known
as pseudo-proteinase A. Once active, S. cerevisiae
proteinase A is essential to the activities of other
yeast vacuolar hydrolases, including proteinase B and
carboxypeptidase Y. The mature enzyme is bilobal, with
each lobe providing one of the two catalytically
essential aspartic acid residues in the active site. The
crystal structure of free proteinase A shows that flap
loop is atypically pointing directly into the S(1)
pocket of the enzyme. Proteinase A preferentially
hydrolyzes hydrophobic residues such as Phe, Leu or Glu
at the P1 position and Phe, Ile, Leu or Ala at P1'.
Moreover, the enzyme is inhibited by IA3, a natural and
highly specific inhibitor produced by S. cerevisiae.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 320
Score = 109 bits (273), Expect = 3e-29
Identities = 52/127 (40%), Positives = 73/127 (57%), Gaps = 4/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ F S + +S+E+GGE FGG+D+ +F G IT+ PV RK YW+ +E I + ++
Sbjct: 143 LDEPVF---SFYLGSSEEDGGEATFGGIDESRFTGKITWLPVRRKAYWEVELEKIGLGDE 199
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ N A DTGTSLI PS + LN IGA NG VDC +D +P++
Sbjct: 200 ELELENTGAAIDTGTSLIALPSDLAEMLNAEIGAKKSWNGQYTVDCSKVDSLPDLTFNFD 259
Query: 168 GKNFTLT 174
G NFTL
Sbjct: 260 GYNFTLG 266
Score = 75.6 bits (186), Expect = 1e-16
Identities = 37/86 (43%), Positives = 52/86 (60%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I+LGTPPQ+FKVI DTGSSNLW+PS C +IAC + D ++ N +
Sbjct: 9 AQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCG--SIACFLHSKYDSSASSTYKANGT 66
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G + G V +D +GD+T
Sbjct: 67 EFKIQYGSGSLEGFVSQDTLSIGDLT 92
>gnl|CDD|133153 cd05486, Cathespin_E, Cathepsin E, non-lysosomal aspartic protease.
Cathepsin E is an intracellular, non-lysosomal aspartic
protease expressed in a variety of cells and tissues.
The protease has proposed physiological roles in antigen
presentation by the MHC class II system, in the
biogenesis of the vasoconstrictor peptide endothelin,
and in neurodegeneration associated with brain ischemia
and aging. Cathepsin E is the only A1 aspartic protease
that exists as a homodimer with a disulfide bridge
linking the two monomers. Like many other aspartic
proteases, it is synthesized as a zymogen which is
catalytically inactive towards its natural substrates at
neutral pH and which auto-activates in an acidic
environment. The overall structure follows the general
fold of aspartic proteases of the A1 family, it is
composed of two structurally similar beta barrel lobes,
each lobe contributing an aspartic acid residue to form
a catalytic dyad that acts to cleave the substrate
peptide bond. The catalytic Asp residues are contained
in an Asp-Thr-Gly-Ser/thr motif in both N- and
C-terminal lobes of the enzyme. The aspartic acid
residues act together to allow a water molecule to
attack the peptide bond. One aspartic acid residue (in
its deprotonated form) activates the attacking water
molecule, whereas the other aspartic acid residue (in
its protonated form) polarizes the peptide carbonyl,
increasing its susceptibility to attack. This family of
aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 316
Score = 107 bits (269), Expect = 1e-28
Identities = 50/116 (43%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN + +GGE++FGG D +F G + + PV+ +GYWQ +++I++ V +CS+ CQAI
Sbjct: 144 RNPNSADGGELVFGGFDTSRFSGQLNWVPVTVQGYWQIQLDNIQVGGTVIFCSDGCQAIV 203
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I +L IGA +G VDC L MP+V + G ++L+
Sbjct: 204 DTGTSLITGPSGDIKQLQNYIGATA-TDGEYGVDCSTLSLMPSVTFTINGIPYSLS 258
Score = 64.9 bits (158), Expect = 6e-13
Identities = 24/33 (72%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT 33
>gnl|CDD|133138 cd05471, pepsin_like, Pepsin-like aspartic proteases, bilobal
enzymes that cleave bonds in peptides at acidic pH.
Pepsin-like aspartic proteases are found in mammals,
plants, fungi and bacteria. These well known and
extensively characterized enzymes include pepsins,
chymosin, renin, cathepsins, and fungal aspartic
proteases. Several have long been known to be medically
(renin, cathepsin D and E, pepsin) or commercially
(chymosin) important. Structurally, aspartic proteases
are bilobal enzymes, each lobe contributing a catalytic
Aspartate residue, with an extended active site cleft
localized between the two lobes of the molecule. The N-
and C-terminal domains, although structurally related by
a 2-fold axis, have only limited sequence homology
except the vicinity of the active site. This suggests
that the enzymes evolved by an ancient duplication
event. Most members of the pepsin family specifically
cleave bonds in peptides that are at least six residues
in length, with hydrophobic residues in both the P1 and
P1' positions. The active site is located at the groove
formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap.The
enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 283
Score = 97.5 bits (243), Expect = 4e-25
Identities = 45/117 (38%), Positives = 63/117 (53%), Gaps = 10/117 (8%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPV--SRKGYWQFGVESIKI---EKNVYCSNCQ 115
R+ NGGE+ FGG+D K+ GD+TY+PV + GYWQ ++ I +
Sbjct: 145 RDGDGGNGGELTFGGIDPSKYTGDLTYTPVVSNGPGYWQVPLDGISVGGKSVISSSGGGG 204
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGA-VPLANGPAKVDCDNLDKMPNVDI----ILG 167
AI D+GTSLI PS V + K +GA V ++G VDC D +P++ ILG
Sbjct: 205 AIVDSGTSLIYLPSSVYDAILKALGAAVSSSDGGYGVDCSPCDTLPDITFTFLWILG 261
Score = 68.2 bits (167), Expect = 3e-14
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
YYG I++GTPPQ+F VIFDTGSS LW+PS +C+ +
Sbjct: 1 YYGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKH 40
>gnl|CDD|133144 cd05477, gastricsin, Gastricsins, asparate proteases produced in
gastric mucosa. Gastricsin is also called pepsinogen C.
Gastricsins are produced in gastric mucosa of mammals.
It is synthesized by the chief cells in the stomach as
an inactive zymogen. It is self-converted to a mature
enzyme under acidic conditions. Human gastricsin is
distributed throughout all parts of the stomach.
Gastricsin is synthesized as an inactive progastricsin
that has an approximately 40 residue prosequence. It is
self-converting to a mature enzyme being triggered by a
drop in pH from neutrality to acidic conditions. Like
other aspartic proteases, gastricsin are characterized
by two catalytic aspartic residues at the active site,
and display optimal activity at acidic pH. Mature enzyme
has a pseudo-2-fold symmetry that passes through the
active site between the catalytic aspartate residues.
Structurally, aspartic proteases are bilobal enzymes,
each lobe contributing a catalytic aspartate residue,
with an extended active site cleft localized between the
two lobes of the molecule. One lobe may be evolved from
the other through ancient gene-duplication event.
Although the three-dimensional structures of the two
lobes are very similar, the amino acid sequences are
more divergent, except for the conserved catalytic site
motif. This family of aspartate proteases is classified
by MEROPS as the peptidase family A1 (pepsin A, clan
AA).
Length = 318
Score = 96.9 bits (241), Expect = 1e-24
Identities = 43/115 (37%), Positives = 69/115 (60%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
+ GGE++FGGVD + + G I ++PV+ + YWQ G++ +I + +CS CQAI
Sbjct: 147 GQQGQQGGELVFGGVDNNLYTGQIYWTPVTSETYWQIGIQGFQINGQATGWCSQGCQAIV 206
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P +V++ L + IGA G V+C+N+ +P + + G +F L
Sbjct: 207 DTGTSLLTAPQQVMSTLMQSIGAQQDQYGQYVVNCNNIQNLPTLTFTINGVSFPL 261
Score = 63.4 bits (154), Expect = 2e-12
Identities = 24/34 (70%), Positives = 27/34 (79%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS C
Sbjct: 2 MSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLC 35
>gnl|CDD|133154 cd05487, renin_like, Renin stimulates production of angiotensin and
thus affects blood pressure. Renin, also known as
angiotensinogenase, is a circulating enzyme that
participates in the renin-angiotensin system that
mediates extracellular volume, arterial
vasoconstriction, and consequently mean arterial blood
pressure. The enzyme is secreted by the kidneys from
specialized juxtaglomerular cells in response to
decreases in glomerular filtration rate (a consequence
of low blood volume), diminished filtered sodium
chloride and sympathetic nervous system innervation. The
enzyme circulates in the blood stream and hydrolyzes
angiotensinogen secreted from the liver into the peptide
angiotensin I. Angiotensin I is further cleaved in the
lungs by endothelial bound angiotensin converting enzyme
(ACE) into angiotensin II, the final active peptide.
Renin is a member of the aspartic protease family.
Structurally, aspartic proteases are bilobal enzymes,
each lobe contributing a catalytic Aspartate residue,
with an extended active site cleft localized between the
two lobes of the molecule. The N- and C-terminal
domains, although structurally related by a 2-fold axis,
have only limited sequence homology except the vicinity
of the active site. This suggests that the enzymes
evolved by an ancient duplication event. The active site
is located at the groove formed by the two lobes, with
an extended loop projecting over the cleft to form an
11-residue flap, which encloses substrates and
inhibitors in the active site. Specificity is determined
by nearest-neighbor hydrophobic residues surrounding the
catalytic aspartates, and by three residues in the flap.
The enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 326
Score = 87.5 bits (217), Expect = 4e-21
Identities = 51/176 (28%), Positives = 85/176 (48%), Gaps = 21/176 (11%)
Query: 7 AQYYGTISLGTPPQEFKVI---FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNA 63
A++ G + +G P Q + FD I SQ V K+ + R++
Sbjct: 111 AKFDGVLGMGYPKQAIGGVTPVFDN------IMSQG---------VLKEDVFSVYYSRDS 155
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTG 121
S GGEI+ GG D + GD Y S+ G+WQ ++ + + + C + C A+ DTG
Sbjct: 156 SHSLGGEIVLGGSDPQHYQGDFHYINTSKTGFWQIQMKGVSVGSSTLLCEDGCTAVVDTG 215
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
S I GP+ I++L + +GA V C+ + +P++ LGGK +TL+ ++
Sbjct: 216 ASFISGPTSSISKLMEALGAKERLGDYV-VKCNEVPTLPDISFHLGGKEYTLSSSD 270
Score = 76.7 bits (189), Expect = 4e-17
Identities = 38/70 (54%), Positives = 44/70 (62%), Gaps = 13/70 (18%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
QYYG I +GTPPQ FKV+FDTGSSNLW+PS CS L AC V H+ + +ASD
Sbjct: 7 TQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTAC-------VTHNLY-DASDS 58
Query: 66 ----ENGGEI 71
ENG E
Sbjct: 59 STYKENGTEF 68
>gnl|CDD|240300 PTZ00165, PTZ00165, aspartyl protease; Provisional.
Length = 482
Score = 69.8 bits (171), Expect = 2e-14
Identities = 23/34 (67%), Positives = 28/34 (82%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
QY+G I +GTPP+ F V+FDTGSSNLWIPS+ C
Sbjct: 120 QYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECK 153
Score = 69.0 bits (169), Expect = 4e-14
Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 15/104 (14%)
Query: 69 GEIMFGGVDKDKFV--GD-ITYSPVSRKGYWQFGVESIKIE-KNV-YCSN-CQAIADTGT 122
G I FG D K+ G I + PV YW+ V I I+ K++ +C C+A DTG+
Sbjct: 278 GSISFGSADP-KYTLEGHKIWWFPVISTDYWEIEVVDILIDGKSLGFCDRKCKAAIDTGS 336
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
SLI GPS VI N L+ +PL DC N D +P + +L
Sbjct: 337 SLITGPSSVI---NPLLEKIPLEE-----DCSNKDSLPRISFVL 372
>gnl|CDD|133161 cd06097, Aspergillopepsin_like, Aspergillopepsin_like, aspartic
proteases of fungal origin. The members of this family
are aspartic proteases of fungal origin, including
aspergillopepsin, rhizopuspepsin, endothiapepsin, and
rodosporapepsin. The various fungal species in this
family may be the most economically important genus of
fungi. They may serve as virulence factors or as
industrial aids. For example, Aspergillopepsin from A.
fumigatus is involved in invasive aspergillosis owing to
its elastolytic activity and Aspergillopepsins from the
mold A. saitoi are used in fermentation industry.
Aspartic proteinases are a group of proteolytic enzymes
in which the scissile peptide bond is attacked by a
nucleophilic water molecule activated by two aspartic
residues in a DT(S)G motif at the active site. They have
a similar fold composed of two beta-barrel domains.
Between the N-terminal and C-terminal domains, each of
which contributes one catalytic aspartic residue, there
is an extended active-site cleft capable of interacting
with multiple residues of a substrate. Although members
of the aspartic protease family of enzymes have very
similar three-dimensional structures and catalytic
mechanisms, each has unique substrate specificity. The
members of this family has an optimal acidic pH (5.5)
and cleaves protein substrates with similar specificity
to that of porcine pepsin A, preferring hydrophobic
residues at P1 and P1' in the cleave site. This family
of aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 278
Score = 61.9 bits (151), Expect = 6e-12
Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 73 FGGVDKDKFVGDITYSPVSR-KGYWQFGVESIKI--EKNVYCSNCQAIADTGTSLIIGPS 129
FG +D+ K+ G+I+++PV G+WQF S + + S AIADTGT+LI+ P
Sbjct: 155 FGYIDESKYKGEISWTPVDNSSGFWQFTSTSYTVGGDAPWSRSGFSAIADTGTTLILLPD 214
Query: 130 KV 131
+
Sbjct: 215 AI 216
Score = 46.1 bits (110), Expect = 2e-06
Identities = 14/38 (36%), Positives = 19/38 (50%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIA 46
Y + +GTPPQ + DTGSS+LW+ S
Sbjct: 1 YLTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQG 38
>gnl|CDD|133137 cd05470, pepsin_retropepsin_like, Cellular and retroviral
pepsin-like aspartate proteases. This family includes
both cellular and retroviral pepsin-like aspartate
proteases. The cellular pepsin and pepsin-like enzymes
are twice as long as their retroviral counterparts. The
cellular pepsin-like aspartic proteases are found in
mammals, plants, fungi and bacteria. These well known
and extensively characterized enzymes include pepsins,
chymosin, rennin, cathepsins, and fungal aspartic
proteases. Several have long been known to be medically
(rennin, cathepsin D and E, pepsin) or commercially
(chymosin) important. The eukaryotic pepsin-like
proteases contain two domains possessing similar
topological features. The N- and C-terminal domains,
although structurally related by a 2-fold axis, have
only limited sequence homology except in the vicinity
of the active site. This suggests that the enzymes
evolved by an ancient duplication event. The eukaryotic
pepsin-like proteases have two active site ASP residues
with each N- and C-terminal lobe contributing one
residue. While the fungal and mammalian pepsins are
bilobal proteins, retropepsins function as dimers and
the monomer resembles structure of the N- or C-terminal
domains of eukaryotic enzyme. The active site motif
(Asp-Thr/Ser-Gly-Ser) is conserved between the
retroviral and eukaryotic proteases and between the
N-and C-terminal of eukaryotic pepsin-like proteases.
The retropepsin-like family includes pepsin-like
aspartate proteases from retroviruses, retrotransposons
and retroelements; as well as eukaryotic
DNA-damage-inducible proteins (DDIs), and bacterial
aspartate peptidases. Retropepsin is synthesized as
part of the POL polyprotein that contains an
aspartyl-protease, a reverse transcriptase, RNase H,
and an integrase. The POL polyprotein undergoes
specific enzymatic cleavage to yield the mature
proteins. This family of aspartate proteases is
classified by MEROPS as the peptidase family A1 (pepsin
A) and A2 (retropepsin family).
Length = 109
Score = 57.4 bits (139), Expect = 2e-11
Identities = 24/55 (43%), Positives = 29/55 (52%), Gaps = 9/55 (16%)
Query: 11 GTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
I +GTPPQ F V+ DTGSSNLW+PS C L I HS + + S
Sbjct: 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYS---------HSSYDDPSA 46
>gnl|CDD|133141 cd05474, SAP_like, SAPs, pepsin-like proteinases secreted from
pathogens to degrade host proteins. SAPs (Secreted
aspartic proteinases) are secreted from a group of
pathogenic fungi, predominantly Candida species. They
are secreted from the pathogen to degrade host
proteins. SAP is one of the most significant
extracellular hydrolytic enzymes produced by C.
albicans. SAP proteins, encoded by a family of 10 SAP
genes. All 10 SAP genes of C. albicans encode
preproenzymes, approximately 60 amino acid longer than
the mature enzyme, which are processed when transported
via the secretory pathway. The mature enzymes contain
sequence motifs typical for all aspartyl proteinases,
including the two conserved aspartate residues other
active site and conserved cysteine residues implicated
in the maintenance of the three-dimensional structure.
Most Sap proteins contain putative N-glycosylation
sites, but it remains to be determined which Sap
proteins are glycosylated. This family of aspartate
proteases is classified by MEROPS as the peptidase
family A1 (pepsin A, clan AA). The overall structure
of Sap protein conforms to the classical aspartic
proteinase fold typified by pepsin. SAP is a bilobal
enzyme, each lobe contributing a catalytic Asp residue,
with an extended active site cleft localized between
the two lobes of the molecule. One lobe may be evolved
from the other through ancient gene-duplication event.
More recently evolved enzymes have similar
three-dimensional structures, however their amino acid
sequences are more divergent except for the conserved
catalytic site motif. This family of aspartate
proteases is classified by MEROPS as the peptidase
family A1 (pepsin A, clan AA).
Length = 295
Score = 53.0 bits (128), Expect = 9e-09
Identities = 16/28 (57%), Positives = 22/28 (78%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIP 36
Y +S+GTPPQ+ V+ DTGSS+LW+P
Sbjct: 3 YSAELSVGTPPQKVTVLLDTGSSDLWVP 30
Score = 51.0 bits (123), Expect = 4e-08
Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 13/116 (11%)
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYW-----------QFGVESIKIEKNVYCSNCQAI 117
G I+FGGVD K+ GD+ P+ V + N A+
Sbjct: 123 GSILFGGVDTAKYSGDLVTLPIVNDNGGSEPSELSVTLSSISVNGSSGNTTLLSKNLPAL 182
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK-VDCDNLDKMPNVDIILGGKNFT 172
D+GT+L PS ++ + K +GA ++ VDCD D ++ GG +
Sbjct: 183 LDSGTTLTYLPSDIVDAIAKQLGATYDSDEGLYVVDCDAKDDG-SLTFNFGGATIS 237
>gnl|CDD|140051 PTZ00013, PTZ00013, plasmepsin 4 (PM4); Provisional.
Length = 450
Score = 51.1 bits (122), Expect = 5e-08
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 17/88 (19%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENG 68
+YG +G Q+F +IFDTGS+NLW+PS+ C +I CS+ + ++S
Sbjct: 139 FYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCD--SIGCSI--------KNLYDSSKSKS 188
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYW 96
E KD DITY + KG++
Sbjct: 189 YE-------KDGTKVDITYGSGTVKGFF 209
Score = 44.6 bits (105), Expect = 9e-06
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 4/110 (3%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSL 124
D + G + GG+++ + G+ITY ++ YWQ ++ + K I D+GT+
Sbjct: 285 DVHAGYLTIGGIEEKFYEGNITYEKLNHDLYWQIDLD-VHFGKQTM-QKANVIVDSGTTT 342
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKV-DCDNLDKMPNVDIILGGKNFTL 173
I PS+ + + + + + P V CDN +MP ++ +TL
Sbjct: 343 ITAPSEFLNKFFANLNVIKVPFLPFYVTTCDN-KEMPTLEFKSANNTYTL 391
>gnl|CDD|133140 cd05473, beta_secretase_like, Beta-secretase, aspartic-acid
protease important in the pathogenesis of Alzheimer's
disease. Beta-secretase also called BACE (beta-site of
APP cleaving enzyme) or memapsin-2. Beta-secretase is an
aspartic-acid protease important in the pathogenesis of
Alzheimer's disease, and in the formation of myelin
sheaths in peripheral nerve cells. It cleaves amyloid
precursor protein (APP) to reveal the N-terminus of the
beta-amyloid peptides. The beta-amyloid peptides are the
major components of the amyloid plaques formed in the
brain of patients with Alzheimer's disease (AD). Since
BACE mediates one of the cleavages responsible for
generation of AD, it is regarded as a potential target
for pharmacological intervention in AD. Beta-secretase
is a member of pepsin family of aspartic proteases. Same
as other aspartic proteases, beta-secretase is a bilobal
enzyme, each lobe contributing a catalytic Asp residue,
with an extended active site cleft localized between the
two lobes of the molecule. The N- and C-terminal
domains, although structurally related by a 2-fold axis,
have only limited sequence homology except the vicinity
of the active site. This suggests that the enzymes
evolved by an ancient duplication event. The enzymes
specifically cleave bonds in peptides which have at
least six residues in length with hydrophobic residues
in both the P1 and P1' positions. The active site is
located at the groove formed by the two lobes, with an
extended loop projecting over the cleft to form an
11-residue flap, which encloses substrates and
inhibitors in the active site. Specificity is determined
by nearest-neighbor hydrophobic residues surrounding the
catalytic aspartates, and by three residues in the flap.
The enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
This family of aspartate proteases is classified by
MEROPS as the peptidase family A1 (pepsin A, clan AA).
Length = 364
Score = 47.4 bits (113), Expect = 8e-07
Identities = 26/85 (30%), Positives = 43/85 (50%), Gaps = 12/85 (14%)
Query: 56 GHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN------- 108
G +AS GG ++ GG+D + GDI Y+P+ + Y++ V +K+E
Sbjct: 148 GLPVNGSASGTVGGSMVIGGIDPSLYKGDIWYTPIREEWYYE--VIILKLEVGGQSLNLD 205
Query: 109 --VYCSNCQAIADTGTSLIIGPSKV 131
Y + +AI D+GT+ + P KV
Sbjct: 206 CKEYNYD-KAIVDSGTTNLRLPVKV 229
Score = 37.4 bits (87), Expect = 0.002
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSN 32
YY + +GTPPQ+ ++ DTGSSN
Sbjct: 4 YYIEMLIGTPPQKLNILVDTGSSN 27
>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
Protease and Nucellin, pepsin-like aspartic proteases
from plants. This family contains pepsin like aspartic
proteases from plants including Chloroplast Nucleoids
DNA-binding Protease and Nucellin. Chloroplast
Nucleoids DNA-binding Protease catalyzes the
degradation of ribulose-1,5-bisphosphate
carboxylase/oxygenase (Rubisco) in senescent leaves of
tobacco and Nucellins are important regulators of
nucellar cell's progressive degradation after ovule
fertilization. Structurally, aspartic proteases are
bilobal enzymes, each lobe contributing a catalytic Asp
residue, with an extended active site cleft localized
between the two lobes of the molecule. The N- and
C-terminal domains, although structurally related by a
2-fold axis, have only limited sequence homology except
the vicinity of the active site. This suggests that the
enzymes evolved by an ancient duplication event. The
enzymes specifically cleave bonds in peptides which
have at least six residues in length with hydrophobic
residues in both the P1 and P1' positions. The active
site is located at the groove formed by the two lobes,
with an extended loop projecting over the cleft to form
an 11-residue flap, which encloses substrates and
inhibitors in the active site. Specificity is
determined by nearest-neighbor hydrophobic residues
surrounding the catalytic aspartates, and by three
residues in the flap. The enzymes are mostly secreted
from cells as inactive proenzymes that activate
autocatalytically at acidic pH.
Length = 265
Score = 45.7 bits (109), Expect = 3e-06
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIP 36
+Y T+S+GTPPQ F +I DTGS W
Sbjct: 1 EYLVTLSIGTPPQPFSLIVDTGSDLTWTQ 29
>gnl|CDD|140176 PTZ00147, PTZ00147, plasmepsin-1; Provisional.
Length = 453
Score = 45.2 bits (107), Expect = 5e-06
Identities = 18/33 (54%), Positives = 22/33 (66%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YG LG Q+F IFDTGS+NLW+PS C+
Sbjct: 140 SYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCT 172
Score = 35.6 bits (82), Expect = 0.010
Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 14/115 (12%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQ------FGVESIKIEKNVYCSNCQAIA 118
D++ G + GG+++ + G +TY ++ YWQ FG NV I
Sbjct: 286 DKHKGYLTIGGIEERFYEGPLTYEKLNHDLYWQVDLDVHFG--------NVSSEKANVIV 337
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
D+GTS+I P++ + + + + + P V N K+P ++ K +TL
Sbjct: 338 DSGTSVITVPTEFLNKFVESLDVFKVPFLPLYVTTCNNTKLPTLEFRSPNKVYTL 392
>gnl|CDD|133139 cd05472, cnd41_like, Chloroplast Nucleoids DNA-binding Protease,
catalyzes the degradation of ribulose-1,5-bisphosphate
carboxylase/oxygenase. Chloroplast Nucleoids
DNA-binding Protease catalyzes the degradation of
ribulose-1,5-bisphosphate carboxylase/oxygenase
(Rubisco) in senescent leaves of tobacco. Antisense
tobacco with reduced amount of CND41 maintained green
leaves and constant protein levels, especially Rubisco.
CND41 has DNA-binding as well as aspartic protease
activities. The pepsin-like aspartic protease domain is
located at the C-terminus of the protein. The enzyme is
characterized by having two aspartic protease catalytic
site motifs, the Asp-Thr-Gly-Ser in the N-terminal and
Asp-Ser-Gly-Ser in the C-terminal region. Aspartic
proteases are bilobal enzymes, each lobe contributing a
catalytic Asp residue, with an extended active site
cleft localized between the two lobes of the molecule.
One lobe may be evolved from the other through ancient
gene-duplication event. This family of aspartate
proteases is classified by MEROPS as the peptidase
family A1 (pepsin A, clan AA).
Length = 299
Score = 39.9 bits (94), Expect = 3e-04
Identities = 15/33 (45%), Positives = 20/33 (60%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+Y T+ LGTP ++ VI DTGS W+ Q C
Sbjct: 1 EYVVTVGLGTPARDQTVIVDTGSDLTWVQCQPC 33
>gnl|CDD|178691 PLN03146, PLN03146, aspartyl protease family protein; Provisional.
Length = 431
Score = 37.3 bits (87), Expect = 0.002
Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Query: 3 SSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWI---PSQHC 40
S +Y IS+GTPP I DTGS +W P C
Sbjct: 79 ISNGGEYLMNISIGTPPVPILAIADTGSDLIWTQCKPCDDC 119
>gnl|CDD|133160 cd06096, Plasmepsin_5, Plasmepsins are a class of aspartic
proteinases produced by the plasmodium parasite. The
family contains a group of aspartic proteinases
homologous to plasmepsin 5. Plasmepsins are a class of
at least 10 enzymes produced by the plasmodium parasite.
Through their haemoglobin-degrading activity, they are
an important cause of symptoms in malaria sufferers.
This family of enzymes is a potential target for
anti-malarial drugs. Plasmepsins are aspartic acid
proteases, which means their active site contains two
aspartic acid residues. These two aspartic acid residue
act respectively as proton donor and proton acceptor,
catalyzing the hydrolysis of peptide bond in proteins.
Aspartic proteinases are composed of two structurally
similar beta barrel lobes, each lobe contributing an
aspartic acid residue to form a catalytic dyad that acts
to cleave the substrate peptide bond. The catalytic Asp
residues are contained in an Asp-Thr-Gly-Ser/thr motif
in both N- and C-terminal lobes of the enzyme. There
are four types of plasmepsins, closely related but
varying in the specificity of cleavage site. The name
plasmepsin may come from plasmodium (the organism) and
pepsin (a common aspartic acid protease with similar
molecular structure). This family of aspartate proteases
is classified by MEROPS as the peptidase family A1
(pepsin A, clan AA).
Length = 326
Score = 36.2 bits (84), Expect = 0.005
Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 14/88 (15%)
Query: 66 ENGGEIMFGGVDKDKFVG----------DITYSPVSRKGYWQFGVESIKI--EKN--VYC 111
E+GGE+ GG DKD V I ++P++RK Y+ +E + + +
Sbjct: 170 EDGGELTIGGYDKDYTVRNSSIGNNKVSKIVWTPITRKYYYYVKLEGLSVYGTTSNSGNT 229
Query: 112 SNCQAIADTGTSLIIGPSKVIAELNKLI 139
+ D+G++L P + ++N
Sbjct: 230 KGLGMLVDSGSTLSHFPEDLYNKINNFF 257
Score = 35.4 bits (82), Expect = 0.009
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+ I +G PPQ+ +I DTGSS+L P C
Sbjct: 2 AYYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQC 35
>gnl|CDD|133142 cd05475, nucellin_like, Nucellins, plant aspartic proteases
specifically expressed in nucellar cells during
degradation. Nucellins are important regulators of
nucellar cell's progressive degradation after ovule
fertilization. This degradation is a characteristic of
programmed cell death. Nucellins are plant aspartic
proteases specifically expressed in nucellar cells
during degradation. The enzyme is characterized by
having two aspartic protease catalytic site motifs, the
Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser
in the C-terminal region, and two other regions nearly
identical to two regions of plant aspartic proteases.
Aspartic proteases are bilobal enzymes, each lobe
contributing a catalytic Asp residue, with an extended
active site cleft localized between the two lobes of
the molecule. One lobe may be evolved from the other
through ancient gene-duplication event. Although the
three-dimensional structures of the two lobes are very
similar, the amino acid sequences are more divergent,
except for the conserved catalytic site motif.
Length = 273
Score = 29.6 bits (67), Expect = 0.63
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWI 35
YY TI++G PP+ + + DTGS W+
Sbjct: 2 YYYVTINIGNPPKPYFLDIDTGSDLTWL 29
Score = 26.2 bits (58), Expect = 8.9
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 7/45 (15%)
Query: 1 MRSSYQAQYYGT-------ISLGTPPQEFKVIFDTGSSNLWIPSQ 38
MR Q ++Y T + +V+FD+GSS + +Q
Sbjct: 150 MRRESQKKHYSPGPASLLFNGQPTGGKGLEVVFDSGSSYTYFNAQ 194
>gnl|CDD|212583 cd11688, THUMP, THUMP domain, predicted to bind RNA. The THUMP
domain is named after THioUridine synthases, RNA
Methyltransferases and Pseudo-uridine synthases. It is
predicted to be an RNA-binding domain and probably
functions by delivering a variety of RNA modification
enzymes to their targets.
Length = 148
Score = 28.6 bits (64), Expect = 1.0
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 134 ELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
E+ +G + +VD DN D + NV++
Sbjct: 110 EIAMKVGDAIVDAFNPEVDLDNPDIVVNVEVH 141
>gnl|CDD|235966 PRK07207, PRK07207, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 965
Score = 28.4 bits (64), Expect = 2.3
Identities = 14/22 (63%), Positives = 15/22 (68%), Gaps = 1/22 (4%)
Query: 112 SNCQAIADTGT-SLIIGPSKVI 132
SNC AIA T T S IIG S+ I
Sbjct: 766 SNCMAIAPTATISNIIGVSQSI 787
>gnl|CDD|182788 PRK10862, PRK10862, SoxR reducing system protein RseC; Provisional.
Length = 154
Score = 27.6 bits (62), Expect = 2.5
Identities = 12/36 (33%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 96 WQFGVESIKIEKNVYCSNCQAIADTGTSLI--IGPS 129
WQ G+ ++ E CS+C + A G+ L+ +GP
Sbjct: 11 WQNGIALLRCEVKAGCSSCASRAGCGSRLLNKLGPQ 46
>gnl|CDD|235624 PRK05850, PRK05850, acyl-CoA synthetase; Validated.
Length = 578
Score = 27.6 bits (62), Expect = 3.7
Identities = 10/42 (23%), Positives = 22/42 (52%)
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLD 157
+ DT S+++ S V+ ++ + + P + P ++ D LD
Sbjct: 104 VLRDTSPSVVLTTSAVVDDVTEYVAPQPGQSAPPVIEVDLLD 145
>gnl|CDD|188492 TIGR03977, rSAM_pair_HxsC, His-Xaa-Ser system radical SAM maturase
HxsC. This model describes the downstream member, HxsC,
of a pair of uncharacterized radical SAM proteins,
regularly found in the context of a small protein with
four or more repeats of the tripeptide His-Xaa-Ser
(HXS). This enzyme appears to be part of a peptide
modification system.
Length = 292
Score = 27.2 bits (61), Expect = 4.1
Identities = 8/26 (30%), Positives = 11/26 (42%), Gaps = 3/26 (11%)
Query: 93 KGYWQFGVESIKIEKNVY---CSNCQ 115
W + +SI KN Y C +C
Sbjct: 255 PSLWPYARQSISDWKNYYPEECEDCA 280
>gnl|CDD|182461 PRK10438, PRK10438, C-N hydrolase family amidase; Provisional.
Length = 256
Score = 27.4 bits (61), Expect = 4.3
Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 2/67 (2%)
Query: 89 PVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLII-GPSKVIAELNKLIGAVPLANG 147
P R +WQ + + IE Y + C + G G S++I ++I +
Sbjct: 166 PAPRSLHWQTLLTARAIENQAYVAGCNRVGSDGNGHHYRGDSRIINPQGEIIATAE-PHQ 224
Query: 148 PAKVDCD 154
++D +
Sbjct: 225 ATRIDAE 231
>gnl|CDD|162204 TIGR01107, Na_K_ATPase_bet, Sodium Potassium ATPase beta subunit.
This model describes the Na+/K+ ATPase beta subunit in
eukaryotes. Na+/K+ ATPase(also called Sodium-Potassium
pump) is intimately associated with the plasma membrane.
It couples the energy released by the hydrolysis of ATP
to extrude 3 Na+ ions, with the concomitant uptake of
2K+ ions, against their ionic gradients [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 289
Score = 27.3 bits (61), Expect = 4.8
Identities = 14/52 (26%), Positives = 22/52 (42%)
Query: 113 NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
NC + D G +I ++N++IG P G VDC + +I
Sbjct: 160 NCSGLNDPTFGYSEGKPCIIIKMNRIIGFKPKNPGVPPVDCTGKRDEDDENI 211
>gnl|CDD|227658 COG5360, COG5360, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 566
Score = 27.5 bits (61), Expect = 4.9
Identities = 9/27 (33%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 136 NKLIGAVPLANGPAKVDCDNLDKMPNV 162
++L+G PL +GP+KV +D +
Sbjct: 409 SRLLGP-PLISGPSKVQAKRIDDVGGQ 434
>gnl|CDD|222288 pfam13650, Asp_protease_2, Aspartyl protease. This family
consists of predicted aspartic proteases, typically
from 180 to 230 amino acids in length, in MEROPS clan
AA. This model describes the well-conserved 121-residue
C-terminal region. The poorly conserved, variable
length N-terminal region usually contains a predicted
transmembrane helix.
Length = 89
Score = 25.7 bits (57), Expect = 6.8
Identities = 12/71 (16%), Positives = 19/71 (26%), Gaps = 8/71 (11%)
Query: 20 QEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKD 79
+ + + DTG+S + L + S NG
Sbjct: 8 KPVRFLLDTGASGTLLSRSLAERLGLK--------PTKLTIGRVSTANGTVRAALVRLDS 59
Query: 80 KFVGDITYSPV 90
+G IT V
Sbjct: 60 LKIGGITLRNV 70
>gnl|CDD|235227 PRK04149, sat, sulfate adenylyltransferase; Reviewed.
Length = 391
Score = 26.7 bits (60), Expect = 8.2
Identities = 8/26 (30%), Positives = 12/26 (46%)
Query: 98 FGVESIKIEKNVYCSNCQAIADTGTS 123
G+ +K E+ YC C +A T
Sbjct: 314 LGITPLKFEEAFYCPKCGGMASEKTC 339
>gnl|CDD|115814 pfam07184, CTV_P33, Citrus tristeza virus P33 protein. This family
consists of several Citrus tristeza virus (CTV) P33
proteins. The function of P33 is unclear although it is
known that the protein is not needed for virion
formation.
Length = 303
Score = 26.1 bits (57), Expect = 9.7
Identities = 11/41 (26%), Positives = 17/41 (41%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQF 98
+H N + G VDK +F I P +G++ F
Sbjct: 87 AHIDNQPNCEGNVTYIKTVDKSRFAILIKAMPAPMRGHYSF 127
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.136 0.414
Gapped
Lambda K H
0.267 0.0740 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,909,910
Number of extensions: 799677
Number of successful extensions: 647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 624
Number of HSP's successfully gapped: 53
Length of query: 177
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 86
Effective length of database: 6,901,388
Effective search space: 593519368
Effective search space used: 593519368
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.3 bits)