RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8704
(74 letters)
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
cathepsin B and similar proteins, including
tubulointerstitial nephritis antigen (TIN-Ag).
Cathepsin B is a lysosomal papain-like cysteine
peptidase which is expressed in all tissues and
functions primarily as an exopeptidase through its
carboxydipeptidyl activity. Together with other
cathepsins, it is involved in the degradation of
proteins, proenzyme activation, Ag processing,
metabolism and apoptosis. Cathepsin B has been
implicated in a number of human diseases such as
cancer, rheumatoid arthritis, osteoporosis and
Alzheimer's disease. The unique carboxydipeptidyl
activity of cathepsin B is attributed to the presence
of an occluding loop in its active site which favors
the binding of the C-termini of substrate proteins.
Some members of this group do not possess the occluding
loop. TIN-Ag is an extracellular matrix basement
protein which was originally identified as a target Ag
involved in anti-tubular basement membrane
antibody-mediated interstitial nephritis. It plays a
role in renal tubulogenesis and is defective in
hereditary tubulointerstitial disorders. TIN-Ag is
exclusively expressed in kidney tissues. .
Length = 236
Score = 100 bits (250), Expect = 5e-28
Identities = 35/59 (59%), Positives = 43/59 (72%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINT 69
AF AVEA SDR+CI S GK++V LS+ DL+SCC CG GCNGG+P AW+Y G+ T
Sbjct: 30 AFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKYLTTTGVVT 88
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease.
Length = 213
Score = 73.3 bits (181), Expect = 6e-18
Identities = 27/56 (48%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
AF AV A+ R CI +G K V LS LV C GCNGG P A+ Y KNG
Sbjct: 27 AFSAVGALEGRYCIKTG--KLVSLSEQQLVDCDT-GNNGCNGGLPDNAFEYIKKNG 79
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease.
Length = 175
Score = 71.5 bits (176), Expect = 1e-17
Identities = 25/56 (44%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
AF A A+ R CI +G V LS LV C GCNGG P A+ Y KNG
Sbjct: 27 AFSATGALEGRYCIKTGK--LVSLSEQQLVDCSGGGNCGCNGGLPDNAFEYIKKNG 80
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
nomenclature); composed of cysteine peptidases (CPs)
similar to papain, including the mammalian CPs
(cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
is an endopeptidase with specific substrate
preferences, primarily for bulky hydrophobic or
aromatic residues at the S2 subsite, a hydrophobic
pocket in papain that accommodates the P2 sidechain of
the substrate (the second residue away from the
scissile bond). Most members of the papain subfamily
are endopeptidases. Some exceptions to this rule can be
explained by specific details of the catalytic domains
like the occluding loop in cathepsin B which confers an
additional carboxydipeptidyl activity and the
mini-chain of cathepsin H resulting in an N-terminal
exopeptidase activity. Papain-like CPs have different
functions in various organisms. Plant CPs are used to
mobilize storage proteins in seeds. Parasitic CPs act
extracellularly to help invade tissues and cells, to
hatch or to evade the host immune system. Mammalian CPs
are primarily lysosomal enzymes with the exception of
cathepsin W, which is retained in the endoplasmic
reticulum. They are responsible for protein degradation
in the lysosome. Papain-like CPs are synthesized as
inactive proenzymes with N-terminal propeptide regions,
which are removed upon activation. In addition to its
inhibitory role, the propeptide is required for proper
folding of the newly synthesized enzyme and its
stabilization in denaturing pH conditions. Residues
within the propeptide region also play a role in the
transport of the proenzyme to lysosomes or acidified
vesicles. Also included in this subfamily are proteins
classified as non-peptidase homologs, which lack
peptidase activity or have missing active site
residues.
Length = 210
Score = 54.6 bits (132), Expect = 7e-11
Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINT 69
AF V A+ I +G K V LS LV C GCNGG P A+ Y G+ +
Sbjct: 26 AFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNGGLAS 82
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
papain-like lysosomal cysteine peptidase exhibiting
carboxymonopeptidase activity. It can also act as a
carboxydipeptidase, like cathepsin B, but has been
shown to preferentially cleave substrates through a
monopeptidyl carboxypeptidase pathway. The propeptide
region of cathepsin X, the shortest among papain-like
peptidases, is covalently attached to the active site
cysteine in the inactive form of the enzyme. Little is
known about the biological function of cathepsin X.
Some studies point to a role in early tumorigenesis. A
more recent study indicates that cathepsin X expression
is restricted to immune cells suggesting a role in
phagocytosis and the regulation of the immune response.
Length = 239
Score = 39.3 bits (92), Expect = 3e-05
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 11 AFGAVEAMSDRVCIASGGK-KHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGI 67
A G+ A++DR+ IA G V LS ++ C G C+GG PG + Y K+GI
Sbjct: 33 AHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG--GGSCHGGDPGGVYEYAHKHGI 88
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
Length = 548
Score = 35.3 bits (81), Expect = 9e-04
Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 7/63 (11%)
Query: 12 FGAVEAMSDRVCIASG-----GKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
A+ AM RV +AS G++ LS+ ++ C + G GC GGFP ++ G
Sbjct: 238 EAALAAMMARVMVASNRTDPLGQQ-TFLSARHVLDCSQY-GQGCAGGFPEEVGKFAETFG 295
Query: 67 INT 69
I T
Sbjct: 296 ILT 298
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
Length = 448
Score = 34.7 bits (80), Expect = 0.002
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 29 KKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGI 67
K V LS +LV+C GC+GG+P A Y G+
Sbjct: 277 DKSVDLSEQELVNCDTKSQ-GCSGGYPDTALEYVKNKGL 314
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
Dipeptidyl Peptidase I (DPPI), an atypical papain-like
cysteine peptidase with chloride dependency and
dipeptidyl aminopeptidase activity, resulting from its
tetrameric structure which limits substrate access.
Each subunit of the tetramer is composed of three
peptides: the heavy and light chains, which together
adopts the papain fold and forms the catalytic domain;
and the residual propeptide region, which forms a beta
barrel and points towards the substrate's N-terminus.
The subunit composition is the result of the unique
characteristic of procathepsin C maturation involving
the cleavage of the catalytic domain and the
non-autocatalytic excision of an activation peptide
within its propeptide region. By removing N-terminal
dipeptide extensions, cathepsin C activates granule
serine peptidases (granzymes) involved in cell-mediated
apoptosis, inflammation and tissue remodelling.
Loss-of-function mutations in cathepsin C are
associated with Papillon-Lefevre and Haim-Munk
syndromes, rare diseases characterized by
hyperkeratosis and early-onset periodontitis. Cathepsin
C is widely expressed in many tissues with high levels
in lung, kidney and placenta. It is also highly
expressed in cytotoxic lymphocytes and mature myeloid
cells.
Length = 243
Score = 33.9 bits (78), Expect = 0.003
Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 11 AFGAVEAMSDRVCIASGGKKHV----RLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
AF +V A+ R+ IAS + LS ++SC + GC+GGFP + ++ G
Sbjct: 31 AFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQY-SQGCDGGFPFLVGKFAEDFG 89
Query: 67 I 67
I
Sbjct: 90 I 90
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
nomenclature), also referred to as the papain family;
composed of two subfamilies of cysteine peptidases
(CPs), C1A (papain) and C1B (bleomycin hydrolase).
Papain-like enzymes are mostly endopeptidases with some
exceptions like cathepsins B, C, H and X, which are
exopeptidases. Papain-like CPs have different functions
in various organisms. Plant CPs are used to mobilize
storage proteins in seeds while mammalian CPs are
primarily lysosomal enzymes responsible for protein
degradation in the lysosome. Papain-like CPs are
synthesized as inactive proenzymes with N-terminal
propeptide regions, which are removed upon activation.
Bleomycin hydrolase (BH) is a CP that detoxifies
bleomycin by hydrolysis of an amide group. It acts as a
carboxypeptidase on its C-terminus to convert itself
into an aminopeptidase and peptide ligase. BH is found
in all tissues in mammals as well as in many other
eukaryotes. It forms a hexameric ring barrel structure
with the active sites imbedded in the central channel.
Some members of the C1 family are proteins classified
as non-peptidase homologs which lack peptidase activity
or have missing active site residues.
Length = 223
Score = 32.1 bits (73), Expect = 0.011
Identities = 18/62 (29%), Positives = 23/62 (37%), Gaps = 5/62 (8%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSC----CRLCGFGCNGGFPGMAWRY-WVKN 65
AF + A+ I G ++V LS L C C C+GG P A
Sbjct: 23 AFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLVALK 82
Query: 66 GI 67
GI
Sbjct: 83 GI 84
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 30.1 bits (68), Expect = 0.055
Identities = 14/44 (31%), Positives = 18/44 (40%), Gaps = 3/44 (6%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGF 54
AF V + + I + V LS +LV C GC GG
Sbjct: 292 AFSTVGVVESQYAIRK--NELVSLSEQELVDC-SFKNNGCYGGL 332
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 27.7 bits (62), Expect = 0.46
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 3 RAEMSSDEAFGAVEAMSDRVC 23
AE DEA+ A ++DRV
Sbjct: 57 AAEKERDEAYAAQMLLNDRVA 77
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
Length = 348
Score = 27.4 bits (60), Expect = 0.64
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKN 65
AF AV + + +A G K VRLS LVSC + GC GG A+ + ++N
Sbjct: 152 AFSAVGNIESQWAVA--GHKLVRLSEQQLVSCDHV-DNGCGGGLMLQAFEWVLRN 203
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 26.6 bits (59), Expect = 1.1
Identities = 9/40 (22%), Positives = 15/40 (37%), Gaps = 2/40 (5%)
Query: 33 RLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINTYKP 72
L+ + V G+ P + +YW + T KP
Sbjct: 129 VLTPEQRVDFVEFLGWLQKTDNPYI--KYWFHKCLGTTKP 166
>gnl|CDD|221325 pfam11934, DUF3452, Domain of unknown function (DUF3452). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria and eukaryotes. This domain
is typically between 124 to 150 amino acids in length.
This domain is found associated with pfam01858,
pfam01857. This domain has a single completely conserved
residue W that may be functionally important.
Length = 138
Score = 26.1 bits (58), Expect = 1.5
Identities = 7/18 (38%), Positives = 7/18 (38%)
Query: 29 KKHVRLSSDDLVSCCRLC 46
K DDLVS L
Sbjct: 111 KGEALSMFDDLVSSYHLL 128
>gnl|CDD|237903 PRK15078, PRK15078, polysaccharide export protein Wza; Provisional.
Length = 379
Score = 25.8 bits (57), Expect = 2.2
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 1 MNRAEMSSDEAFGAVEAMSDRVCIASG 27
M+R+ M+ EA G E + A+G
Sbjct: 271 MDRSGMTLTEALGNAEGIDQTTADATG 297
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 25.7 bits (56), Expect = 2.6
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 3/27 (11%)
Query: 37 DDLVSCCRL---CGFGCNGGFPGMAWR 60
DDL C R+ CG C+ GF + W
Sbjct: 20 DDLRICSRVIDSCGIACSSGFVAVPWE 46
>gnl|CDD|237575 PRK13977, PRK13977, myosin-cross-reactive antigen; Provisional.
Length = 576
Score = 25.2 bits (56), Expect = 3.3
Identities = 6/19 (31%), Positives = 11/19 (57%)
Query: 22 VCIASGGKKHVRLSSDDLV 40
+G ++ + L+ DDLV
Sbjct: 267 HLTRNGKEETIDLTEDDLV 285
>gnl|CDD|240058 cd04703, Asparaginase_2_like, A subfamily of the L-Asparaginase
type 2-like enzymes. The wider family, a member of the
Ntn-hydrolase superfamily, includes
Glycosylasparaginase, Taspase 1 and L-Asparaginase type
2 enzymes. The proenzymes undergo autoproteolytic
cleavage before a threonine to generate alpha and beta
subunits. The threonine becomes the N-terminal residue
of the beta subunit and is the catalytic residue.
Length = 246
Score = 25.1 bits (55), Expect = 3.9
Identities = 17/56 (30%), Positives = 21/56 (37%), Gaps = 10/56 (17%)
Query: 1 MNRAEMSSDEAFGAVEAMSD--------RVCIASGGKKHVRLSSDDLVSCCRLCGF 48
+ M+SD FGAV AM R + HV L+ D V L G
Sbjct: 68 TDAGVMTSDGDFGAVAAMQGVEHPVLVARAVMEET--PHVLLAGDGAVKFAALTGV 121
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
Length = 693
Score = 24.1 bits (52), Expect = 7.8
Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 5/24 (20%)
Query: 37 DDLVSCCRL--CGF---GCNGGFP 55
DDL+S + C F GCNGGFP
Sbjct: 439 DDLLSIQTVLSCSFYDQGCNGGFP 462
>gnl|CDD|238218 cd00368, Molybdopterin-Binding, Molybdopterin-Binding (MopB)
domain of the MopB superfamily of proteins, a large,
diverse, heterogeneous superfamily of enzymes that, in
general, bind molybdopterin as a cofactor. The MopB
domain is found in a wide variety of molybdenum- and
tungsten-containing enzymes, including formate
dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms
of nitrate reductase (Nap, Nas, NarG),
dimethylsulfoxide reductase (DMSOR), thiosulfate
reductase, formylmethanofuran dehydrogenase, and
arsenite oxidase. Molybdenum is present in most of
these enzymes in the form of molybdopterin, a modified
pterin ring with a dithiolene side chain, which is
responsible for ligating the Mo. In many bacterial and
archaeal species, molybdopterin is in the form of a
dinucleotide, with two molybdopterin dinucleotide units
per molybdenum. These proteins can function as
monomers, heterodimers, or heterotrimers, depending on
the protein and organism. Also included in the MopB
superfamily is the eukaryotic/eubacterial protein
domain family of the 75-kDa subunit/Nad11/NuoG (second
domain) of respiratory complex 1/NADH-quinone
oxidoreductase which is postulated to have lost an
ancestral formate dehydrogenase activity and only
vestigial sequence evidence remains of a molybdopterin
binding site.
Length = 374
Score = 24.2 bits (53), Expect = 7.8
Identities = 9/27 (33%), Positives = 11/27 (40%), Gaps = 7/27 (25%)
Query: 41 SCCRLCGFGCNGGFPGMAWRYWVKNGI 67
S C CG GC +VK+G
Sbjct: 2 SVCPFCGVGCG-------ILVYVKDGK 21
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.136 0.456
Gapped
Lambda K H
0.267 0.0761 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,676,437
Number of extensions: 257912
Number of successful extensions: 254
Number of sequences better than 10.0: 1
Number of HSP's gapped: 246
Number of HSP's successfully gapped: 21
Length of query: 74
Length of database: 10,937,602
Length adjustment: 44
Effective length of query: 30
Effective length of database: 8,986,026
Effective search space: 269580780
Effective search space used: 269580780
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.1 bits)