RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8704
         (74 letters)



>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
          cathepsin B and similar proteins, including
          tubulointerstitial nephritis antigen (TIN-Ag).
          Cathepsin B is a lysosomal papain-like cysteine
          peptidase which is expressed in all tissues and
          functions primarily as an exopeptidase through its
          carboxydipeptidyl activity. Together with other
          cathepsins, it is involved in the degradation of
          proteins, proenzyme activation, Ag processing,
          metabolism and apoptosis. Cathepsin B has been
          implicated in a number of human diseases such as
          cancer, rheumatoid arthritis, osteoporosis and
          Alzheimer's disease. The unique carboxydipeptidyl
          activity of cathepsin B is attributed to the presence
          of an occluding loop in its active site which favors
          the binding of the C-termini of substrate proteins.
          Some members of this group do not possess the occluding
          loop. TIN-Ag is an extracellular matrix basement
          protein which was originally identified as a target Ag
          involved in anti-tubular basement membrane
          antibody-mediated interstitial nephritis. It plays a
          role in renal tubulogenesis and is defective in
          hereditary tubulointerstitial disorders. TIN-Ag is
          exclusively expressed in kidney tissues. .
          Length = 236

 Score =  100 bits (250), Expect = 5e-28
 Identities = 35/59 (59%), Positives = 43/59 (72%)

Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINT 69
          AF AVEA SDR+CI S GK++V LS+ DL+SCC  CG GCNGG+P  AW+Y    G+ T
Sbjct: 30 AFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKYLTTTGVVT 88


>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score = 73.3 bits (181), Expect = 6e-18
 Identities = 27/56 (48%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
          AF AV A+  R CI +G  K V LS   LV C      GCNGG P  A+ Y  KNG
Sbjct: 27 AFSAVGALEGRYCIKTG--KLVSLSEQQLVDCDT-GNNGCNGGLPDNAFEYIKKNG 79


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score = 71.5 bits (176), Expect = 1e-17
 Identities = 25/56 (44%), Positives = 28/56 (50%), Gaps = 2/56 (3%)

Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
          AF A  A+  R CI +G    V LS   LV C      GCNGG P  A+ Y  KNG
Sbjct: 27 AFSATGALEGRYCIKTGK--LVSLSEQQLVDCSGGGNCGCNGGLPDNAFEYIKKNG 80


>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
          nomenclature); composed of cysteine peptidases (CPs)
          similar to papain, including the mammalian CPs
          (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
          is an endopeptidase with specific substrate
          preferences, primarily for bulky hydrophobic or
          aromatic residues at the S2 subsite, a hydrophobic
          pocket in papain that accommodates the P2 sidechain of
          the substrate (the second residue away from the
          scissile bond). Most members of the papain subfamily
          are endopeptidases. Some exceptions to this rule can be
          explained by specific details of the catalytic domains
          like the occluding loop in cathepsin B which confers an
          additional carboxydipeptidyl activity and the
          mini-chain of cathepsin H resulting in an N-terminal
          exopeptidase activity. Papain-like CPs have different
          functions in various organisms. Plant CPs are used to
          mobilize storage proteins in seeds. Parasitic CPs act
          extracellularly to help invade tissues and cells, to
          hatch or to evade the host immune system. Mammalian CPs
          are primarily lysosomal enzymes with the exception of
          cathepsin W, which is retained in the endoplasmic
          reticulum. They are responsible for protein degradation
          in the lysosome. Papain-like CPs are synthesized as
          inactive proenzymes with N-terminal propeptide regions,
          which are removed upon activation. In addition to its
          inhibitory role, the propeptide is required for proper
          folding of the newly synthesized enzyme and its
          stabilization in denaturing pH conditions. Residues
          within the propeptide region also play a role in the
          transport of the proenzyme to lysosomes or acidified
          vesicles. Also included in this subfamily are proteins
          classified as non-peptidase homologs, which lack
          peptidase activity or have missing active site
          residues.
          Length = 210

 Score = 54.6 bits (132), Expect = 7e-11
 Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 2/59 (3%)

Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINT 69
          AF  V A+     I +G  K V LS   LV C      GCNGG P  A+ Y    G+ +
Sbjct: 26 AFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNGGLAS 82


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
          papain-like lysosomal cysteine peptidase exhibiting
          carboxymonopeptidase activity. It can also act as a
          carboxydipeptidase, like cathepsin B, but has been
          shown to preferentially cleave substrates through a
          monopeptidyl carboxypeptidase pathway. The propeptide
          region of cathepsin X, the shortest among papain-like
          peptidases, is covalently attached to the active site
          cysteine in the inactive form of the enzyme. Little is
          known about the biological function of cathepsin X.
          Some studies point to a role in early tumorigenesis. A
          more recent study indicates that cathepsin X expression
          is restricted to immune cells suggesting a role in
          phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 39.3 bits (92), Expect = 3e-05
 Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 11 AFGAVEAMSDRVCIASGGK-KHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGI 67
          A G+  A++DR+ IA  G    V LS   ++ C    G  C+GG PG  + Y  K+GI
Sbjct: 33 AHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG--GGSCHGGDPGGVYEYAHKHGI 88


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 35.3 bits (81), Expect = 9e-04
 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 7/63 (11%)

Query: 12  FGAVEAMSDRVCIASG-----GKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
             A+ AM  RV +AS      G++   LS+  ++ C +  G GC GGFP    ++    G
Sbjct: 238 EAALAAMMARVMVASNRTDPLGQQ-TFLSARHVLDCSQY-GQGCAGGFPEEVGKFAETFG 295

Query: 67  INT 69
           I T
Sbjct: 296 ILT 298


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score = 34.7 bits (80), Expect = 0.002
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 29  KKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGI 67
            K V LS  +LV+C      GC+GG+P  A  Y    G+
Sbjct: 277 DKSVDLSEQELVNCDTKSQ-GCSGGYPDTALEYVKNKGL 314


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
          Dipeptidyl Peptidase I (DPPI), an atypical papain-like
          cysteine peptidase with chloride dependency and
          dipeptidyl aminopeptidase activity, resulting from its
          tetrameric structure which limits substrate access.
          Each subunit of the tetramer is composed of three
          peptides: the heavy and light chains, which together
          adopts the papain fold and forms the catalytic domain;
          and the residual propeptide region, which forms a beta
          barrel and points towards the substrate's N-terminus.
          The subunit composition is the result of the unique
          characteristic of procathepsin C maturation involving
          the cleavage of the catalytic domain and the
          non-autocatalytic excision of an activation peptide
          within its propeptide region. By removing N-terminal
          dipeptide extensions, cathepsin C activates granule
          serine peptidases (granzymes) involved in cell-mediated
          apoptosis, inflammation and tissue remodelling.
          Loss-of-function mutations in cathepsin C are
          associated with Papillon-Lefevre and Haim-Munk
          syndromes, rare diseases characterized by
          hyperkeratosis and early-onset periodontitis. Cathepsin
          C is widely expressed in many tissues with high levels
          in lung, kidney and placenta. It is also highly
          expressed in cytotoxic lymphocytes and mature myeloid
          cells.
          Length = 243

 Score = 33.9 bits (78), Expect = 0.003
 Identities = 20/61 (32%), Positives = 31/61 (50%), Gaps = 5/61 (8%)

Query: 11 AFGAVEAMSDRVCIASGGKKHV----RLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNG 66
          AF +V A+  R+ IAS     +     LS   ++SC +    GC+GGFP +  ++    G
Sbjct: 31 AFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQY-SQGCDGGFPFLVGKFAEDFG 89

Query: 67 I 67
          I
Sbjct: 90 I 90


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
          nomenclature), also referred to as the papain family;
          composed of two subfamilies of cysteine peptidases
          (CPs), C1A (papain) and C1B (bleomycin hydrolase).
          Papain-like enzymes are mostly endopeptidases with some
          exceptions like cathepsins B, C, H and X, which are
          exopeptidases. Papain-like CPs have different functions
          in various organisms. Plant CPs are used to mobilize
          storage proteins in seeds while mammalian CPs are
          primarily lysosomal enzymes responsible for protein
          degradation in the lysosome. Papain-like CPs are
          synthesized as inactive proenzymes with N-terminal
          propeptide regions, which are removed upon activation.
          Bleomycin hydrolase (BH) is a CP that detoxifies
          bleomycin by hydrolysis of an amide group. It acts as a
          carboxypeptidase on its C-terminus to convert itself
          into an aminopeptidase and peptide ligase. BH is found
          in all tissues in mammals as well as in many other
          eukaryotes. It forms a hexameric ring barrel structure
          with the active sites imbedded in the central channel.
          Some members of the C1 family are proteins classified
          as non-peptidase homologs which lack peptidase activity
          or have missing active site residues.
          Length = 223

 Score = 32.1 bits (73), Expect = 0.011
 Identities = 18/62 (29%), Positives = 23/62 (37%), Gaps = 5/62 (8%)

Query: 11 AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSC----CRLCGFGCNGGFPGMAWRY-WVKN 65
          AF +  A+     I  G  ++V LS   L  C    C      C+GG P  A        
Sbjct: 23 AFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLVALK 82

Query: 66 GI 67
          GI
Sbjct: 83 GI 84


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 30.1 bits (68), Expect = 0.055
 Identities = 14/44 (31%), Positives = 18/44 (40%), Gaps = 3/44 (6%)

Query: 11  AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGF 54
           AF  V  +  +  I     + V LS  +LV C      GC GG 
Sbjct: 292 AFSTVGVVESQYAIRK--NELVSLSEQELVDC-SFKNNGCYGGL 332


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
          RIB43A-like eukaryotic proteins. Ciliary and flagellar
          microtubules contain a specialised set of
          protofilaments, termed ribbons, that are composed of
          tubulin and several associated proteins. RIB43A was
          first characterized in the unicellular biflagellate,
          Chlamydomonas reinhardtii although highly related
          sequences are present in several higher eukaryotes
          including humans. The function of this protein is
          unknown although the structure of RIB43A and its
          association with the specialised protofilament ribbons
          and with basal bodies is relevant to the proposed role
          of ribbons in forming and stabilising doublet and
          triplet microtubules and in organising their
          three-dimensional structure. Human RIB43A homologues
          could represent a structural requirement in centriole
          replication in dividing cells.
          Length = 379

 Score = 27.7 bits (62), Expect = 0.46
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 3  RAEMSSDEAFGAVEAMSDRVC 23
           AE   DEA+ A   ++DRV 
Sbjct: 57 AAEKERDEAYAAQMLLNDRVA 77


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score = 27.4 bits (60), Expect = 0.64
 Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 11  AFGAVEAMSDRVCIASGGKKHVRLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKN 65
           AF AV  +  +  +A  G K VRLS   LVSC  +   GC GG    A+ + ++N
Sbjct: 152 AFSAVGNIESQWAVA--GHKLVRLSEQQLVSCDHV-DNGCGGGLMLQAFEWVLRN 203


>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3). 
           This family contains EBNA-3A, -3B, and -3C which are
           latent infection nuclear proteins important for
           Epstein-Barr virus (EBV)-induced B-cell immortalisation
           and the immune response to EBV infection.
          Length = 254

 Score = 26.6 bits (59), Expect = 1.1
 Identities = 9/40 (22%), Positives = 15/40 (37%), Gaps = 2/40 (5%)

Query: 33  RLSSDDLVSCCRLCGFGCNGGFPGMAWRYWVKNGINTYKP 72
            L+ +  V      G+      P +  +YW    + T KP
Sbjct: 129 VLTPEQRVDFVEFLGWLQKTDNPYI--KYWFHKCLGTTKP 166


>gnl|CDD|221325 pfam11934, DUF3452, Domain of unknown function (DUF3452).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria and eukaryotes. This domain
           is typically between 124 to 150 amino acids in length.
           This domain is found associated with pfam01858,
           pfam01857. This domain has a single completely conserved
           residue W that may be functionally important.
          Length = 138

 Score = 26.1 bits (58), Expect = 1.5
 Identities = 7/18 (38%), Positives = 7/18 (38%)

Query: 29  KKHVRLSSDDLVSCCRLC 46
           K       DDLVS   L 
Sbjct: 111 KGEALSMFDDLVSSYHLL 128


>gnl|CDD|237903 PRK15078, PRK15078, polysaccharide export protein Wza; Provisional.
          Length = 379

 Score = 25.8 bits (57), Expect = 2.2
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 1   MNRAEMSSDEAFGAVEAMSDRVCIASG 27
           M+R+ M+  EA G  E +      A+G
Sbjct: 271 MDRSGMTLTEALGNAEGIDQTTADATG 297


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 25.7 bits (56), Expect = 2.6
 Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 3/27 (11%)

Query: 37 DDLVSCCRL---CGFGCNGGFPGMAWR 60
          DDL  C R+   CG  C+ GF  + W 
Sbjct: 20 DDLRICSRVIDSCGIACSSGFVAVPWE 46


>gnl|CDD|237575 PRK13977, PRK13977, myosin-cross-reactive antigen; Provisional.
          Length = 576

 Score = 25.2 bits (56), Expect = 3.3
 Identities = 6/19 (31%), Positives = 11/19 (57%)

Query: 22  VCIASGGKKHVRLSSDDLV 40
               +G ++ + L+ DDLV
Sbjct: 267 HLTRNGKEETIDLTEDDLV 285


>gnl|CDD|240058 cd04703, Asparaginase_2_like, A subfamily of the L-Asparaginase
           type 2-like enzymes. The wider family, a member of the
           Ntn-hydrolase superfamily, includes
           Glycosylasparaginase, Taspase 1 and  L-Asparaginase type
           2 enzymes. The proenzymes undergo autoproteolytic
           cleavage before a threonine to generate alpha and beta
           subunits. The threonine becomes the N-terminal residue
           of the beta subunit and is the catalytic residue.
          Length = 246

 Score = 25.1 bits (55), Expect = 3.9
 Identities = 17/56 (30%), Positives = 21/56 (37%), Gaps = 10/56 (17%)

Query: 1   MNRAEMSSDEAFGAVEAMSD--------RVCIASGGKKHVRLSSDDLVSCCRLCGF 48
            +   M+SD  FGAV AM          R  +      HV L+ D  V    L G 
Sbjct: 68  TDAGVMTSDGDFGAVAAMQGVEHPVLVARAVMEET--PHVLLAGDGAVKFAALTGV 121


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 24.1 bits (52), Expect = 7.8
 Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 5/24 (20%)

Query: 37  DDLVSCCRL--CGF---GCNGGFP 55
           DDL+S   +  C F   GCNGGFP
Sbjct: 439 DDLLSIQTVLSCSFYDQGCNGGFP 462


>gnl|CDD|238218 cd00368, Molybdopterin-Binding, Molybdopterin-Binding (MopB)
          domain of the MopB superfamily of proteins, a  large,
          diverse, heterogeneous superfamily of enzymes that, in
          general, bind molybdopterin as a cofactor. The MopB
          domain is found in a wide variety of molybdenum- and
          tungsten-containing enzymes, including formate
          dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms
          of nitrate reductase (Nap, Nas, NarG),
          dimethylsulfoxide reductase (DMSOR), thiosulfate
          reductase, formylmethanofuran dehydrogenase, and
          arsenite oxidase. Molybdenum is present in most of
          these enzymes in the form of molybdopterin, a modified
          pterin ring with a dithiolene side chain, which is
          responsible for ligating the Mo. In many bacterial and
          archaeal species, molybdopterin is in the form of a
          dinucleotide, with two molybdopterin dinucleotide units
          per molybdenum. These proteins can function as
          monomers, heterodimers, or heterotrimers, depending on
          the protein and organism. Also included in the MopB
          superfamily is the eukaryotic/eubacterial protein
          domain family of the 75-kDa subunit/Nad11/NuoG (second
          domain) of respiratory complex 1/NADH-quinone
          oxidoreductase which is postulated to have lost an
          ancestral formate dehydrogenase activity and only
          vestigial sequence evidence remains of a molybdopterin
          binding site.
          Length = 374

 Score = 24.2 bits (53), Expect = 7.8
 Identities = 9/27 (33%), Positives = 11/27 (40%), Gaps = 7/27 (25%)

Query: 41 SCCRLCGFGCNGGFPGMAWRYWVKNGI 67
          S C  CG GC           +VK+G 
Sbjct: 2  SVCPFCGVGCG-------ILVYVKDGK 21


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.136    0.456 

Gapped
Lambda     K      H
   0.267   0.0761    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,676,437
Number of extensions: 257912
Number of successful extensions: 254
Number of sequences better than 10.0: 1
Number of HSP's gapped: 246
Number of HSP's successfully gapped: 21
Length of query: 74
Length of database: 10,937,602
Length adjustment: 44
Effective length of query: 30
Effective length of database: 8,986,026
Effective search space: 269580780
Effective search space used: 269580780
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.1 bits)