RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7201
         (65 letters)



>gnl|CDD|173889 cd06901, lectin_VIP36_VIPL, VIP36 and VIPL type 1 transmembrane
           proteins, lectin domain.  The vesicular integral protein
           of 36 kDa (VIP36) is a type 1 transmembrane protein of
           the mammalian early secretory pathway that acts as a
           cargo receptor transporting high mannose type
           glycoproteins between the Golgi and the endoplasmic
           reticulum (ER).  Lectins of the early secretory pathway
           are involved in the selective transport of newly
           synthesized glycoproteins from the ER to the ER-Golgi
           intermediate compartment (ERGIC). The most prominent
           cycling lectin is the mannose-binding type1 membrane
           protein ERGIC-53, which functions as a cargo receptor to
           facilitate export of glycoproteins from the ER. L-type
           lectins have a dome-shaped beta-barrel carbohydrate
           recognition domain with a curved seven-stranded
           beta-sheet referred to as the "front face" and a flat
           six-stranded beta-sheet referred to as the "back face". 
           This domain homodimerizes so that adjacent back sheets
           form a contiguous 12-stranded sheet and homotetramers
           occur by a back-to-back association of these homodimers.
            Though L-type lectins exhibit both sequence and
           structural similarity to one another, their carbohydrate
           binding specificities differ widely.
          Length = 248

 Score = 90.9 bits (226), Expect = 1e-24
 Identities = 34/53 (64%), Positives = 42/53 (79%)

Query: 1   STDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFP 53
            TD + K  WKECF V+GV+LPTGYYFG SAATGDLSDNHD++ ++ YEL+  
Sbjct: 174 MTDIDGKNEWKECFDVTGVRLPTGYYFGASAATGDLSDNHDIISMKLYELDVE 226


>gnl|CDD|217528 pfam03388, Lectin_leg-like, Legume-like lectin family.  Lectins are
           structurally diverse proteins that bind to specific
           carbohydrates. This family includes the VIP36 and
           ERGIC-53 lectins. These two proteins were the first
           recognised members of a family of animal lectins similar
           (19-24%) to the leguminous plant lectins. The alignment
           for this family aligns residues lying towards the
           N-terminus, where the similarity of VIP36 and ERGIC-53
           is greatest. However, while Fiedler and Simons
           identified these proteins as a new family of animal
           lectins, our alignment also includes yeast sequences.
           ERGIC-53 is a 53kD protein, localised to the
           intermediate region between the endoplasmic reticulum
           and the Golgi apparatus (ER-Golgi-Intermediate
           Compartment, ERGIC). It was identified as a
           calcium-dependent, mannose-specific lectin. Its
           dysfunction has been associated with combined factors V
           and VIII deficiency OMIM:227300 OMIM:601567, suggesting
           an important and substrate-specific role for ERGIC-53 in
           the glycoprotein- secreting pathway.
          Length = 226

 Score = 72.8 bits (179), Expect = 7e-18
 Identities = 31/48 (64%), Positives = 37/48 (77%)

Query: 5   ENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEF 52
           E+K  +K CF+V  V+LPTGYYFGVSAATGDLSDNHDV    T++L  
Sbjct: 179 EDKVDYKLCFQVENVRLPTGYYFGVSAATGDLSDNHDVFSFLTFQLFN 226


>gnl|CDD|173892 cd07308, lectin_leg-like, legume-like lectins: ERGIC-53, ERGL,
           VIP36, VIPL, EMP46, and EMP47.  The legume-like
           (leg-like) lectins are eukaryotic intracellular sugar
           transport proteins with a carbohydrate recognition
           domain similar to that of the legume lectins.  This
           domain binds high-mannose-type oligosaccharides for
           transport from the endoplasmic reticulum to the Golgi
           complex.  These leg-like lectins include ERGIC-53, ERGL,
           VIP36, VIPL, EMP46, EMP47, and the UIP5
           (ULP1-interacting protein 5) precursor protein.
           Leg-like lectins have different intracellular
           distributions and dynamics in the endoplasmic
           reticulum-Golgi system of the secretory pathway and
           interact with N-glycans of glycoproteins in a
           calcium-dependent manner, suggesting a role in
           glycoprotein sorting and trafficking.  L-type lectins
           have a dome-shaped beta-barrel carbohydrate recognition
           domain with a curved seven-stranded beta-sheet referred
           to as the "front face" and a flat six-stranded
           beta-sheet referred to as the "back face".  This domain
           homodimerizes so that adjacent back sheets form a
           contiguous 12-stranded sheet and homotetramers occur by
           a back-to-back association of these homodimers.  Though
           L-type lectins exhibit both sequence and structural
           similarity to one another, their carbohydrate binding
           specificities differ widely.
          Length = 218

 Score = 65.8 bits (161), Expect = 3e-15
 Identities = 28/48 (58%), Positives = 32/48 (66%)

Query: 3   DFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50
            +     WKECF V  V LP+  YFG SA TGDLSDNHD+L + TYEL
Sbjct: 171 TYSEGNNWKECFTVEDVILPSQGYFGFSAQTGDLSDNHDILSVHTYEL 218


>gnl|CDD|173890 cd06902, lectin_ERGIC-53_ERGL, ERGIC-53 and ERGL type 1
           transmembrane proteins, N-terminal lectin domain.
           ERGIC-53 and ERGL, N-terminal carbohydrate recognition
           domain. ERGIC-53 and ERGL are eukaryotic mannose-binding
           type 1 transmembrane proteins of the early secretory
           pathway that transport newly synthesized glycoproteins
           from the endoplasmic reticulum (ER) to the ER-Golgi
           intermediate compartment (ERGIC).  ERGIC-53 and ERGL
           have an N-terminal lectin-like carbohydrate recognition
           domain (represented by this alignment model) as well as
           a C-terminal transmembrane domain.  ERGIC-53 functions
           as a 'cargo receptor' to facilitate the export of
           glycoproteins with different characteristics from the
           ER, while the ERGIC-53-like protein (ERGL) which may act
           as a regulator of ERGIC-53.  In mammals, ERGIC-53 forms
           a complex with MCFD2 (multi-coagulation factor
           deficiency 2) which then recruits blood coagulation
           factors V and VIII.  Mutations in either MCFD2 or
           ERGIC-53 cause a mild form of inherited hemophilia known
           as combined deficiency of factors V and VIII (F5F8D). In
           addition to the lectin and transmembrane domains,
           ERGIC-53 and ERGL have a short N-terminal cytoplasmic
           region of about 12 amino acids. ERGIC-53 forms
           disulphide-linked homodimers and homohexamers. ERGIC-53
           and ERGL are sequence-similar to the lectins of
           leguminous plants.  L-type lectins have a dome-shaped
           beta-barrel carbohydrate recognition domain with a
           curved seven-stranded beta-sheet referred to as the
           "front face" and a flat six-stranded beta-sheet referred
           to as the "back face".  This domain homodimerizes so
           that adjacent back sheets form a contiguous 12-stranded
           sheet and homotetramers occur by a back-to-back
           association of these homodimers.  Though L-type lectins
           exhibit both sequence and structural similarity to one
           another, their carbohydrate binding specificities differ
           widely.
          Length = 225

 Score = 56.9 bits (138), Expect = 9e-12
 Identities = 23/45 (51%), Positives = 30/45 (66%)

Query: 6   NKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50
           NK  ++ C +V  + LP   YFGVSAATG L+D+HDVL   T+ L
Sbjct: 180 NKDDYELCTRVENMVLPPNGYFGVSAATGGLADDHDVLSFLTFSL 224


>gnl|CDD|173886 cd01951, lectin_L-type, legume lectins.  The L-type (legume-type)
           lectins are a highly diverse family of carbohydrate
           binding proteins that generally display no enzymatic
           activity toward the sugars they bind.  This family
           includes arcelin, concanavalinA, the lectin-like
           receptor kinases, the ERGIC-53/VIP36/EMP46 type1
           transmembrane proteins, and an alpha-amylase inhibitor. 
           L-type lectins have a dome-shaped beta-barrel
           carbohydrate recognition domain with a curved
           seven-stranded beta-sheet referred to as the "front
           face" and a flat six-stranded beta-sheet referred to as
           the "back face".  This domain homodimerizes so that
           adjacent back sheets form a contiguous 12-stranded sheet
           and homotetramers occur by a back-to-back association of
           these homodimers.  Though L-type lectins exhibit both
           sequence and structural similarity to one another, their
           carbohydrate binding specificities differ widely.
          Length = 223

 Score = 37.0 bits (86), Expect = 2e-04
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 15  KVSGVKLPTGYYFGVSAATGDLSDNHDVLGIR 46
                  PT  YFG +A+TG L++ HD+L   
Sbjct: 189 VDLIQLGPTKAYFGFTASTGGLTNLHDILNWS 220


>gnl|CDD|173891 cd06903, lectin_EMP46_EMP47, EMP46 and EMP47 type 1 transmembrane
           proteins, N-terminal lectin domain.  EMP46 and EMP47,
           N-terminal carbohydrate recognition domain. EMP46 and
           EMP47 are fungal type-I transmembrane proteins that
           cycle between the endoplasmic reticulum and the golgi
           apparatus and are thought to function as cargo receptors
           that transport newly synthesized glycoproteins.  EMP47
           is a receptor for EMP46 responsible for the selective
           transport of EMP46 by forming hetero-oligomerization
           between the two proteins. EMP46 and EMP47 have an
           N-terminal lectin-like carbohydrate recognition domain
           (represented by this alignment model) as well as a
           C-terminal transmembrane domain. EMP46 and EMP47 are 45%
           sequence-identical to one another and have sequence
           homology to a class of intracellular lectins defined by
           ERGIC-53 and VIP36.  L-type lectins have a dome-shaped
           beta-barrel carbohydrate recognition domain with a
           curved seven-stranded beta-sheet referred to as the
           "front face" and a flat six-stranded beta-sheet referred
           to as the "back face".  This domain homodimerizes so
           that adjacent back sheets form a contiguous 12-stranded
           sheet and homotetramers occur by a back-to-back
           association of these homodimers.  Though L-type lectins
           exhibit both sequence and structural similarity to one
           another, their carbohydrate binding specificities differ
           widely.
          Length = 215

 Score = 32.6 bits (75), Expect = 0.006
 Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 13  CFKVSGVKLP-TGYYFGVSAATGDLSDNHDVLGIRTYE 49
           CF+   V+LP  GY FG++AA  D  ++ ++L ++ + 
Sbjct: 176 CFQTDKVQLPQGGYRFGITAANADNPESFEILKLKVWN 213


>gnl|CDD|173887 cd06899, lectin_legume_LecRK_Arcelin_ConA, legume lectins,
           lectin-like receptor kinases, arcelin, concanavalinA,
           and alpha-amylase inhibitor.  This alignment model
           includes the legume lectins (also known as agglutinins),
           the arcelin (also known as phytohemagglutinin-L) family
           of lectin-like defense proteins, the LecRK family of
           lectin-like receptor kinases, concanavalinA (ConA), and
           an alpha-amylase inhibitor.  Arcelin is a major seed
           glycoprotein discovered in kidney beans (Phaseolus
           vulgaris) that has insecticidal properties and protects
           the seeds from predation by larvae of various bruchids. 
           Arcelin is devoid of monosaccharide binding properties
           and lacks a key metal-binding loop that is present in
           other members of this family.  Phytohaemagglutinin (PHA)
           is a lectin found in plants, especially beans, that
           affects cell metabolism by inducing mitosis and by
           altering the permeability of the cell membrane to
           various proteins.  PHA agglutinates most mammalian red
           blood cell types by binding glycans on the cell surface.
            Medically, PHA is used as a mitogen to trigger cell
           division in T-lymphocytes and to activate latent HIV-1
           from human peripheral lymphocytes.  Plant L-type lectins
           are primarily found in the seeds of leguminous plants
           where they constitute about 10% of the total soluble
           protein of the seed extracts. They are synthesized
           during seed development several weeks after flowering
           and transported to the vacuole where they become
           condensed into specialized vesicles called protein
           bodies. L-type lectins have a dome-shaped beta-barrel
           carbohydrate recognition domain with a curved
           seven-stranded beta-sheet referred to as the "front
           face" and a flat six-stranded beta-sheet referred to as
           the "back face".  This domain homodimerizes so that
           adjacent back sheets form a contiguous 12-stranded sheet
           and homotetramers occur by a back-to-back association of
           these homodimers.  Though L-type lectins exhibit both
           sequence and structural similarity to one another, their
           carbohydrate binding specificities differ widely.
          Length = 236

 Score = 29.5 bits (67), Expect = 0.074
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 26  YFGVSAATGDLSDNHDVLG 44
           Y G SA+TG L++ H +L 
Sbjct: 211 YVGFSASTGLLTELHYILS 229


>gnl|CDD|215744 pfam00139, Lectin_legB, Legume lectin domain. 
          Length = 231

 Score = 28.0 bits (63), Expect = 0.26
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 20  KLPTGYYFGVSAATGDLSDNHDVLG 44
            LP   Y G SA+TG  +++H VL 
Sbjct: 203 VLPEWVYVGFSASTGGATESHYVLS 227


>gnl|CDD|182189 PRK10001, PRK10001, D-alanyl-D-alanine carboxypeptidase fraction C;
           Provisional.
          Length = 400

 Score = 26.5 bits (58), Expect = 0.97
 Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 5/57 (8%)

Query: 6   NKAAWKECFKVSGVKLPT----GYYFGVSAATGDLSDNHDVLGIRTYELEF-PGEKL 57
           N+  W     V G+K  T    GY    SA  GD+     VLG +T  + F   EKL
Sbjct: 221 NRLLWSSNLNVDGMKTGTTAGAGYNLVASATQGDMRLISVVLGAKTDRIRFNESEKL 277


>gnl|CDD|241116 cd12672, RRM_DAZL, RNA recognition motif in vertebrate deleted in
          azoospermia-like (DAZL) proteins.  This subgroup
          corresponds to the RRM of DAZL, also termed
          SPGY-like-autosomal, encoded by the autosomal homolog
          of DAZ gene, DAZL. It is ancestral to the deleted in
          azoospermia (DAZ) protein. DAZL is germ-cell-specific
          RNA-binding protein that contains a RNA recognition
          motif (RRM), also known as RBD (RNA binding domain) or
          RNP (ribonucleoprotein domain), and a DAZ motif, a
          protein-protein interaction domain. Although their
          specific biochemical functions remain to be
          investigated, DAZL proteins may interact with
          poly(A)-binding proteins (PABPs), and act as
          translational activators of specific mRNAs during
          gametogenesis. .
          Length = 82

 Score = 24.4 bits (53), Expect = 2.8
 Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 15 KVSGVKLPTGYYFGVSAATG--DLSDNHDVLGIRTYELEFPGEKLSKGPFI 63
           V  VK+ T    GVS   G     D+ DV  I   ++ F G+KL  GP I
Sbjct: 31 SVKEVKIIT-DRTGVSKGYGFVSFYDDVDVQKIVESQINFHGKKLKLGPAI 80


>gnl|CDD|215524 PLN02970, PLN02970, serine racemase.
          Length = 328

 Score = 24.6 bits (54), Expect = 3.6
 Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 1   STDFENKAAWKECFKVSGVKLPTG 24
           S  F +  AWK C  V G+ L  G
Sbjct: 293 SDSFRSNPAWKGCKNV-GIVLSGG 315


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.137    0.420 

Gapped
Lambda     K      H
   0.267   0.0681    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,281,027
Number of extensions: 226616
Number of successful extensions: 141
Number of sequences better than 10.0: 1
Number of HSP's gapped: 140
Number of HSP's successfully gapped: 11
Length of query: 65
Length of database: 10,937,602
Length adjustment: 36
Effective length of query: 29
Effective length of database: 9,340,858
Effective search space: 270884882
Effective search space used: 270884882
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 53 (24.3 bits)