RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7201
(65 letters)
>gnl|CDD|173889 cd06901, lectin_VIP36_VIPL, VIP36 and VIPL type 1 transmembrane
proteins, lectin domain. The vesicular integral protein
of 36 kDa (VIP36) is a type 1 transmembrane protein of
the mammalian early secretory pathway that acts as a
cargo receptor transporting high mannose type
glycoproteins between the Golgi and the endoplasmic
reticulum (ER). Lectins of the early secretory pathway
are involved in the selective transport of newly
synthesized glycoproteins from the ER to the ER-Golgi
intermediate compartment (ERGIC). The most prominent
cycling lectin is the mannose-binding type1 membrane
protein ERGIC-53, which functions as a cargo receptor to
facilitate export of glycoproteins from the ER. L-type
lectins have a dome-shaped beta-barrel carbohydrate
recognition domain with a curved seven-stranded
beta-sheet referred to as the "front face" and a flat
six-stranded beta-sheet referred to as the "back face".
This domain homodimerizes so that adjacent back sheets
form a contiguous 12-stranded sheet and homotetramers
occur by a back-to-back association of these homodimers.
Though L-type lectins exhibit both sequence and
structural similarity to one another, their carbohydrate
binding specificities differ widely.
Length = 248
Score = 90.9 bits (226), Expect = 1e-24
Identities = 34/53 (64%), Positives = 42/53 (79%)
Query: 1 STDFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEFP 53
TD + K WKECF V+GV+LPTGYYFG SAATGDLSDNHD++ ++ YEL+
Sbjct: 174 MTDIDGKNEWKECFDVTGVRLPTGYYFGASAATGDLSDNHDIISMKLYELDVE 226
>gnl|CDD|217528 pfam03388, Lectin_leg-like, Legume-like lectin family. Lectins are
structurally diverse proteins that bind to specific
carbohydrates. This family includes the VIP36 and
ERGIC-53 lectins. These two proteins were the first
recognised members of a family of animal lectins similar
(19-24%) to the leguminous plant lectins. The alignment
for this family aligns residues lying towards the
N-terminus, where the similarity of VIP36 and ERGIC-53
is greatest. However, while Fiedler and Simons
identified these proteins as a new family of animal
lectins, our alignment also includes yeast sequences.
ERGIC-53 is a 53kD protein, localised to the
intermediate region between the endoplasmic reticulum
and the Golgi apparatus (ER-Golgi-Intermediate
Compartment, ERGIC). It was identified as a
calcium-dependent, mannose-specific lectin. Its
dysfunction has been associated with combined factors V
and VIII deficiency OMIM:227300 OMIM:601567, suggesting
an important and substrate-specific role for ERGIC-53 in
the glycoprotein- secreting pathway.
Length = 226
Score = 72.8 bits (179), Expect = 7e-18
Identities = 31/48 (64%), Positives = 37/48 (77%)
Query: 5 ENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYELEF 52
E+K +K CF+V V+LPTGYYFGVSAATGDLSDNHDV T++L
Sbjct: 179 EDKVDYKLCFQVENVRLPTGYYFGVSAATGDLSDNHDVFSFLTFQLFN 226
>gnl|CDD|173892 cd07308, lectin_leg-like, legume-like lectins: ERGIC-53, ERGL,
VIP36, VIPL, EMP46, and EMP47. The legume-like
(leg-like) lectins are eukaryotic intracellular sugar
transport proteins with a carbohydrate recognition
domain similar to that of the legume lectins. This
domain binds high-mannose-type oligosaccharides for
transport from the endoplasmic reticulum to the Golgi
complex. These leg-like lectins include ERGIC-53, ERGL,
VIP36, VIPL, EMP46, EMP47, and the UIP5
(ULP1-interacting protein 5) precursor protein.
Leg-like lectins have different intracellular
distributions and dynamics in the endoplasmic
reticulum-Golgi system of the secretory pathway and
interact with N-glycans of glycoproteins in a
calcium-dependent manner, suggesting a role in
glycoprotein sorting and trafficking. L-type lectins
have a dome-shaped beta-barrel carbohydrate recognition
domain with a curved seven-stranded beta-sheet referred
to as the "front face" and a flat six-stranded
beta-sheet referred to as the "back face". This domain
homodimerizes so that adjacent back sheets form a
contiguous 12-stranded sheet and homotetramers occur by
a back-to-back association of these homodimers. Though
L-type lectins exhibit both sequence and structural
similarity to one another, their carbohydrate binding
specificities differ widely.
Length = 218
Score = 65.8 bits (161), Expect = 3e-15
Identities = 28/48 (58%), Positives = 32/48 (66%)
Query: 3 DFENKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50
+ WKECF V V LP+ YFG SA TGDLSDNHD+L + TYEL
Sbjct: 171 TYSEGNNWKECFTVEDVILPSQGYFGFSAQTGDLSDNHDILSVHTYEL 218
>gnl|CDD|173890 cd06902, lectin_ERGIC-53_ERGL, ERGIC-53 and ERGL type 1
transmembrane proteins, N-terminal lectin domain.
ERGIC-53 and ERGL, N-terminal carbohydrate recognition
domain. ERGIC-53 and ERGL are eukaryotic mannose-binding
type 1 transmembrane proteins of the early secretory
pathway that transport newly synthesized glycoproteins
from the endoplasmic reticulum (ER) to the ER-Golgi
intermediate compartment (ERGIC). ERGIC-53 and ERGL
have an N-terminal lectin-like carbohydrate recognition
domain (represented by this alignment model) as well as
a C-terminal transmembrane domain. ERGIC-53 functions
as a 'cargo receptor' to facilitate the export of
glycoproteins with different characteristics from the
ER, while the ERGIC-53-like protein (ERGL) which may act
as a regulator of ERGIC-53. In mammals, ERGIC-53 forms
a complex with MCFD2 (multi-coagulation factor
deficiency 2) which then recruits blood coagulation
factors V and VIII. Mutations in either MCFD2 or
ERGIC-53 cause a mild form of inherited hemophilia known
as combined deficiency of factors V and VIII (F5F8D). In
addition to the lectin and transmembrane domains,
ERGIC-53 and ERGL have a short N-terminal cytoplasmic
region of about 12 amino acids. ERGIC-53 forms
disulphide-linked homodimers and homohexamers. ERGIC-53
and ERGL are sequence-similar to the lectins of
leguminous plants. L-type lectins have a dome-shaped
beta-barrel carbohydrate recognition domain with a
curved seven-stranded beta-sheet referred to as the
"front face" and a flat six-stranded beta-sheet referred
to as the "back face". This domain homodimerizes so
that adjacent back sheets form a contiguous 12-stranded
sheet and homotetramers occur by a back-to-back
association of these homodimers. Though L-type lectins
exhibit both sequence and structural similarity to one
another, their carbohydrate binding specificities differ
widely.
Length = 225
Score = 56.9 bits (138), Expect = 9e-12
Identities = 23/45 (51%), Positives = 30/45 (66%)
Query: 6 NKAAWKECFKVSGVKLPTGYYFGVSAATGDLSDNHDVLGIRTYEL 50
NK ++ C +V + LP YFGVSAATG L+D+HDVL T+ L
Sbjct: 180 NKDDYELCTRVENMVLPPNGYFGVSAATGGLADDHDVLSFLTFSL 224
>gnl|CDD|173886 cd01951, lectin_L-type, legume lectins. The L-type (legume-type)
lectins are a highly diverse family of carbohydrate
binding proteins that generally display no enzymatic
activity toward the sugars they bind. This family
includes arcelin, concanavalinA, the lectin-like
receptor kinases, the ERGIC-53/VIP36/EMP46 type1
transmembrane proteins, and an alpha-amylase inhibitor.
L-type lectins have a dome-shaped beta-barrel
carbohydrate recognition domain with a curved
seven-stranded beta-sheet referred to as the "front
face" and a flat six-stranded beta-sheet referred to as
the "back face". This domain homodimerizes so that
adjacent back sheets form a contiguous 12-stranded sheet
and homotetramers occur by a back-to-back association of
these homodimers. Though L-type lectins exhibit both
sequence and structural similarity to one another, their
carbohydrate binding specificities differ widely.
Length = 223
Score = 37.0 bits (86), Expect = 2e-04
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 15 KVSGVKLPTGYYFGVSAATGDLSDNHDVLGIR 46
PT YFG +A+TG L++ HD+L
Sbjct: 189 VDLIQLGPTKAYFGFTASTGGLTNLHDILNWS 220
>gnl|CDD|173891 cd06903, lectin_EMP46_EMP47, EMP46 and EMP47 type 1 transmembrane
proteins, N-terminal lectin domain. EMP46 and EMP47,
N-terminal carbohydrate recognition domain. EMP46 and
EMP47 are fungal type-I transmembrane proteins that
cycle between the endoplasmic reticulum and the golgi
apparatus and are thought to function as cargo receptors
that transport newly synthesized glycoproteins. EMP47
is a receptor for EMP46 responsible for the selective
transport of EMP46 by forming hetero-oligomerization
between the two proteins. EMP46 and EMP47 have an
N-terminal lectin-like carbohydrate recognition domain
(represented by this alignment model) as well as a
C-terminal transmembrane domain. EMP46 and EMP47 are 45%
sequence-identical to one another and have sequence
homology to a class of intracellular lectins defined by
ERGIC-53 and VIP36. L-type lectins have a dome-shaped
beta-barrel carbohydrate recognition domain with a
curved seven-stranded beta-sheet referred to as the
"front face" and a flat six-stranded beta-sheet referred
to as the "back face". This domain homodimerizes so
that adjacent back sheets form a contiguous 12-stranded
sheet and homotetramers occur by a back-to-back
association of these homodimers. Though L-type lectins
exhibit both sequence and structural similarity to one
another, their carbohydrate binding specificities differ
widely.
Length = 215
Score = 32.6 bits (75), Expect = 0.006
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 13 CFKVSGVKLP-TGYYFGVSAATGDLSDNHDVLGIRTYE 49
CF+ V+LP GY FG++AA D ++ ++L ++ +
Sbjct: 176 CFQTDKVQLPQGGYRFGITAANADNPESFEILKLKVWN 213
>gnl|CDD|173887 cd06899, lectin_legume_LecRK_Arcelin_ConA, legume lectins,
lectin-like receptor kinases, arcelin, concanavalinA,
and alpha-amylase inhibitor. This alignment model
includes the legume lectins (also known as agglutinins),
the arcelin (also known as phytohemagglutinin-L) family
of lectin-like defense proteins, the LecRK family of
lectin-like receptor kinases, concanavalinA (ConA), and
an alpha-amylase inhibitor. Arcelin is a major seed
glycoprotein discovered in kidney beans (Phaseolus
vulgaris) that has insecticidal properties and protects
the seeds from predation by larvae of various bruchids.
Arcelin is devoid of monosaccharide binding properties
and lacks a key metal-binding loop that is present in
other members of this family. Phytohaemagglutinin (PHA)
is a lectin found in plants, especially beans, that
affects cell metabolism by inducing mitosis and by
altering the permeability of the cell membrane to
various proteins. PHA agglutinates most mammalian red
blood cell types by binding glycans on the cell surface.
Medically, PHA is used as a mitogen to trigger cell
division in T-lymphocytes and to activate latent HIV-1
from human peripheral lymphocytes. Plant L-type lectins
are primarily found in the seeds of leguminous plants
where they constitute about 10% of the total soluble
protein of the seed extracts. They are synthesized
during seed development several weeks after flowering
and transported to the vacuole where they become
condensed into specialized vesicles called protein
bodies. L-type lectins have a dome-shaped beta-barrel
carbohydrate recognition domain with a curved
seven-stranded beta-sheet referred to as the "front
face" and a flat six-stranded beta-sheet referred to as
the "back face". This domain homodimerizes so that
adjacent back sheets form a contiguous 12-stranded sheet
and homotetramers occur by a back-to-back association of
these homodimers. Though L-type lectins exhibit both
sequence and structural similarity to one another, their
carbohydrate binding specificities differ widely.
Length = 236
Score = 29.5 bits (67), Expect = 0.074
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 26 YFGVSAATGDLSDNHDVLG 44
Y G SA+TG L++ H +L
Sbjct: 211 YVGFSASTGLLTELHYILS 229
>gnl|CDD|215744 pfam00139, Lectin_legB, Legume lectin domain.
Length = 231
Score = 28.0 bits (63), Expect = 0.26
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 20 KLPTGYYFGVSAATGDLSDNHDVLG 44
LP Y G SA+TG +++H VL
Sbjct: 203 VLPEWVYVGFSASTGGATESHYVLS 227
>gnl|CDD|182189 PRK10001, PRK10001, D-alanyl-D-alanine carboxypeptidase fraction C;
Provisional.
Length = 400
Score = 26.5 bits (58), Expect = 0.97
Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 6 NKAAWKECFKVSGVKLPT----GYYFGVSAATGDLSDNHDVLGIRTYELEF-PGEKL 57
N+ W V G+K T GY SA GD+ VLG +T + F EKL
Sbjct: 221 NRLLWSSNLNVDGMKTGTTAGAGYNLVASATQGDMRLISVVLGAKTDRIRFNESEKL 277
>gnl|CDD|241116 cd12672, RRM_DAZL, RNA recognition motif in vertebrate deleted in
azoospermia-like (DAZL) proteins. This subgroup
corresponds to the RRM of DAZL, also termed
SPGY-like-autosomal, encoded by the autosomal homolog
of DAZ gene, DAZL. It is ancestral to the deleted in
azoospermia (DAZ) protein. DAZL is germ-cell-specific
RNA-binding protein that contains a RNA recognition
motif (RRM), also known as RBD (RNA binding domain) or
RNP (ribonucleoprotein domain), and a DAZ motif, a
protein-protein interaction domain. Although their
specific biochemical functions remain to be
investigated, DAZL proteins may interact with
poly(A)-binding proteins (PABPs), and act as
translational activators of specific mRNAs during
gametogenesis. .
Length = 82
Score = 24.4 bits (53), Expect = 2.8
Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 15 KVSGVKLPTGYYFGVSAATG--DLSDNHDVLGIRTYELEFPGEKLSKGPFI 63
V VK+ T GVS G D+ DV I ++ F G+KL GP I
Sbjct: 31 SVKEVKIIT-DRTGVSKGYGFVSFYDDVDVQKIVESQINFHGKKLKLGPAI 80
>gnl|CDD|215524 PLN02970, PLN02970, serine racemase.
Length = 328
Score = 24.6 bits (54), Expect = 3.6
Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 1/24 (4%)
Query: 1 STDFENKAAWKECFKVSGVKLPTG 24
S F + AWK C V G+ L G
Sbjct: 293 SDSFRSNPAWKGCKNV-GIVLSGG 315
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.137 0.420
Gapped
Lambda K H
0.267 0.0681 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,281,027
Number of extensions: 226616
Number of successful extensions: 141
Number of sequences better than 10.0: 1
Number of HSP's gapped: 140
Number of HSP's successfully gapped: 11
Length of query: 65
Length of database: 10,937,602
Length adjustment: 36
Effective length of query: 29
Effective length of database: 9,340,858
Effective search space: 270884882
Effective search space used: 270884882
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 53 (24.3 bits)