RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9821
(78 letters)
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 45.2 bits (108), Expect = 2e-08
Identities = 18/35 (51%), Positives = 22/35 (62%)
Query: 39 CGLGLHDCHKDAKCTNTHGSYSCQCKRGFHGDGKT 73
C CH +A CTNT GS++C CK G+ GDG T
Sbjct: 1 CAENNGGCHPNATCTNTGGSFTCTCKSGYTGDGVT 35
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 43.9 bits (104), Expect = 6e-08
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGF-HGDGKTSC 75
DVDEC G H+C + C NT GS+ C C G+ + + T+C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 40.7 bits (96), Expect = 9e-07
Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGFHGD 70
D+DEC G + C C NT GSY C C G+ G
Sbjct: 1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR 35
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 39.5 bits (93), Expect = 3e-06
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 35 DVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGFH 68
D+DEC G + C C NT GSY C+C G+
Sbjct: 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 40.8 bits (96), Expect = 1e-05
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Query: 31 SLCPDVDECGLGLHDCHKDAKCTNTHGSYSCQCKRGF 67
+C D C H C + C +T GSY C C G+
Sbjct: 182 KICVVPDLCATLSHVCQQ--VCISTPGSYLCACTEGY 216
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 32.8 bits (75), Expect = 0.001
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 44 HDCHKDAKCTNTHGSYSCQCKRGFHGDG 71
+ C C NT GSY C C G+ GD
Sbjct: 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 29.7 bits (67), Expect = 0.019
Identities = 9/25 (36%), Positives = 13/25 (52%)
Query: 46 CHKDAKCTNTHGSYSCQCKRGFHGD 70
C C +T G Y+C+C G+ G
Sbjct: 7 CSNGGTCVDTPGGYTCECPEGYTGK 31
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 27.5 bits (61), Expect = 0.16
Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 44 HDCHKDAKCTNTHGSYSCQCKRGFHGD 70
C C NT GSY+C C G+ GD
Sbjct: 6 GPCSNG-TCINTPGSYTCSCPPGYTGD 31
>gnl|CDD|119287 pfam10767, DUF2593, Protein of unknown function (DUF2593). This
family of proteins appear to be restricted to
Enterobacteriaceae. Some members in the family are
annotated as YbjO however currently there is no known
function.
Length = 144
Score = 27.8 bits (62), Expect = 0.36
Identities = 10/33 (30%), Positives = 12/33 (36%), Gaps = 6/33 (18%)
Query: 1 MLCFISDSCAVAVAARHNISVYEEDARWSYSLC 33
+L + C AV N RW Y LC
Sbjct: 56 VLLCLEIRCGFAVLKGRNW------GRWGYLLC 82
>gnl|CDD|225400 COG2844, GlnD, UTP:GlnB (protein PII) uridylyltransferase
[Posttranslational modification, protein turnover,
chaperones].
Length = 867
Score = 28.1 bits (63), Expect = 0.40
Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 6/33 (18%)
Query: 21 VYEEDARW------SYSLCPDVDECGLGLHDCH 47
V E+D R Y+L PD+ GL D H
Sbjct: 178 VEEQDERHARYGDTRYNLEPDIKSGPGGLRDLH 210
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 27.4 bits (61), Expect = 0.56
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 46 CHKDAKCTNTHGSYSCQCKRGFHGDG 71
C ++ +C G Y C CK GF GDG
Sbjct: 138 CKENEECKLVGGYYECVCKEGFPGDG 163
>gnl|CDD|225459 COG2907, COG2907, Predicted NAD/FAD-binding protein [General
function prediction only].
Length = 447
Score = 26.3 bits (58), Expect = 1.7
Identities = 7/20 (35%), Positives = 14/20 (70%)
Query: 8 SCAVAVAARHNISVYEEDAR 27
S A ++ RH+++++E D R
Sbjct: 22 SAAWLLSRRHDVTLFEADRR 41
>gnl|CDD|182256 PRK10126, PRK10126, tyrosine phosphatase; Provisional.
Length = 147
Score = 25.3 bits (55), Expect = 3.3
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 6 SDSCAVAVAARHNISVYEEDARW-SYSLCPDVDECGLGLHDCHKDAKC 52
+D A++VAA H +S+ AR S LC + D L + H + C
Sbjct: 45 ADPTAISVAAEHQLSLEGHCARQISRRLCRNYDLI-LTMEKRHIERLC 91
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or
complement Clr-like EGF, domains have six conserved
cysteine residues disulfide-bonded into the
characteristic pattern 'ababcc'. They are found in
blood coagulation proteins such as fibrillin, Clr and
Cls, thrombomodulin, and the LDL receptor. The core
fold of the EGF domain consists of two small
beta-hairpins packed against each other. Two major
structural variants have been identified based on the
structural context of the C-terminal cysteine residue
of disulfide 'c' in the C-terminal hairpin: hEGFs and
cEGFs. In cEGFs the C-terminal thiol resides on the
C-terminal beta-sheet, resulting in long loop-lengths
between the cysteine residues of disulfide 'c',
typically C[10+]XC. These longer loop-lengths may have
arisen by selective cysteine loss from a four-disulfide
EGF template such as laminin or integrin. Tandem cEGF
domains have five linking residues between terminal
cysteines of adjacent domains. cEGF domains may or may
not bind calcium in the linker region. cEGF domains
with the consensus motif CXN4X[F,Y]XCXC are
hydroxylated exclusively on the asparagine residue.
Length = 24
Score = 22.8 bits (50), Expect = 6.8
Identities = 7/20 (35%), Positives = 10/20 (50%), Gaps = 1/20 (5%)
Query: 58 SYSCQCKRGFHGDG-KTSCT 76
SY+C C G+ G +C
Sbjct: 1 SYTCSCPPGYQLSGDGRTCE 20
>gnl|CDD|233157 TIGR00863, P2X, cation transporter protein. ATP-gated Cation
Channel (ACC) Family (TC 1.A.7)Members of the ACC family
(also called P2X receptors) respond to ATP, a functional
neurotransmitter released by exocytosis from many types
of neurons.These channels, which function at
neuron-neuron and neuron-smooth muscle junctions, may
play roles in the control of blood pressure and pain
sensation. They may also function in lymphocyte and
plateletphysiology. They are found only in animals.ACC
channels are probably hetero- or homomultimers and
transport small monovalent cations (Me+). Some also
transport Ca2+; a few also transport small metabolites
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 372
Score = 24.3 bits (53), Expect = 8.7
Identities = 13/49 (26%), Positives = 20/49 (40%), Gaps = 9/49 (18%)
Query: 31 SLCPDVDECGLGLHDCHKDAKC------TNTHGSYSCQCKRGFHGDGKT 73
CP+ L C D+ C T+ +G + +C F+G KT
Sbjct: 116 GRCPEHPSVPLA--ICWSDSDCTAGEAGTHGNGIKTGRCVP-FNGTVKT 161
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.134 0.464
Gapped
Lambda K H
0.267 0.0632 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,586,744
Number of extensions: 247839
Number of successful extensions: 186
Number of sequences better than 10.0: 1
Number of HSP's gapped: 184
Number of HSP's successfully gapped: 23
Length of query: 78
Length of database: 10,937,602
Length adjustment: 47
Effective length of query: 31
Effective length of database: 8,852,964
Effective search space: 274441884
Effective search space used: 274441884
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 53 (24.4 bits)