RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6349
(145 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 45.3 bits (108), Expect = 8e-08
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 46 INECES-DPCQNGATCQDLIGGYSCQCARGFQGQNCE 81
I+EC S +PCQNG TC + +G Y C C G+ G+NCE
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 35.3 bits (82), Expect = 6e-04
Identities = 15/31 (48%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 83 NVDDCK-PNPCQNGGICHDLINHFSCSCPSG 112
++D+C NPCQNGG C + + + CSCP G
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPG 31
Score = 28.8 bits (65), Expect = 0.13
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 18 HNGTCEDFGNSHRCHCMEGYTGSYC 42
+ GTC + S+RC C GYTG C
Sbjct: 13 NGGTCVNTVGSYRCSCPPGYTGRNC 37
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 42.6 bits (101), Expect = 1e-06
Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 2/39 (5%)
Query: 45 EINECES-DPCQNGATCQDLIGGYSCQCARGFQ-GQNCE 81
+I+EC S +PCQNG TC + +G Y C+C G+ G+NCE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 35.3 bits (82), Expect = 6e-04
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 83 NVDDCK-PNPCQNGGICHDLINHFSCSCPSGLKREALC 119
++D+C NPCQNGG C + + + C CP G C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38
Score = 25.7 bits (57), Expect = 2.3
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 18 HNGTCEDFGNSHRCHCMEGYT 38
+ GTC + S+RC C GYT
Sbjct: 13 NGGTCVNTVGSYRCECPPGYT 33
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 33.2 bits (76), Expect = 0.003
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 50 ESDPCQNGATCQDLIGGYSCQCARGFQGQN 79
S+PC NG TC + G Y C C G+ G
Sbjct: 4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Score = 31.7 bits (72), Expect = 0.012
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 89 PNPCQNGGICHDLINHFSCSCPSGLKREALC 119
NPC NGG C + + C CP G + C
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSC 35
Score = 24.7 bits (54), Expect = 4.8
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 18 HNGTCEDFGNSHRCHCMEGYTGS 40
+ GTC + S+RC C GYTG
Sbjct: 10 NGGTCVNTPGSYRCVCPPGYTGD 32
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 31.3 bits (71), Expect = 0.014
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 53 PCQNGATCQDLIGGYSCQCARGFQGQN 79
PC NG TC D GGY+C+C G+ G+
Sbjct: 6 PCSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 28.2 bits (63), Expect = 0.22
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 90 NPCQNGGICHDLINHFSCSCPSG 112
NPC NGG C D ++C CP G
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEG 27
Score = 24.7 bits (54), Expect = 4.6
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 18 HNGTCEDFGNSHRCHCMEGYTGSY 41
+ GTC D + C C EGYTG
Sbjct: 9 NGGTCVDTPGGYTCECPEGYTGKR 32
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 27.7 bits (62), Expect = 0.35
Identities = 9/33 (27%), Positives = 15/33 (45%), Gaps = 2/33 (6%)
Query: 46 INECES--DPCQNGATCQDLIGGYSCQCARGFQ 76
++EC C C + IG + C C G++
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 27.5 bits (61), Expect = 0.43
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 17 CHNGTCEDFGNSHRCHCMEGYTGS 40
C NGTC + S+ C C GYTG
Sbjct: 8 CSNGTCINTPGSYTCSCPPGYTGD 31
Score = 26.7 bits (59), Expect = 0.84
Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 1/30 (3%)
Query: 50 ESDPCQNGATCQDLIGGYSCQCARGFQGQN 79
PC NG TC + G Y+C C G+ G
Sbjct: 4 SGGPCSNG-TCINTPGSYTCSCPPGYTGDK 32
Score = 23.6 bits (51), Expect = 8.9
Identities = 11/31 (35%), Positives = 15/31 (48%), Gaps = 1/31 (3%)
Query: 89 PNPCQNGGICHDLINHFSCSCPSGLKREALC 119
PC NG C + ++CSCP G + C
Sbjct: 5 GGPCSNG-TCINTPGSYTCSCPPGYTGDKRC 34
>gnl|CDD|215604 PLN03151, PLN03151, cation/calcium exchanger; Provisional.
Length = 650
Score = 27.0 bits (60), Expect = 4.7
Identities = 27/113 (23%), Positives = 39/113 (34%), Gaps = 22/113 (19%)
Query: 1 MRN-LIVSSPGLKREALCHNGTCEDFGNSHRCHCMEGYTGSYCSIEINECESDPCQNGAT 59
+RN L+ +S + + +G F R SY S + P
Sbjct: 38 LRNPLLRNSSFVNDGSGSTSGGFNSFTVIRRRLSDIDVNSSYFSDNSSLSGDSP----VL 93
Query: 60 CQDLIG--GYSCQCARGFQGQNCELNVDDCKPNP-CQNGGICHDLINHFSCSC 109
C L GY+ QC + K +P C +GG D + F CSC
Sbjct: 94 CSGLHEHEGYADQC-------------EFLKAHPICSSGGF-FDYLKFFYCSC 132
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.140 0.503
Gapped
Lambda K H
0.267 0.0730 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,745,489
Number of extensions: 537526
Number of successful extensions: 332
Number of sequences better than 10.0: 1
Number of HSP's gapped: 327
Number of HSP's successfully gapped: 64
Length of query: 145
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 57
Effective length of database: 7,034,450
Effective search space: 400963650
Effective search space used: 400963650
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 54 (24.6 bits)