RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7015
(284 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 43.8 bits (104), Expect = 2e-06
Identities = 22/36 (61%), Positives = 28/36 (77%), Gaps = 1/36 (2%)
Query: 65 DEC-WSNPCHNGGSCIDGIAAYNCSCPPGYTGPSCE 99
DEC NPC NGG+C++ + +Y CSCPPGYTG +CE
Sbjct: 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 43.4 bits (103), Expect = 2e-06
Identities = 19/34 (55%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 101 NVDECGS-NPCQNNGTCHDLLNGFVCSCHPGFTG 133
++DEC S NPCQN GTC + + + CSC PG+TG
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 43.4 bits (103), Expect = 2e-06
Identities = 19/34 (55%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Query: 158 NVDECGS-NPCQNNGTCHDLLNGFVCSCHPGFTG 190
++DEC S NPCQN GTC + + + CSC PG+TG
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 40.3 bits (95), Expect = 2e-05
Identities = 20/35 (57%), Positives = 26/35 (74%), Gaps = 1/35 (2%)
Query: 201 NECES-SPCQNGGVCVDLHAAYTCACLFGFTGRNC 234
+EC S +PCQNGG CV+ +Y C+C G+TGRNC
Sbjct: 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 41.8 bits (99), Expect = 6e-06
Identities = 17/33 (51%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 101 NVDECGS-NPCQNNGTCHDLLNGFVCSCHPGFT 132
++DEC S NPCQN GTC + + + C C PG+T
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 41.8 bits (99), Expect = 6e-06
Identities = 17/33 (51%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 158 NVDECGS-NPCQNNGTCHDLLNGFVCSCHPGFT 189
++DEC S NPCQN GTC + + + C C PG+T
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 38.8 bits (91), Expect = 9e-05
Identities = 22/37 (59%), Positives = 28/37 (75%), Gaps = 2/37 (5%)
Query: 65 DECWS-NPCHNGGSCIDGIAAYNCSCPPGYT-GPSCE 99
DEC S NPC NGG+C++ + +Y C CPPGYT G +CE
Sbjct: 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 35.7 bits (83), Expect = 0.001
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 201 NECES-SPCQNGGVCVDLHAAYTCACLFGFT-GRNCD 235
+EC S +PCQNGG CV+ +Y C C G+T GRNC+
Sbjct: 3 DECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 36.7 bits (85), Expect = 4e-04
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 107 SNPCQNNGTCHDLLNGFVCSCHPGFTGN 134
SNPC N GTC + + C C PG+TG+
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 36.3 bits (84), Expect = 6e-04
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 164 SNPCQNNGTCHDLLNGFVCSCHPGFTG 190
SNPC N GTC + + C C PG+TG
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 35.5 bits (82), Expect = 0.001
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 69 SNPCHNGGSCIDGIAAYNCSCPPGYTGP 96
SNPC NGG+C++ +Y C CPPGYTG
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 31.3 bits (71), Expect = 0.039
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 204 ESSPCQNGGVCVDLHAAYTCACLFGFTGRN 233
S+PC NGG CV+ +Y C C G+TG
Sbjct: 4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 34.3 bits (79), Expect = 0.003
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 70 NPCHNGGSCIDGIAAYNCSCPPGYTGP 96
NPC NGG+C+D Y C CP GYTG
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTGK 31
Score = 32.8 bits (75), Expect = 0.010
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 108 NPCQNNGTCHDLLNGFVCSCHPGFTG 133
NPC N GTC D G+ C C G+TG
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 32.8 bits (75), Expect = 0.010
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 165 NPCQNNGTCHDLLNGFVCSCHPGFTG 190
NPC N GTC D G+ C C G+TG
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 29.3 bits (66), Expect = 0.16
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 207 PCQNGGVCVDLHAAYTCACLFGFTGRN 233
PC NGG CVD YTC C G+TG+
Sbjct: 6 PCSNGGTCVDTPGGYTCECPEGYTGKR 32
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 29.4 bits (66), Expect = 0.15
Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 107 SNPCQNNGTCHDLLNGFVCSCHPGFTGN 134
PC N GTC + + CSC PG+TG+
Sbjct: 5 GGPCSN-GTCINTPGSYTCSCPPGYTGD 31
Score = 29.0 bits (65), Expect = 0.22
Identities = 13/27 (48%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 164 SNPCQNNGTCHDLLNGFVCSCHPGFTG 190
PC N GTC + + CSC PG+TG
Sbjct: 5 GGPCSN-GTCINTPGSYTCSCPPGYTG 30
Score = 26.7 bits (59), Expect = 1.4
Identities = 19/31 (61%), Positives = 22/31 (70%), Gaps = 2/31 (6%)
Query: 66 ECWS-NPCHNGGSCIDGIAAYNCSCPPGYTG 95
EC S PC NG +CI+ +Y CSCPPGYTG
Sbjct: 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 29.6 bits (67), Expect = 0.19
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 102 VDEC--GSNPCQNNGTCHDLLNGFVCSCHPGFTGNCIDG 138
VDEC G++ C N C + + F C C G+ N
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGT 40
Score = 27.7 bits (62), Expect = 0.87
Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 2/32 (6%)
Query: 65 DEC--WSNPCHNGGSCIDGIAAYNCSCPPGYT 94
DEC ++ C C++ I ++ C CP GY
Sbjct: 3 DECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
Score = 27.3 bits (61), Expect = 1.3
Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 159 VDEC--GSNPCQNNGTCHDLLNGFVCSCHPGFT 189
VDEC G++ C N C + + F C C G+
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 1220
Score = 31.1 bits (71), Expect = 0.76
Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 4/25 (16%)
Query: 92 GYTGPSCESNVDECGSNPCQNNGTC 116
GYTG +C ECG+ NGTC
Sbjct: 1187 GYTGEACS----ECGNFTLVRNGTC 1207
Score = 31.1 bits (71), Expect = 0.76
Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 4/25 (16%)
Query: 149 GYTGPSCESNVDECGSNPCQNNGTC 173
GYTG +C ECG+ NGTC
Sbjct: 1187 GYTGEACS----ECGNFTLVRNGTC 1207
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 30.8 bits (70), Expect = 0.86
Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 5/33 (15%)
Query: 227 FGFTGRNCDIELKICENSPCLNEALCLEEEEEQ 259
F F+ R C+ +I L L EE+E+
Sbjct: 385 FSFSRRGCEEAAQILSTLD-----LVLTEEKER 412
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 26.7 bits (60), Expect = 1.3
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 106 GSNPCQNNGTCHDLLNGFVCSCHPGFTGN 134
+ C N TC + F C+C G+TG+
Sbjct: 4 NNGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Score = 26.0 bits (58), Expect = 3.3
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 163 GSNPCQNNGTCHDLLNGFVCSCHPGFTG 190
+ C N TC + F C+C G+TG
Sbjct: 4 NNGGCHPNATCTNTGGSFTCTCKSGYTG 31
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.140 0.511
Gapped
Lambda K H
0.267 0.0612 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,752,585
Number of extensions: 1193658
Number of successful extensions: 959
Number of sequences better than 10.0: 1
Number of HSP's gapped: 936
Number of HSP's successfully gapped: 122
Length of query: 284
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 188
Effective length of database: 6,679,618
Effective search space: 1255768184
Effective search space used: 1255768184
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.4 bits)