RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9424
(535 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 32.2 bits (74), Expect = 0.035
Identities = 17/34 (50%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 280 IDFCAAK-PCGPGARCDNSRGSYKCLCPLGLVGD 312
ID CA+ PC G C N+ GSY+C CP G G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 30.7 bits (70), Expect = 0.12
Identities = 16/30 (53%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 280 IDFCAAK-PCGPGARCDNSRGSYKCLCPLG 308
ID CA+ PC G C N+ GSY+C CP G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCECPPG 31
>gnl|CDD|214709 smart00532, LIGANc, Ligase N family.
Length = 441
Score = 32.2 bits (74), Expect = 0.76
Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 15/52 (28%)
Query: 213 PVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICY 264
P C C S +V D RC PN LC A+ ++I +
Sbjct: 399 PTHC--PSCGSELVREEGEV--DIRC-----------PNPLCPAQLIERIIH 435
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 28.2 bits (63), Expect = 0.80
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 287 PCGPGARCDNSRGSYKCLCPLGLVGDPY 314
PC G C N+ GSY+C+CP G GD
Sbjct: 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 26.7 bits (59), Expect = 3.6
Identities = 14/31 (45%), Positives = 15/31 (48%)
Query: 242 CLANNPCGPNALCSAEKHKQICYCQPGYTGD 272
C A+NPC C C C PGYTGD
Sbjct: 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
>gnl|CDD|236571 PRK09565, PRK09565, hypothetical protein; Reviewed.
Length = 533
Score = 30.9 bits (70), Expect = 2.1
Identities = 11/49 (22%), Positives = 13/49 (26%)
Query: 164 ATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGG 212
+ G G H GH GG+ G S H G
Sbjct: 266 PEDAADGGTGGTHDAEEFGEHGHHGGHPGGEDGEHPHGHEDSGGHHGSG 314
>gnl|CDD|106181 PRK13213, araD, L-ribulose-5-phosphate 4-epimerase; Reviewed.
Length = 231
Score = 30.1 bits (67), Expect = 2.4
Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 4/47 (8%)
Query: 186 SHSGHAG--GKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGD 230
+HS HA + G LS T+H+ + GP+ C R+ + A+ +GD
Sbjct: 96 THSRHATIWAQAGKSLSALGTTHADYFYGPIPCTRLM--TEAEITGD 140
>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin.
Length = 137
Score = 29.0 bits (65), Expect = 3.1
Identities = 14/43 (32%), Positives = 15/43 (34%)
Query: 170 PGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGG 212
G G GP G H H G+HG S SS S
Sbjct: 25 KGEGGGTGPGGHGGGGEHGTHGHGEHGKLGGLLRRSGSSSSSS 67
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 25.2 bits (56), Expect = 10.0
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 151 CGRNALCTASDHHATCSCKPGYVG 174
C NA CT + TC+CK GY G
Sbjct: 8 CHPNATCTNTGGSFTCTCKSGYTG 31
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.135 0.476
Gapped
Lambda K H
0.267 0.0580 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,079,522
Number of extensions: 2076356
Number of successful extensions: 1671
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1617
Number of HSP's successfully gapped: 123
Length of query: 535
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 433
Effective length of database: 6,413,494
Effective search space: 2777042902
Effective search space used: 2777042902
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.6 bits)