RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5752
(222 letters)
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 36.0 bits (84), Expect = 6e-04
Identities = 14/32 (43%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 64 NCHANAECVYNATSAGYRCQCAQGYVGNGVEC 95
CH NA C S + C C GY G+GV C
Sbjct: 7 GCHPNATCTNTGGS--FTCTCKSGYTGDGVTC 36
Score = 35.2 bits (82), Expect = 0.001
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 128 CLEDRSLCGKDASCVVASQGHFHCECNEGFTGNGITC 164
C E+ C +A+C + G F C C G+TG+G+TC
Sbjct: 1 CAENNGGCHPNATCT-NTGGSFTCTCKSGYTGDGVTC 36
Score = 31.7 bits (73), Expect = 0.018
Identities = 17/35 (48%), Positives = 18/35 (51%), Gaps = 6/35 (17%)
Query: 16 PCHPNAQCTSPDEFNESREQAKCICNFGYTGDGVT 50
CHPNA CT+ C C GYTGDGVT
Sbjct: 7 GCHPNATCTNTGG------SFTCTCKSGYTGDGVT 35
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 27.0 bits (60), Expect = 0.94
Identities = 12/32 (37%), Positives = 15/32 (46%), Gaps = 2/32 (6%)
Query: 59 CNVVKNCHANAECVYNATSAGYRCQCAQGYVG 90
C+ C CV T GY C+C +GY G
Sbjct: 1 CSPNNPCSNGGTCVD--TPGGYTCECPEGYTG 30
>gnl|CDD|234792 PRK00558, uvrC, excinuclease ABC subunit C; Validated.
Length = 598
Score = 30.1 bits (69), Expect = 1.1
Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 16/52 (30%)
Query: 170 KESDFLLVNQGMFMLRVPYQPTRTD-----RGRPIINHPNQMLIGLCLSPCV 216
+E+ LL Q +F LR T D R RP + + Q IG CL+PCV
Sbjct: 137 RETLDLL--QKLFPLR-----TCEDSVFKNRKRPCLLY--Q--IGRCLAPCV 177
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 26.7 bits (59), Expect = 1.3
Identities = 12/34 (35%), Positives = 13/34 (38%), Gaps = 2/34 (5%)
Query: 59 CNVVKNCHANAECVYNATSAGYRCQCAQGYVGNG 92
C C CV T YRC C GY G+
Sbjct: 2 CAASNPCSNGGTCVN--TPGSYRCVCPPGYTGDR 33
>gnl|CDD|235502 PRK05560, PRK05560, DNA gyrase subunit A; Validated.
Length = 805
Score = 28.8 bits (66), Expect = 2.6
Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 7/35 (20%)
Query: 173 DFLLV--NQG-MFMLRVPYQ-P--TRTDRGRPIIN 201
D LL N+G ++ L+V Y+ P +RT RGRPI+N
Sbjct: 549 DTLLFFTNRGRVYRLKV-YEIPEASRTARGRPIVN 582
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 24.9 bits (55), Expect = 6.1
Identities = 12/32 (37%), Positives = 12/32 (37%), Gaps = 2/32 (6%)
Query: 59 CNVVKNCHANAECVYNATSAGYRCQCAQGYVG 90
C C CV T YRC C GY G
Sbjct: 5 CASGNPCQNGGTCVN--TVGSYRCSCPPGYTG 34
Score = 24.5 bits (54), Expect = 7.1
Identities = 9/30 (30%), Positives = 14/30 (46%), Gaps = 1/30 (3%)
Query: 130 EDRSLCGKDASCVVASQGHFHCECNEGFTG 159
+ C +CV + G + C C G+TG
Sbjct: 6 ASGNPCQNGGTCVN-TVGSYRCSCPPGYTG 34
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 24.6 bits (54), Expect = 7.2
Identities = 12/32 (37%), Positives = 13/32 (40%), Gaps = 2/32 (6%)
Query: 63 KNCHANAECVYNATSAGYRCQCAQGYVGNGVE 94
NC AN CV T + C C GY N
Sbjct: 10 HNCPANTVCV--NTIGSFECVCPDGYENNEDG 39
>gnl|CDD|216976 pfam02329, HDC, Histidine carboxylase PI chain. Histidine
carboxylase catalyzes the formation of histamine from
histidine. Cleavage of the proenzyme PI chain yields two
subunits, alpha and beta, which arrange as a hexamer
(alpha beta)6.
Length = 306
Score = 26.8 bits (59), Expect = 9.3
Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 14/81 (17%)
Query: 100 SCLEDRSLCG----KDSQGYVGNGVECHPLKSCLEDR--------SLCGKDASCVVASQ- 146
+ L+DR+ +D+ + LK LE R +LCG+D
Sbjct: 186 AILKDRTKGSSLFVEDAGVWGDENPTEEELKEFLEGRLRKVTKSIALCGQDQHVSYERIY 245
Query: 147 -GHFHCECNEGFTGNGITCKP 166
G+ + G G +TC P
Sbjct: 246 IGYKYTMMEPGQVGCALTCAP 266
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.138 0.463
Gapped
Lambda K H
0.267 0.0637 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,299,931
Number of extensions: 865798
Number of successful extensions: 606
Number of sequences better than 10.0: 1
Number of HSP's gapped: 597
Number of HSP's successfully gapped: 48
Length of query: 222
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 129
Effective length of database: 6,812,680
Effective search space: 878835720
Effective search space used: 878835720
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 57 (25.9 bits)