RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5614
(98 letters)
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 29.4 bits (66), Expect = 0.27
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 9/35 (25%)
Query: 17 CSCLPNYIGAPPNCRPE---CVMNSECASNLACIN 48
C+CL CR E C+ N +CA+N+AC+
Sbjct: 114 CACLLK------ECRLELAKCIANPKCAANVACLQ 142
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 24.8 bits (55), Expect = 2.0
Identities = 12/28 (42%), Positives = 13/28 (46%)
Query: 56 PGSCGYNAQCKVINHTPICTCPEGYIGD 83
G C NA C + CTC GY GD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 24.7 bits (54), Expect = 2.6
Identities = 9/28 (32%), Positives = 10/28 (35%)
Query: 56 PGSCGYNAQCKVINHTPICTCPEGYIGD 83
C C + C CP GY GD
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
>gnl|CDD|215663 pfam00026, Asp, Eukaryotic aspartyl protease. Aspartyl (acid)
proteases include pepsins, cathepsins, and renins.
Two-domain structure, probably arising from ancestral
duplication. This family does not include the
retroviral nor retrotransposon proteases (pfam00077),
which are much smaller and appear to be homologous to a
single domain of the eukaryotic asp proteases.
Length = 316
Score = 26.1 bits (58), Expect = 2.8
Identities = 7/20 (35%), Positives = 10/20 (50%)
Query: 35 VMNSECASNLACINEKCRDP 54
V + C S+ AC + DP
Sbjct: 28 VPSVYCTSSYACKSHGTFDP 47
>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein. The N-terminal
end matches very strongly a pfam Mg_chelatase domain
[Unknown function, General].
Length = 499
Score = 26.0 bits (57), Expect = 4.0
Identities = 6/20 (30%), Positives = 9/20 (45%)
Query: 76 CPEGYIGDPFSSCFPKPPER 95
CP G+ G + C P +
Sbjct: 348 CPCGHYGGKNTHCRCSPQQI 367
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 23.9 bits (52), Expect = 4.3
Identities = 9/25 (36%), Positives = 9/25 (36%)
Query: 59 CGYNAQCKVINHTPICTCPEGYIGD 83
C C C CPEGY G
Sbjct: 7 CSNGGTCVDTPGGYTCECPEGYTGK 31
>gnl|CDD|165621 PLN00043, PLN00043, elongation factor 1-alpha; Provisional.
Length = 447
Score = 25.8 bits (56), Expect = 4.3
Identities = 11/51 (21%), Positives = 19/51 (37%), Gaps = 8/51 (15%)
Query: 53 DPCPGSCGYNAQCKVINH--------TPICTCPEGYIGDPFSSCFPKPPER 95
DP + + +Q ++NH P+ C +I F+ K R
Sbjct: 320 DPAKEAANFTSQVIIMNHPGQIGNGYAPVLDCHTSHIAVKFAEILTKIDRR 370
>gnl|CDD|188457 TIGR03942, sulfatase_rSAM, anaerobic sulfatase-maturating enzyme.
Members of this protein family are radical SAM family
enzymes, maturases that prepare the oxygen-sensitive
radical required in the active site of anaerobic
sulfatases. This maturase role has led to many
misleading legacy annotations suggesting that this
enzyme maturase is instead a sulfatase regulatory
protein. All members of the seed alignment are radical
SAM enzymes encoded next to or near an anaerobic
sulfatase. Note that a single genome may encode more
than one sulfatase/maturase pair [Protein fate, Protein
modification and repair].
Length = 363
Score = 24.9 bits (55), Expect = 9.8
Identities = 8/28 (28%), Positives = 12/28 (42%), Gaps = 3/28 (10%)
Query: 18 SCLPNYIGAPPNCRPECVMNSECASNLA 45
+ L ++G P CV + C NL
Sbjct: 235 NALAIWLGNPS---QSCVHSPTCGQNLV 259
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.137 0.515
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,562,252
Number of extensions: 314419
Number of successful extensions: 274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 269
Number of HSP's successfully gapped: 48
Length of query: 98
Length of database: 10,937,602
Length adjustment: 64
Effective length of query: 34
Effective length of database: 8,098,946
Effective search space: 275364164
Effective search space used: 275364164
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.2 bits)