RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy590
(257 letters)
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 31.0 bits (71), Expect = 0.039
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 110 PCAPNAVCINKNHATDCSCLEGFQGNGFV 138
C PNA C N + C+C G+ G+G
Sbjct: 7 GCHPNATCTNTGGSFTCTCKSGYTGDGVT 35
Score = 26.0 bits (58), Expect = 2.3
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 178 CGEHAICIPKNHGIDCQCQSGHQGN 202
C +A C C C+SG+ G+
Sbjct: 8 CHPNATCTNTGGSFTCTCKSGYTGD 32
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 29.8 bits (67), Expect = 0.14
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 104 PCAVHSPCAPNAVCINKNHATDCSCLEGFQGNG 136
CA +PC+ C+N + C C G+ G+
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Score = 26.3 bits (58), Expect = 2.2
Identities = 10/31 (32%), Positives = 14/31 (45%)
Query: 41 CAIASLCAPSAKCHVVKHRPVCTCPQGYEGN 71
CA ++ C+ C C CP GY G+
Sbjct: 2 CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 27.6 bits (62), Expect = 0.69
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 105 CAVHSPCAPNAVCINKNHATDCSCLEGFQG 134
CA +PC C+N + CSC G+ G
Sbjct: 5 CASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 25.7 bits (57), Expect = 3.3
Identities = 11/32 (34%), Positives = 13/32 (40%)
Query: 39 DLCAIASLCAPSAKCHVVKHRPVCTCPQGYEG 70
D CA + C C C+CP GY G
Sbjct: 3 DECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family. In
Brassicaceae, self-incompatible plants have a
self/non-self recognition system. This is
sporophytically controlled by multiple alleles at a
single locus (S). S-locus glycoproteins, as well as
S-receptor kinases, are in linkage with the S-alleles.
Length = 110
Score = 28.4 bits (64), Expect = 1.4
Identities = 10/32 (31%), Positives = 14/32 (43%), Gaps = 1/32 (3%)
Query: 38 EDLCAIASLCAPSAKCHVVKHRPVCTCPQGYE 69
+D C + C P C V P C C +G+
Sbjct: 77 KDQCDVYGRCGPYGYC-DVNTSPKCNCIKGFV 107
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 30.0 bits (67), Expect = 1.5
Identities = 12/34 (35%), Positives = 13/34 (38%), Gaps = 4/34 (11%)
Query: 207 CLSELICLCHLY----TEEPWHRLPMPERASGQP 236
CL L H + TEE WH L AS
Sbjct: 690 CLQTSYRLLHPFMPFITEELWHVLRAQVGASAWA 723
>gnl|CDD|177468 PHA02686, PHA02686, hypothetical protein; Provisional.
Length = 138
Score = 28.4 bits (63), Expect = 2.1
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 212 ICLCHLYTEEPWHRLPMPERASGQ 235
+CLC L E P R P + ASG+
Sbjct: 22 VCLCQLRGEAPERRFPALDEASGE 45
>gnl|CDD|200537 cd11276, Sema_plexin_B2, The Sema domain, a protein interacting
module, of Plexin B2. Plexin B2 serves as the receptor
of Sema4C and Sema4G. By signaling the effect of Sema4C
and Sema4G, the plexin B2 receptor plays important roles
in neural tube closure and cerebellar granule cell
development. Mice lacking Plexin B2 demonstrated defects
in closure of the neural tube and disorganization of the
embryonic brain. In developing kidney, Sema4C-Plexin B2
signaling modulates ureteric branching. Plexin B2 is
expressed both in the pretubular aggregates and the
ureteric epithelium in the developing kidney. Deletion
of Plexin B2 results in renal hypoplasia and occasional
double ureters. The Sema domain is located at the
N-terminus and contains four disulfide bonds formed by
eight conserved cysteine residues. It serves as a
ligand-recognition and -binding module.
Length = 449
Score = 29.0 bits (65), Expect = 3.0
Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 7/66 (10%)
Query: 30 EACYSGQCEDLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEE 89
EACY+G +D H+ + + P +LP PLG S +E
Sbjct: 309 EACYTGTIDDRDVFYKPFHSQKDIICGSHQ-----QKNSKSFPCGSEHLPYPLG--SRDE 361
Query: 90 CASTEA 95
A T
Sbjct: 362 LALTAP 367
>gnl|CDD|181145 PRK07850, PRK07850, acetyl-CoA acetyltransferase; Provisional.
Length = 387
Score = 28.1 bits (63), Expect = 5.0
Identities = 10/18 (55%), Positives = 11/18 (61%)
Query: 191 IDCQCQSGHQGNPYIACL 208
IDCQC S Q N +A L
Sbjct: 85 IDCQCGSAQQANHLVAGL 102
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 27.9 bits (62), Expect = 6.4
Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 7/43 (16%)
Query: 91 ASTEACINSQCQKP---CAVHSPCAPNAVCI----NKNHATDC 126
T AC+ +C+ C + CA N C+ + T+C
Sbjct: 111 LKTCACLLKECRLELAKCIANPKCAANVACLQTCNGRPDETEC 153
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 24.8 bits (54), Expect = 7.7
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 104 PCAVHSPCAPNAVCINKNHATDCSCLEGFQGN 135
CA PC N CIN + CSC G+ G+
Sbjct: 1 ECASGGPC-SNGTCINTPGSYTCSCPPGYTGD 31
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 24.9 bits (55), Expect = 7.8
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 105 CAVHSPCAPNAVCINKNHATDCSCLEGFQ 133
CA +PC C+N + C C G+
Sbjct: 5 CASGNPCQNGGTCVNTVGSYRCECPPGYT 33
>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
Length = 508
Score = 27.5 bits (61), Expect = 9.2
Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 4/62 (6%)
Query: 141 LPVRNYQPICQYNEDCPPDKLCDRLNRICINPCQEDSCGEHAIC---IPKNHGIDCQCQS 197
L + + NE+ +L + N+ + + CG IC + ++ D +S
Sbjct: 81 LEHKELSS-GECNEESNDQRLENLANQPMLYNEAINCCGMEGICSHEVLQDGSTDSDNRS 139
Query: 198 GH 199
Sbjct: 140 VL 141
>gnl|CDD|239590 cd03513, CrtW_beta-carotene-ketolase, Beta-carotene
ketolase/oxygenase (CrtW, also known as CrtO), the
carotenoid astaxanthin biosynthetic enzyme, initially
catalyzes the addition of two keto groups to carbons C4
and C4' of beta-carotene. Carotenoids are important
natural pigments produced by many microorganisms and
plants. Astaxanthin is reported to be an antioxidant, an
anti-cancer agent, and an immune system stimulant. A
number of bacteria and green algae can convert
beta-carotene into astaxanthin by using several
ketocarotenoids as intermediates and CrtW and a
beta-carotene hydroxylase (CrtZ). CrtW initially
converts beta-carotene to canthaxanthin via echinenone,
and CrtZ initially mediates the conversion of
beta-carotene to zeaxanthin via beta-cryptoxanthin.
After a few more intermediates are formed, CrtW and CrtZ
act in combination to produce astaxanthin. Sequences of
this domain family appear to be structurally related to
membrane fatty acid desaturases and alkane hydroxylases.
They all share in common extensive hydrophobic regions
that are capable of spanning the membrane bilayer at
least twice. Comparison of these sequences also reveals
three regions of conserved histidine cluster motifs that
contain eight histidine residues: HXXXH, HXXHH, and
HXXHH. These histidine residues are reported to be
catalytically essential and proposed to be the ligands
for the iron atoms contained within homologs, stearoyl
CoA desaturase and alkane hydroxylase.
Length = 225
Score = 26.9 bits (60), Expect = 9.9
Identities = 7/12 (58%), Positives = 7/12 (58%)
Query: 216 HLYTEEPWHRLP 227
HL PW RLP
Sbjct: 211 HLSPSTPWWRLP 222
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.137 0.480
Gapped
Lambda K H
0.267 0.0632 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,797,190
Number of extensions: 991470
Number of successful extensions: 692
Number of sequences better than 10.0: 1
Number of HSP's gapped: 676
Number of HSP's successfully gapped: 58
Length of query: 257
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 162
Effective length of database: 6,723,972
Effective search space: 1089283464
Effective search space used: 1089283464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.3 bits)