RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7854
(153 letters)
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 40.6 bits (95), Expect = 7e-06
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 81 ENPGICTNGACENLMGTYRCICNPGYEGDPT 111
+ G C+NG C N G+Y C C PGY GD
Sbjct: 3 ASGGPCSNGTCINTPGSYTCSCPPGYTGDKR 33
Score = 24.4 bits (53), Expect = 5.6
Identities = 10/15 (66%), Positives = 11/15 (73%)
Query: 127 ICSGGTCKNTPGSFQ 141
CS GTC NTPGS+
Sbjct: 7 PCSNGTCINTPGSYT 21
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 38.0 bits (89), Expect = 5e-05
Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 1/30 (3%)
Query: 81 ENPGICTNGA-CENLMGTYRCICNPGYEGD 109
+ C NG C N +G+YRC C PGY G
Sbjct: 6 ASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35
Score = 29.5 bits (67), Expect = 0.089
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 117 DQNECLMDVMICSGGTCKNTPGSFQ 141
D +EC +GGTC NT GS++
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYR 25
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 38.0 bits (89), Expect = 6e-05
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 81 ENPGICTNGA-CENLMGTYRCICNPGYEGDPTGKLC 115
+ C NG C N +G+YRC C PGY G+ C
Sbjct: 6 ASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38
Score = 29.1 bits (66), Expect = 0.11
Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 2/26 (7%)
Query: 117 DQNECLMDVMICS-GGTCKNTPGSFQ 141
D +EC C GGTC NT GS++
Sbjct: 1 DIDECASGN-PCQNGGTCVNTVGSYR 25
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 36.3 bits (85), Expect = 2e-04
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 98 YRCICNPGYEGDPTGKLCSDQNE 120
Y C C PGY+ G+ C D +E
Sbjct: 2 YTCSCPPGYQLSGDGRTCEDIDE 24
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 36.3 bits (84), Expect = 2e-04
Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 1/32 (3%)
Query: 81 ENPGICTNGA-CENLMGTYRCICNPGYEGDPT 111
C+NG C N G+YRC+C PGY GD +
Sbjct: 3 AASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 24.7 bits (54), Expect = 3.8
Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 6/27 (22%)
Query: 115 CSDQNECLMDVMICSGGTCKNTPGSFQ 141
C+ N C +GGTC NTPGS++
Sbjct: 2 CAASNPCS------NGGTCVNTPGSYR 22
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 34.8 bits (81), Expect = 8e-04
Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 80 TENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLC 115
EN G C N C N G++ C C GY GD G C
Sbjct: 2 AENNGGCHPNATCTNTGGSFTCTCKSGYTGD--GVTC 36
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 33.9 bits (78), Expect = 0.003
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 88 NGACENLMGTYRCICNPGYEGDPTGKLC 115
N C N +G++ C+C GYE + G C
Sbjct: 15 NTVCVNTIGSFECVCPDGYENNEDGTNC 42
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 28.6 bits (64), Expect = 0.15
Identities = 11/25 (44%), Positives = 13/25 (52%), Gaps = 1/25 (4%)
Query: 86 CTNGA-CENLMGTYRCICNPGYEGD 109
C+NG C + G Y C C GY G
Sbjct: 7 CSNGGTCVDTPGGYTCECPEGYTGK 31
>gnl|CDD|224454 COG1537, PelA, Predicted RNA-binding proteins [General function
prediction only].
Length = 352
Score = 31.1 bits (71), Expect = 0.18
Identities = 19/73 (26%), Positives = 26/73 (35%), Gaps = 25/73 (34%)
Query: 12 EGRLRIKGV----------SKHSRLAVALRRSISI--DEFTVHSSTLIALSQESQDLERL 59
RLRIKG + + V + I I +E+ LERL
Sbjct: 75 ANRLRIKGPIVEGPEEVVKGSYHTINVTIGTEIEIEKEEW------------NKDQLERL 122
Query: 60 KFGLEPPTSRSEV 72
K +E + R EV
Sbjct: 123 KEAVE-ASKRPEV 134
>gnl|CDD|213325 cd12117, A_NRPS_Srf_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Bacillus subtilis
termination module Surfactin (SrfA-C). The adenylation
(A) domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides
in bacteria and fungi via a template-directed, nucleic
acid independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions. This family includes the
adenylation domain of the Bacillus subtilis termination
module (Surfactin domain, SrfA-C) which recognizes a
specific amino acid building block, which is then
activated and transferred to the terminal thiol of the
4'-phosphopantetheine (Ppan) arm of the downstream
peptidyl carrier protein (PCP) domain.
Length = 474
Score = 30.2 bits (69), Expect = 0.39
Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 4 NANANMLTEGRLRIKGVSKHSRLAVALRRS 33
N AN L RLR +GV +A+ L RS
Sbjct: 19 NERANRLA-RRLRARGVGPGDVVALLLERS 47
>gnl|CDD|236075 PRK07683, PRK07683, aminotransferase A; Validated.
Length = 387
Score = 29.7 bits (67), Expect = 0.49
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 16/76 (21%)
Query: 35 SIDE-FT--VHSSTLIALSQESQDLERLKFGLEPPTSRSEVHGANHHTTENPGICTNGAC 91
+I E +T H++ L+ L + + + + K+ L H++ E+ I T GA
Sbjct: 53 AITENYTSYTHNAGLLELRKAACNFVKDKYDL-------------HYSPESEIIVTIGAS 99
Query: 92 ENLMGTYRCICNPGYE 107
E + +R I PG E
Sbjct: 100 EAIDIAFRTILEPGTE 115
>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
Length = 5163
Score = 29.2 bits (65), Expect = 1.2
Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 4 NANANMLTEGRLRIKGVSKHSRLAVALRRSISI 36
++ AN L RLR +GV R+A+A RS +
Sbjct: 2035 DSRANRLAH-RLRARGVGPEVRVAIAAERSFEL 2066
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 28.6 bits (64), Expect = 1.3
Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 12/60 (20%)
Query: 16 RIKGVSKHSRLAVALR--------RSISIDEFTVHSSTLIALSQE----SQDLERLKFGL 63
+IK + K +L ++++ RSIS D F S+ L+ +SQ D+ +LK L
Sbjct: 481 KIKSILKFIQLCISMKDINGFISIRSISFDTFKSISNELLGMSQRLLNLEADVRKLKDLL 540
>gnl|CDD|165214 PHA02887, PHA02887, EGF-like protein; Provisional.
Length = 126
Score = 27.2 bits (60), Expect = 2.3
Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 2/25 (8%)
Query: 86 CTNGACENLMGTYR--CICNPGYEG 108
C NG C N++ CICN GY G
Sbjct: 94 CINGECMNIIDLDEKFCICNKGYTG 118
>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a
thioester bond to the enzyme-bound cofactor
phosphopantetheine of a peptidyl carrier protein
domain. NRPSs are large multifunctional enzymes which
synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic
acid independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions.
Length = 445
Score = 27.0 bits (61), Expect = 4.7
Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 1/31 (3%)
Query: 4 NANANMLTEGRLRIKGVSKHSRLAVALRRSI 34
N AN L LR +GV +A+ L RS
Sbjct: 19 NERANRLA-HYLRARGVGPGDLVAICLERSP 48
>gnl|CDD|188999 cd06461, M2_ACE, Peptidase family M2 Angiotensin converting enzyme
(ACE). Peptidase family M2 Angiotensin converting
enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc
dependent dipeptidase that catalyzes the conversion of
the decapeptide angiotensin I to the potent vasopressor
octapeptide angiotensin II, by removing two C-terminal
amino acids. There are two forms of the enzyme in
humans, the ubiquitous somatic ACE and the
sperm-specific germinal ACE, both encoded by the same
gene through transcription from alternative promoters.
Somatic ACE has two tandem active sites with distinct
catalytic properties, whereas germinal ACE, the function
of which is largely unknown, has just a single active
site. Recently, an ACE homolog, ACE2, has been
identified in humans that differs from ACE; it
preferentially removes carboxy-terminal hydrophobic or
basic amino acids and appears to be important in cardiac
function. ACE homologs (also known as members of the M2
gluzincin family) have been found in a wide variety of
species, including those that neither have a
cardiovascular system nor synthesize angiotensin. ACE is
well-known as a key part of the renin-angiotensin system
that regulates blood pressure and ACE inhibitors are
important for the treatment of hypertension.
Length = 562
Score = 26.8 bits (60), Expect = 5.7
Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 4/25 (16%)
Query: 58 RLKF-GLEPPTSRSEVH---GANHH 78
R K+ G++PP RSE GA +H
Sbjct: 444 REKYQGVKPPVPRSETDFDPGAKYH 468
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs such
as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 24.2 bits (54), Expect = 5.8
Identities = 6/10 (60%), Positives = 6/10 (60%)
Query: 100 CICNPGYEGD 109
C C PGY G
Sbjct: 2 CQCPPGYTGP 11
>gnl|CDD|181036 PRK07568, PRK07568, aspartate aminotransferase; Provisional.
Length = 397
Score = 26.7 bits (60), Expect = 5.9
Identities = 9/32 (28%), Positives = 14/32 (43%)
Query: 76 NHHTTENPGICTNGACENLMGTYRCICNPGYE 107
+ + TNG E ++ IC+PG E
Sbjct: 83 GIDVEPDEILITNGGSEAILFAMMAICDPGDE 114
>gnl|CDD|213324 cd12116, A_NRPS_Ta1_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including salinosporamide A
polyketide synthase. The adenylation (A) domain of
NRPS recognizes a specific amino acid or hydroxy acid
and activates it as an (amino) acyl adenylate by
hydrolysis of ATP. The activated acyl moiety then forms
a thioester to the enzyme-bound cofactor
phosphopantetheine of a peptidyl carrier protein
domain. NRPSs are large multifunctional enzymes which
synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic
acid independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions. This family includes the
myxovirescin (TA) antibiotic biosynthetic gene in
Myxococcus xanthus; TA production plays a role in
predation. It also includes the salinosporamide A
polyketide synthase which is involved in the
biosynthesis of salinosporamide A, a marine microbial
metabolite whose chlorine atom is crucial for potent
proteasome inhibition and anticancer activity.
Length = 438
Score = 26.4 bits (59), Expect = 6.7
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 14 RLRIKGVSKHSRLAVALRRSISI 36
RLR GV R+ V L RS +
Sbjct: 28 RLRALGVGPGDRVGVLLERSADL 50
>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
Willebrand factor type A (vWA) domain [General function
prediction only].
Length = 437
Score = 26.6 bits (59), Expect = 6.8
Identities = 11/53 (20%), Positives = 16/53 (30%)
Query: 12 EGRLRIKGVSKHSRLAVALRRSISIDEFTVHSSTLIALSQESQDLERLKFGLE 64
G ++ R LR ++ D F H + QES L
Sbjct: 24 FGIVKYILKKLIPRWREILRERLAADAFYPHLLEEVFRYQESNSEPEFLVQLP 76
>gnl|CDD|213262 cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of
the osmoprotectant transporter. OpuCA is a the ATP
binding component of a bacterial solute transporter
that serves a protective role to cells growing in a
hyperosmolar environment. ABC (ATP-binding cassette)
transporter nucleotide-binding domain; ABC transporters
are a large family of proteins involved in the
transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. The nucleotide binding domain shows the
highest similarity between all members of the family.
ABC transporters are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition, to the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing
proteins.
Length = 242
Score = 26.1 bits (58), Expect = 7.1
Identities = 9/24 (37%), Positives = 11/24 (45%)
Query: 11 TEGRLRIKGVSKHSRLAVALRRSI 34
T G + I G + V LRR I
Sbjct: 54 TSGEIFIDGEDIREQDPVELRRKI 77
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 25.9 bits (57), Expect = 9.4
Identities = 9/22 (40%), Positives = 11/22 (50%)
Query: 85 ICTNGACENLMGTYRCICNPGY 106
IC NG + + C CN GY
Sbjct: 7 ICKNGYLIQMSNHFECKCNEGY 28
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.131 0.394
Gapped
Lambda K H
0.267 0.0632 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,182,725
Number of extensions: 590918
Number of successful extensions: 473
Number of sequences better than 10.0: 1
Number of HSP's gapped: 473
Number of HSP's successfully gapped: 36
Length of query: 153
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 64
Effective length of database: 6,990,096
Effective search space: 447366144
Effective search space used: 447366144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 55 (25.2 bits)