RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13158
(289 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 29.9 bits (68), Expect = 0.12
Identities = 12/27 (44%), Positives = 13/27 (48%)
Query: 17 PSPCGPYSECRNINGGPSCSCRPGYIG 43
+PC C N G CSC PGY G
Sbjct: 8 GNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 26.1 bits (58), Expect = 2.8
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 167 VNPCV-PSPCGLYSQCRDIGGSPSCSCLPNYIG 198
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 28.6 bits (64), Expect = 0.30
Identities = 11/28 (39%), Positives = 12/28 (42%)
Query: 17 PSPCGPYSECRNINGGPSCSCRPGYIGS 44
+PC C N G C C PGY G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 25.5 bits (56), Expect = 4.0
Identities = 11/33 (33%), Positives = 13/33 (39%), Gaps = 2/33 (6%)
Query: 224 CADPCPGSCGYNAQCKVINHTPTCTCPDGFIGD 256
CA P C C + C CP G+ GD
Sbjct: 2 CAASNP--CSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 24.7 bits (54), Expect = 9.0
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 172 PSPCGLYSQCRDIGGSPSCSCLPNYIG 198
+PC C + GS C C P Y G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTG 31
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 27.6 bits (62), Expect = 0.95
Identities = 10/25 (40%), Positives = 11/25 (44%)
Query: 17 PSPCGPYSECRNINGGPSCSCRPGY 41
+PC C N G C C PGY
Sbjct: 8 GNPCQNGGTCVNTVGSYRCECPPGY 32
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 26.7 bits (60), Expect = 1.7
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 116 PGSCGYNAECKVINHNPICSCSQGYIGDGFS 146
G C NA C + C+C GY GDG +
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGDGVT 35
Score = 26.7 bits (60), Expect = 1.8
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 19 PCGPYSECRNINGGPSCSCRPGYIG 43
C P + C N G +C+C+ GY G
Sbjct: 7 GCHPNATCTNTGGSFTCTCKSGYTG 31
Score = 26.0 bits (58), Expect = 3.1
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 229 PGSCGYNAQCKVINHTPTCTCPDGFIGD 256
G C NA C + TCTC G+ GD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 28.6 bits (64), Expect = 3.3
Identities = 35/131 (26%), Positives = 45/131 (34%), Gaps = 23/131 (17%)
Query: 19 PCGPYSECRNINGGPS-----CSCRPGYIGSPPNCRPECVMNSECPSHEACIKIPECIQN 73
CG Y+ C N C C GY S C P N C S +CI +
Sbjct: 51 VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSG-------KCIVD 103
Query: 74 SECPYDKACIREKCVDPCPGSCGYGAVCTVINHSNEKCQDPCPGSCGYNAECKVINHNPI 133
P + C SC G V + + C C N ECK++
Sbjct: 104 PANPNNTTC-----------SCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYE 152
Query: 134 CSCSQGYIGDG 144
C C +G+ GDG
Sbjct: 153 CVCKEGFPGDG 163
Score = 27.8 bits (62), Expect = 6.2
Identities = 27/113 (23%), Positives = 37/113 (32%), Gaps = 31/113 (27%)
Query: 174 PCGLYSQCRDIGGSPS-----CSCLPNYIGAPPNCRPECLQNSECPNDKACIRE------ 222
CG Y+ C + C C+ Y + C P N C + K CI +
Sbjct: 51 VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSGK-CIVDPANPNN 109
Query: 223 ---------------KCADP----CPGSCGYNAQCKVINHTPTCTCPDGFIGD 256
KC C C N +CK++ C C +GF GD
Sbjct: 110 TTCSCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPGD 162
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 28.2 bits (62), Expect = 5.1
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 150 PKPPEVPPPPQQDVQE 165
PK P PPPP+Q +E
Sbjct: 210 PKAPSSPPPPKQSAKE 225
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family. In
Brassicaceae, self-incompatible plants have a
self/non-self recognition system. This is
sporophytically controlled by multiple alleles at a
single locus (S). S-locus glycoproteins, as well as
S-receptor kinases, are in linkage with the S-alleles.
Length = 110
Score = 26.9 bits (60), Expect = 6.9
Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 1/24 (4%)
Query: 230 GSCGYNAQCKVINHTPTCTCPDGF 253
G CG C +N +P C C GF
Sbjct: 84 GRCGPYGYC-DVNTSPKCNCIKGF 106
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.142 0.513
Gapped
Lambda K H
0.267 0.0656 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,064,013
Number of extensions: 1222572
Number of successful extensions: 1184
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1161
Number of HSP's successfully gapped: 110
Length of query: 289
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 193
Effective length of database: 6,679,618
Effective search space: 1289166274
Effective search space used: 1289166274
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.3 bits)