RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11187
(115 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 27.2 bits (61), Expect = 0.43
Identities = 7/16 (43%), Positives = 10/16 (62%)
Query: 61 ECDESSPCGVGSSCIN 76
EC +PC G +C+N
Sbjct: 4 ECASGNPCQNGGTCVN 19
>gnl|CDD|236107 PRK07823, PRK07823, 5'-methylthioadenosine phosphorylase;
Validated.
Length = 264
Score = 28.5 bits (64), Expect = 0.61
Identities = 8/17 (47%), Positives = 11/17 (64%)
Query: 7 AAEAFREFQRNRDRIKT 23
A + F EF RN +R+K
Sbjct: 217 AVDVFAEFGRNIERLKR 233
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 26.4 bits (59), Expect = 0.75
Identities = 7/16 (43%), Positives = 10/16 (62%)
Query: 61 ECDESSPCGVGSSCIN 76
EC +PC G +C+N
Sbjct: 4 ECASGNPCQNGGTCVN 19
>gnl|CDD|221784 pfam12810, Gly_rich, Glycine rich protein. This family of proteins
is greatly expanded in Trichomonas vaginalis. The
proteins are composed of several glycine rich motifs
interspersed through the sequence. Although many
proteins have been annotated by similarity in the family
these annotations given the biased composition of the
sequences these are unlikely to be functionally
relevant.
Length = 248
Score = 27.9 bits (63), Expect = 1.1
Identities = 14/54 (25%), Positives = 18/54 (33%), Gaps = 3/54 (5%)
Query: 63 DESSPCGVGSSCINSSIGQSAESCIAISSIIINNSPSVSSTSNNSSD---EITI 113
+ G GSS I+ G I S N + NS + ITI
Sbjct: 185 HYAGGGGGGSSYISGKDGCIPNGSIHGSGYFFENPGMIDGNFTNSGNGYARITI 238
>gnl|CDD|234934 PRK01286, PRK01286, deoxyguanosinetriphosphate
triphosphohydrolase-like protein; Provisional.
Length = 336
Score = 27.4 bits (62), Expect = 1.7
Identities = 8/9 (88%), Positives = 9/9 (100%)
Query: 13 EFQRNRDRI 21
EFQR+RDRI
Sbjct: 28 EFQRDRDRI 36
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 25.5 bits (56), Expect = 1.8
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 61 ECDESSPCGVGSSCINS 77
EC S+PC G +C+N+
Sbjct: 1 ECAASNPCSNGGTCVNT 17
>gnl|CDD|214601 smart00289, WR1, Worm-specific repeat type 1. Worm-specific
repeat type 1. Cysteine-rich domain apparently unique
(so far) to C. elegans. Often appears with KU domains.
About 3 dozen worm proteins contain this domain.
Length = 38
Score = 24.9 bits (54), Expect = 3.1
Identities = 8/27 (29%), Positives = 9/27 (33%)
Query: 55 SSTPVCECDESSPCGVGSSCINSSIGQ 81
C + C G SC NS G
Sbjct: 9 LGGSPVRCSPNGSCPSGYSCQNSKQGI 35
>gnl|CDD|215216 PLN02385, PLN02385, hydrolase; alpha/beta fold family protein.
Length = 349
Score = 26.6 bits (59), Expect = 3.5
Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 9/37 (24%)
Query: 5 REAAEAFREFQRNRDR---------IKTEETNERASR 32
R A +AF++ Q D IKTEE+ E SR
Sbjct: 34 RRARDAFKDIQLQLDHCLFKTPPSGIKTEESYEVNSR 70
>gnl|CDD|233748 TIGR02149, glgA_Coryne, glycogen synthase, Corynebacterium family.
This model describes Corynebacterium glutamicum GlgA and
closely related proteins in several other species. This
enzyme is required for glycogen biosynthesis and appears
to replace the distantly related TIGR02095 family of
ADP-glucose type glycogen synthase in Corynebacterium
glutamicum, Mycobacterium tuberculosis, Bifidobacterium
longum, and Streptomyces coelicolor [Energy metabolism,
Biosynthesis and degradation of polysaccharides].
Length = 388
Score = 26.3 bits (58), Expect = 4.1
Identities = 18/63 (28%), Positives = 23/63 (36%), Gaps = 7/63 (11%)
Query: 4 LREAAEAFREFQRNR-------DRIKTEETNERASRPSKYVKIKTNKPVGTVRLETDKSS 56
E +A RNR + EE E S +V +P+G V LE
Sbjct: 245 AEEVRQAVALLDRNRTGIIWINKMLPKEELVELLSNAEVFVCPSIYEPLGIVNLEAMACG 304
Query: 57 TPV 59
TPV
Sbjct: 305 TPV 307
>gnl|CDD|128486 smart00189, IL2, Interleukin-2 family. Interleukin-2 is a cytokine
produced by T-helper cells in response to antigenic or
mitogenic stimulation. This protein is required for
T-cell proliferation and other activities crucial to the
regulation of the immune response.
Length = 154
Score = 25.6 bits (56), Expect = 5.5
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 85 SCIAISSIIINNSPSVSSTSNNSSDEI 111
SCIA++ ++ NS SS++ ++ +
Sbjct: 8 SCIALTLALLTNSAPTSSSTKSTQQHL 34
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 25.9 bits (57), Expect = 5.6
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 4 LREAAEAFREFQRNRDRIKTEETNERASR 32
L + A+A + RD ++ E ERA R
Sbjct: 222 LTDVADALEQLDEYRDELERLEALERALR 250
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.120 0.324
Gapped
Lambda K H
0.267 0.0593 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,063,644
Number of extensions: 381128
Number of successful extensions: 368
Number of sequences better than 10.0: 1
Number of HSP's gapped: 368
Number of HSP's successfully gapped: 21
Length of query: 115
Length of database: 10,937,602
Length adjustment: 79
Effective length of query: 36
Effective length of database: 7,433,636
Effective search space: 267610896
Effective search space used: 267610896
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.5 bits)