RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5613
(1010 letters)
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family. In
Brassicaceae, self-incompatible plants have a
self/non-self recognition system. This is
sporophytically controlled by multiple alleles at a
single locus (S). S-locus glycoproteins, as well as
S-receptor kinases, are in linkage with the S-alleles.
Length = 110
Score = 30.7 bits (70), Expect = 1.0
Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)
Query: 637 CGPYSQCRDINGSPSCSCLP 656
CGPY C D+N SP C+C+
Sbjct: 86 CGPYGYC-DVNTSPKCNCIK 104
Score = 28.8 bits (65), Expect = 5.7
Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)
Query: 900 GSCGQNANCRVINHSPICTCRPGF 923
G CG C +N SP C C GF
Sbjct: 84 GRCGPYGYC-DVNTSPKCNCIKGF 106
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 27.9 bits (63), Expect = 2.3
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 581 PGTCGQNANCRVINHNPSCTCKAGFTGD 608
G C NA C + +CTCK+G+TGD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Score = 26.7 bits (60), Expect = 5.9
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 899 PGSCGQNANCRVINHSPICTCRPGFTG 925
G C NA C S CTC+ G+TG
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTG 31
Score = 26.3 bits (59), Expect = 8.1
Identities = 12/28 (42%), Positives = 13/28 (46%), Gaps = 4/28 (14%)
Query: 743 CAPNAVCRDN----VCVCLPDYYGDGYT 766
C PNA C + C C Y GDG T
Sbjct: 8 CHPNATCTNTGGSFTCTCKSGYTGDGVT 35
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 31.1 bits (70), Expect = 2.4
Identities = 16/44 (36%), Positives = 19/44 (43%), Gaps = 8/44 (18%)
Query: 213 PVCSCPPGYTGNPFSQCLLPPTPTPTQATPTDPCFPSPCGSNAR 256
PV PG P PTP + PTDP S CG ++R
Sbjct: 176 PVAGGLPGGGA--------PAPPTPPRPGPTDPASESDCGRDSR 211
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 27.6 bits (62), Expect = 2.8
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 629 VNPC-IPSPCGPYSQCRDINGSPSCSCLPNYIG 660
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
>gnl|CDD|163554 TIGR03842, F420_CPS_4043, F420-dependent oxidoreductase, CPS_4043
family. This model represents a family of putative
F420-dependent oxidoreductases, fairly closely related
to 5,10-methylenetetrahydromethanopterin reductase (mer,
TIGR03555), both within the bacterial luciferase-like
monoxygenase (LLM) family. A fairly deep split (to about
40 % sequence identity) in the present family separates
a strictly Actinobacterial clade from an
alpha/beta/gamma-proteobacterial clade, in which the
member is often the only apparent F420-dependent LLM
family member. The specific function, and whether
Actinobacterial and Proteobacterial clades differ in
function, are unknown [Unknown function, Enzymes of
unknown specificity].
Length = 330
Score = 31.2 bits (71), Expect = 3.0
Identities = 17/50 (34%), Positives = 18/50 (36%), Gaps = 13/50 (26%)
Query: 736 QQADPCIC---------APNAVCRD----NVCVCLPDYYGDGYTVCRPEC 772
Q ADP I A RD VCV P Y GD R +C
Sbjct: 170 QLADPDIAKWMIDAVREAAEDAGRDPDSVKVCVAAPAYVGDDLEHQREQC 219
>gnl|CDD|221000 pfam11145, DUF2921, Protein of unknown function (DUF2921). This
eukaryotic family of proteins has no known function.
Length = 871
Score = 31.3 bits (71), Expect = 3.2
Identities = 13/60 (21%), Positives = 22/60 (36%), Gaps = 8/60 (13%)
Query: 347 EYRDPCSTTQCGLNAICTVINGAAQCACLLLLQHHIHKNQDMDQYIS--------LGYML 398
+ S + L I V++ C L H+ + D+ ++S LGYM
Sbjct: 582 RRQAKDSIWRMDLEGIMVVVSLTLSCVFAGLQLFHVKSHPDVLPFVSLVMLGVQALGYMT 641
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.139 0.496
Gapped
Lambda K H
0.267 0.0656 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,829,603
Number of extensions: 4328586
Number of successful extensions: 3830
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3741
Number of HSP's successfully gapped: 220
Length of query: 1010
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 904
Effective length of database: 6,236,078
Effective search space: 5637414512
Effective search space used: 5637414512
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (28.6 bits)