RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13157
(1434 letters)
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 29.4 bits (67), Expect = 0.86
Identities = 15/28 (53%), Positives = 15/28 (53%)
Query: 1132 PGTCGQNANCKVINHSPICTCKPGYTGD 1159
G C NA C S CTCK GYTGD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Score = 29.4 bits (67), Expect = 0.96
Identities = 14/28 (50%), Positives = 15/28 (53%)
Query: 257 PGTCGQNANCRVINHSPICTCKPGFTGD 284
G C NA C S CTCK G+TGD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Score = 26.7 bits (60), Expect = 8.4
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 72 PGSCGQNANCRVINHSPVCSCKPGFTG 98
G C NA C S C+CK G+TG
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTG 31
Score = 26.7 bits (60), Expect = 8.4
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 574 PGSCGQNANCRVINHSPVCSCKPGFTG 600
G C NA C S C+CK G+TG
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTG 31
Score = 26.7 bits (60), Expect = 8.4
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 1006 PGSCGQNANCRVINHSPVCSCKPGFTG 1032
G C NA C S C+CK G+TG
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTG 31
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 33.0 bits (75), Expect = 1.4
Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 12/43 (27%)
Query: 1168 PPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVN 1210
PPPPPP PG+ + + PPPPPP +V +
Sbjct: 9 PPPPPP-------PPGFEPPS-----QPPPPPPPGVNVKKRSR 39
Score = 31.1 bits (70), Expect = 6.3
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 1196 PPPPPPQDDVPEPVNPCYPSPCGLYSECR 1224
PPPPPP P P P P G+ + R
Sbjct: 9 PPPPPPPPGFEPPSQPPPPPPPGVNVKKR 37
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 32.7 bits (75), Expect = 1.7
Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 17/46 (36%)
Query: 1166 RIPPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNP 1211
R+P PP P E +P PPPPPP +D +P
Sbjct: 257 RLPAPPQPPEE----EP-------------PPPPPPPEDDDDPPED 285
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family. In
Brassicaceae, self-incompatible plants have a
self/non-self recognition system. This is
sporophytically controlled by multiple alleles at a
single locus (S). S-locus glycoproteins, as well as
S-receptor kinases, are in linkage with the S-alleles.
Length = 110
Score = 30.7 bits (70), Expect = 1.8
Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)
Query: 313 CGPYAQCRDINGSPSCSCLP 332
CGPY C D+N SP C+C+
Sbjct: 86 CGPYGYC-DVNTSPKCNCIK 104
Score = 28.8 bits (65), Expect = 6.8
Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)
Query: 258 GTCGQNANCRVINHSPICTCKPGF 281
G CG C +N SP C C GF
Sbjct: 84 GRCGPYGYC-DVNTSPKCNCIKGF 106
Score = 28.8 bits (65), Expect = 7.2
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%)
Query: 67 PEHPCP--GSCGQNANCRVINHSPVCSCKPGF 96
P+ C G CG C +N SP C+C GF
Sbjct: 76 PKDQCDVYGRCGPYGYC-DVNTSPKCNCIKGF 106
Score = 28.4 bits (64), Expect = 9.0
Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 1/24 (4%)
Query: 575 GSCGQNANCRVINHSPVCSCKPGF 598
G CG C +N SP C+C GF
Sbjct: 84 GRCGPYGYC-DVNTSPKCNCIKGF 106
Score = 28.4 bits (64), Expect = 9.0
Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 1/24 (4%)
Query: 1007 GSCGQNANCRVINHSPVCSCKPGF 1030
G CG C +N SP C+C GF
Sbjct: 84 GRCGPYGYC-DVNTSPKCNCIKGF 106
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion of
magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 32.5 bits (74), Expect = 1.8
Identities = 13/43 (30%), Positives = 14/43 (32%), Gaps = 19/43 (44%)
Query: 1169 PPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNP 1211
P P PQ P PPPPPP+ PE
Sbjct: 267 PEPEPQPP-------------------PPPPPPEPPEPEEEPD 290
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 32.2 bits (73), Expect = 3.0
Identities = 25/105 (23%), Positives = 30/105 (28%), Gaps = 15/105 (14%)
Query: 610 PPRPPPQEDVPEPVNPC--YPSPCGPYSQCRDIGGSPSCS--------CLPNYIGSPPN- 658
P PPP P P P P + RD S SPP
Sbjct: 2623 APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQR 2682
Query: 659 CRPECVMNSECPSHEASRPPPQEDVPEP----VNPCYPSPCGPYS 699
R + + PPP PEP + P P GP +
Sbjct: 2683 PRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA 2727
Score = 30.3 bits (68), Expect = 9.4
Identities = 19/93 (20%), Positives = 27/93 (29%), Gaps = 1/93 (1%)
Query: 600 GEPRIRCNKIPPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGSPPNC 659
R + PP P PQ P P P P P P Q + + + G+ P
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQP 2962
Query: 660 RPECVMNSECPSHEASRPPPQEDVPEPVNPCYP 692
++ P P P + P
Sbjct: 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 28.0 bits (63), Expect = 3.1
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 305 VNPCV-PSPCGPYAQCRDINGSPSCSCLPNYIG 336
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 26.8 bits (60), Expect = 8.1
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 623 VNPCY-PSPCGPYSQCRDIGGSPSCSCLPNYIG 654
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 26.8 bits (60), Expect = 8.1
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 687 VNPCY-PSPCGPYSQCRDIGGSPSCSCLPNYIG 718
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 30.9 bits (70), Expect = 6.6
Identities = 10/55 (18%), Positives = 13/55 (23%)
Query: 1165 NRIPPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNPCYPSPCGL 1219
+ P P+ G P P + P P P P G
Sbjct: 690 AALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.140 0.504
Gapped
Lambda K H
0.267 0.0656 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 68,333,304
Number of extensions: 6243832
Number of successful extensions: 6236
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5899
Number of HSP's successfully gapped: 363
Length of query: 1434
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1325
Effective length of database: 6,103,016
Effective search space: 8086496200
Effective search space used: 8086496200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.0 bits)