RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13159
(660 letters)
>gnl|CDD|234824 PRK00725, glgC, glucose-1-phosphate adenylyltransferase;
Provisional.
Length = 425
Score = 34.8 bits (81), Expect = 0.12
Identities = 17/49 (34%), Positives = 25/49 (51%), Gaps = 4/49 (8%)
Query: 283 PECTVNTDCPLNKACVNQKCVDPCPGS-CGENRELDAQRFLVSS--VCL 328
P+ V C L + +++ CV P G GE+ E DA+RF S + L
Sbjct: 365 PDVNVGRSCRLRRCVIDRGCVIP-EGMVIGEDPEEDAKRFRRSEEGIVL 412
>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
Length = 955
Score = 35.2 bits (80), Expect = 0.14
Identities = 23/58 (39%), Positives = 25/58 (43%), Gaps = 8/58 (13%)
Query: 474 PICTCPDGFIGDPFTLCSPKP--PEPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGS 529
P PD DP P P P+P P D PEPV+P P P P DIG S
Sbjct: 574 PDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPVD-PEPVDPVIPDPTIP-----DIGQS 625
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 31.3 bits (71), Expect = 1.7
Identities = 9/20 (45%), Positives = 11/20 (55%)
Query: 492 PKPPEPRPPPQEDVPEPVNP 511
P+PP P PPP+ PE
Sbjct: 271 PQPPPPPPPPEPPEPEEEPD 290
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 27.6 bits (62), Expect = 2.0
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 38 VNPCV-PSPCGPYSQCRDIGGSPSCSCLPNYIG 69
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 27.2 bits (61), Expect = 2.2
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 396 VNPC-IPSPCGPYSQCRDIGGSPSCSCLPNYIG 427
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 27.2 bits (61), Expect = 2.5
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 509 VNPCY-PSPCGPYSQCRDIGGSPSCSCLPNYIG 540
++ C +PC C + GS CSC P Y G
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 30.9 bits (70), Expect = 2.5
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 3/38 (7%)
Query: 479 PDGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSP 516
DG IG+P + +P P + + VNP P P
Sbjct: 78 TDGIIGEP---QQQQTFQPTPVETQQEEKAVNPFTPQP 112
>gnl|CDD|233184 TIGR00917, 2A060601, Niemann-Pick C type protein family. The model
describes Niemann-Pick C type protein in eukaryotes. The
defective protein has been associated with Niemann-Pick
disease which is described in humans as autosomal
recessive lipidosis. It is characterized by the
lysosomal accumulation of unestrified cholesterol. It is
an integral membrane protein, which indicates that this
protein is most likely involved in cholesterol transport
or acts as some component of cholesterol homeostasis
[Transport and binding proteins, Other].
Length = 1204
Score = 31.0 bits (70), Expect = 2.6
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 5/50 (10%)
Query: 97 DPCPGSCGY----NAQCKVINHTPICTCPDGFIGDAFLSCHPKPPEPVQP 142
PCP S G + + C+C D SC KPP P +P
Sbjct: 181 TPCPVSSGMRPMNVSTYSCGDAVLGCSCGCS-CSDCDCSCKAKPPTPPKP 229
>gnl|CDD|184519 PRK14120, gpmA, phosphoglyceromutase; Provisional.
Length = 249
Score = 30.4 bits (69), Expect = 2.9
Identities = 16/56 (28%), Positives = 18/56 (32%), Gaps = 22/56 (39%)
Query: 381 SFTAPPPPLESPPEYVNPCIPSPCGPYSQCRDIGGSPSCSCLPNYIGSPPNCRPEC 436
S+ PPPP+E E YSQ D P Y R EC
Sbjct: 119 SYDTPPPPIEDGSE------------YSQDND----------PRYADLGVGPRTEC 152
>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family. In
Brassicaceae, self-incompatible plants have a
self/non-self recognition system. This is
sporophytically controlled by multiple alleles at a
single locus (S). S-locus glycoproteins, as well as
S-receptor kinases, are in linkage with the S-alleles.
Length = 110
Score = 28.0 bits (63), Expect = 5.4
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 1/20 (5%)
Query: 46 CGPYSQCRDIGGSPSCSCLP 65
CGPY C D+ SP C+C+
Sbjct: 86 CGPYGYC-DVNTSPKCNCIK 104
Score = 28.0 bits (63), Expect = 5.4
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 1/20 (5%)
Query: 404 CGPYSQCRDIGGSPSCSCLP 423
CGPY C D+ SP C+C+
Sbjct: 86 CGPYGYC-DVNTSPKCNCIK 104
Score = 28.0 bits (63), Expect = 5.4
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 1/20 (5%)
Query: 517 CGPYSQCRDIGGSPSCSCLP 536
CGPY C D+ SP C+C+
Sbjct: 86 CGPYGYC-DVNTSPKCNCIK 104
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 25.9 bits (57), Expect = 6.4
Identities = 10/28 (35%), Positives = 12/28 (42%)
Query: 401 PSPCGPYSQCRDIGGSPSCSCLPNYIGS 428
+PC C + GS C C P Y G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 25.9 bits (57), Expect = 8.0
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 43 PSPCGPYSQCRDIGGSPSCSCLPNYIG 69
+PC C + GS C C P Y G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 25.9 bits (57), Expect = 8.0
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 514 PSPCGPYSQCRDIGGSPSCSCLPNYIG 540
+PC C + GS C C P Y G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTG 31
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 28.9 bits (65), Expect = 7.0
Identities = 28/112 (25%), Positives = 40/112 (35%), Gaps = 29/112 (25%)
Query: 403 PCGPYSQCRDIGGSPS-----CSCLPNYIGSPPNCRP-----------ECVMNSECPSNE 446
CG Y+ C + C C+ Y S C P +C+++ P+N
Sbjct: 51 VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSGKCIVDPANPNNT 110
Query: 447 ACI---------NEKCGDP----CPGSCGYNAQCKVINHTPICTCPDGFIGD 485
C N KC C C N +CK++ C C +GF GD
Sbjct: 111 TCSCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPGD 162
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 29.6 bits (66), Expect = 7.2
Identities = 16/48 (33%), Positives = 19/48 (39%), Gaps = 3/48 (6%)
Query: 480 DGFIGDPFTLCSPKPPEPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIG 527
F GDP + P P+ P PEPV P P P PY +
Sbjct: 407 PQFCGDPGLV---SPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMA 451
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 25.6 bits (57), Expect = 9.7
Identities = 13/28 (46%), Positives = 14/28 (50%)
Query: 571 PGSCGYNALCKVINHTPICTCPDGYTGD 598
G C NA C + CTC GYTGD
Sbjct: 5 NGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.141 0.505
Gapped
Lambda K H
0.267 0.0656 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 32,206,964
Number of extensions: 2934637
Number of successful extensions: 3163
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3054
Number of HSP's successfully gapped: 189
Length of query: 660
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 557
Effective length of database: 6,369,140
Effective search space: 3547610980
Effective search space used: 3547610980
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (27.8 bits)