RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15398
(88 letters)
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.4 bits (94), Expect = 3e-05
Identities = 20/26 (76%), Positives = 22/26 (84%)
Query: 54 VRGGGGVEGGPGKEEEEEEEEEEEEE 79
V GGGG +GG +EEEEEEEEEEEEE
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEE 877
Score = 27.7 bits (61), Expect = 0.82
Identities = 13/21 (61%), Positives = 16/21 (76%)
Query: 59 GVEGGPGKEEEEEEEEEEEEE 79
GV+GG G + + EEEEEEEE
Sbjct: 851 GVDGGGGSDGGDSEEEEEEEE 871
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 34.3 bits (79), Expect = 0.004
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 58 GGVEGGPGKEEEEEEEEEEEEENGPQ 83
E +EE+EEEEEEE+E+ GP
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPS 409
Score = 25.1 bits (55), Expect = 6.9
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 52 YIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
YI+ +E E ++EEEE+EEE
Sbjct: 371 YILDAFSALEIEDANTERDDEEEEDEEE 398
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 31.4 bits (72), Expect = 0.014
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
EEE++EEEEEEEE
Sbjct: 56 AAAAAAAAAAAEEEKKEEEEEEEE 79
Score = 28.0 bits (63), Expect = 0.28
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
+EE++EEEEEEEE+
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEED 80
Score = 26.0 bits (58), Expect = 1.3
Identities = 9/16 (56%), Positives = 15/16 (93%)
Query: 66 KEEEEEEEEEEEEENG 81
+++EEEEEEEE+++ G
Sbjct: 69 EKKEEEEEEEEDDDMG 84
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 31.1 bits (70), Expect = 0.047
Identities = 13/21 (61%), Positives = 16/21 (76%)
Query: 59 GVEGGPGKEEEEEEEEEEEEE 79
G + ++EEEEEEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 30.2 bits (68), Expect = 0.086
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEE 79
G VE KE+EEEEE EEEEE
Sbjct: 257 GDYVEEKELKEDEEEEETEEEEE 279
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.9 bits (67), Expect = 0.11
Identities = 6/30 (20%), Positives = 7/30 (23%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
E E+E E E E
Sbjct: 221 QAPSPNTQQAVEHEDEPTEPEREGPPFPGH 250
Score = 24.9 bits (54), Expect = 6.9
Identities = 5/29 (17%), Positives = 10/29 (34%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENGPQT 84
+ ++E E E E G ++
Sbjct: 224 SPNTQQAVEHEDEPTEPEREGPPFPGHRS 252
Score = 24.5 bits (53), Expect = 9.3
Identities = 5/26 (19%), Positives = 5/26 (19%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENG 81
G P EE E
Sbjct: 99 SVGSPTPSPSGSAEELASGLSPENTS 124
Score = 24.5 bits (53), Expect = 9.3
Identities = 8/31 (25%), Positives = 10/31 (32%), Gaps = 4/31 (12%)
Query: 56 GGGGVEGGPGKEE----EEEEEEEEEEENGP 82
+ P E E+E E E GP
Sbjct: 215 QSPTPQQAPSPNTQQAVEHEDEPTEPEREGP 245
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 30.0 bits (68), Expect = 0.12
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 64 PGKEEEEEEEEEEEEENGPQTTFE 87
+EEEE EEEEEEE + TFE
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFE 331
Score = 28.5 bits (64), Expect = 0.40
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
E +EE EEEEEEEEE
Sbjct: 307 EEEEEEEEVPEEEEEEEEE 325
Score = 26.9 bits (60), Expect = 1.4
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
E + EEEEEEEEEEE
Sbjct: 309 EEEEEEVPEEEEEEEEEEE 327
Score = 25.0 bits (55), Expect = 7.0
Identities = 12/19 (63%), Positives = 12/19 (63%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
E EEEEEEEEEEE
Sbjct: 310 EEEEEVPEEEEEEEEEEER 328
Score = 24.6 bits (54), Expect = 8.9
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
E +E EEEEEEEEEE
Sbjct: 308 EEEEEEEVPEEEEEEEEEE 326
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.0 bits (67), Expect = 0.14
Identities = 5/20 (25%), Positives = 15/20 (75%)
Query: 65 GKEEEEEEEEEEEEENGPQT 84
+E+E+E++++EE++ +
Sbjct: 155 DDDEDEDEDDDDEEDDEEEE 174
Score = 29.2 bits (65), Expect = 0.23
Identities = 7/29 (24%), Positives = 18/29 (62%)
Query: 60 VEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
V ++E+E++++EE++E + E+
Sbjct: 152 VIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180
Score = 25.8 bits (56), Expect = 3.3
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 55 RGGGGVEGGPGKEEEEEEEEEEEEE 79
R ++ E+E++++EE++EE
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEE 172
Score = 25.4 bits (55), Expect = 4.6
Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 7/32 (21%)
Query: 58 GGVEGGPGKEEEEEEEEEE-------EEENGP 82
+ +++EEEEEEEE E+E
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 28.8 bits (65), Expect = 0.15
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
K+EE++EEEEEE ++
Sbjct: 72 AAAAAAAAEAKKEEKKEEEEEESDD 96
Score = 27.3 bits (61), Expect = 0.69
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
KEE++EEEEEE +++
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEESDDD 97
Score = 26.9 bits (60), Expect = 0.74
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
+ ++EE++EEEEEE
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEE 93
Score = 26.9 bits (60), Expect = 0.76
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
++EE++EEEEEE +
Sbjct: 71 AAAAAAAAAEAKKEEKKEEEEEESD 95
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.0 bits (65), Expect = 0.18
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
G G G G +EEEEEEEE E
Sbjct: 83 GPAGTTSGTGHTRQEEEEEEEENE 106
Score = 27.9 bits (62), Expect = 0.48
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 64 PGKEEEEEEEEEEEEENGPQ 83
G +EEEEEEEEN Q
Sbjct: 89 SGTGHTRQEEEEEEEENEKQ 108
Score = 27.9 bits (62), Expect = 0.52
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 62 GGPGKEEEEEEEEEEEEENGPQT 84
G +EEEEEEEE E+++ +
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEA 114
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.9 bits (65), Expect = 0.23
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 50 VIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGP 82
V + + +EEEEEE EE E E GP
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGP 374
Score = 28.1 bits (63), Expect = 0.48
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 66 KEEEEEEEEEEEEENGPQT 84
++EEEEEEEEE EE P+
Sbjct: 354 EDEEEEEEEEEPEEPEPEE 372
Score = 26.6 bits (59), Expect = 1.6
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 67 EEEEEEEEEEEEENGPQ 83
+E+EEEEEEEEE P+
Sbjct: 353 QEDEEEEEEEEEPEEPE 369
Score = 25.8 bits (57), Expect = 2.9
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 66 KEEEEEEEEEEEEENGPQTTFE 87
+ EEEEEEEEEEE E
Sbjct: 286 QLEEEEEEEEEEEPAERDELEE 307
Score = 24.6 bits (54), Expect = 7.7
Identities = 11/16 (68%), Positives = 11/16 (68%)
Query: 67 EEEEEEEEEEEEENGP 82
EEEEE EE E EE P
Sbjct: 360 EEEEEPEEPEPEEGPP 375
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 28.8 bits (65), Expect = 0.25
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 64 PGKEEEEEEEEEEEEENG 81
EEEEEEE++ G
Sbjct: 288 AAAAAPAEEEEEEEDDMG 305
Score = 24.6 bits (54), Expect = 7.3
Identities = 6/16 (37%), Positives = 6/16 (37%)
Query: 66 KEEEEEEEEEEEEENG 81
EEEEEE
Sbjct: 287 AAAAAAPAEEEEEEED 302
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 28.0 bits (63), Expect = 0.30
Identities = 8/24 (33%), Positives = 12/24 (50%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
K+EE++EE EEE +
Sbjct: 77 AAAAAAAAAAKKEEKKEESEEESD 100
Score = 26.8 bits (60), Expect = 1.0
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
++EE++EE EEE
Sbjct: 76 AAAAAAAAAAAKKEEKKEESEEES 99
Score = 25.7 bits (57), Expect = 2.3
Identities = 8/24 (33%), Positives = 12/24 (50%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
KEE++EE EEE ++
Sbjct: 78 AAAAAAAAAKKEEKKEESEEESDD 101
Score = 25.7 bits (57), Expect = 2.7
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
++EE++EE EEE
Sbjct: 75 AAAAAAAAAAAAKKEEKKEESEEE 98
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 28.5 bits (64), Expect = 0.33
Identities = 14/18 (77%), Positives = 14/18 (77%)
Query: 65 GKEEEEEEEEEEEEENGP 82
EEEEEEEEEEEEE P
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420
Score = 28.5 bits (64), Expect = 0.41
Identities = 15/20 (75%), Positives = 16/20 (80%)
Query: 60 VEGGPGKEEEEEEEEEEEEE 79
VE +EEEEEEEEEEEEE
Sbjct: 399 VERWAAEEEEEEEEEEEEEE 418
Score = 28.1 bits (63), Expect = 0.52
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 67 EEEEEEEEEEEEENGPQTTFEV 88
EEEEEEEEEEE + EV
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEV 424
Score = 26.6 bits (59), Expect = 1.8
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 67 EEEEEEEEEEEEENGPQTTF 86
EEEEEEEEEEEEE +
Sbjct: 407 EEEEEEEEEEEEEPVAEVMM 426
Score = 25.8 bits (57), Expect = 3.5
Identities = 12/22 (54%), Positives = 12/22 (54%)
Query: 67 EEEEEEEEEEEEENGPQTTFEV 88
EEEEEEEEEEEE
Sbjct: 408 EEEEEEEEEEEEPVAEVMMMPA 429
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 28.3 bits (64), Expect = 0.41
Identities = 11/18 (61%), Positives = 12/18 (66%)
Query: 64 PGKEEEEEEEEEEEEENG 81
+EEEEEEEEE EE
Sbjct: 304 EEEEEEEEEEEEPSEEEA 321
Score = 27.1 bits (61), Expect = 0.94
Identities = 13/14 (92%), Positives = 14/14 (100%)
Query: 66 KEEEEEEEEEEEEE 79
+EEEEEEEEEEEEE
Sbjct: 302 EEEEEEEEEEEEEE 315
Score = 26.8 bits (60), Expect = 1.5
Identities = 13/17 (76%), Positives = 13/17 (76%)
Query: 66 KEEEEEEEEEEEEENGP 82
EEEEEEEEEEEE P
Sbjct: 300 AAEEEEEEEEEEEEEEP 316
Score = 26.4 bits (59), Expect = 2.0
Identities = 13/16 (81%), Positives = 13/16 (81%)
Query: 64 PGKEEEEEEEEEEEEE 79
EEEEEEEEEEEEE
Sbjct: 299 AAAEEEEEEEEEEEEE 314
Score = 26.0 bits (58), Expect = 2.9
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 60 VEGGPGKEEEEEEEEEEEEE 79
+EEEEEEEEEEEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEP 316
Score = 25.2 bits (56), Expect = 5.2
Identities = 10/18 (55%), Positives = 11/18 (61%)
Query: 60 VEGGPGKEEEEEEEEEEE 77
E +EEEEEE EEE
Sbjct: 303 EEEEEEEEEEEEEPSEEE 320
>gnl|CDD|178307 PLN02705, PLN02705, beta-amylase.
Length = 681
Score = 28.4 bits (63), Expect = 0.47
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 53 IVRGGGGVEGGPGKEEEEEEEE 74
I GGGG GG GK E E+E+E
Sbjct: 65 ISSGGGGGGGGKGKREREKEKE 86
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 27.4 bits (61), Expect = 0.56
Identities = 11/22 (50%), Positives = 12/22 (54%)
Query: 58 GGVEGGPGKEEEEEEEEEEEEE 79
G E EEEE+EEE EE
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEE 97
Score = 25.4 bits (56), Expect = 2.8
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENG 81
E +EEE+EEE EEE ++
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 25.4 bits (56), Expect = 2.9
Identities = 11/24 (45%), Positives = 12/24 (50%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
G EEEE+EEE EEE
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEES 99
Score = 25.4 bits (56), Expect = 3.1
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 62 GGPGKEEEEEEEEEEEEEN 80
G E +E EEEE+EE
Sbjct: 76 GAEAAAEADEAEEEEKEEE 94
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 27.4 bits (61), Expect = 0.84
Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 4/63 (6%)
Query: 21 NLRKEVIRNKIRAIGKMARVFSV---LRPLLSVIYIVRGGGGVE-GGPGKEEEEEEEEEE 76
LR+ + + IR + + + LR + +R EE +E+E+EE
Sbjct: 318 ELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEE 377
Query: 77 EEE 79
EE+
Sbjct: 378 EEQ 380
Score = 25.5 bits (56), Expect = 3.9
Identities = 7/28 (25%), Positives = 11/28 (39%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEENGPQT 84
E + E+ EEE + E+G
Sbjct: 380 QRSDEHEEEEGEDSEEEGSQSREDGSSE 407
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 27.4 bits (60), Expect = 0.88
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
G + E+EEEEEEEEEEE
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEE 149
Score = 27.4 bits (60), Expect = 0.99
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEEN 80
G G + E +E+EEEEEEEE
Sbjct: 123 GNAGKKATKEDESDEDEEEEEEEEE 147
Score = 27.4 bits (60), Expect = 1.0
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 48 LSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEEN 80
L+ + + + G KE+E +E+EEEEEE
Sbjct: 113 LAALQLPKKAGNAGKKATKEDESDEDEEEEEEE 145
Score = 27.0 bits (59), Expect = 1.4
Identities = 13/28 (46%), Positives = 13/28 (46%)
Query: 54 VRGGGGVEGGPGKEEEEEEEEEEEEENG 81
V G G GG EE EEE E E G
Sbjct: 169 VDHGNGSSGGDNGEEGEEESVTEAEAEG 196
Score = 25.4 bits (55), Expect = 4.7
Identities = 12/13 (92%), Positives = 12/13 (92%)
Query: 67 EEEEEEEEEEEEE 79
EEEEEEEEEEE E
Sbjct: 139 EEEEEEEEEEEAE 151
Score = 25.4 bits (55), Expect = 4.8
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 55 RGGGGVEGGPGKEEEEEEEEEEEE 78
+ E +EEEEEEEEEE E
Sbjct: 128 KATKEDESDEDEEEEEEEEEEEAE 151
Score = 25.0 bits (54), Expect = 6.6
Identities = 11/14 (78%), Positives = 13/14 (92%)
Query: 66 KEEEEEEEEEEEEE 79
++EEEEEEEEEEE
Sbjct: 137 EDEEEEEEEEEEEA 150
Score = 24.7 bits (53), Expect = 8.6
Identities = 13/19 (68%), Positives = 13/19 (68%)
Query: 67 EEEEEEEEEEEEENGPQTT 85
EEEEEEEEE E E Q T
Sbjct: 141 EEEEEEEEEAEVEENEQGT 159
>gnl|CDD|235323 PRK04960, PRK04960, universal stress protein UspB; Provisional.
Length = 111
Score = 26.9 bits (60), Expect = 0.98
Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 16/44 (36%)
Query: 37 MARVFSVLRPLLSVIYIVRG----------GGG---VEGGPGKE 67
MAR FS LR LL ++RG GGG G P K+
Sbjct: 19 MARYFSSLRALL---VVLRGCDPLLYQYVDGGGFFTSHGQPNKQ 59
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 26.8 bits (59), Expect = 1.0
Identities = 15/40 (37%), Positives = 19/40 (47%), Gaps = 15/40 (37%)
Query: 57 GGGVEGGPG---------------KEEEEEEEEEEEEENG 81
GGGV KEE++EEEEEEE++ G
Sbjct: 68 GGGVAAAAAPAAGAATAGAKAEAKKEEKKEEEEEEEDDLG 107
>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
This family consists of several mammalian protein
phosphatase inhibitor 1 (IPP-1) and dopamine- and
cAMP-regulated neuronal phosphoprotein (DARPP-32)
proteins. Protein phosphatase inhibitor-1 is involved in
signal transduction and is an endogenous inhibitor of
protein phosphatase-1. It has been demonstrated that
DARPP-32, if phosphorylated, can inhibit
protein-phosphatase-1. DARPP-32 has a key role in many
neurotransmitter pathways throughout the brain and has
been shown to be involved in controlling receptors, ion
channels and other physiological factors including the
brain's response to drugs of abuse, such as cocaine,
opiates and nicotine. DARPP-32 is reciprocally regulated
by the two neurotransmitters that are most often
implicated in schizophrenia - dopamine and glutamate.
Dopamine activates DARPP-32 through the D1 receptor
pathway and disables DARPP-32 through the D2 receptor.
Glutamate, acting through the N-methyl-d-aspartate
receptor, renders DARPP-32 inactive. A mutant form of
DARPP-32 has been linked with gastric cancers.
Length = 170
Score = 26.8 bits (59), Expect = 1.1
Identities = 9/30 (30%), Positives = 11/30 (36%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
E G E E E ++ EE G T
Sbjct: 109 LDSPDELGYPTGETESTEAQDSEEPGSCPT 138
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 27.0 bits (60), Expect = 1.1
Identities = 13/20 (65%), Positives = 16/20 (80%)
Query: 60 VEGGPGKEEEEEEEEEEEEE 79
V+ K+EEEEEEEEEE+E
Sbjct: 168 VDEEDEKDEEEEEEEEEEDE 187
Score = 26.6 bits (59), Expect = 1.5
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 60 VEGGPGKEEEEEEEEEEEEE 79
E EEEEEEEEEE+E+
Sbjct: 169 DEEDEKDEEEEEEEEEEDED 188
Score = 25.1 bits (55), Expect = 5.6
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 65 GKEEEEEEEEEEEEEN 80
E++EEEEEEEEEE+
Sbjct: 171 EDEKDEEEEEEEEEED 186
>gnl|CDD|220587 pfam10127, Nuc-transf, Predicted nucleotidyltransferase. Members
of this family of bacterial proteins catalyze the
transfer of nucleotide residues from nucleoside
diphosphates or triphosphates into dimer or polymer
forms.
Length = 247
Score = 26.9 bits (60), Expect = 1.2
Identities = 26/85 (30%), Positives = 35/85 (41%), Gaps = 27/85 (31%)
Query: 18 AAANLRK----EVIRNKIRAIGKMARVFSVLRPLLSVIYIVRGGG-----------GVEG 62
A RK E +R K + F VLRPLL+ +I+R G GV
Sbjct: 133 ARKQFRKYLKGETVRLK--------KYFYVLRPLLAGRWILRTGTPPPMEFQVLLAGVLD 184
Query: 63 GPGKEEEEE----EEEEEEEENGPQ 83
G E +E + EE+E GP+
Sbjct: 185 GSLLAEIDELLARKRGGEEKEEGPR 209
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 27.0 bits (60), Expect = 1.2
Identities = 7/19 (36%), Positives = 12/19 (63%)
Query: 64 PGKEEEEEEEEEEEEENGP 82
K+ E ++++ EE EN P
Sbjct: 79 EEKKPESDDDKTEENENDP 97
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 27.0 bits (60), Expect = 1.2
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEENGP 82
G V+ P +EEEEE EEEEE P
Sbjct: 154 GSTVDLEPPEEEEEEIAEEEEEVKEP 179
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 26.1 bits (58), Expect = 1.5
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 66 KEEEEEEEEEEEEENG 81
+EEEEEEE+EE EE
Sbjct: 82 EEEEEEEEKEESEEEA 97
Score = 25.7 bits (57), Expect = 2.0
Identities = 12/14 (85%), Positives = 13/14 (92%)
Query: 66 KEEEEEEEEEEEEE 79
KEEEEEEEE+EE E
Sbjct: 81 KEEEEEEEEKEESE 94
Score = 25.7 bits (57), Expect = 2.2
Identities = 10/22 (45%), Positives = 12/22 (54%)
Query: 61 EGGPGKEEEEEEEEEEEEENGP 82
EE++EEEEEEEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEE 92
Score = 25.7 bits (57), Expect = 2.2
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 64 PGKEEEEEEEEEEEEE 79
+EEEEEE+EE EEE
Sbjct: 81 KEEEEEEEEKEESEEE 96
Score = 25.3 bits (56), Expect = 2.8
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
K+EEEEEEEE+EE
Sbjct: 75 AAAEEKKEEEEEEEEKEES 93
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 26.7 bits (59), Expect = 1.6
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 61 EGGPGKEEEEEEEEEEEEENGPQT 84
E +EEE +EEEE+EE +T
Sbjct: 51 EEKTTDKEEEVDEEEEKEEKKKKT 74
>gnl|CDD|215967 pfam00519, PPV_E1_C, Papillomavirus helicase. This protein is a
DNA helicase that is required for initiation of viral
DNA replication. This protein forms a complex with the
E2 protein pfam00508.
Length = 432
Score = 26.7 bits (60), Expect = 1.6
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 71 EEEEEEEEENGPQTTF 86
++E+EEE+ PQ TF
Sbjct: 415 SDQEDEEEDGEPQRTF 430
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 26.7 bits (59), Expect = 1.7
Identities = 13/36 (36%), Positives = 15/36 (41%)
Query: 53 IVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
+V E +EEEEE E E P FEV
Sbjct: 397 VVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The
domain is found in the primary vegetative sigma factor.
The function of this domain is unclear and can be
removed without loss of function.
Length = 211
Score = 26.4 bits (59), Expect = 1.7
Identities = 8/16 (50%), Positives = 14/16 (87%)
Query: 67 EEEEEEEEEEEEENGP 82
+E+E+E++EEE + GP
Sbjct: 60 DEDEDEDDEEEADLGP 75
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 26.3 bits (59), Expect = 2.3
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 53 IVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFE 87
I+ G G +EEEEE EEEEEEE +
Sbjct: 221 ILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEA 255
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 26.1 bits (58), Expect = 2.4
Identities = 25/80 (31%), Positives = 35/80 (43%), Gaps = 22/80 (27%)
Query: 23 RKEVIRNKIRA--------IGKMARVFSVLRPLLSVIYIVRGGGGVEGGPG-------KE 67
+ IR+ IR+ I MAR+FS+L + + Y+ R P KE
Sbjct: 176 QNSSIRSNIRSNKYLVSELINSMARIFSILLFITCLYYLGRI-------PSPFFTKKLKE 228
Query: 68 EEEEEEEEEEEENGPQTTFE 87
E EE EEE + +TT E
Sbjct: 229 TSETEEREEETDVEIETTSE 248
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 26.2 bits (57), Expect = 2.6
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 66 KEEEEEEEEEEEEENGPQ 83
K+ +EE+EEEE EE Q
Sbjct: 218 KQPQEEQEEEEVEEEAKQ 235
Score = 25.8 bits (56), Expect = 2.8
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 61 EGGPGKEEEEEEEEEEEEENGPQTT 85
E P +E+EEEE EEE ++ Q T
Sbjct: 217 EKQPQEEQEEEEVEEEAKQEEGQGT 241
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 25.7 bits (57), Expect = 2.9
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 59 GVEGGPGKEEEEEEEEEEEEE 79
E ++EE+EEEE++EE
Sbjct: 108 ASEEDESDDDEEDEEEEDDEE 128
Score = 25.7 bits (57), Expect = 3.0
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 67 EEEEEEEEEEEEENGPQT 84
+EE+++E+E EEE P
Sbjct: 126 DEEDDDEDESEEEESPVK 143
Score = 25.0 bits (55), Expect = 5.1
Identities = 8/13 (61%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
EE+EEEE++EE++
Sbjct: 118 EEDEEEEDDEEDD 130
Score = 24.6 bits (54), Expect = 6.9
Identities = 7/13 (53%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
EEE++EE+++E+E
Sbjct: 122 EEEDDEEDDDEDE 134
Score = 24.6 bits (54), Expect = 7.2
Identities = 8/13 (61%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
+EE+EEEE++EE+
Sbjct: 117 DEEDEEEEDDEED 129
Score = 24.6 bits (54), Expect = 7.7
Identities = 7/13 (53%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
E+EEEE++EE+++
Sbjct: 119 EDEEEEDDEEDDD 131
Score = 24.6 bits (54), Expect = 8.2
Identities = 7/13 (53%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
EEEE++EE+++E+
Sbjct: 121 EEEEDDEEDDDED 133
Score = 24.2 bits (53), Expect = 9.4
Identities = 7/13 (53%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
+EEEE++EE+++E
Sbjct: 120 DEEEEDDEEDDDE 132
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 26.0 bits (57), Expect = 2.9
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 51 IYIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
+ I G E G+E+EEEE EEE EE
Sbjct: 73 VRISEDGEEEEVEEGEEDEEEEGEEESEE 101
Score = 24.8 bits (54), Expect = 6.8
Identities = 13/28 (46%), Positives = 15/28 (53%)
Query: 60 VEGGPGKEEEEEEEEEEEEENGPQTTFE 87
E G +E EE EE+EEEE FE
Sbjct: 76 SEDGEEEEVEEGEEDEEEEGEEESEEFE 103
>gnl|CDD|233307 TIGR01195, oadG_fam, sodium pump decarboxylases, gamma subunit.
This model finds the subfamily of distantly related,
low complexity, hydrophobic small subunits of several
related sodium ion-pumping decarboxylases. These
include oxaloacetate decarboxylase gamma subunit and
methylmalonyl-CoA decarboxylase delta subunit. Most
sequences scoring between the noise and trusted cutoffs
are eukaryotic sodium channel proteins.
Length = 82
Score = 25.1 bits (55), Expect = 3.0
Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 3/43 (6%)
Query: 37 MARVFSVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEE 79
M VF L L IY VRG G V G E ++ ++
Sbjct: 15 MGIVFLFLSLL---IYAVRGMGKVVGRKKVPEPPASAKKSKKA 54
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 25.4 bits (56), Expect = 3.0
Identities = 13/26 (50%), Positives = 14/26 (53%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEENG 81
E +EEEEEEEEEE EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEA 96
Score = 25.4 bits (56), Expect = 3.2
Identities = 13/26 (50%), Positives = 13/26 (50%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEENGP 82
EEEEEEEEEEEEE
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEE 91
Score = 25.0 bits (55), Expect = 3.7
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
+EEEEEEEEEEEEE
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEE 89
Score = 24.6 bits (54), Expect = 5.8
Identities = 12/16 (75%), Positives = 13/16 (81%)
Query: 66 KEEEEEEEEEEEEENG 81
+EEEEEEEEE EEE
Sbjct: 82 EEEEEEEEEESEEEAM 97
Score = 24.2 bits (53), Expect = 7.6
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEE 79
+EEEEEEEEEEEE
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEES 92
>gnl|CDD|212001 TIGR04278, viperin, antiviral radical SAM protein viperin.
Viperin (Virus Inhibitory Protein, ER-associated,
Iterferon-inducible) is a radical SAM enzyme found in
human and other vertebrates. It is both induced by
interferon and demonstrably active in blocking
replication by several types of virus, apparently by
modifying lipid chemistries in lipid droplets and
membrane rafts.
Length = 347
Score = 25.8 bits (57), Expect = 3.1
Identities = 12/47 (25%), Positives = 19/47 (40%)
Query: 42 SVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQTTFEV 88
S+L L + + G + E +EEEE+ + P T V
Sbjct: 15 SLLSLLCWLRAALWLAGSEKSRQQLRGEPTRKEEEEDPDQPTTPTSV 61
>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
His-Domain type N23 protein tyrosine phosphatase and
related domains. This family contains the V-shaped (V)
domain of mammalian His-Domain type N23 protein tyrosine
phosphatase (HD-PTP, also known as PTPN23) and related
domains. It belongs to the V_Alix_like superfamily which
includes the V domains of Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, mammalian Alix
(apoptosis-linked gene-2 interacting protein X/ also
known as apoptosis-linked gene-2 interacting protein 1,
AIP1), and related domains. HD_PTP interacts with the
ESCRT (Endosomal Sorting Complexes Required for
Transport) system, and participates in cell migration
and endosomal trafficking. The related Alix V-domain
(belonging to a different family in this superfamily)
contains a binding site, partially conserved in the
superfamily, for the retroviral late assembly (L) domain
YPXnL motif. The Alix V-domain is also a dimerization
domain. In addition to the V-domain, HD_PTP also has an
N-terminal Bro1-like domain, a proline-rich region
(PRR), a catalytically inactive tyrosine phosphatase
domain, and a region containing a PEST motif. Bro1-like
domains bind components of the ESCRT-III complex,
specifically to CHMP4 in the case of HD-PTP. The
Bro1-like domain of HD-PTP can also bind human
immunodeficiency virus type 1 (HIV-1) nucleocapsid.
HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
gene candidate frequently absent in human kidney,
breast, lung, and cervical tumors. This family also
contains Drosophila Myopic, which promotes epidermal
growth factor receptor (EGFR) signaling, and
Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
promotes Notch signaling.
Length = 337
Score = 25.7 bits (57), Expect = 3.1
Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 23/80 (28%)
Query: 8 DLVLNCLISGAAANLRKEVIRNKIRAIGKMARVF----SVLRPLLSVIYIVRGGGGVEGG 63
DLV C AA ++R + I+N + A+G+++ V+ ++L + S++
Sbjct: 59 DLVERC----AALSVRPDTIKNLVEAMGELSDVYQDVEAMLNEIESLL------------ 102
Query: 64 PGKEEEEEEEEEEEEENGPQ 83
EEEE +E+E +E G +
Sbjct: 103 ---EEEELQEKEFQEAVGKR 119
>gnl|CDD|165509 PHA03250, PHA03250, UL35; Provisional.
Length = 564
Score = 25.9 bits (57), Expect = 3.3
Identities = 10/37 (27%), Positives = 14/37 (37%)
Query: 47 LLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
LL +Y +R G E +E E E + P
Sbjct: 342 LLLRLYNLRDATHCAGLDDDLEGQESAELEVDPTSPY 378
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 25.7 bits (56), Expect = 3.4
Identities = 11/15 (73%), Positives = 14/15 (93%)
Query: 66 KEEEEEEEEEEEEEN 80
+E++EEEE EEEEEN
Sbjct: 336 QEQQEEEENEEEEEN 350
Score = 24.9 bits (54), Expect = 7.1
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 67 EEEEEEEEEEEEENG 81
+E++EEEE EEEE
Sbjct: 336 QEQQEEEENEEEEEN 350
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 25.4 bits (56), Expect = 3.5
Identities = 9/29 (31%), Positives = 10/29 (34%)
Query: 57 GGGVEGGPGKEEEEEEEEEEEEENGPQTT 85
G G GP E E E E+E
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEK 145
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 25.7 bits (56), Expect = 3.6
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 55 RGGGGVEGGPGKEEEEEEEEEEEEENGP 82
RG G + +E EE EE+E + P
Sbjct: 297 RGRGRSQNRAERERSEESEEKELDRASP 324
>gnl|CDD|237798 PRK14714, PRK14714, DNA polymerase II large subunit; Provisional.
Length = 1337
Score = 25.4 bits (56), Expect = 4.2
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 60 VEGGPGKEEEEEEEEEEEEENGPQTTFE 87
++G + + EE EEEE + +E
Sbjct: 277 IDGTIEGKSDAGEEGEEEELSTDADEYE 304
>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
Provisional.
Length = 134
Score = 25.2 bits (55), Expect = 4.3
Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 13/57 (22%)
Query: 29 NKIRAIGKMARVFS--VLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
N IR + MAR FS VL +L +V +E EEEE+++ E +
Sbjct: 9 NNIRTLRAMAREFSIDVLEEMLEKFRVV-----------TKERREEEEQQQRELAER 54
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 25.3 bits (56), Expect = 4.5
Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 9/60 (15%)
Query: 4 FYLTDLVL-NCLISGAAANLRKEVIRNKIRAIGKMARVFSVLRPLLSVIYIVRGGGGVEG 62
L ++ + L+ G A + ++ + + VI I+RGGG E
Sbjct: 39 GPLVEIEIYPTLVQGDGA--AESIVSA-LERANERETAKDY-----DVIVIIRGGGSKED 90
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 25.3 bits (56), Expect = 4.5
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 52 YIVRGGGGVEGGPGKEEEEEEEEEEEEENG 81
+ G G+E+EEEEE+ ++E+
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDE 335
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 25.2 bits (55), Expect = 4.6
Identities = 9/14 (64%), Positives = 12/14 (85%)
Query: 68 EEEEEEEEEEEENG 81
+EEEEEEE++ NG
Sbjct: 159 QEEEEEEEQDAANG 172
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 25.1 bits (56), Expect = 5.0
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 61 EGGPGKEEEEEEEEEEEEENG 81
G EE +++E+EEEEE+
Sbjct: 184 HVGSELEELDDDEDEEEEEDE 204
Score = 24.4 bits (54), Expect = 9.5
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 61 EGGPGKEEEEEEEEEEEEENG 81
G +E +++E+EEEEE+
Sbjct: 185 VGSELEELDDDEDEEEEEDEN 205
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 24.7 bits (53), Expect = 5.0
Identities = 6/21 (28%), Positives = 15/21 (71%)
Query: 60 VEGGPGKEEEEEEEEEEEEEN 80
+EG + + +EEE++++E+
Sbjct: 6 LEGEEDSDSDSDEEEDDDDED 26
>gnl|CDD|218187 pfam04637, Herpes_pp85, Herpesvirus phosphoprotein 85 (HHV6-7
U14/HCMV UL25). This family includes UL25 proteins from
HCMV, as well as U14 proteins from HHV 6 and HHV7. These
85 kD phosphoproteins appear to act as structural
antigens, but their precise function is otherwise
unknown.
Length = 502
Score = 25.4 bits (56), Expect = 5.0
Identities = 11/44 (25%), Positives = 19/44 (43%), Gaps = 5/44 (11%)
Query: 40 VFSVLRPLLSVIYIVRGGGGVEGGPGKEEEEEEEEEEEEENGPQ 83
V LR LL+ + +R G + + E ++ E + GP
Sbjct: 318 VRFALRLLLNRLSRLRDAEA-----GLDSDSESDDLELDPTGPY 356
>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
Length = 242
Score = 25.2 bits (56), Expect = 5.1
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 64 PGKEEEEEEEEEEEEE 79
P EE EEEEE+ +E
Sbjct: 75 PKVEEFEEEEEDPRQE 90
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 25.2 bits (56), Expect = 5.3
Identities = 8/16 (50%), Positives = 12/16 (75%), Gaps = 1/16 (6%)
Query: 72 EEEEEEEENG-PQTTF 86
++E+E E+G PQ TF
Sbjct: 587 SDQEDEGEDGEPQRTF 602
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The
CAF-1 or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression
in addition to DNA-based checkpoints.
Length = 76
Score = 24.5 bits (54), Expect = 5.3
Identities = 6/13 (46%), Positives = 13/13 (100%)
Query: 67 EEEEEEEEEEEEE 79
E+EE+EEE+++++
Sbjct: 60 EDEEDEEEDDDDD 72
Score = 24.1 bits (53), Expect = 7.0
Identities = 6/15 (40%), Positives = 14/15 (93%)
Query: 67 EEEEEEEEEEEEENG 81
+EE+EEE+++++ +G
Sbjct: 61 DEEDEEEDDDDDMDG 75
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 25.3 bits (56), Expect = 5.5
Identities = 7/17 (41%), Positives = 10/17 (58%)
Query: 67 EEEEEEEEEEEEENGPQ 83
+ E+EE EEE+ P
Sbjct: 161 GKSEKEEPEEEKTPAPD 177
>gnl|CDD|226744 COG4294, Uve, UV damage repair endonuclease [DNA replication,
recombination, and repair].
Length = 347
Score = 25.2 bits (55), Expect = 5.5
Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 3/44 (6%)
Query: 23 RKEVIRNKIRAIGKMARVFSVLR-PLLSVIYIVRGG--GGVEGG 63
R+EV+ + IR + R+ + SV + GG GG +
Sbjct: 146 REEVVDSSIRDLAYHYRILDGMGLAERSVWNLHLGGTHGGKKER 189
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 25.3 bits (55), Expect = 5.6
Identities = 8/15 (53%), Positives = 10/15 (66%)
Query: 66 KEEEEEEEEEEEEEN 80
EE E EEE E+ E+
Sbjct: 380 PEESEHEEEVEDYED 394
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 24.9 bits (54), Expect = 5.7
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 67 EEEEEEEEEEEEENGPQTT 85
EE+EE EE EEE
Sbjct: 47 EEDEEAEEVEEETAASSKA 65
Score = 24.5 bits (53), Expect = 7.5
Identities = 9/13 (69%), Positives = 11/13 (84%)
Query: 67 EEEEEEEEEEEEE 79
+EE+EE EE EEE
Sbjct: 46 DEEDEEAEEVEEE 58
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 25.1 bits (55), Expect = 6.0
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 36 KMARVFSVLRPLLSVIYIVRGGGGVEGGPG 65
K V SV P+++ +Y GGG G PG
Sbjct: 597 KQKEVESVCNPIMTKMYQAAGGGMPGGMPG 626
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 25.0 bits (54), Expect = 6.9
Identities = 6/25 (24%), Positives = 15/25 (60%)
Query: 61 EGGPGKEEEEEEEEEEEEENGPQTT 85
+ G + + EE++++ EE ++T
Sbjct: 107 QDTKGSKTDSEEDDDDSEEEDNKST 131
>gnl|CDD|114257 pfam05525, Branch_AA_trans, Branched-chain amino acid transport
protein. This family consists of several bacterial
branched-chain amino acid transport proteins which are
responsible for the transport of leucine, isoleucine
and valine via proton motive force.
Length = 428
Score = 24.9 bits (55), Expect = 7.1
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 43 VLRPLLSVIYIVRGGGGVEG 62
V PLL +I + + GGG+E
Sbjct: 48 VGLPLLGIIAVAKSGGGLES 67
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 24.9 bits (54), Expect = 7.2
Identities = 11/14 (78%), Positives = 12/14 (85%)
Query: 66 KEEEEEEEEEEEEE 79
EEEE E+EEEEEE
Sbjct: 462 SEEEEGEDEEEEEE 475
Score = 24.5 bits (53), Expect = 9.8
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 66 KEEEEEEEEEEEEEN 80
+E EEEE E+EEEE
Sbjct: 460 QESEEEEGEDEEEEE 474
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 25.0 bits (54), Expect = 7.2
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 64 PGKEEEEEEEEEEEEENGPQ 83
P E EEE+EEEE E P+
Sbjct: 96 PDTEAEEEDEEEEIEAPDPE 115
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 24.6 bits (54), Expect = 7.3
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 66 KEEEEEEEEEEEEENGPQT 84
++++E EE E + NGP+
Sbjct: 153 EKDQEPEESETKLSNGPKV 171
>gnl|CDD|180650 PRK06665, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 627
Score = 25.0 bits (55), Expect = 7.4
Identities = 8/22 (36%), Positives = 12/22 (54%)
Query: 17 GAAANLRKEVIRNKIRAIGKMA 38
GA LR IR++I + +A
Sbjct: 283 GALIELRDTDIRDEINELDTLA 304
>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein.
Length = 92
Score = 24.1 bits (53), Expect = 7.7
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 67 EEEEEEEEEEEEENGPQTT 85
+ EEEEE+EE+ P
Sbjct: 29 DSEEEEEDEEDRGQPPAQQ 47
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 24.5 bits (53), Expect = 7.8
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 58 GGVEGGPGKEEEEEEEEEEEEENGPQTT 85
GG + G E + E E E E+ P+ T
Sbjct: 108 GGEKPDQGPEAKGEGEGHEPEDPPPEDT 135
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 24.7 bits (54), Expect = 8.1
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 65 GKEEEEEEEEEEEEE 79
K+EEEEEEEE+E
Sbjct: 180 EKKEEEEEEEEDEAL 194
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 24.6 bits (53), Expect = 8.8
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 65 GKEEEEEEEEEEEEE 79
GKE EEE EEE+++
Sbjct: 42 GKEAEEEAMEEEDDD 56
Score = 24.6 bits (53), Expect = 8.9
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 63 GPGKEEEEEEEEEEEEENG 81
G EEE EEE+++EE+
Sbjct: 42 GKEAEEEAMEEEDDDEEDD 60
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 24.4 bits (53), Expect = 9.4
Identities = 7/19 (36%), Positives = 12/19 (63%)
Query: 61 EGGPGKEEEEEEEEEEEEE 79
E + +E+EEE+ E+E
Sbjct: 60 EDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 24.2 bits (53), Expect = 9.4
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 52 YIVRGGGGVEGGPGKEEEEEEEEEEEEEN 80
+ E +E E EE +EEE+ +
Sbjct: 93 LLGLDKKEKEEEEEEEVEVEELDEEEQID 121
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and
L12p. Ribosomal proteins P1 and P2 are the eukaryotic
proteins that are functionally equivalent to bacterial
L7/L12. L12p is the archaeal homolog. Unlike other
ribosomal proteins, the archaeal L12p and eukaryotic P1
and P2 do not share sequence similarity with their
bacterial counterparts. They are part of the ribosomal
stalk (called the L7/L12 stalk in bacteria), along with
28S rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial
ribosomes, L7/L12 homodimers bind the extended
C-terminal helix of L10 to anchor the L7/L12 molecules
to the ribosome. Eukaryotic P1/P2 heterodimers and
archaeal L12p homodimers are believed to bind the L10
equivalent proteins, eukaryotic P0 and archaeal L10e,
in a similar fashion. P1 and P2 (L12p, L7/L12) are the
only proteins in the ribosome to occur as multimers,
always appearing as sets of dimers. Recent data
indicate that most archaeal species contain six copies
of L12p (three homodimers), while eukaryotes have two
copies each of P1 and P2 (two heterodimers). Bacteria
may have four or six copies (two or three homodimers),
depending on the species. As in bacteria, the stalk is
crucial for binding of initiation, elongation, and
release factors in eukaryotes and archaea.
Length = 105
Score = 24.1 bits (52), Expect = 9.8
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 56 GGGGVEGGPGKEEEEEEEEEEEEE 79
K EE +EEEEEEE+
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEED 97
>gnl|CDD|220077 pfam08938, DUF1916, Domain of unknown function (DUF1916). This
domain is found in various eukaryotic HBS1-like
proteins.
Length = 148
Score = 24.3 bits (53), Expect = 9.9
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 67 EEEEEEEEEEEEE 79
EEE+E+E E+E E
Sbjct: 57 EEEDEDEYEDENE 69
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.137 0.386
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,826,499
Number of extensions: 426621
Number of successful extensions: 2817
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2215
Number of HSP's successfully gapped: 336
Length of query: 88
Length of database: 10,937,602
Length adjustment: 56
Effective length of query: 32
Effective length of database: 8,453,778
Effective search space: 270520896
Effective search space used: 270520896
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 53 (24.3 bits)