RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17102
(412 letters)
>gnl|CDD|144376 pfam00751, DM, DM DNA binding domain. The DM domain is named after
dsx and mab-3. dsx contains a single amino-terminal DM
domain, whereas mab-3 contains two amino-terminal
domains. The DM domain has a pattern of conserved zinc
chelating residues C2H2C4. The dsx DM domain has been
shown to dimerise and bind palindromic DNA.
Length = 46
Score = 103 bits (258), Expect = 1e-27
Identities = 36/46 (78%), Positives = 41/46 (89%)
Query: 56 TPKCARCRNHGVVSALKGHKRYCRWRDCVCAKCTLIAERQRVMAAQ 101
+PKCARCRNHGV+S LKGHKRYC +RDC C KC L+AERQR+MAAQ
Sbjct: 1 SPKCARCRNHGVLSPLKGHKRYCPYRDCTCEKCALVAERQRLMAAQ 46
>gnl|CDD|214606 smart00301, DM, Doublesex DNA-binding motif.
Length = 54
Score = 96.7 bits (241), Expect = 3e-25
Identities = 29/54 (53%), Positives = 38/54 (70%)
Query: 55 RTPKCARCRNHGVVSALKGHKRYCRWRDCVCAKCTLIAERQRVMAAQVALRRQQ 108
R P C +C NHGV LKGHK C +RDC C KCTL+ +R+ +MA Q+ L+R+Q
Sbjct: 1 RIPYCQKCENHGVKVPLKGHKPECPFRDCECEKCTLVEKRRALMALQIKLKREQ 54
>gnl|CDD|112299 pfam03474, DMA, DMRTA motif. This region is found to the
C-terminus of the pfam00751. DM-domain proteins with
this motif are known as DMRTA proteins. The function of
this region is unknown.
Length = 39
Score = 60.1 bits (146), Expect = 3e-12
Identities = 25/37 (67%), Positives = 32/37 (86%)
Query: 293 RSPVDVLRRVFPNRRRGEVETVLQRCKGDVLQAIEMM 329
RSP+DVL ++FPNR+R +E VL+RCKGDVLQAIE+
Sbjct: 2 RSPIDVLMKIFPNRKRSVLELVLKRCKGDVLQAIELA 38
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 34.9 bits (81), Expect = 0.013
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
A KK+EEEEEEE+E+ EEE
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAA 99
Score = 33.4 bits (77), Expect = 0.050
Identities = 13/21 (61%), Positives = 17/21 (80%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
A +KKEEEEEEEE+++ EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEE 95
Score = 31.8 bits (73), Expect = 0.14
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
A ++++EEEEEEE++EE EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
Score = 31.5 bits (72), Expect = 0.24
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 6 KKEEEEEEEEEKQEEEGEEKKDG 28
EE++EEEE++EE+ E +++
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEA 97
Score = 28.8 bits (65), Expect = 1.7
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 1 EARRKKKEEEEEEEEEKQEEEG 22
E + +++EEEE+EE E++ G
Sbjct: 79 EKKEEEEEEEEKEESEEEAAAG 100
Score = 28.8 bits (65), Expect = 1.8
Identities = 7/20 (35%), Positives = 14/20 (70%)
Query: 8 EEEEEEEEEKQEEEGEEKKD 27
EE++E++EEE E+++
Sbjct: 74 AAAAEEKKEEEEEEEEKEES 93
Score = 28.4 bits (64), Expect = 2.3
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 8 EEEEEEEEEKQEEEGEEKKD 27
EE+K+EEE EE+K+
Sbjct: 72 AAAAAAEEKKEEEEEEEEKE 91
Score = 28.0 bits (63), Expect = 3.4
Identities = 8/23 (34%), Positives = 12/23 (52%)
Query: 8 EEEEEEEEEKQEEEGEEKKDGPT 30
EEK+EEE EE++ +
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEES 93
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 36.1 bits (83), Expect = 0.035
Identities = 17/31 (54%), Positives = 24/31 (77%)
Query: 6 KKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
++EEEEEEEEE++EEE EE+++ LSL
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896
Score = 35.7 bits (82), Expect = 0.053
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
E +++EEEEEEEEE++EEE EE+ + P
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEP 893
Score = 35.0 bits (80), Expect = 0.073
Identities = 15/27 (55%), Positives = 23/27 (85%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E +++EEEEEEEEE++EEE EE+++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 29.6 bits (66), Expect = 3.5
Identities = 14/31 (45%), Positives = 20/31 (64%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTT 31
E +++EEEEEEEEE++E E + P T
Sbjct: 871 EEEEEEEEEEEEEEEEEEENEEPLSLEWPET 901
Score = 28.8 bits (64), Expect = 6.2
Identities = 12/20 (60%), Positives = 18/20 (90%)
Query: 8 EEEEEEEEEKQEEEGEEKKD 27
+ EEEEEEE++EEE EE+++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEE 881
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 33.5 bits (77), Expect = 0.044
Identities = 14/24 (58%), Positives = 19/24 (79%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
A +++EEEEEEEEE++EE EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 33.1 bits (76), Expect = 0.062
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 1 EARRKKKEEEEEEEEEKQEEEG 22
+++EEEEEEEEE+ EEE
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEA 96
Score = 31.6 bits (72), Expect = 0.18
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 7 KEEEEEEEEEKQEEEGEEKKD 27
EEEEEEE++EEE EE+ +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESE 93
Score = 30.4 bits (69), Expect = 0.54
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGE 23
E +++EEEEEEEEE +EE
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMA 98
Score = 30.0 bits (68), Expect = 0.68
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDG 28
EEEEEEEEE++EEE E +++
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEA 96
Score = 29.6 bits (67), Expect = 0.81
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 8 EEEEEEEEEKQEEEGEEKKD 27
EEEE++EEE EE+++
Sbjct: 71 AAAAAEEEEEEEEEEEEEEE 90
Score = 29.2 bits (66), Expect = 1.4
Identities = 13/22 (59%), Positives = 18/22 (81%)
Query: 7 KEEEEEEEEEKQEEEGEEKKDG 28
+EEEEEEEEE++EEE E++
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAM 97
Score = 28.9 bits (65), Expect = 1.6
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 9 EEEEEEEEKQEEEGEEKKDGP 29
EEE++EEE EE+++
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEE 91
Score = 28.1 bits (63), Expect = 3.5
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 8 EEEEEEEEEKQEEEGEEKKD 27
EEEEE++EEE EE+++
Sbjct: 72 AAAAEEEEEEEEEEEEEEEE 91
Score = 27.7 bits (62), Expect = 4.6
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 1 EARRKKKEEEEEEEEEKQEEEG 22
E +++EEEEEEE E++ G
Sbjct: 78 EEEEEEEEEEEEEESEEEAMAG 99
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 33.6 bits (77), Expect = 0.061
Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 2/27 (7%)
Query: 3 RRKKKEEEEEEEEE--KQEEEGEEKKD 27
KK+EE+EEE E K+EE E +D
Sbjct: 94 TSKKEEEKEEEIPEPTKEEELLGEIRD 120
>gnl|CDD|202430 pfam02845, CUE, CUE domain. CUE domains have been shown to bind
ubiquitin. It has been suggested that CUE domains are
related to pfam00627 and this has been confirmed by the
structure of the domain. CUE domains also occur in two
protein of the IL-1 signal transduction pathway, tollip
and TAB2.
Length = 42
Score = 31.3 bits (72), Expect = 0.061
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 296 VDVLRRVFPNRRRGEVETVLQRCKGDVLQAIEM 328
++ L+ +FP+ + VLQ G+V AI+
Sbjct: 5 LETLKEMFPDLDEEVIRAVLQASNGNVEAAIDA 37
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 32.2 bits (74), Expect = 0.062
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
E +RK++EE+E +E ++ E E +K G
Sbjct: 53 EKKRKQEEEKERRKEARKAERAEARKRG 80
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 33.1 bits (76), Expect = 0.063
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDG 28
E +E EEEEK+EE EE D
Sbjct: 79 AAAEADEAEEEEKEEEAEEESDDD 102
Score = 29.3 bits (66), Expect = 1.2
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
A + EEEE+EEE +EE ++
Sbjct: 79 AAAEADEAEEEEKEEEAEEESDDD 102
Score = 28.9 bits (65), Expect = 1.7
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
A + +E EEEE+++E E E D
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDDM 103
Score = 27.4 bits (61), Expect = 6.6
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
+ +E EEEE+E++ EE +
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 27.0 bits (60), Expect = 7.9
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKKD 27
A E E +E +EEE EE+ +
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAE 96
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 31.2 bits (71), Expect = 0.076
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKK 26
K+++EEE E+EE+ EE+ E K+
Sbjct: 25 KREDEEENEDEEEGEEQSEVKR 46
Score = 30.4 bits (69), Expect = 0.15
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKK 26
+K+E+EEE E+E++ EE E K
Sbjct: 24 EKREDEEENEDEEEGEEQSEVK 45
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 33.7 bits (78), Expect = 0.13
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 1 EARRKKKEEEEEEEEEKQEEEG 22
A +++EEEEEEEEE EEE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEA 321
Score = 32.1 bits (74), Expect = 0.45
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
+A ++EEEEEEEEE++E EE G
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEEAAAG 324
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.0 bits (78), Expect = 0.13
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E +++K+EEEE+ +K+EE EE++ K
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK 73
Score = 29.7 bits (67), Expect = 3.4
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTT 31
E +K+ +EEEEEEK+E++ EE+K
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE 58
Score = 29.7 bits (67), Expect = 3.6
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E K++++EEEE+ +EEE +E+++ K
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72
Score = 28.2 bits (63), Expect = 10.0
Identities = 10/30 (33%), Positives = 21/30 (70%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
K++EE+ ++EE+ +EE E+++ T K+
Sbjct: 48 KEEEEKTTDKEEEVDEEEEKEEKKKKTKKV 77
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 31.7 bits (72), Expect = 0.18
Identities = 13/23 (56%), Positives = 20/23 (86%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGE 23
E + ++KEEE+++EEEK+EEE E
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEE 97
Score = 31.7 bits (72), Expect = 0.19
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
A K +E+EEE+++E+++EE EE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 31.7 bits (72), Expect = 0.21
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
+ +++EEE+++EE++EEE EE G
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 30.9 bits (70), Expect = 0.33
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSLWF 38
A +E+ EE+EEEK++EE +E+++ L F
Sbjct: 70 AAAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGALF 106
Score = 29.0 bits (65), Expect = 1.4
Identities = 12/28 (42%), Positives = 22/28 (78%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
A ++K EE+EEE++K+EE+ EE+++
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 29.0 bits (65), Expect = 1.6
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
A +K EE+EEE+++++E+E EE++
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEEAL 99
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 31.1 bits (71), Expect = 0.19
Identities = 9/18 (50%), Positives = 14/18 (77%)
Query: 5 KKKEEEEEEEEEKQEEEG 22
EEE++EEEE++EE+
Sbjct: 64 AAAEEEKKEEEEEEEEDD 81
Score = 28.4 bits (64), Expect = 1.9
Identities = 8/18 (44%), Positives = 16/18 (88%)
Query: 5 KKKEEEEEEEEEKQEEEG 22
+++++EEEEEEE+ ++ G
Sbjct: 67 EEEKKEEEEEEEEDDDMG 84
Score = 28.0 bits (63), Expect = 2.3
Identities = 8/18 (44%), Positives = 14/18 (77%)
Query: 7 KEEEEEEEEEKQEEEGEE 24
EEE++EE++EEE ++
Sbjct: 64 AAAEEEKKEEEEEEEEDD 81
Score = 28.0 bits (63), Expect = 2.5
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 1 EARRKKKEEEEEEEEE 16
A ++K+EEEEEEEE
Sbjct: 64 AAAEEEKKEEEEEEEE 79
Score = 27.2 bits (61), Expect = 4.1
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 6 KKEEEEEEEEEKQEEEGEEKKD 27
EEE+K+EEE EE+ D
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81
Score = 27.2 bits (61), Expect = 4.5
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKD 27
EEE++++EEE EE D
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDDD 82
Score = 27.2 bits (61), Expect = 5.0
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 7 KEEEEEEEEEKQEEEGEEKKDG 28
EEEK+EEE EE++D
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 31.9 bits (73), Expect = 0.26
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKK 26
E +K+ E+E E+ K+EEE E K
Sbjct: 85 ERELRKRAEKEALEQAKKEEELREAK 110
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 32.7 bits (75), Expect = 0.39
Identities = 13/25 (52%), Positives = 20/25 (80%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDGP 29
+K EEEE+E+EE++EEE E ++ P
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 29.6 bits (67), Expect = 2.9
Identities = 12/20 (60%), Positives = 19/20 (95%)
Query: 5 KKKEEEEEEEEEKQEEEGEE 24
++K+EEEE+E+E++EEE EE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEE 364
Score = 28.1 bits (63), Expect = 8.8
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 7 KEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E+++EEEE+++EE EE+++ P P+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPE 369
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.0 bits (73), Expect = 0.41
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 3 RRKKKEEEEEEEEEKQEEEGEEKKD 27
++++ EE+E EEEEK EE + +++
Sbjct: 155 KKERAEEKEREEEEKAAEEEKAREE 179
Score = 27.8 bits (62), Expect = 9.6
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
E + +++EE+ EEE+ +EEE
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 32.0 bits (73), Expect = 0.59
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
R +EEEEEEEEE++EEE + P++ L
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQL 434
>gnl|CDD|234251 TIGR03544, DivI1A_domain, DivIVA domain. This model describes a
domain found in Bacillus subtilis cell division
initiation protein DivIVA, and homologs, toward the
N-terminus. It is also found as a repeated domain in
certain other proteins, including family TIGR03543.
Length = 34
Score = 28.2 bits (64), Expect = 0.60
Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 4/32 (12%)
Query: 294 SPVDVLRRVFPNRRRG----EVETVLQRCKGD 321
+P D+ + F + RG EV+ L R D
Sbjct: 2 TPEDIRNKRFKKKLRGYDAAEVDAFLDRVADD 33
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.9 bits (72), Expect = 0.66
Identities = 15/30 (50%), Positives = 18/30 (60%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
++ E+ EE EEEK EEEG K G KL
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKL 344
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 31.8 bits (73), Expect = 0.75
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
+ +KK EEE++E++K+ G++K++ K
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
Score = 31.0 bits (71), Expect = 1.2
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
A +KK+EEEEEE+E+K+EE+ EE+++ +
Sbjct: 435 FAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 30.3 bits (69), Expect = 1.9
Identities = 10/33 (30%), Positives = 20/33 (60%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E +K+K+++ ++K+EEE EEK+ +
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 29.1 bits (66), Expect = 4.9
Identities = 8/33 (24%), Positives = 21/33 (63%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E +++KK++ +++++EEE E++K +
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 29.1 bits (66), Expect = 5.1
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
E ++KKEEE+EEEEE+ EEE EE+++
Sbjct: 445 EEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 29.1 bits (66), Expect = 5.3
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E ++KE++EEE+EE++EE EEK++ K
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 28.7 bits (65), Expect = 5.8
Identities = 9/33 (27%), Positives = 21/33 (63%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
+ ++ +++EEEEE+++E+ EE+K+
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 28.7 bits (65), Expect = 6.2
Identities = 10/35 (28%), Positives = 22/35 (62%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLS 35
+ + +++EEEEEE++E++ EEK++ +
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 28.3 bits (64), Expect = 7.7
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 3 RRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
+ +++EEE+E++EE++EEE EE ++ +
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 28.3 bits (64), Expect = 7.7
Identities = 18/33 (54%), Positives = 23/33 (69%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
KKKEEEEEEE+EK+EEE EE+++ K
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 31.7 bits (72), Expect = 0.83
Identities = 35/165 (21%), Positives = 51/165 (30%), Gaps = 34/165 (20%)
Query: 124 SGAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVD 183
A +PS +L G ++ + SP RS +KK I D
Sbjct: 173 KPASPSSSYQSPSYSS---------SLGPVNSSGNRS-NLRSSPWALRSSGDKK-DITTD 221
Query: 184 EDSEDSLDDNMNAE-HRTRTSTNNNNTTMPK-------------SSRTPSPNQLVDPTGE 229
E ++ ++ E H +S N T SR S
Sbjct: 222 EKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKRS 281
Query: 230 VTKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPM 274
SP+PVPS + S + E P MS +S S +
Sbjct: 282 YQLSPSPVPSKQKASTSPKKGEGEPP---------NMSLESASEV 317
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 30.8 bits (70), Expect = 0.96
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
E R KKKEEEE +E E+Q + +E
Sbjct: 50 EEREKKKEEEERKEREEQARKEQE 73
Score = 29.3 bits (66), Expect = 2.6
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKK 26
E RK++EE EEE E+K+EEE +++
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKER 64
Score = 29.3 bits (66), Expect = 3.0
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEK 25
E +++E+++EEEE K+ EE K
Sbjct: 46 EELEEEREKKKEEEERKEREEQARK 70
Score = 28.9 bits (65), Expect = 3.3
Identities = 11/27 (40%), Positives = 22/27 (81%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
R++++E EEE E++K+EEE +E+++
Sbjct: 40 GERKEEEELEEEREKKKEEEERKEREE 66
Score = 28.5 bits (64), Expect = 5.2
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEE 24
E +KK+EEE +E EE+ +E EE
Sbjct: 51 EREKKKEEEERKEREEQARKEQEE 74
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 29.7 bits (67), Expect = 1.1
Identities = 14/28 (50%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 1 EARRKKKEEE-EEEEEEKQEEEGEEKKD 27
A +K+ EE + EEEEK+ EE EE+ D
Sbjct: 93 SAYKKQLEENRKLEEEEKEREELEEEND 120
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.4 bits (69), Expect = 1.2
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKKD 27
AR+K+K +E +++E+ Q+ E EE K+
Sbjct: 20 ARKKQKRKEAKKKEDAQKSEAEEVKN 45
Score = 28.8 bits (65), Expect = 4.1
Identities = 7/28 (25%), Positives = 19/28 (67%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDG 28
E RR+K+E+ + +++K++E +++
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQ 36
Score = 28.4 bits (64), Expect = 5.0
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPT 30
EA++K+ ++ E EE K EE +KK P
Sbjct: 28 EAKKKEDAQKSEAEEVKNEENKSKKKAAPI 57
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 30.9 bits (69), Expect = 1.3
Identities = 44/238 (18%), Positives = 73/238 (30%), Gaps = 10/238 (4%)
Query: 84 VCAKCTLIAERQRVMAAQVALRRQQAQEENEARELGLIYTSGAPLP-PGSANPSSPGGQD 142
V TLI R A + + + + + +YT P +A P++P
Sbjct: 398 VADAKTLIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPP 457
Query: 143 NNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRT 202
+ G + G + + + SP + + S NA T
Sbjct: 458 ASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQ 517
Query: 203 STNNNNTTMPKSS--RTPSPNQLVDPTGEVTKSPAPVPSPRIQEES-----SDESETVLP 255
T++ + + PTG T S SP++ EES + T P
Sbjct: 518 KTSDTPNATSPTPIVIGVTTTATSPPTG--TTSVPNATSPQVTEESPVNNTNTPVVTSAP 575
Query: 256 ENLSLPKNTPMSADSTSPMSAEHAYLYTHPYSPYQNGRSPVDVLRRVFPNRRRGEVET 313
L+ T +SP S + + +P N S +L P E
Sbjct: 576 SVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEE 633
>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 402
Score = 30.9 bits (70), Expect = 1.4
Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 2/69 (2%)
Query: 40 HLPNCRFLTTVHRYQRTPKCARCRNHGV--VSALKGHKRYCRWRDCVCAKCTLIAERQRV 97
LP + +R KC RC V + +R R A + +
Sbjct: 61 DLPLFEVPVYLFLRKRRYKCCRCGKRFAEKVDFVDRAQRITRRWAQAAAIALIKGKAFTD 120
Query: 98 MAAQVALRR 106
A + R
Sbjct: 121 TAELNLISR 129
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 30.9 bits (70), Expect = 1.4
Identities = 16/102 (15%), Positives = 28/102 (27%), Gaps = 4/102 (3%)
Query: 176 KKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPA 235
++I ED E++ ++N N TS + P S A
Sbjct: 370 RQIAEERSEDEEEAAENNDNGIVIDETSEFVRSLQKEPLEEKPENKDESVE----EISDA 425
Query: 236 PVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAE 277
++E D + + K S + E
Sbjct: 426 EEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPSTILEE 467
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 30.7 bits (69), Expect = 1.4
Identities = 23/90 (25%), Positives = 34/90 (37%), Gaps = 17/90 (18%)
Query: 151 QGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTT 210
+G R G + S PSG SP+ + + ++N N ++ NNNN
Sbjct: 477 EGRRRGPRR-ESKSQEPSGGESPNPEL----------PANNNNSN------SNNNNNNGA 519
Query: 211 MPKSSRTPSPNQLVDPTGEVTKSPAPVPSP 240
K + + N G T P PV S
Sbjct: 520 DRKEAAATTGNATTTSNGSGTSVPLPVSSE 549
>gnl|CDD|183147 PRK11465, PRK11465, putative mechanosensitive channel protein;
Provisional.
Length = 741
Score = 30.5 bits (69), Expect = 1.7
Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 219 SPNQLVDPTGEVTKSPAPVPSPRIQEESSDESETVLPENLS 259
S +L+D +P P P+I + E +TVL +N++
Sbjct: 61 SRKELIDQLRTAAATPPAEPVPKIVPPTLVEEKTVL-QNVT 100
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
This family consists of several Plasmodium falciparum
SPAM (secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the
alanine heptad-repeat domain. Heptad repeats in which
the a and d position contain hydrophobic residues
generate amphipathic alpha-helices which give rise to
helical bundles or coiled-coil structures in proteins.
SPAM is an example of a P. falciparum antigen in which
a repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.8 bits (67), Expect = 1.7
Identities = 14/27 (51%), Positives = 21/27 (77%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E + +K+E++EEEEE+ EEE EE +D
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEEIEEPED 67
Score = 28.3 bits (63), Expect = 5.4
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 7 KEEEEEEEEEKQEEEGEEKKDGP 29
+EEEEE+EE+ + + EKK+
Sbjct: 78 EEEEEEDEEDNVDLKDIEKKNIN 100
>gnl|CDD|234696 PRK00232, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase;
Reviewed.
Length = 332
Score = 30.1 bits (69), Expect = 2.2
Identities = 9/56 (16%), Positives = 18/56 (32%), Gaps = 4/56 (7%)
Query: 84 VCAKCTLIAERQRVMAAQVALRRQQAQEENEARELGLIY----TSGAPLPPGSANP 135
V A L+ R ++ + LR + +E G + + G +
Sbjct: 39 VIADRALLERRAAILGLPLDLRPYSPDAPAQPQEAGTLTVLPVPLAPAVVAGQLDA 94
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 29.2 bits (66), Expect = 2.3
Identities = 11/21 (52%), Positives = 18/21 (85%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
E +R++KE E+EE+E K+E+E
Sbjct: 15 EEKREEKEREKEEKERKEEKE 35
Score = 28.0 bits (63), Expect = 4.4
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKK 26
E R+K+E+E +EE+EK+ +G +K
Sbjct: 20 EKEREKEEKERKEEKEKEWGKGLVQK 45
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 27.7 bits (62), Expect = 2.4
Identities = 8/21 (38%), Positives = 15/21 (71%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
E +R+++EE + EE K+ E+
Sbjct: 33 EEKREEEEEARKREERKEREK 53
Score = 26.1 bits (58), Expect = 7.3
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 6 KKEEEEEEEEEKQEEEGEEKK 26
KK+E + EEE+++EEE K+
Sbjct: 25 KKKELKAEEEKREEEEEARKR 45
Score = 26.1 bits (58), Expect = 9.2
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
+A +K+EEEEE + ++ +E E+ K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNKS 56
>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Flap endonuclease-1 (FEN1) is
involved in multiple DNA metabolic pathways, including
DNA replication processes (5' flap DNA endonuclease
activity and double stranded DNA 5'-exonuclease
activity) and DNA repair processes (long-patch base
excision repair) in eukaryotes and archaea. Interaction
between FEN1 and PCNA (Proliferating cell nuclear
antigen) is an essential prerequisite to FEN1's DNA
replication functionality and stimulates FEN1 nuclease
activity by 10-50 fold. FEN1 belongs to the
FEN1-EXO1-like family of structure-specific, 5'
nucleases. These nucleases contain a PIN (PilT N
terminus) domain with a helical arch/clamp region (I
domain) of variable length (approximately 45 residues in
FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
domain, an atypical helix-hairpin-helix-2-like region.
Both the H3TH domain (not included here) and the helical
arch/clamp region are involved in DNA binding. Nucleases
within this group also have a carboxylate-rich active
site that is involved in binding essential divalent
metal ion cofactors (Mg2+/Mn2+). FEN1 has a C-terminal
extension containing residues forming the consensus
PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
to PCNA.
Length = 261
Score = 29.8 bits (68), Expect = 2.7
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEK 25
E RR+++EE EE+ EE +EE E+
Sbjct: 93 EKRRERREEAEEKLEEAKEEGDAEE 117
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 29.8 bits (67), Expect = 2.7
Identities = 14/52 (26%), Positives = 17/52 (32%), Gaps = 7/52 (13%)
Query: 129 PPGSANPSSPGGQDNNNGALNLQGNRVGGEQ-------NHSYPDSPSGNRSP 173
P G+ P S GG D + G N G D+PS P
Sbjct: 8 PLGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPP 59
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 30.0 bits (68), Expect = 2.7
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E + E E + KQE+EG ++D
Sbjct: 237 EETDVEIETTSETKGTKQEQEGSTEED 263
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.0 bits (67), Expect = 2.8
Identities = 6/22 (27%), Positives = 18/22 (81%)
Query: 8 EEEEEEEEEKQEEEGEEKKDGP 29
+E+E+E+++ +E++ EE+++
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178
Score = 28.5 bits (63), Expect = 7.9
Identities = 7/21 (33%), Positives = 19/21 (90%)
Query: 7 KEEEEEEEEEKQEEEGEEKKD 27
++E+E++++E+ +EE EE+++
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEE 178
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 29.8 bits (67), Expect = 2.9
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKLSL 36
+++EEEEEEE + E EE G P++ +
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMPM 437
Score = 29.0 bits (65), Expect = 5.7
Identities = 12/22 (54%), Positives = 17/22 (77%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKK 26
++ EEEEEEEE++EEE E +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAE 420
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 29.8 bits (67), Expect = 2.9
Identities = 10/66 (15%), Positives = 20/66 (30%), Gaps = 1/66 (1%)
Query: 191 DDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQEESSD-E 249
DD N + +S P+ TG + P+ +E + +
Sbjct: 695 DDGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETT 754
Query: 250 SETVLP 255
+ + P
Sbjct: 755 TGSSTP 760
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.3 bits (63), Expect = 3.1
Identities = 10/17 (58%), Positives = 16/17 (94%)
Query: 6 KKEEEEEEEEEKQEEEG 22
KKEE++EEEEE++++ G
Sbjct: 91 KKEEKKEEEEEEEDDLG 107
Score = 28.0 bits (62), Expect = 4.6
Identities = 9/21 (42%), Positives = 18/21 (85%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
A+ + K+EE++EEEE++E++
Sbjct: 85 GAKAEAKKEEKKEEEEEEEDD 105
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 29.7 bits (67), Expect = 3.2
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 6 KKEEEEEEEEEKQEEEGEE 24
EEEE+EEEE++E+E E
Sbjct: 389 DDEEEEDEEEEEEEDEDEG 407
Score = 29.3 bits (66), Expect = 4.2
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEK 25
R ++EE+EEEEEE+ E+EG K
Sbjct: 386 TERDDEEEEDEEEEEEEDEDEGPSK 410
Score = 28.9 bits (65), Expect = 6.0
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 4 RKKKEEEEEEEEEKQEEEGEEKKDG 28
R +EEE+EEEEE+++E+ K+
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEH 412
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.8 bits (65), Expect = 3.3
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 8 EEEEEEEEEKQEEEGEEKKDGPTTPK 33
EE+EEEE+++++++ +E ++ + K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVK 143
Score = 28.4 bits (64), Expect = 3.8
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 8 EEEEEEEEEKQEEEGEEKKDGPTTPK 33
+EE+EEEE+ +E++ E++ + +P
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPV 142
Score = 27.3 bits (61), Expect = 9.2
Identities = 9/19 (47%), Positives = 15/19 (78%)
Query: 8 EEEEEEEEEKQEEEGEEKK 26
+EE+++E+E +EEE KK
Sbjct: 126 DEEDDDEDESEEEESPVKK 144
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 28.0 bits (63), Expect = 3.3
Identities = 7/16 (43%), Positives = 11/16 (68%)
Query: 1 EARRKKKEEEEEEEEE 16
+ +KKEE EEE ++
Sbjct: 86 AKKEEKKEESEEESDD 101
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 29.2 bits (66), Expect = 3.9
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEE 24
+R+++ EE EEE EK EEG++
Sbjct: 100 EKRRERREEAEEELEKAIEEGDD 122
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 29.3 bits (65), Expect = 3.9
Identities = 22/118 (18%), Positives = 37/118 (31%), Gaps = 16/118 (13%)
Query: 175 EKKIKINVDEDSEDSLDDNMNAEH------RTRTSTNNNNTTMPKSSRTPSPNQLVDPTG 228
E+ IN+ + + N +TN T+ K + P
Sbjct: 47 ERLAMINITAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQP-----KQE 101
Query: 229 EVTKSPAPVPSPRIQEESSDESETVLPENLSLPK-NTPMSADSTSPMSAEHAYLYTHP 285
E QE+S +E+ P+ K TP S ++ PM + + P
Sbjct: 102 EQKSLNISATPAPKQEQSQTTTESTTPK----TKVTTPPSTNTPQPMQSTKSDTPQSP 155
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 29.6 bits (66), Expect = 4.0
Identities = 18/56 (32%), Positives = 22/56 (39%), Gaps = 5/56 (8%)
Query: 193 NMNAEHRTRTSTNNN-----NTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQ 243
NM A T NN+ P S P P DPT + +P P P+P Q
Sbjct: 896 NMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQ 951
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 28.9 bits (64), Expect = 4.1
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E + ++++EEEE EEE ++EEG+ D
Sbjct: 217 EKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 28.0 bits (63), Expect = 4.2
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEK 25
EA+R + EE++ +EEE + + E K
Sbjct: 94 EAQRARDEEKKLDEEEAKRQHEEAK 118
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 29.1 bits (66), Expect = 4.3
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGP 29
++ +KEE E+ +K+ E +K GP
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Score = 28.7 bits (65), Expect = 5.6
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E + +K+E E+ ++K E ++ K K
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 28.0 bits (63), Expect = 4.3
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKDGPTT 31
++ ++EEEEE+E++E+E E +DG T
Sbjct: 28 EEYDKEEEEEKEEKEKEAEPDEDGWVT 54
>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
Validated.
Length = 451
Score = 29.4 bits (66), Expect = 4.4
Identities = 13/67 (19%), Positives = 26/67 (38%), Gaps = 2/67 (2%)
Query: 215 SRTPSPNQLVDPTGEV--TKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTS 272
S + +L ++ + S V SP+ Q + E ++L K + + S
Sbjct: 364 SPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKKETIHQEVPS 423
Query: 273 PMSAEHA 279
P ++ A
Sbjct: 424 PSLSKSA 430
>gnl|CDD|225372 COG2815, COG2815, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 303
Score = 28.9 bits (65), Expect = 4.4
Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 5/63 (7%)
Query: 215 SRTPSPNQLVDPTGEVT----KSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADS 270
S++PS V P V K P + P + + DE+ + L + L N+ S
Sbjct: 136 SQSPSAGTEVKPGETVKLTVSKGPETITVPDLVGMTYDEASSNL-KAAGLTVNSKEYVSS 194
Query: 271 TSP 273
P
Sbjct: 195 DRP 197
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.4 bits (64), Expect = 4.4
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 4 RKKKEEEEEEEEEKQEEEGEEKK 26
++++EEEE E EE EEE ++
Sbjct: 101 KEEEEEEEVEVEELDEEEQIDEL 123
Score = 27.6 bits (62), Expect = 7.4
Identities = 8/23 (34%), Positives = 15/23 (65%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKD 27
++ E EE +EEE+ +E E++
Sbjct: 107 EEVEVEELDEEEQIDELLEKELA 129
Score = 27.6 bits (62), Expect = 7.4
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 5 KKKEEEEEEEEEKQEEEGEEKKD 27
K+KEEEEEEE E +E + EE+ D
Sbjct: 99 KEKEEEEEEEVEVEELDEEEQID 121
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
Trichoplein or mitostatin, was first defined as a
meiosis-specific nuclear structural protein. It has
since been linked with mitochondrial movement. It is
associated with the mitochondrial outer membrane, and
over-expression leads to reduction in mitochondrial
motility whereas lack of it enhances mitochondrial
movement. The activity appears to be mediated through
binding the mitochondria to the actin intermediate
filaments (IFs).
Length = 349
Score = 29.1 bits (66), Expect = 4.4
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 1 EARRKKKEEEEEEEEEKQEEE 21
E R K EEEE E K++EE
Sbjct: 48 EEERLKALAEEEERERKRKEE 68
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 27.7 bits (62), Expect = 4.5
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKK 26
AR + +EEE E+EE++E+ ++K
Sbjct: 82 ARLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 27.6 bits (61), Expect = 4.5
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 1 EARRKKKEEEEEEEEEKQEEEG 22
E K +E +EEEEEE+ E+ G
Sbjct: 80 EPAEKAEEAKEEEEEEEDEDFG 101
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 29.0 bits (64), Expect = 4.5
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 9/73 (12%)
Query: 125 GAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDE 184
GA G + P S G + ++ G +L+G G ++N SP D++ +VD+
Sbjct: 33 GAQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKEN-----SPPLPTEEDDE----DVDD 83
Query: 185 DSEDSLDDNMNAE 197
DSE+ DD+ AE
Sbjct: 84 DSEEGDDDDGGAE 96
>gnl|CDD|219086 pfam06553, BNIP3, BNIP3. This family consists of several mammalian
specific BCL2/adenovirus E1B 19-kDa protein-interacting
protein 3 or BNIP3 sequences. BNIP3 belongs to the Bcl-2
homology 3 (BH3)-only family, a Bcl-2-related family
possessing an atypical Bcl-2 homology 3 (BH3) domain,
which regulates PCD from mitochondrial sites by
selective Bcl-2/Bcl-XL interactions. BNIP3 family
members contain a C-terminal transmembrane domain that
is required for their mitochondrial localisation,
homodimerisation, as well as regulation of their
pro-apoptotic activities. BNIP3-mediated apoptosis has
been reported to be independent of caspase activation
and cytochrome c release and is characterized by early
plasma membrane and mitochondrial damage, prior to the
appearance of chromatin condensation or DNA
fragmentation.
Length = 197
Score = 28.7 bits (64), Expect = 5.0
Identities = 26/134 (19%), Positives = 44/134 (32%), Gaps = 20/134 (14%)
Query: 132 SANPSSPGGQDNNNGA-LNLQGNRVGGEQNHSYPDSPSGNRSPDEKKIKINVDEDSEDSL 190
SA P +D NG+ + L+ + + + S + K+ ++ E S
Sbjct: 2 SAQQVPPSSEDGLNGSWVELEMSGNNASSSSPGRLEHVPSSSSNGDMEKLLLEAQRESS- 60
Query: 191 DDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQEESSDES 250
++ SSR SP + P E ++ V + +S
Sbjct: 61 -----------------RSSSRTSSRCDSPPRSPTPPQESPQTELDVEMDS-LRDVYIQS 102
Query: 251 ETVLPENLSLPKNT 264
E E L KNT
Sbjct: 103 EEEAGEKLKEVKNT 116
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.6 bits (64), Expect = 5.0
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E + K+ E E+ +EE++++EE EE+++
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEE 183
Score = 27.8 bits (62), Expect = 9.2
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
+ +K EEEEEEEEE+ E+ ++ D
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDD 195
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 28.9 bits (64), Expect = 5.2
Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 2/73 (2%)
Query: 194 MNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKSPAPVPSPRIQ-EESSDESET 252
+N +R R T + + + PT V P+P S I + + ES
Sbjct: 417 VNERNRRRNHTEIRLAEQRMNEEAATATPMSAPT-SVLTQPSPSLSTSIMPKLNPSESVV 475
Query: 253 VLPENLSLPKNTP 265
V E S P +P
Sbjct: 476 VASEKASSPDLSP 488
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 28.9 bits (65), Expect = 5.3
Identities = 21/104 (20%), Positives = 40/104 (38%), Gaps = 6/104 (5%)
Query: 183 DEDSEDSLDDNMNAEHRTRTSTNNNNTTM---PKSSRTPSPNQLVDPTGEVTKSPAP--- 236
DEDSED +DN N S++ T +++ P P
Sbjct: 213 DEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA 272
Query: 237 VPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHAY 280
+ S ++++++ ES + P + +P +A ++S S
Sbjct: 273 LSSLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGDYS 316
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 29.0 bits (64), Expect = 5.5
Identities = 41/197 (20%), Positives = 69/197 (35%), Gaps = 13/197 (6%)
Query: 174 DEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPSPNQLVDPTGEVTKS 233
D + E +DS DD + + S+ T PK P + VD
Sbjct: 231 DGNWVWTRELEPQDDSEDDYVIPDAEI-ISSPTLEVTAPKEVEQPLQPEPVDEETVAETK 289
Query: 234 PAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHAYLYTHPYSPY-QNG 292
P P E+ S PE+ ++ +N P T + + T P P Q
Sbjct: 290 AEEEPQPTQTVETKPTSA---PES-TVEENLPEINQPTQAVQPTSETISTTPVEPTDQLK 345
Query: 293 RSPVDVLRRVFPNRRRGEVETV------LQRCKGDVLQAIEM-MVYSEQLEEPRSAFSPL 345
VD ++ + EVE + L D L+ ++ +V + Q +EP A
Sbjct: 346 PKEVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENW 405
Query: 346 VSAFQRNFASQQANRRF 362
+ +Q + +Q +
Sbjct: 406 ATDYQLDEPTQSNIDWY 422
>gnl|CDD|133058 cd06436, GlcNAc-1-P_transferase, N-acetyl-glucosamine transferase
is involved in the synthesis of
Poly-beta-1,6-N-acetyl-D-glucosamine.
N-acetyl-glucosamine transferase is responsible for the
synthesis of bacteria
Poly-beta-1,6-N-acetyl-D-glucosamine (PGA).
Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer
that serves as an adhesion for the maintenance of
biofilm structural stability in diverse eubacteria.
N-acetyl-glucosamine transferase is the product of gene
pgaC. Genetic analysis indicated that all four genes of
the pgaABCD locus were required for the PGA production,
pgaC being a glycosyltransferase.
Length = 191
Score = 28.1 bits (63), Expect = 5.8
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 11/36 (30%)
Query: 290 QNGRSPVDVLRRVFPNRRRGEVETVLQRCKGDVLQA 325
+ R V +LRR PN R G KGD L A
Sbjct: 48 TDSR--VHLLRRHLPNARTG---------KGDALNA 72
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 28.6 bits (64), Expect = 6.0
Identities = 13/27 (48%), Positives = 19/27 (70%), Gaps = 1/27 (3%)
Query: 1 EARRKKKEEEEEEE-EEKQEEEGEEKK 26
E +RKKKE+++ EE ++KQ E E K
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLK 104
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.2 bits (63), Expect = 7.0
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPK 33
E ++ EEE EEE+E+ +E E++ + T
Sbjct: 85 EKEDEESEEENEEEDEESSDENEKETEEKTESN 117
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 28.7 bits (64), Expect = 7.2
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKD 27
E + KK+E++EE+ EE + EE +E+ +
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFE 304
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.2 bits (63), Expect = 8.3
Identities = 14/29 (48%), Positives = 21/29 (72%), Gaps = 2/29 (6%)
Query: 1 EARRKKKEEE--EEEEEEKQEEEGEEKKD 27
EA K KEE+ +EE E+KQ+E+ +E K+
Sbjct: 397 EAEAKAKEEKLKQEENEKKQKEQADEDKE 425
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 28.4 bits (63), Expect = 8.4
Identities = 39/187 (20%), Positives = 61/187 (32%), Gaps = 15/187 (8%)
Query: 100 AQVALRRQQAQEENEARELGLIYTSGAPLPPGSANPSSPGGQDNNNGALNLQGNRVGGEQ 159
Q+ LR + Q++ + ++ A GGQ G LN+QGN+ G
Sbjct: 341 KQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLGA 400
Query: 160 NHSYPDSPSGNRSPDEKKIKINVDEDSEDSLDDNMNAEHRTRTSTNNNNTTMPKSSRTPS 219
N P SP + S+ + + PS
Sbjct: 401 NPMQQGQPGMMSSPSPVP-----QVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPS 455
Query: 220 PNQLVDPTGEVTKSPAPVPSPRIQEESSDESETVLPENLSLPKNTPMSADSTSPMSAEHA 279
P L+ SP+P S + + + + V P P NTP + SP + +
Sbjct: 456 PPALM-------PSPSPQMSQSPASQRTIQQDMVSP---GGPLNTPGQSSVNSPANPQEE 505
Query: 280 YLYTHPY 286
LY Y
Sbjct: 506 QLYREKY 512
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.0 bits (63), Expect = 8.7
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKK 26
E + K EEE +EE QE++ E+KK
Sbjct: 268 EEEKILKAAEEERQEEAQEKKEEKKK 293
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 25.2 bits (56), Expect = 8.9
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEE 24
R K E++ EEE++K EE +E
Sbjct: 11 KRVWKAEQKAEEEKKKIEERRKE 33
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 28.1 bits (63), Expect = 9.3
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 8 EEEEEEEEEKQEEEGEEKKDGPTT 31
EEEEEEEEE EEE EE+++ T
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERT 329
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 26.7 bits (58), Expect = 9.4
Identities = 8/34 (23%), Positives = 20/34 (58%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKKDGPTTPKL 34
++ ++E++++E+EE +E+ +E D P
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSF 46
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 28.3 bits (63), Expect = 9.5
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 2 ARRKKKEEEEEEEEEKQEEEGEEKKDGPTTP 32
+R +E +EEE EE++E E+ +
Sbjct: 396 VKRVVREVDEEEIEEEEEAMQPEEMEMEGFE 426
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 28.4 bits (63), Expect = 9.5
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 4 RKKKEEEEEEEEEKQEEEGEEKKDG 28
R +E E EEE E E+E + K
Sbjct: 378 RNPEESEHEEEVEDYEDENDHSKRI 402
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 27.9 bits (62), Expect = 9.5
Identities = 7/26 (26%), Positives = 17/26 (65%)
Query: 1 EARRKKKEEEEEEEEEKQEEEGEEKK 26
+A ++ +E++++ EE K ++ E K
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKA 134
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.127 0.371
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,951,719
Number of extensions: 1978423
Number of successful extensions: 6637
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5986
Number of HSP's successfully gapped: 324
Length of query: 412
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 313
Effective length of database: 6,546,556
Effective search space: 2049072028
Effective search space used: 2049072028
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (26.7 bits)