RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3598
(175 letters)
>gnl|CDD|176263 cd08049, TAF8, TATA Binding Protein (TBP) Associated Factor 8.
The TATA Binding Protein (TBP) Associated Factor 8
(TAF8) is one of several TAFs that bind TBP, and is
involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID,
TFIIE, TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryotes. TFIID plays an important role in the
recognition of promoter DNA and the assembly of the
preinitiation complex. The TFIID complex is composed of
the TBP and at least 13 TAFs. TAFs from various species
were originally named by their predicted molecular
weight or their electrophoretic mobility in
polyacrylamide gels. A new, unified nomenclature for
the pol II TAFs has been suggested to show the
relationship between TAF orthologs and paralogs.
Several hypotheses are proposed for TAFs' functions,
such as serving as activator-binding sites, involvement
in the core-promoter recognition, or a role in the
essential catalytic activity of the complex. The mouse
ortholog of TAF8 is called taube nuss protein (TBN),
and is required for early embryonic development. TBN
mutant mice exhibit disturbances in the balance between
cell death and cell survival in the early embryo. TAF8
plays a role in the differentiation of preadipocyte
fibroblasts to adipocytes; it is also required for the
integration of TAF10 into the TAF complex. In yeast and
human cells, TAFs have been found as components of
other complexes besides TFIID. TAF8 is also a
component of a small TAF complex (SMAT), which contains
TAF8, TAF10 and SUPT7L. Several TAFs interact via
histone-fold motifs. The histone fold (HFD) is the
interaction motif involved in heterodimerization of the
core histones and their assembly into nucleosome
octamer. TAF8 contains an H4 related histone fold
motif, and interacts with several subunits of TFIID,
including TBP and the histone-fold protein TAF10.
Currently, five HF-containing TAF pairs have been
described or suggested to exist in TFIID: TAF6-TAF9,
TAF4-TAF12, TAF11-TAF13, TAF8-TAF10 and TAF3-TAF10.
Length = 54
Score = 36.1 bits (84), Expect = 7e-04
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 22 ETHKQPVTEYEAIREKAATQKRDLARALNKFIIKT 56
T+ + T+ + +RE+ A+Q+R +AL + KT
Sbjct: 20 PTYSERETDPKKLREELASQRRLAEKALLRLQQKT 54
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 37.7 bits (88), Expect = 0.001
Identities = 16/39 (41%), Positives = 25/39 (64%)
Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
KKK+ ++K K+ E+D++D +ED+ E GEG EE
Sbjct: 183 KKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEE 221
Score = 32.3 bits (74), Expect = 0.11
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
S +KK K++ K+ K +++D +EDE ++ EG G E
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEE 221
>gnl|CDD|192573 pfam10406, TAF8_C, Transcription factor TFIID complex subunit 8
C-term. This is the C-terminal, Delta, part of the
TAF8 protein. The N-terminal is generally the histone
fold domain, Bromo_TP (pfam07524). TAF8 is one of the
key subunits of the transcription factor for pol II,
TFIID. TAF8 is one of the several general cofactors
which are typically involved in gene activation to
bring about the communication between gene-specific
transcription factors and components of the general
transcription machinery.
Length = 51
Score = 34.5 bits (80), Expect = 0.002
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 22 ETHKQPVTEYEAIREKAATQKRDLARALNKFI 53
+ +PVT+ IREK + R +AL K +
Sbjct: 20 PVYTEPVTDPRKIREKLVEESRLAEKALRKLL 51
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 34.6 bits (79), Expect = 0.013
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 104 QKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
K ++ K +K+ EE EE+ E+EA++EEG+G ++
Sbjct: 204 SKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 35.0 bits (80), Expect = 0.018
Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 6/73 (8%)
Query: 88 KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
+Q +E ++ + K ++ K+ + E ++ G +DE + + E N +P
Sbjct: 3924 NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP--- 3980
Query: 148 SNMVPDSEAIDNP 160
P++E +D P
Sbjct: 3981 ---PPENEDLDLP 3990
Score = 28.8 bits (64), Expect = 1.7
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 8/42 (19%)
Query: 118 KKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
++ V+E+EE E+ +EE E E PED +AIDN
Sbjct: 4068 EENVQENEESTEDGVKSDEELEQGEVPED--------QAIDN 4101
Score = 26.9 bits (59), Expect = 7.0
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 13/80 (16%)
Query: 93 EEDEDNIQASPQKKKK--PTPVKKKSKKKKVEED-----------EEDGEEDEAEEEEGE 139
+ E+N Q P+ + P +K K+ V +D +E+ EE +AE++E
Sbjct: 3972 DIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPM 4031
Query: 140 GNEEPEDTSNMVPDSEAIDN 159
+E+P + +N + + D+
Sbjct: 4032 QDEDPLEENNTLDEDIQQDD 4051
Score = 26.5 bits (58), Expect = 9.9
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 117 KKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
K V E+E E+E ++ + E N+ PED SN + +
Sbjct: 3869 KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNE 3905
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 34.5 bits (79), Expect = 0.023
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 7/55 (12%)
Query: 105 KKKKPTP-------VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
+KKKPT ++++KK E D E EED+ E+GE ++ D P
Sbjct: 71 RKKKPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDDGIDFPYDDP 125
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 34.6 bits (79), Expect = 0.025
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDS 154
E+EE+ EE+E EEEE E EE E+ N P S
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895
Score = 34.2 bits (78), Expect = 0.026
Identities = 21/55 (38%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPY--LRPVKYP 168
S++++ EE+EE+ EE+E EEEE E E E S P++ Y L P+ +P
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLLPIVFP 917
Score = 29.2 bits (65), Expect = 1.1
Identities = 20/54 (37%), Positives = 26/54 (48%), Gaps = 3/54 (5%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
+E E I+A K T + + + E E +G EDE E E GE EE ED
Sbjct: 693 QEGEGEIEAKEADHKGET---EAEEVEHEGETEAEGTEDEGEIETGEEGEEVED 743
Score = 28.0 bits (62), Expect = 3.3
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 124 DEEDGEEDEAEEEEGEGNEEPED 146
D E+ EE+E EEEE E EE E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEE 884
Score = 27.7 bits (61), Expect = 4.1
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 123 EDEEDGEEDEAEEEEGEGNEEPED 146
+ EE+ EE+E EEEE E EE E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEE 885
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.9 bits (75), Expect = 0.047
Identities = 30/91 (32%), Positives = 38/91 (41%), Gaps = 14/91 (15%)
Query: 89 DQVFEEDEDNIQASPQKKKKPTPVKKK---SKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
D +F ED + KKKK TP KKK KKK +E EE E E + E NEE +
Sbjct: 36 DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEND 95
Query: 146 D-----------TSNMVPDSEAIDNPYLRPV 165
T+ V + ID L+
Sbjct: 96 KKVDYELPKVQNTAAEVNHEDVIDLSDLKLA 126
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.8 bits (75), Expect = 0.049
Identities = 17/74 (22%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 87 PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
+D +F ED + KKK+PT +K + KK ++ +++ +E ++ E +++ E+
Sbjct: 34 VRDDIFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPE-SDDDKTEE 92
Query: 147 TSNMVPDSEAIDNP 160
N ++E +
Sbjct: 93 NENDPDNNEESGDS 106
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 32.4 bits (74), Expect = 0.061
Identities = 9/44 (20%), Positives = 12/44 (27%)
Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
+A + T KKK + E EA E
Sbjct: 112 EADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.6 bits (75), Expect = 0.083
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 99 IQASPQKKKKPTPVKKKSKK----KKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
I +KK++ +KK K KK EE+EE+ E+ E E+EE E E E
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 29.5 bits (67), Expect = 1.0
Identities = 18/57 (31%), Positives = 29/57 (50%)
Query: 99 IQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
I+ +K +K +KK KKKK ++ EE+E E+E+ E +E E+ E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 28.3 bits (64), Expect = 2.0
Identities = 18/83 (21%), Positives = 32/83 (38%), Gaps = 14/83 (16%)
Query: 78 EMLPYLCLIPKDQV-----------FEEDEDNIQASPQKKKKPTPVKKKSKKKKVEE--- 123
E+LP+L +I K E+E +K K + +KK EE
Sbjct: 369 EVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKK 428
Query: 124 DEEDGEEDEAEEEEGEGNEEPED 146
+++ ++EE E E+ +
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKK 451
Score = 28.3 bits (64), Expect = 2.5
Identities = 17/44 (38%), Positives = 29/44 (65%)
Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
KKKK KKK ++++ E+++++ E++E EEE E EE E+
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 26.8 bits (60), Expect = 7.2
Identities = 18/45 (40%), Positives = 27/45 (60%)
Query: 103 PQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
++KKK KK ++++ EE E+ EE E EEEE E +E E+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 26.8 bits (60), Expect = 7.4
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 90 QVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
+ E + + +K+KK K K+++ EE++E EE++ EEEE E+ E+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific phytosulfokine
precursor proteins. Phytosulfokines, are active as
either a pentapeptide or a C-terminally truncated
tetrapeptide. These compounds were first isolated
because of their ability to stimulate cell division in
somatic embryo cultures of Asparagus officinalis.
Length = 74
Score = 29.6 bits (67), Expect = 0.16
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
PT +S K + E + E++ E EGEG EE
Sbjct: 19 PTDTGHESSKIEAAESSKVDEDEVVSECEGEGEEE 53
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 31.5 bits (71), Expect = 0.16
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
+E + DE E+ + + NE+ +DT+ D +
Sbjct: 276 DEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDC 310
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.9 bits (73), Expect = 0.17
Identities = 11/62 (17%), Positives = 29/62 (46%)
Query: 87 PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
+ + + + + + + + ++ +KKK + ED ++ E ++ E E +E+ ED
Sbjct: 1327 SEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386
Query: 147 TS 148
Sbjct: 1387 DD 1388
Score = 28.9 bits (65), Expect = 1.6
Identities = 11/104 (10%), Positives = 30/104 (28%), Gaps = 7/104 (6%)
Query: 36 EKAATQKRDLARALNKFIIKTEPIEHIFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEED 95
K + + + K +L K + K +
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVK-------KRLEGSLAALKKKKKSEKKTARKKKSKTRV 1341
Query: 96 EDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGE 139
+ + + + KK + ++D E + ++ ++E+ E
Sbjct: 1342 KQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDE 1385
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.4 bits (71), Expect = 0.24
Identities = 18/48 (37%), Positives = 30/48 (62%)
Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
SP + + ++ ++++ EE+EE+ EE E+EEEEGE EE E+
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 30.9 bits (70), Expect = 0.32
Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 1/48 (2%)
Query: 127 DGEEDEAEEEEGEGNEEPEDTSNMVPDSEA-IDNPYLRPVKYPQRIKS 173
DG ED++E+ +++ + S E N R ++ P R+ S
Sbjct: 31 DGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRRLIRTPPRVDS 78
>gnl|CDD|217093 pfam02540, NAD_synthase, NAD synthase. NAD synthase (EC:6.3.5.1)
is involved in the de novo synthesis of NAD and is
induced by stress factors such as heat shock and glucose
limitation.
Length = 242
Score = 30.4 bits (69), Expect = 0.38
Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 40 TQKRDLARALN--KFIIKTEPIEHIFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDED 97
TQ +LA+ LN + IIK P ++ D L +P E+ L + + + EE+
Sbjct: 157 TQVYELAKRLNVPERIIKKPPSADLWPGQTDEDELGIPYDELDQILKGLEELKQSEEEII 216
Query: 98 NIQASPQKKKKPTPVKKKSKKKK 120
++ + ++ + +KS+ K+
Sbjct: 217 DLGLPAEVVERVENLIQKSEHKR 239
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.4 bits (69), Expect = 0.41
Identities = 15/24 (62%), Positives = 19/24 (79%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPE 145
++DEE+ +EDE EEEE E EEPE
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPE 369
Score = 30.4 bits (69), Expect = 0.47
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPE 145
++ +E+EE +E+E EEEE EPE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPE 371
Score = 28.1 bits (63), Expect = 2.8
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
V + K ++ E+++E+ EE+E E EE E E P
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 26.9 bits (60), Expect = 5.6
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
E+EE+ EE +EE EG EE E+ VP
Sbjct: 170 GEEEEEEEEVGEADEEDEGEEEEEEEPEEVP 200
Score = 26.9 bits (60), Expect = 7.0
Identities = 14/37 (37%), Positives = 20/37 (54%)
Query: 121 VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAI 157
E + +GEE+E EEE GE +EE E + E +
Sbjct: 163 AEVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEV 199
Score = 26.6 bits (59), Expect = 8.5
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPE 145
V +++D EE++ +EEE E EEPE
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPE 366
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.4 bits (68), Expect = 0.44
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
A KK KK +K+ + +EDEE+ EE+E EE E E NE+
Sbjct: 115 ALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157
Score = 27.4 bits (60), Expect = 4.4
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 101 ASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
A+ Q KK KK+ K+ +++E+ EE+E EEEE E E + T+ +S +D+
Sbjct: 114 AALQLPKKAGNAGKKATKEDESDEDEEEEEEE-EEEEAEVEENEQGTNGTSTNSTEVDH 171
>gnl|CDD|220440 pfam09848, DUF2075, Uncharacterized conserved protein (DUF2075).
This domain, found in various prokaryotic proteins
(including putative ATP/GTP binding proteins), has no
known function.
Length = 348
Score = 30.3 bits (69), Expect = 0.44
Identities = 15/41 (36%), Positives = 18/41 (43%), Gaps = 5/41 (12%)
Query: 31 YEAIREKAATQKRDLARALNKFI---IKTEPIEH--IFDEA 66
YEA+ +K+ L R FI K P E I DEA
Sbjct: 48 YEALAGDLKVRKKKLFRKPTSFINNLHKAPPHEDVVIVDEA 88
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 30.5 bits (69), Expect = 0.45
Identities = 15/41 (36%), Positives = 21/41 (51%)
Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
Q Q K+K T K K KK V++D+ E +E +E E
Sbjct: 12 QQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAEN 52
Score = 28.1 bits (63), Expect = 2.9
Identities = 14/53 (26%), Positives = 24/53 (45%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
E+ +N +++ K KKK+KK K + D++D E E E +
Sbjct: 3 NENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQN 55
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.0 bits (68), Expect = 0.51
Identities = 16/56 (28%), Positives = 25/56 (44%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTS 148
+E+ED + + K KK+ K K+EE E+ E E + + PED
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 30.3 bits (69), Expect = 0.51
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 120 KVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
+EED++D EE + E+EE +E+ ED + D +
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDD 157
Score = 27.6 bits (62), Expect = 3.9
Identities = 9/38 (23%), Positives = 19/38 (50%)
Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNP 160
E+++D +E+ EE+E E ++ + D + I
Sbjct: 124 EEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161
Score = 27.6 bits (62), Expect = 4.1
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 111 PVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
P + + E+D++D E DE +EE + ++ +D
Sbjct: 114 PRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 30.5 bits (69), Expect = 0.52
Identities = 11/34 (32%), Positives = 22/34 (64%)
Query: 104 QKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
+K K PVK+K +++ +D++ + D+ E+EE
Sbjct: 11 RKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEE 44
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 30.3 bits (68), Expect = 0.53
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 92 FEEDEDNIQASPQKKKKPT----PVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
ED+D ++ S K+K T K+ + +K +E EE+ E+E ++EE + +++ ED
Sbjct: 9 ELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDE 68
Query: 148 SNMVPDSEAID 158
+ D + D
Sbjct: 69 DDDDDDDDEDD 79
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 30.0 bits (68), Expect = 0.53
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
K +++K D++ E D +E+E E +E P
Sbjct: 180 FYKPKQQEKGAPDDDLDEYDYGDEDEEEDDEPPW 213
Score = 26.5 bits (59), Expect = 7.2
Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 3/56 (5%)
Query: 95 DEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
DE ++ A +++ P V E G E E E E EEPE S M
Sbjct: 106 DEADLLALMEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEP---EEEPEKKSGM 158
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 30.1 bits (68), Expect = 0.60
Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 10/84 (11%)
Query: 85 LIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
L P D + +A+P+ + + TP + + E E EE E EE
Sbjct: 192 LGPLDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEE------- 244
Query: 145 EDTSNMVPDSEAIDNPYLRPVKYP 168
N P + PY+ P +YP
Sbjct: 245 ---ENNKPVKPPVPKPYISPDEYP 265
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.9 bits (67), Expect = 0.67
Identities = 8/49 (16%), Positives = 17/49 (34%)
Query: 108 KPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
+ P + + + + EE +E ++T VP+ A
Sbjct: 373 REQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAA 421
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 29.6 bits (67), Expect = 0.69
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 89 DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKK 120
+ F +E A+ ++++ KK+SKKKK
Sbjct: 165 LEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.0 bits (67), Expect = 0.79
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 123 EDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
+DE++ E+D+ EE++ E EE E+
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGF 183
Score = 27.3 bits (60), Expect = 5.1
Identities = 3/21 (14%), Positives = 14/21 (66%)
Query: 127 DGEEDEAEEEEGEGNEEPEDT 147
D ++++ +E++ + ++ E+
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEE 174
Score = 27.3 bits (60), Expect = 5.6
Identities = 10/22 (45%), Positives = 17/22 (77%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEE 143
+EDE++ ++DE ++EE E EE
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.8 bits (65), Expect = 0.84
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTS 148
+D+E+ EE+E +EE+ + +E E+ S
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 26.1 bits (58), Expect = 8.0
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEA 156
E++ +D EEDE EE++ E ++E E P +
Sbjct: 111 EDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 29.5 bits (67), Expect = 0.85
Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 21/81 (25%)
Query: 90 QVFEEDEDNIQA-----------------SPQKKKKPTPVKKKSKKKKVEED----EEDG 128
Q EEDE IQA SP+ + +KK+K++ EE
Sbjct: 64 QEAEEDEQQIQALKRKYLSSPEKSPVADLSPRLEAISLSPRKKAKRRLFEEQDSGLGNSL 123
Query: 129 EEDEAEEEEGEGNEEPEDTSN 149
EE+ + E EG E
Sbjct: 124 EEESTDVVEEEGVESSGGGEG 144
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 28.9 bits (65), Expect = 1.0
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 89 DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKK 120
+ F +E + KK TP K ++KKKK
Sbjct: 165 KEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 28.0 bits (63), Expect = 1.3
Identities = 9/50 (18%), Positives = 25/50 (50%)
Query: 102 SPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMV 151
K + P + + KK +E++ED + ++++ + ++ P+ + V
Sbjct: 39 EVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDV 88
Score = 25.7 bits (57), Expect = 6.6
Identities = 9/47 (19%), Positives = 24/47 (51%)
Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
P K ++ E+ + E++E E++ +++ +D + +PD +
Sbjct: 38 PEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLD 84
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.6 bits (64), Expect = 1.4
Identities = 10/51 (19%), Positives = 29/51 (56%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
E+ ++ + +K K+ + +K EE+EE+ EE++ + ++ + +++
Sbjct: 147 EDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 28.9 bits (65), Expect = 1.5
Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 9/66 (13%)
Query: 104 QKKKKPTPVKKKSKKKKVE-------EDEEDGEEDEAEEEEGEGNEE--PEDTSNMVPDS 154
K+K T K+ + ++ ED +DE EE+E E EE E S D
Sbjct: 356 LNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDD 415
Query: 155 EAIDNP 160
E +
Sbjct: 416 EEFEED 421
Score = 26.6 bits (59), Expect = 8.5
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYL 162
EE+EE+ +EDE +E +EE E+ ++ N L
Sbjct: 396 EEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSL 436
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 28.8 bits (64), Expect = 1.5
Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 5/74 (6%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVP 152
++DE+N+ +S + +K V E EE+E E EE E E E +
Sbjct: 111 DDDEENLPSSIAPPEIDPEPRKPIVPDLVLE-----EEEEVEMEEDEEYYEKEPGKVVDE 165
Query: 153 DSEAIDNPYLRPVK 166
SE + L+ +K
Sbjct: 166 KSEEEEEEELKTMK 179
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.5 bits (64), Expect = 1.6
Identities = 10/60 (16%), Positives = 30/60 (50%)
Query: 88 KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
++V EE+++ + ++ K + + +++ EE++E+ ++ +E E + E
Sbjct: 62 IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121
Score = 26.6 bits (59), Expect = 6.6
Identities = 13/71 (18%), Positives = 31/71 (43%)
Query: 94 EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
+D++ + K+++ + K+ K + ++ED E +E EEE E + + +
Sbjct: 55 DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114
Query: 154 SEAIDNPYLRP 164
++ P
Sbjct: 115 ESNVEKEITNP 125
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 28.6 bits (64), Expect = 1.7
Identities = 10/54 (18%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Query: 107 KKPTPVKKKSKKKKVEE-----DEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
K+P K SK++ + + +++G+ + +++E + +D+ + V D +
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDED 280
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 28.6 bits (64), Expect = 1.8
Identities = 13/69 (18%), Positives = 31/69 (44%), Gaps = 2/69 (2%)
Query: 88 KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
+ + E+ED +A P P + +D+++G++++ ++ E G + ED
Sbjct: 394 RPRGPPENED--EADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDD 451
Query: 148 SNMVPDSEA 156
+ D +
Sbjct: 452 VYDLKDVDE 460
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.8 bits (65), Expect = 1.8
Identities = 15/70 (21%), Positives = 29/70 (41%)
Query: 91 VFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
+D D+ S KKK K + K + + E D +++ E + + +D N
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120
Query: 151 VPDSEAIDNP 160
V D + ++
Sbjct: 121 VKDIDVLNQA 130
Score = 26.9 bits (60), Expect = 7.4
Identities = 18/166 (10%), Positives = 48/166 (28%)
Query: 2 ATSKDTNRSLLKSSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFIIKTEPIEH 61
+ + + V + + E +K+ A +
Sbjct: 37 EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96
Query: 62 IFDEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKV 121
+ + + L + V + +D+ + +
Sbjct: 97 KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPVKY 167
+ED++D + D+ +EE+ E E + + + + D+ LR +
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202
>gnl|CDD|216090 pfam00733, Asn_synthase, Asparagine synthase. This family is
always found associated with pfam00310. Members of this
family catalyze the conversion of aspartate to
asparagine.
Length = 195
Score = 28.4 bits (64), Expect = 1.9
Identities = 11/54 (20%), Positives = 20/54 (37%), Gaps = 1/54 (1%)
Query: 14 SSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFIIKTEPIEHIFDEAD 67
S + AL A P+ + E + + A + + T+ E I E +
Sbjct: 31 SLIAALAARQSDPPLKTFSVGFEGSDYDEAPYAELVADH-LGTDHHEIIVTEEE 83
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 28.7 bits (64), Expect = 2.1
Identities = 15/34 (44%), Positives = 19/34 (55%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
E ++ EEDEAE +GEG E D + DSE
Sbjct: 254 SEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE 287
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 27.9 bits (63), Expect = 2.4
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
++ ++D++D +EDE +EEE + +PE+
Sbjct: 49 DEEDLEDDDDDDEDEDEDDEEEADLGPDPEEA 80
Score = 26.4 bits (59), Expect = 7.1
Identities = 6/35 (17%), Positives = 24/35 (68%)
Query: 121 VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSE 155
+E + ++ + ++ ++++ + +E+ E+ +++ PD E
Sbjct: 44 IESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 28.4 bits (63), Expect = 2.5
Identities = 11/53 (20%), Positives = 25/53 (47%)
Query: 94 EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
+ + +++ K +P + + +ED E+ EE++ EEE G + +
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKL 341
Score = 26.8 bits (59), Expect = 6.4
Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 94 EDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPD 153
++ + + S +KKKK KKK K + D + DE + ++G+ ED + D
Sbjct: 233 GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREED---YISD 289
Query: 154 SEAIDN 159
S A N
Sbjct: 290 SSASGN 295
>gnl|CDD|165416 PHA03145, PHA03145, helicase-primase primase subunit; Provisional.
Length = 1058
Score = 28.4 bits (63), Expect = 2.6
Identities = 10/53 (18%), Positives = 18/53 (33%), Gaps = 4/53 (7%)
Query: 108 KPTPVKKKSKKKKVEEDEEDGE----EDEAEEEEGEGNEEPEDTSNMVPDSEA 156
+P E + DG +EE + + + E ++ D EA
Sbjct: 676 RPRAAAVDPGLGPDGEYDGDGPWFPAGGGPGDEEDDEDPDAEALAHGGVDEEA 728
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 27.9 bits (63), Expect = 2.6
Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 95 DEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
D+D + ++ +++++ EE+EE+ EE+ +EEE G
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEE-EEEPSEEEAAAG 324
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 28.0 bits (62), Expect = 2.7
Identities = 13/60 (21%), Positives = 26/60 (43%)
Query: 100 QASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDN 159
+ Q++ KP KK+ +K+K E E +G+ +E + + D ++ N
Sbjct: 58 KGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVAN 117
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.1 bits (62), Expect = 2.7
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 104 QKKKKPTPVKKKSKKKKVEEDEED-GEEDE-AEEEEGEGNEEPEDTSNMVPDSEAID 158
+++K V+K+ K KK++ED ED G E E ++ EE E E+ S D E D
Sbjct: 129 EEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 28.0 bits (63), Expect = 2.7
Identities = 13/69 (18%), Positives = 26/69 (37%)
Query: 87 PKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
P EE+E ++K K++ +KK E++E + + + E + D
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Query: 147 TSNMVPDSE 155
D +
Sbjct: 451 GETKKVDPD 459
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 27.5 bits (61), Expect = 2.9
Identities = 15/52 (28%), Positives = 26/52 (50%)
Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLR 163
+ ++ K ++VEE+ E E +E EE E E + ++ +N YLR
Sbjct: 1 MSEEIKNEEVEEEVEATETEETVEEVVEETPEKSELELANERADEFENKYLR 52
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 27.7 bits (61), Expect = 3.0
Identities = 14/57 (24%), Positives = 32/57 (56%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
E++ ++ A+ + K+ + K+ ++++VE E EE +++ + G E+ ED N
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDEN 329
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 27.5 bits (61), Expect = 3.4
Identities = 23/76 (30%), Positives = 30/76 (39%), Gaps = 16/76 (21%)
Query: 67 DDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEE 126
DD L P+ PY P E+DE I+ + + E+ E
Sbjct: 42 DDAIELAEPETSDDPYGNPDPFG---EDDEGRIEVRISEDGEE-------------EEVE 85
Query: 127 DGEEDEAEEEEGEGNE 142
+GEEDE EE E E E
Sbjct: 86 EGEEDEEEEGEEESEE 101
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.8 bits (61), Expect = 3.8
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
E+E+ I+A+ + KK KK + KK EEDE+ E +E E
Sbjct: 1655 AEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Score = 27.0 bits (59), Expect = 7.8
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEE 143
E +E +K ++ +K + KK EED++ EE + EE+ + E
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 27.7 bits (62), Expect = 3.9
Identities = 15/78 (19%), Positives = 27/78 (34%), Gaps = 18/78 (23%)
Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGN------------------EEPEDTSNM 150
P KKK +++ +EE+ EE+ G +E +D+
Sbjct: 24 ELPPKKKGRRRVSPVEEEEEEEEAEVVAVGFSYPPVRISRGKDGKRPVRPLKEEKDSEKK 83
Query: 151 VPDSEAIDNPYLRPVKYP 168
A+ NP P++
Sbjct: 84 ASTEAAVRNPLSDPLESK 101
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 27.3 bits (61), Expect = 4.1
Identities = 8/29 (27%), Positives = 17/29 (58%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNM 150
EE+++ E ++ +EE+ + EE +D
Sbjct: 290 EEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 27.5 bits (61), Expect = 4.1
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 105 KKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
K+K VK+ ++ EE EE+ E + EE E EG E P
Sbjct: 389 KEKGHPVVKRVVREVDEEEIEEEEEAMQPEEMEMEGFEVPA 429
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.2 bits (58), Expect = 4.6
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 114 KKSKKKKVEEDEEDGEEDEAEEEEGEG 140
+++++ EE+EE+ EE+E+EEE G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAG 99
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 26.2 bits (58), Expect = 4.7
Identities = 19/55 (34%), Positives = 24/55 (43%)
Query: 106 KKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNP 160
K K VKKK KKKK + ++ E EEEE E D+E I+
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQE 67
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.3 bits (61), Expect = 4.7
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 116 SKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAID 158
+ ++ EE+EEDG +DE EE++ + EE E+ ++ + E +
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEE 359
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 27.3 bits (61), Expect = 4.9
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
P KK + + EE EE+ + + E +G ++ ++ S
Sbjct: 220 PFFTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGST 260
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 27.5 bits (61), Expect = 5.0
Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 12/71 (16%)
Query: 98 NIQASPQKKKKPTPVKKKSKKKK--------VEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
+ S +K +KP + ++ED +D EE + E E +EPE T N
Sbjct: 410 SASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPESE----QEPETTKN 465
Query: 150 MVPDSEAIDNP 160
D+ + P
Sbjct: 466 ETKDTAMEEEP 476
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 27.1 bits (60), Expect = 5.4
Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 92 FEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEE 136
+E E N QA PQ K T K+ +VE DEE+ + D+ E E
Sbjct: 72 YERREGN-QACPQCK---TRYKRIKGSPRVEGDEEEDDIDDLENE 112
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 26.9 bits (60), Expect = 6.2
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 112 VKKKSKKKKVEEDEEDGEEDEAEEE 136
KKK KKK + + + EE E E+
Sbjct: 149 PKKKVKKKNLLGGKSEKEEPEEEKT 173
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 26.8 bits (59), Expect = 6.7
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEE 137
+ D D+ +S KK P K++ KK + E+E E
Sbjct: 133 DSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEENYNTE 177
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 25.3 bits (56), Expect = 6.9
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 123 EDEEDGEEDEAEEEEGEGNEEPED 146
E+EE+GE+ E+E+EE E ++ +D
Sbjct: 49 EEEEEGEDLESEDEEDEEEDDDDD 72
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 25.9 bits (56), Expect = 7.0
Identities = 9/30 (30%), Positives = 20/30 (66%)
Query: 115 KSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
S + E+D+++ EED+ E+++ + +E P
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDEVP 42
>gnl|CDD|178137 PLN02522, PLN02522, ATP citrate (pro-S)-lyase.
Length = 608
Score = 26.7 bits (59), Expect = 7.1
Identities = 16/46 (34%), Positives = 20/46 (43%)
Query: 9 RSLLKSSVVALLAETHKQPVTEYEAIREKAATQKRDLARALNKFII 54
RS SS+ AL T + E + E Q ARA NK +I
Sbjct: 89 RSAAASSMEALKQPTIRVVAIIAEGVPESDTKQLIAYARANNKVVI 134
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 26.8 bits (59), Expect = 7.3
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 119 KKVEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAID 158
+ +E+ E + E D + EE E ++ E +VP +A+
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVR 705
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 26.0 bits (57), Expect = 7.5
Identities = 11/36 (30%), Positives = 24/36 (66%), Gaps = 3/36 (8%)
Query: 89 DQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEED 124
+ +E+ED+++ +K++K ++K +KK VEE+
Sbjct: 98 NNAIQEEEDDMEQQEEKEEKG---REKGRKKNVEEE 130
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 26.9 bits (60), Expect = 7.5
Identities = 14/64 (21%), Positives = 22/64 (34%), Gaps = 13/64 (20%)
Query: 83 LCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNE 142
L L P+ + P P ED++D EDE E+++ E
Sbjct: 249 LVLAPRATRLPAPPQPPEEEPPPPPPP------------PEDDDDPPEDEEEQDDAED-R 295
Query: 143 EPED 146
E+
Sbjct: 296 ALEE 299
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 26.7 bits (59), Expect = 7.6
Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 121 VEEDEEDGEEDEA---EEEEGEGNEEPEDTSNMVPDSEAID 158
E+D D +E++ EE+E E EE ED V S D
Sbjct: 40 AEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEVKASPDAD 80
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 26.5 bits (59), Expect = 7.9
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 113 KKKSKKKK-----VEEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAI-DNP 160
KKK KKKK ++DEE+ E++ +E++ D +N+ P+ + I NP
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNP 54
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 26.7 bits (59), Expect = 8.7
Identities = 8/54 (14%), Positives = 27/54 (50%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
++ + + + K+K++K++++E + ++ E +E+ E ++ E
Sbjct: 379 QDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 26.3 bits (58), Expect = 9.5
Identities = 10/60 (16%), Positives = 30/60 (50%)
Query: 88 KDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDT 147
+ ++ +E +Q +K + S++ + + EE +++E E+++ E +E ++
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEK 426
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 26.2 bits (58), Expect = 8.7
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 1/51 (1%)
Query: 96 EDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPED 146
E++ + S + K S+ V+ E EE+E EE E +EPED
Sbjct: 132 EESKKGSAETLKLEEKDSSSSEGSTVDL-EPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 25.7 bits (57), Expect = 8.7
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 109 PTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEG 140
+ ++K EE+EE+ E++E+EEE G
Sbjct: 69 AAAAAAAAAEEKKEEEEEEEEKEESEEEAAAG 100
>gnl|CDD|215121 PLN02189, PLN02189, cellulose synthase.
Length = 1040
Score = 26.5 bits (58), Expect = 8.8
Identities = 21/104 (20%), Positives = 42/104 (40%), Gaps = 2/104 (1%)
Query: 64 DEADDRFTLIVPKPEMLPYLCLIPKDQVFEEDEDNIQASPQKKKKPTPVKKKSKKKKVEE 123
D+ +++F ++ P P + E + +S K+ P PV + K E+
Sbjct: 140 DDENNQFPPVITGVRSRPVSGEFPIGSGYGHGEQMLSSSLHKRVHPYPVSEPGSAKWDEK 199
Query: 124 DEEDGEE--DEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPV 165
E +E D+ + ++G +P+D + + P R V
Sbjct: 200 KEGGWKERMDDWKMQQGNLGPDPDDYDADMALIDEARQPLSRKV 243
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 26.6 bits (59), Expect = 9.0
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 106 KKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPE 145
K+K PV ++ ++ EE+EE+ EE+E E PE
Sbjct: 392 KEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 26.2 bits (58), Expect = 9.5
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 122 EEDEEDGEEDEAEEEEGEGNEEPEDTSNMVPDSEAIDNPYLRPVKYPQR 170
E D++ E +E E+ E + E+ +VP E RP + +R
Sbjct: 209 EPDDDATVETSSESEDSESESDDEELPYIVPRMEP---VQGRPPVFLRR 254
>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein. S-antigens are heat
stable proteins that are found in the blood of
individuals infected with malaria.
Length = 94
Score = 25.3 bits (55), Expect = 9.6
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 113 KKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEPEDTSN 149
K K +K E+ +E+GE + +EE+ E D N
Sbjct: 49 SKNGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEEN 85
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 26.3 bits (58), Expect = 9.8
Identities = 14/52 (26%), Positives = 29/52 (55%)
Query: 93 EEDEDNIQASPQKKKKPTPVKKKSKKKKVEEDEEDGEEDEAEEEEGEGNEEP 144
EE++ N QA+ ++ ++ +S ++DE E+++ +EE+ E E P
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.127 0.342
Gapped
Lambda K H
0.267 0.0857 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,000,509
Number of extensions: 861483
Number of successful extensions: 4484
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3114
Number of HSP's successfully gapped: 673
Length of query: 175
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 85
Effective length of database: 6,945,742
Effective search space: 590388070
Effective search space used: 590388070
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 56 (25.1 bits)