RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12225
(396 letters)
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 158 bits (401), Expect = 3e-44
Identities = 90/289 (31%), Positives = 130/289 (44%), Gaps = 76/289 (26%)
Query: 2 EMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEG 61
+MR ++++E +L+ ++RERV+PK+G +D+DY+KLHDAFF KP LS G+ Y E
Sbjct: 209 QMRRMMREREAGMSLRERIRERVQPKMGGLDVDYRKLHDAFFSLGPKPYLSKFGEFYEEV 268
Query: 62 KEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPN 121
+K+K+PG +SAELR ALG+ G PPPWL MQ++G PPSYP+
Sbjct: 269 DNDYRFVKKKRPGAISAELREALGIDSGT------------PPPWLFNMQKHGMPPSYPD 316
Query: 122 LKIPGLN-APIP-QGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLY 179
LKIPGLN APIP +G +GY GW +P +
Sbjct: 317 LKIPGLNWAPIPLEGDVYGYQPPGWHEP----LFEV------------------------ 348
Query: 180 GDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEE--EEGEDESGLITP 237
+TA GEL EE D+E+ E + E +
Sbjct: 349 -------------------GPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEERVDDV 389
Query: 238 AEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAP---QLFQVL 283
+ G P ++E+ K+ + +A L+ VL
Sbjct: 390 SNG----------GLAEFAPAAVEVAPKEDAKVTKRGKAKREKLLYSVL 428
>gnl|CDD|112836 pfam04037, DUF382, Domain of unknown function (DUF382). This
domain is specific to the human splicing factor 3b
subunit 2 and it's orthologues. Splicing factor 3b
subunit 2 or SAP145 is a suppressor of U2 snRNA
mutations. Pre-mRNA splicing is catalyzed by a large
ribonucleoprotein complex called the spliceosome.
Spliceosomes are multi-component enzymes that catalyze
pre-mRNA splicing and form step-wise by the ordered
interaction of UsnRNPs and non-snRNP proteins with short
conserved regions of the pre-mRNA at the 5' and 3'
splice sites and branch site.
Length = 129
Score = 124 bits (314), Expect = 8e-35
Identities = 46/64 (71%), Positives = 56/64 (87%)
Query: 1 MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYE 60
EMR +L +KE +KTLK K RERV+PK+GK+DIDYQKLHDAFFK+QTKP L+ +GD+YYE
Sbjct: 66 AEMRDALLEKEAEKTLKQKQRERVQPKMGKLDIDYQKLHDAFFKYQTKPDLTKYGDVYYE 125
Query: 61 GKEF 64
GKEF
Sbjct: 126 GKEF 129
>gnl|CDD|202864 pfam04046, PSP, PSP. Proline rich domain found in numerous
spliceosome associated proteins.
Length = 48
Score = 84.1 bits (209), Expect = 9e-21
Identities = 37/60 (61%), Positives = 39/60 (65%), Gaps = 12/60 (20%)
Query: 72 KPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPI 131
KPG LS ELR ALGMP G PPPWL M+R G PPSYP+LKIPGLNAPI
Sbjct: 1 KPGVLSDELREALGMPEGD------------PPPWLYRMRRLGYPPSYPDLKIPGLNAPI 48
>gnl|CDD|128850 smart00581, PSP, proline-rich domain in spliceosome associated
proteins.
Length = 54
Score = 82.8 bits (205), Expect = 4e-20
Identities = 36/65 (55%), Positives = 39/65 (60%), Gaps = 12/65 (18%)
Query: 68 LKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGL 127
K KPG +S ELR ALG+P G PPPWL M+R G PP YP LKIPGL
Sbjct: 1 FKHFKPGRISDELREALGLPPG------------QPPPWLYRMRRLGYPPGYPRLKIPGL 48
Query: 128 NAPIP 132
NAPIP
Sbjct: 49 NAPIP 53
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 44.6 bits (105), Expect = 8e-05
Identities = 33/106 (31%), Positives = 43/106 (40%), Gaps = 14/106 (13%)
Query: 140 HAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDI 199
AG G+ DE G + G G+ E D ET E+N E+
Sbjct: 784 QAGEDGEMKGDE-GAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQ 842
Query: 200 EKTA-------------WGELESESEEESEEDDEEEEEEEEGEDES 232
+ G+ E E EEE EE++EEEEEEEE E+E
Sbjct: 843 GEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 34.6 bits (79), Expect = 0.11
Identities = 34/105 (32%), Positives = 44/105 (41%), Gaps = 9/105 (8%)
Query: 163 TGGWGKPPVDETGKPLYGDVFGTSA------ETKTAEVNEEDIEKTAWGELESESEEESE 216
TG G+ P + G+ G+ G A ETK +E +I GE E E E E++
Sbjct: 645 TGEEGERPTEAEGEN--GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702
Query: 217 EDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIE 261
E D + E E E E T AEG I + G E D E
Sbjct: 703 EADHKGETEAEEV-EHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746
Score = 30.3 bits (68), Expect = 2.1
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEE 225
+ E+ E+ E E E EEE EE++EEE EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 40.0 bits (94), Expect = 0.002
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E+ +E A+ +LE + +++ E D+E+EE + +DE
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDD 149
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 38.1 bits (88), Expect = 0.010
Identities = 15/38 (39%), Positives = 26/38 (68%)
Query: 194 VNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
+ EED+ ++ + E+E E+DD+EE++EEE E+E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
Score = 30.8 bits (69), Expect = 1.5
Identities = 12/22 (54%), Positives = 18/22 (81%)
Query: 206 ELESESEEESEEDDEEEEEEEE 227
+ + + +++ EEDDEEEEEEEE
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEE 178
Score = 30.0 bits (67), Expect = 3.2
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 5/33 (15%)
Query: 206 ELESESEEESEEDDEEEEEEEE-----GEDESG 233
E E E +++ E+D+EEEEEEEE EDE
Sbjct: 158 EDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEED 190
Score = 29.6 bits (66), Expect = 3.5
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 9/53 (16%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEED---DEEEEEEEEGED 230
D F + + + +++D E + E EEE EE D+E+EE+E GED
Sbjct: 149 DNFVIDDDDEDEDEDDDD------EEDDEEEEEEEEEIKGFDDEDEEDEGGED 195
Score = 29.2 bits (65), Expect = 4.8
Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 5/34 (14%)
Query: 205 GELESESEEESEEDDEEEEE-----EEEGEDESG 233
+ + + EE+ EE++EEEEE +E+ EDE G
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGG 193
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 36.0 bits (84), Expect = 0.018
Identities = 11/44 (25%), Positives = 20/44 (45%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
A E + E ++DD+E+E+E++ E+ P E
Sbjct: 36 AAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 33.7 bits (78), Expect = 0.095
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 5/44 (11%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
A T A +E D E +LE + +++ +ED+++EEE + G D
Sbjct: 38 AATAAAIESELDEE-----DLEDDDDDDEDEDEDDEEEADLGPD 76
Score = 33.3 bits (77), Expect = 0.12
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
N TA EE+ E+DD+++E+E+E ++E
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 30.6 bits (70), Expect = 0.95
Identities = 6/37 (16%), Positives = 21/37 (56%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ + A ESE + E+ +++++++E+ +++
Sbjct: 31 DPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDD 67
Score = 30.6 bits (70), Expect = 1.0
Identities = 9/37 (24%), Positives = 20/37 (54%)
Query: 191 TAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
A ++ E + E +++ +ED++E++EEE
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 29.8 bits (68), Expect = 1.8
Identities = 7/42 (16%), Positives = 20/42 (47%)
Query: 191 TAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ ++ A +E + +D+++++E+E ED+
Sbjct: 27 SGFIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDE 68
Score = 28.3 bits (64), Expect = 5.7
Identities = 6/35 (17%), Positives = 19/35 (54%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
+ E E +E+D E++++++ +++ A+
Sbjct: 39 ATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADL 73
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 35.9 bits (83), Expect = 0.019
Identities = 13/43 (30%), Positives = 28/43 (65%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
E K E+ ED+++ + E E EEE E++D +++++++ +D
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDD 199
Score = 34.0 bits (78), Expect = 0.098
Identities = 17/63 (26%), Positives = 29/63 (46%)
Query: 170 PVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
+ GK L F T E + D + + + E E E ++++E++EEEE E
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEE 181
Query: 230 DES 232
+E
Sbjct: 182 EEE 184
Score = 30.1 bits (68), Expect = 1.8
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
+ K E + D E E E E EEE E+ D+++++++
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 36.9 bits (86), Expect = 0.020
Identities = 16/66 (24%), Positives = 31/66 (46%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
++D + E S+EE +E+DE+ ++E++ E+E + S + +P
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPFTFP 396
Query: 256 TPDSIE 261
P S E
Sbjct: 397 CPKSHE 402
Score = 30.7 bits (70), Expect = 1.7
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
E D + E EEE E+ ++E+EE++ +D
Sbjct: 306 FEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDL 342
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 36.2 bits (84), Expect = 0.025
Identities = 18/42 (42%), Positives = 26/42 (61%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E EE +EK E E + E +ED+E+E+EEEE +D+ G
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317
Score = 33.9 bits (78), Expect = 0.12
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
+E+ + + E E EE+ + E+E+EE+E E+E
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEE 310
Score = 29.6 bits (67), Expect = 3.1
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
E E EE EE+D+ E E+E E++ +
Sbjct: 280 EELLEKAEEEEEEDDYSESEDEDEEDEDEEEEED 313
Score = 28.9 bits (65), Expect = 4.6
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 194 VNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
V ++ + E E EEE + + E+E+EE+ ++E
Sbjct: 272 VGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEE 309
Score = 28.1 bits (63), Expect = 8.0
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDESG 233
G E + EE E+ +EEEEE++ E E
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDE 301
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 34.6 bits (80), Expect = 0.029
Identities = 13/29 (44%), Positives = 22/29 (75%)
Query: 203 AWGELESESEEESEEDDEEEEEEEEGEDE 231
A E ES+ +EE EE++++EE+++E E E
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDESE 136
Score = 34.6 bits (80), Expect = 0.030
Identities = 13/28 (46%), Positives = 21/28 (75%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDES 232
E + + E+E EEDDEE+++E+E E+E
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDESEEEE 139
Score = 33.5 bits (77), Expect = 0.076
Identities = 12/28 (42%), Positives = 22/28 (78%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDES 232
E ES+++ E+++EE++EE++ EDES
Sbjct: 108 ASEEDESDDDEEDEEEEDDEEDDDEDES 135
Score = 33.5 bits (77), Expect = 0.079
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 210 ESEEESEEDDEEEEEEEEGEDES 232
E +EE E+D+E+++E+E E+ES
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEES 140
Score = 30.0 bits (68), Expect = 1.2
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
L + E+ES++D+E+EEEE++ ED+
Sbjct: 105 HLVASEEDESDDDEEDEEEEDDEEDDD 131
Score = 29.6 bits (67), Expect = 1.6
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
V E+ E E E E ++E ++D++E EEEE
Sbjct: 106 LVASEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 35.2 bits (81), Expect = 0.056
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 172 DETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESE-EDDEEEEEEEEGED 230
DE G + E ++D ++ E +SESE ES+ ED EE+E+E++ E
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIE-DSESESESDGEDGEEDEQEDDAEA 693
Query: 231 ESGLITPAEGL 241
G++ + +
Sbjct: 694 NEGVVPIDKAV 704
Score = 29.1 bits (65), Expect = 5.7
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEE-EEGEDES 232
A+ N+ ++ GE E E + E +DDE+E E E+ E ES
Sbjct: 634 ADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESES 675
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 34.8 bits (80), Expect = 0.073
Identities = 16/45 (35%), Positives = 22/45 (48%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
AE + EE +++ A LE+E E +EED E EDE
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDE 430
Score = 32.1 bits (73), Expect = 0.59
Identities = 17/44 (38%), Positives = 22/44 (50%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
E AE E E EL E+E E++DE EE + E E+E
Sbjct: 404 ERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447
Score = 28.6 bits (64), Expect = 7.2
Identities = 14/42 (33%), Positives = 20/42 (47%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
E + AE EED L + E+ +E +E + EEEE
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEA 448
>gnl|CDD|215971 pfam00524, PPV_E1_N, E1 Protein, N terminal domain.
Length = 131
Score = 33.0 bits (76), Expect = 0.079
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLI 235
+ E W +E+ +++ +DE+E+ + G D S I
Sbjct: 8 DGEGGGCGGWFFVEAICSDDNISEDEDEDLFDTGSDLSDFI 48
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 31.2 bits (71), Expect = 0.080
Identities = 14/21 (66%), Positives = 18/21 (85%)
Query: 212 EEESEEDDEEEEEEEEGEDES 232
EEE ED+EE E+EEEGE++S
Sbjct: 22 EEEKREDEEENEDEEEGEEQS 42
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 32.4 bits (74), Expect = 0.085
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 212 EEESEEDDEEEEEEEEGEDESGLITPAEGLM 242
+ E+E DDEEEEEEEE +D L E +
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFID 33
Score = 28.6 bits (64), Expect = 1.9
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
E+E ++E EE++EEE++ E+ DE I AE
Sbjct: 5 EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAE 36
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 32.4 bits (73), Expect = 0.093
Identities = 13/27 (48%), Positives = 23/27 (85%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
GE +S+S+ + EEDD++E+EE++ ED+
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEEDDDEDD 34
Score = 32.0 bits (72), Expect = 0.12
Identities = 13/30 (43%), Positives = 23/30 (76%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITP 237
+S+S+EE ++DDE+EE+++E +DE P
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDEVP 42
Score = 30.9 bits (69), Expect = 0.27
Identities = 11/35 (31%), Positives = 26/35 (74%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
+ +S+ EE+ +++DEE+++E++ ED+ + P+ G
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFG 47
Score = 29.4 bits (65), Expect = 1.1
Identities = 10/26 (38%), Positives = 22/26 (84%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
E E +S+ +S+E++++++E+EE +DE
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDE 32
Score = 27.4 bits (60), Expect = 5.6
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 207 LESESEEESEEDDEEEEEEEEGEDE 231
LE E + +S+ D+EE++++E+ ED+
Sbjct: 6 LEGEEDSDSDSDEEEDDDDEDEEDD 30
Score = 27.4 bits (60), Expect = 5.9
Identities = 9/32 (28%), Positives = 23/32 (71%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
EED + + E + + E+E ++D++++E+++E
Sbjct: 9 EEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.4 bits (79), Expect = 0.096
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 189 TKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
K E D E+ E + E EE++ + +EE +EEEE E+
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 28.6 bits (64), Expect = 7.8
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 204 WGELESESEEESEEDDEEEEEEEEGEDES 232
W E E E E EE++EE+EE++E E+++
Sbjct: 26 WVEKEVEKEVPDEEEEEEKEEKKEEEEKT 54
Score = 28.2 bits (63), Expect = 8.8
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
L E E E E DEEEEEE+E + E
Sbjct: 24 YLWVEKEVEKEVPDEEEEEEKEEKKEEE 51
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 34.1 bits (79), Expect = 0.10
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 4/34 (11%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
+ E EE++EEEEEEEE +E A GL
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEA----AAGL 325
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 31.0 bits (71), Expect = 0.13
Identities = 13/28 (46%), Positives = 20/28 (71%), Gaps = 2/28 (7%)
Query: 206 ELESESEEESE--EDDEEEEEEEEGEDE 231
+ E E EEE E E ++EE+EEE+ +D+
Sbjct: 45 DAEWEEEEEGEDLESEDEEDEEEDDDDD 72
Score = 27.6 bits (62), Expect = 3.1
Identities = 11/28 (39%), Positives = 18/28 (64%), Gaps = 6/28 (21%)
Query: 211 SEEESEEDDE------EEEEEEEGEDES 232
S+ E EE++E E+EE+EE +D+
Sbjct: 44 SDAEWEEEEEGEDLESEDEEDEEEDDDD 71
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 34.1 bits (78), Expect = 0.15
Identities = 18/78 (23%), Positives = 32/78 (41%)
Query: 167 GKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
P + + +E + + EE+ E+ E + EEE E+++EEEE E
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478
Query: 227 EGEDESGLITPAEGLMTP 244
+ E + +EG
Sbjct: 479 DNGSEEEMEGSSEGDGDG 496
Score = 29.9 bits (67), Expect = 2.7
Identities = 18/50 (36%), Positives = 26/50 (52%)
Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
+ ++ E + E E EEE EE++EEEEEE+E E+E G
Sbjct: 418 SSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEG 467
Score = 29.5 bits (66), Expect = 3.7
Identities = 24/112 (21%), Positives = 33/112 (29%), Gaps = 4/112 (3%)
Query: 171 VDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEE----EEEE 226
P E+ E EE+ E+ E E E E+ EE++E E EEE
Sbjct: 426 STSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEE 485
Query: 227 EGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQ 278
G E S AG+ + + E P
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPL 537
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 34.0 bits (78), Expect = 0.16
Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 6/49 (12%)
Query: 189 TKTAEVNEEDIEKTAWGELESESEE-----ESEEDDEEEEEEEEGEDES 232
TK E E+D A+ LE E E EED+EEEEEE+E E S
Sbjct: 362 TKAKE-TEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPS 409
Score = 32.8 bits (75), Expect = 0.32
Identities = 21/67 (31%), Positives = 25/67 (37%), Gaps = 11/67 (16%)
Query: 166 WGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEE 225
W K E L SA E D E E E EEE EE+DE+E
Sbjct: 361 WTKAKETEQDYILDA----FSALEIEDANTERDDE-------EEEDEEEEEEEDEDEGPS 409
Query: 226 EEGEDES 232
+E D+
Sbjct: 410 KEHSDDE 416
Score = 32.8 bits (75), Expect = 0.33
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
E + E EE+ EE++EE+E+E ++ S
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEGPSKEHSD 414
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 33.9 bits (78), Expect = 0.17
Identities = 15/22 (68%), Positives = 16/22 (72%)
Query: 208 ESESEEESEEDDEEEEEEEEGE 229
E E EEE E +EEEEEEEE E
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEE 327
Score = 32.7 bits (75), Expect = 0.37
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E E EEE EE+ EEEEEEE E+E
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTF 330
Score = 32.3 bits (74), Expect = 0.42
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
E EEE EE EEEEEEEE E+ +
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERT 329
Score = 31.5 bits (72), Expect = 0.78
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
E E E E EE++EEEEEE E+E
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 31.5 bits (72), Expect = 0.87
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDES 232
E EEE EE++E EEEEE E+E
Sbjct: 302 PPEPEEEEEEEEEVPEEEEEEEEEE 326
Score = 30.8 bits (70), Expect = 1.4
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDE 231
S E EE++EEEEE E E+E
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEE 321
Score = 30.0 bits (68), Expect = 2.2
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
EEE EE++ EEEEEE E+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEER 328
Score = 30.0 bits (68), Expect = 2.5
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDES 232
S E E EE++EEE EEE E+E
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEE 324
Score = 28.1 bits (63), Expect = 8.9
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLIT 236
EE EE++EE EEEE E+E T
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERT 329
>gnl|CDD|197548 smart00157, PRP, Major prion protein. The prion protein is a major
component of scrapie-associated fibrils in
Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
syndrome and bovine spongiform encephalopathy.
Length = 218
Score = 32.5 bits (74), Expect = 0.25
Identities = 23/64 (35%), Positives = 26/64 (40%), Gaps = 13/64 (20%)
Query: 119 YPNLKIPGLNAPIPQGCSFGY---------HAGGWGKPPVDETGKPLYGDVFGTGG---- 165
YP PG N PQG +G H GGWG+P G+P G GG
Sbjct: 17 YPGQGSPGGNRYPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHNQ 76
Query: 166 WGKP 169
W KP
Sbjct: 77 WNKP 80
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 31.0 bits (71), Expect = 0.26
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 202 TAWGELESESEEESEEDDEEEEEEEEGEDE 231
T GEL E+E E +EE++++E+ E+E
Sbjct: 13 TVRGELPEPEEDEILELEEEDDDDEDEEEE 42
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 32.2 bits (73), Expect = 0.26
Identities = 19/39 (48%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 196 EEDIEKTAWGELESESEEESEED--DEEEEEEEEGEDES 232
EED + A E E E E ED DEE+EE EE E+E+
Sbjct: 21 EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 32.7 bits (75), Expect = 0.26
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E +E+ K E ES+++ +E++E E EE+E D+ G
Sbjct: 76 EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116
Score = 32.7 bits (75), Expect = 0.28
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E+ E+ E+ + + E ++DDEEEE E E +++S
Sbjct: 72 LELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDS 112
Score = 30.8 bits (70), Expect = 1.2
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE--GLMTPS 245
+EE+ ++ A E EE SEED+EE EEEE E E + ++TP+
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186
Score = 29.6 bits (67), Expect = 2.7
Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 3/45 (6%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ E +++D E+ W E E +E+S+++ E + E + E ES
Sbjct: 88 AEQGLESDDDDDEEEEW---EVEEDEDSDDEGEWIDVESDKEIES 129
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 33.1 bits (75), Expect = 0.27
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 189 TKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
T+ E +ED+E+ E EEE ++D++ E+++E+ D G
Sbjct: 105 TQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEG 149
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 33.0 bits (75), Expect = 0.27
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLI 235
E +DI E+E+ + EESE ++E E+ E+E + +
Sbjct: 361 EKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRIC 403
Score = 30.0 bits (67), Expect = 2.5
Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 3/73 (4%)
Query: 166 WGKPPVDETGKPLYGDVFGTSAETKTAEVNE-EDIEKTAWGELESESEEESEEDDEEEEE 224
WG + K D E + ++ E +E ESE EEE E+ ++E +
Sbjct: 341 WGN--EEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDH 398
Query: 225 EEEGEDESGLITP 237
+ D+ L
Sbjct: 399 SKRICDDDELENH 411
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 32.7 bits (75), Expect = 0.29
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 210 ESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSV 250
+EEE EE++EEEEEEE + + P L GI +
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIKII 443
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 32.4 bits (74), Expect = 0.31
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 185 TSAETKTAEVNEEDIE-KTAWGELESESEEESEEDDEEEEEEEEGEDES 232
S + +TAE+ E E K A + E + ++E++DEE EEE E EDE
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Score = 30.9 bits (70), Expect = 0.95
Identities = 19/46 (41%), Positives = 27/46 (58%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E K A +E+ +K + + ESEEE+EE+DEE +E E E E
Sbjct: 68 EEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 32.0 bits (73), Expect = 0.32
Identities = 15/40 (37%), Positives = 18/40 (45%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E EED K E E +EE ED EE E + E +
Sbjct: 22 ENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Score = 32.0 bits (73), Expect = 0.35
Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E + + E++E+ E+E++EE + +E+ E EE ++E
Sbjct: 2 EKECKDAKHENMEEDC--CKENENKEEDKGKEEDLEFEEIEKEEI 44
Score = 29.3 bits (66), Expect = 2.4
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
E + NE E E E E EE E+ EE E + E
Sbjct: 14 EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Score = 28.6 bits (64), Expect = 4.7
Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 1/55 (1%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
+ K + EED K + E + +EE E +E E+EE + E E L
Sbjct: 6 KDAKHENM-EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 32.5 bits (74), Expect = 0.34
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
EED ++ E E +E+ EE++ EEEEEEE EDE
Sbjct: 250 EEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 29.4 bits (66), Expect = 3.3
Identities = 16/47 (34%), Positives = 29/47 (61%)
Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
G+ + K ++ E+D + + E + E+E EE+ EEEEEEE+ ++
Sbjct: 239 GSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 28.3 bits (63), Expect = 7.9
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
++++ + G+ E E + +E++EE EEEEE EDE
Sbjct: 247 SDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284
>gnl|CDD|219311 pfam07149, Pes-10, Pes-10. This family consists of several
Caenorhabditis elegans pes-10 and related proteins.
Members of this family are typically around 400 residues
in length. The function of this family is unknown.
Length = 370
Score = 32.5 bits (74), Expect = 0.37
Identities = 14/42 (33%), Positives = 21/42 (50%)
Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
+++ E+ E E+E E D + EE E ESG+ T E
Sbjct: 157 KVDQLLKDEVLEEDEQEEEADKLLDNLEEHSETESGIFTDEE 198
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 32.2 bits (73), Expect = 0.40
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 208 ESESEEESEEDDEEEEEEEE 227
S+SE E +E++EEEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEE 242
Score = 31.1 bits (70), Expect = 1.0
Identities = 14/19 (73%), Positives = 18/19 (94%)
Query: 208 ESESEEESEEDDEEEEEEE 226
+SESEE+ EE++EEEEEEE
Sbjct: 225 DSESEEDEEEEEEEEEEEE 243
Score = 30.7 bits (69), Expect = 1.2
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 209 SESEEESEEDDEEEEEEEEGED 230
S S+ ESEED+EEEEEEEE E+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243
Score = 30.3 bits (68), Expect = 1.8
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 209 SESEEESEEDDEEEEEEEEGEDE 231
S S +SE +++EEEEEEE E+E
Sbjct: 220 SSSGSDSESEEDEEEEEEEEEEE 242
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 32.5 bits (74), Expect = 0.40
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E ++ IE+ A E + + ++ ED E EE EG DES
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDES 260
Score = 30.2 bits (68), Expect = 2.1
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 3/63 (4%)
Query: 172 DETGKPLYGDVFGTSAETKTAEVNEEDI-EKTAWGELESESEEESEEDDEE--EEEEEEG 228
+ + D A+ + + ++ D E + G ESE +ESEED+ E + E EEG
Sbjct: 216 MDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEG 275
Query: 229 EDE 231
E +
Sbjct: 276 EMD 278
Score = 29.4 bits (66), Expect = 4.1
Identities = 16/54 (29%), Positives = 27/54 (50%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
AE + EED ++ + + ++ E+SE EE E +E E++ T EG
Sbjct: 219 AEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEG 272
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 31.2 bits (71), Expect = 0.42
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 209 SESEEESEEDDEEEEEEEEGEDESG 233
SES +E EE++ EEEE E G+ E+G
Sbjct: 52 SESSDEEEEEEAEEEEAETGDVEAG 76
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 30.3 bits (69), Expect = 0.44
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
+ E++ EE++EEE+EE E E +GL
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAAGL 101
Score = 29.2 bits (66), Expect = 1.4
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E + EEE EE+++EE EEE L
Sbjct: 78 EEKKEEEEEEEEKEESEEEAAAGLGAL 104
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 32.3 bits (74), Expect = 0.50
Identities = 18/34 (52%), Positives = 24/34 (70%), Gaps = 2/34 (5%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG--LITP 237
+ + E E+E EE++EEEEE EE E E G L+TP
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTP 379
Score = 31.6 bits (72), Expect = 0.71
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 3/39 (7%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTP 244
+ + EEE E+++EEEEEEE E E P L+TP
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP---LLTP 379
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 30.8 bits (70), Expect = 0.56
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAE---GLMTPSGITSVPAGLETPDSIEL 262
+ + ++E +EDDEE + E++ E+ LI E L PS + + PA T S L
Sbjct: 51 GGDIDDDDEDDEDDEEADAEDDDENPYKLIRLEEILAPLTHPSDLPTHPAISRTFKSKTL 110
Query: 263 RK------KKIESDMET-NEAPQLFQVL 283
+ IE + N+ +L QVL
Sbjct: 111 SELALEAIALIEKEQNNLNKLNKLLQVL 138
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.1 bits (73), Expect = 0.63
Identities = 14/47 (29%), Positives = 24/47 (51%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E +EE++E A ++E +DE+E+EE++ E ES
Sbjct: 6 NTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 31.8 bits (72), Expect = 0.68
Identities = 35/151 (23%), Positives = 59/151 (39%), Gaps = 18/151 (11%)
Query: 127 LNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTS 186
++ P+G + A + V+ + K + + G + T L S
Sbjct: 1 MSTSKPKGSAPRGFAP---RTTVESSQKRIQQN----NGDKEDSSTSTSS-LSV-----S 47
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSG 246
A KT+ +E+I+ ES EE ED+ E E+ + +S I E + T
Sbjct: 48 AVEKTSNA-KEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMD 106
Query: 247 IT---SVPAGLETPDSIELRKKKIESDMETN 274
S+ L+ + LRK++IE E N
Sbjct: 107 GIDDDSLDRKLKL-ERENLRKREIEELAEEN 136
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 31.5 bits (71), Expect = 0.77
Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 9/107 (8%)
Query: 165 GWGKPPVDETG-----KPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDD 219
G P V+ET + GD+ G + + L SE+ +E+E +
Sbjct: 357 GESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEV 416
Query: 220 EEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKK 266
E+ + + A P+ P G+ D R+++
Sbjct: 417 PEKAAPIPDPAKPDELAVAGPGDDPAE----PDGIRRQDDFSSRRRR 459
Score = 31.5 bits (71), Expect = 0.80
Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 2/52 (3%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEE-EEEEEGEDE 231
D G S E +E DIE+ G+L ++ + D E EE
Sbjct: 354 DRDGEST-PAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAAL 404
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 31.4 bits (72), Expect = 0.86
Identities = 17/53 (32%), Positives = 27/53 (50%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE 239
+ + E E+ + A + E E EEE E+ +EE+EEEEE +E +
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 30.3 bits (69), Expect = 1.8
Identities = 22/54 (40%), Positives = 28/54 (51%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
AE K E +E +K G+ + E EEE +E EEE+EEEE E E E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 30.3 bits (69), Expect = 2.1
Identities = 20/66 (30%), Positives = 27/66 (40%)
Query: 174 TGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
TG E + EE EK + EEE EE+ E++EEE+E E+E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 234 LITPAE 239
E
Sbjct: 463 EEEKEE 468
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 31.5 bits (72), Expect = 0.90
Identities = 8/40 (20%), Positives = 23/40 (57%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ +++ E+ + + + E++ + D+++EEE + E E
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 29.7 bits (67), Expect = 0.90
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGED 230
A E +E++E EE+ EEE EEE +D
Sbjct: 72 AAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 27.8 bits (62), Expect = 3.7
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDESG 233
G + +E+EE+++EEE EEE +D+
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 27.4 bits (61), Expect = 5.6
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E+ +E + E++E+EEE EE D+ L
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 27.4 bits (61), Expect = 6.3
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E E EE +EE EEE + + GL
Sbjct: 81 AEADEAEEEEKEEEAEEESDDDMLFGL 107
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 29.6 bits (67), Expect = 0.91
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDESGL 234
+E EEE EE++EEEEEE E E +GL
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMAGL 100
Score = 27.7 bits (62), Expect = 3.8
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E E EEE EE++EEE EEE L
Sbjct: 77 EEEEEEEEEEEEEEESEEEAMAGLGAL 103
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 29.9 bits (67), Expect = 0.92
Identities = 14/57 (24%), Positives = 24/57 (42%)
Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
A E E+ + + +++E +E D +E +EEE DE+ T E
Sbjct: 58 AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEA 114
Score = 29.5 bits (66), Expect = 1.3
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGE 229
A A ++ E+T E+ E ++E ++ +EEE+EE E
Sbjct: 85 ADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
Score = 28.8 bits (64), Expect = 2.3
Identities = 13/46 (28%), Positives = 20/46 (43%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ AE + D E+ E E D E EE +EE ++E+
Sbjct: 84 DADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEA 129
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 30.4 bits (69), Expect = 0.94
Identities = 14/51 (27%), Positives = 29/51 (56%)
Query: 186 SAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
+E+++ + E++ + ES EEE + D+E++EE + E+E +T
Sbjct: 2 GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 30.4 bits (69), Expect = 0.96
Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
+ +DI + +SE EE EE DEE+ E E D S L
Sbjct: 3 DSDDIIDYESDDDDSEEYEEEEE-DEEDAESLESSDVSSL 41
Score = 29.3 bits (66), Expect = 2.4
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 211 SEEESEEDDEEEEEEEEGEDESGLITPAE 239
+ ES++DD EE EEEE ++E +
Sbjct: 8 IDYESDDDDSEEYEEEEEDEEDAESLESS 36
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 30.0 bits (68), Expect = 0.97
Identities = 17/29 (58%), Positives = 20/29 (68%), Gaps = 4/29 (13%)
Query: 206 ELESESEEESEED----DEEEEEEEEGED 230
E E + EEE E+D DEE EEEE+GED
Sbjct: 1 EDEPDEEEELEDDIDDLDEEAEEEEDGED 29
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.0 bits (70), Expect = 1.0
Identities = 16/48 (33%), Positives = 31/48 (64%)
Query: 184 GTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
S + + E +++D E+ + E ES+SEE +E+D ++EEEE+ ++
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 31.3 bits (71), Expect = 1.0
Identities = 16/67 (23%), Positives = 25/67 (37%), Gaps = 1/67 (1%)
Query: 177 PLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
LY + E K ++ + D E + + E ++E EG D + L
Sbjct: 872 SLYATLAAELKENKDSKEGDADAEVAE-VDAAPGGAVDHEPPVAKQEGNAEGGDGAALDA 930
Query: 237 PAEGLMT 243
A LMT
Sbjct: 931 AAGRLMT 937
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 30.4 bits (69), Expect = 1.0
Identities = 14/55 (25%), Positives = 25/55 (45%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLM 242
E EV E ++E+ E+E E+ EE E + +EE+ + + + E
Sbjct: 6 EQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERY 60
Score = 27.7 bits (62), Expect = 9.8
Identities = 11/47 (23%), Positives = 17/47 (36%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
E E + + + EE EE+ E E +EE + L
Sbjct: 3 EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAEL 49
>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein. S-antigens are heat
stable proteins that are found in the blood of
individuals infected with malaria.
Length = 94
Score = 29.1 bits (65), Expect = 1.1
Identities = 13/60 (21%), Positives = 33/60 (55%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
+++GT ++ N ++ + + +L+ E E E++++++ EE ++E+GLI
Sbjct: 34 NIYGTKYYLRSGFFNSKNGKGQKYEDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGE 93
Score = 27.6 bits (61), Expect = 3.5
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 197 EDIEKTAWGELESESEEESEEDDEEEEEE-EEGEDES 232
ED+++ GE + E + SEE + +EE EG+ ES
Sbjct: 58 EDLQEEGEGENDDEEDSNSEESNNDEENGLIEGQGES 94
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 30.2 bits (68), Expect = 1.1
Identities = 15/44 (34%), Positives = 27/44 (61%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
+ + E +EE+IE+ E E E E+ EE++E+EE+ + +D
Sbjct: 51 DEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94
Score = 28.7 bits (64), Expect = 3.9
Identities = 21/83 (25%), Positives = 37/83 (44%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPA 252
+ ++E+ E+ E+E + E EE+ E+EEEEE ++E + + I +
Sbjct: 48 QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107
Query: 253 GLETPDSIELRKKKIESDMETNE 275
+ I KK E +T E
Sbjct: 108 DDNAQNLISKNYKKNEKSKKTAE 130
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 29.9 bits (68), Expect = 1.1
Identities = 8/40 (20%), Positives = 20/40 (50%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
EV +E ++ EE ++D+++ + + + D+S
Sbjct: 6 EVEDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDS 45
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 29.4 bits (66), Expect = 1.1
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E E+E E+ EEE+EEEE E +GL
Sbjct: 76 EKAEEKEEEKKKEEEKEEEEEEALAGL 102
Score = 27.1 bits (60), Expect = 7.9
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E EE EE+ ++EEE+EE E+E+
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEEAL 99
Score = 26.7 bits (59), Expect = 9.7
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGL 234
E + EE+ +E+++EEEEEE L
Sbjct: 79 EEKEEEKKKEEEKEEEEEEALAGLGAL 105
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 30.6 bits (69), Expect = 1.2
Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 6/88 (6%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL-ITPAEGLMTPSGITSVPAG 253
++E + E E E E E+DEEEE EEE E+ L TP E + G
Sbjct: 66 DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEHEEG 125
Query: 254 LETPDSIELRKKKIESDMETNEAPQLFQ 281
+ + ++ +E + ++
Sbjct: 126 FQ-----AMVEQAVERGLSAETITRIQA 148
Score = 29.8 bits (67), Expect = 2.4
Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
ED E + + EEE E+ EE+EEEE E+ + TP +T LE
Sbjct: 64 GEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD---TPEELTEASEQLE 120
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 30.9 bits (70), Expect = 1.2
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 3/90 (3%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
D T + T + E +++ +E +EEE EE EEEEEEE E E+ + P G
Sbjct: 372 DKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEE--EEEEEEEAAEAEAPMEEPVPG 429
Query: 241 LMTPSGITSVPAGLETPDSIELRKKKIESD 270
P + AG I L+ KI ++
Sbjct: 430 FEVPE-MPMPAAGGGGGIKIVLKNAKITAE 458
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 31.1 bits (70), Expect = 1.2
Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 3/53 (5%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
D + E + ++ + ED+ + E E++S +E + +D +E E + D
Sbjct: 947 DESDETDEDEESDESSEDLSED---ESENDSSDEEDGEDWDELESKAAYDSRP 996
Score = 30.0 bits (67), Expect = 2.8
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 13/76 (17%)
Query: 161 FGTGGWG---KPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEE 217
F GGW DE +E + +E +++ + + ES+E SE+
Sbjct: 915 FEDGGWSFLMVGSDDE----------SDESEEEVSEYEASSDDESDETDEDEESDESSED 964
Query: 218 DDEEEEEEEEGEDESG 233
E+E E + ++E G
Sbjct: 965 LSEDESENDSSDEEDG 980
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.2 bits (71), Expect = 1.3
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
+ SE +++SE DD E+E++E+ ED+
Sbjct: 1362 DSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
Score = 28.9 bits (65), Expect = 7.1
Identities = 6/26 (23%), Positives = 16/26 (61%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
S +++ E D+ E+E++E +++
Sbjct: 1363 SSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.7 bits (70), Expect = 1.4
Identities = 12/46 (26%), Positives = 27/46 (58%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ +++++DI+ E + E +++ + DDE+EE++E E E
Sbjct: 135 DDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180
Score = 28.4 bits (64), Expect = 8.4
Identities = 7/48 (14%), Positives = 29/48 (60%)
Query: 185 TSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
A+ + +++D++ + + + +++ ++DD++ ++E+E + E+
Sbjct: 128 NQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEA 175
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.1 bits (70), Expect = 1.4
Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 3/85 (3%)
Query: 190 KTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITS 249
++A NE D+ E ++ E++DEEE ++ G D+ I P
Sbjct: 3926 QSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDE--IQPDIQENNSQPPPE 3983
Query: 250 VPAGLETPDSIELRKKKIESDMETN 274
L+ P+ ++L +K+ + +++
Sbjct: 3984 NED-LDLPEDLKLDEKEGDVSKDSD 4007
Score = 29.2 bits (65), Expect = 5.8
Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 6/49 (12%)
Query: 188 ETKTAEVNEED------IEKTAWGELESESEEESEEDDEEEEEEEEGED 230
E TA ++ D +E G + E+E D EEE ++ E+
Sbjct: 3843 EEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891
Score = 28.8 bits (64), Expect = 7.5
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 5/49 (10%)
Query: 198 DIEKTAWGELESE-----SEEESEEDDEEEEEEEEGEDESGLITPAEGL 241
DI++T L +E E+ E E+E+ E+E D + ITPA +
Sbjct: 4268 DIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELPDYNVKITPAMPI 4316
Score = 28.4 bits (63), Expect = 8.2
Identities = 27/106 (25%), Positives = 44/106 (41%), Gaps = 12/106 (11%)
Query: 185 TSAETKTAEVNEEDIEKTAWGELESESEEESEEDD----EEEEEEEEGEDESGLITPAEG 240
++A ++ V++ED K + E E+E E D ++E + + E+ S E
Sbjct: 3927 SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENED 3986
Query: 241 LMTPS--------GITSVPAGLETPDSIELRKKKIESDMETNEAPQ 278
L P G S + LE D + K E+D E +E Q
Sbjct: 3987 LDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQ 4032
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 30.5 bits (69), Expect = 1.5
Identities = 19/72 (26%), Positives = 36/72 (50%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKK 265
E E E EEE E+DD++ ++E+ +++ T + SG+ V E + +L KK
Sbjct: 136 ESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195
Query: 266 KIESDMETNEAP 277
+ + + + P
Sbjct: 196 DVLEEPKMFKKP 207
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 30.6 bits (69), Expect = 1.6
Identities = 27/155 (17%), Positives = 51/155 (32%), Gaps = 26/155 (16%)
Query: 94 KTFSRETVIPPPWLIAMQRYGPPPSYP--------NLKIPGLNAPI--PQGCSFGYHAGG 143
KT R+ P + + + P P +I P+ P S Y A
Sbjct: 258 KTVWRQKNTPAQTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAP- 316
Query: 144 WGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTA----EVNEEDI 199
KP T + L + F + + + ++ + + E + ++
Sbjct: 317 --KPEPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVPPLPEYSSDED 374
Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
E + +E E D E+E + ED++ L
Sbjct: 375 E---------DDSDEDEVDYEKERKRRREEDKNFL 400
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 28.9 bits (65), Expect = 1.7
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E E+ EK E ++ EE ++E EEEE EE ED+
Sbjct: 39 EEQGEEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKE 78
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.5 bits (69), Expect = 1.7
Identities = 13/41 (31%), Positives = 22/41 (53%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
EV+E++ E+ E E EE + ++E + E+G ES
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408
Score = 30.1 bits (68), Expect = 2.2
Identities = 18/62 (29%), Positives = 29/62 (46%)
Query: 209 SESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIE 268
E + +ED+EEE+ +E E+E G + EG + +S + DS K+
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425
Query: 269 SD 270
SD
Sbjct: 426 SD 427
Score = 30.1 bits (68), Expect = 2.5
Identities = 13/26 (50%), Positives = 15/26 (57%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E E+E EE +E EEEEGED
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEE 395
Score = 28.9 bits (65), Expect = 4.6
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E E EEE + DE EEEE E +E G
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEG 397
Score = 28.9 bits (65), Expect = 5.5
Identities = 12/39 (30%), Positives = 18/39 (46%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E + L+ EE +ED++EEEE+ E E
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEE 388
Score = 28.5 bits (64), Expect = 6.5
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 197 EDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
+ I+ E E E EE+ ++ EEEE E+ E+ S
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQ 399
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.0 bits (67), Expect = 1.8
Identities = 18/55 (32%), Positives = 25/55 (45%)
Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPA 252
+ K A E ES+ +EE EE++EEEE E E ++ T G S
Sbjct: 124 NAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGG 178
Score = 29.3 bits (65), Expect = 3.2
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 201 KTAWGELESESEEESEEDDEEEEEEEEGEDE 231
K ++ E+ES+ED+EEEEEEEE E E
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAE 151
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.5 bits (70), Expect = 1.9
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
D EE ++D++EEEEE+E +D
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209
Score = 29.8 bits (68), Expect = 3.0
Identities = 7/26 (26%), Positives = 16/26 (61%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E + SE ++ +++E+EE E++
Sbjct: 180 EDPAHVGSELEELDDDEDEEEEEDEN 205
Score = 29.4 bits (67), Expect = 3.8
Identities = 13/41 (31%), Positives = 22/41 (53%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+ N E+ ELE ++E EE++E+E ++ DES
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADES 214
Score = 29.4 bits (67), Expect = 4.2
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITP 237
+ + + SE EE +++DEEEEE+E + + +
Sbjct: 174 VDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215
Score = 28.6 bits (65), Expect = 7.7
Identities = 10/37 (27%), Positives = 17/37 (45%)
Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
D E + E+ +++E+EEE EDE+
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDD 207
>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
Length = 633
Score = 30.2 bits (68), Expect = 2.0
Identities = 29/96 (30%), Positives = 40/96 (41%), Gaps = 18/96 (18%)
Query: 185 TSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTP 244
TS+ + V E K+ + ES S + S DDEEEEE ED +T
Sbjct: 438 TSSVNSESLVTEVKTTKSVEHKSESSSSDGSGHDDEEEEEPLLSEDRV---------ITS 488
Query: 245 SGITSVPAG-----LETPDSIELRKKKIESDMETNE 275
S V G LE EL++++ E D + E
Sbjct: 489 SVEEEVTEGELWYELEK----ELQRQETEVDAQAQE 520
Score = 29.8 bits (67), Expect = 2.9
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 6/64 (9%)
Query: 188 ETKTAEVNEEDIEKTAWGELESE-SEEESEEDDEEEEEE-----EEGEDESGLITPAEGL 241
T+ V EE E W ELE E +E+E D + +EEE E E+E+ L E
Sbjct: 484 RVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQEEEAAAAKEITEEENVLAKAVESK 543
Query: 242 MTPS 245
+ +
Sbjct: 544 TSIT 547
>gnl|CDD|129574 TIGR00483, EF-1_alpha, translation elongation factor EF-1 alpha.
This model represents the counterpart of bacterial EF-Tu
for the Archaea (aEF-1 alpha) and Eukaryotes (eEF-1
alpha). The trusted cutoff is set fairly high so that
incomplete sequences will score between suggested and
trusted cutoff levels [Protein synthesis, Translation
factors].
Length = 426
Score = 30.2 bits (68), Expect = 2.0
Identities = 26/71 (36%), Positives = 33/71 (46%), Gaps = 10/71 (14%)
Query: 146 KPPVDETGKPL---YGDVFGTGGWGKPPVD--ETGKPLYGDVF-----GTSAETKTAEVN 195
+PP T KPL DV+ G G PV ETG GD G S E K+ E++
Sbjct: 218 EPPEKPTDKPLRIPIQDVYSITGVGTVPVGRVETGVLKPGDKVVFEPAGVSGEVKSIEMH 277
Query: 196 EEDIEKTAWGE 206
E IE+ G+
Sbjct: 278 HEQIEQAEPGD 288
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 30.1 bits (67), Expect = 2.2
Identities = 17/54 (31%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGEL--ESESEEESEEDDEEEEEEEEGEDES 232
D+ +AE + EV+E D E+ E+ EE + +E EE+E +DE+
Sbjct: 276 DLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDEN 329
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.1 bits (68), Expect = 2.2
Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 6/82 (7%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGI 247
T+ ++ +K + +SE++ ++ E++E + EGE + G TP + T
Sbjct: 49 VATTSTKKDKKEDKNNE--SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT---- 102
Query: 248 TSVPAGLETPDSIELRKKKIES 269
P + + ++ KI
Sbjct: 103 KKKPPKPKPNEDVDNAFNKIAE 124
>gnl|CDD|238836 cd01673, dNK, Deoxyribonucleoside kinase (dNK) catalyzes the
phosphorylation of deoxyribonucleosides to yield
corresponding monophosphates (dNMPs). This family
consists of various deoxynucleoside kinases including
deoxyribo- cytidine (EC 2.7.1.74), guanosine (EC
2.7.1.113), adenosine (EC 2.7.1.76), and thymidine (EC
2.7.1.21) kinases. They are key enzymes in the salvage
of deoxyribonucleosides originating from extra- or
intracellular breakdown of DNA.
Length = 193
Score = 29.5 bits (67), Expect = 2.4
Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 15 TLKSKMRERVRPKLGKIDIDY-QKLHDAFFKW 45
T ++++R RP+ I +DY + LH+A+ KW
Sbjct: 137 TCLKRIKKRGRPEEQGIPLDYLEDLHEAYEKW 168
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 30.0 bits (67), Expect = 2.6
Identities = 36/182 (19%), Positives = 57/182 (31%), Gaps = 22/182 (12%)
Query: 220 EEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQL 279
+ + G EG S PA + E+ ++ I S+ + P+
Sbjct: 130 ISQLLKNNMAKNGGGPDRGEGDRGDREAPSFPASALSNTKPEVPQEDICSNSSRDSPPE- 188
Query: 280 FQVLTEKRPDKLGPGMMASTH--------------MYEVPGAGASAPGAPV----KRTLP 321
+ RP + H M P AG S A V KR L
Sbjct: 189 -CMSPFGRPPSSSTERLNDEHMRAKRARVENIVRGMSHSPSAGISEGQAQVNGCKKRKLY 247
Query: 322 SSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVARQKNKRKRQQT 381
+ P+E + +R + R+Q Q++ D L QK + +RQ
Sbjct: 248 QPQQHETSPAESYGAAAAGLNSR--ENKRKQMLPQQQQQSFDQLVSPRKEQKREERRQLK 305
Query: 382 TQ 383
Q
Sbjct: 306 QQ 307
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.9 bits (65), Expect = 2.8
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Query: 200 EKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDS 259
E+ LE + EEES E++EE+ EEE E+E G L TP+
Sbjct: 67 EEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEE-DGDSYDELPTPEE 125
Query: 260 IELRKKKIESDME 272
E +++ S+M
Sbjct: 126 REELREEFISEMH 138
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 29.6 bits (67), Expect = 3.1
Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 8/48 (16%)
Query: 192 AEVNEEDIE--------KTAWGELESESEEESEEDDEEEEEEEEGEDE 231
++ EE + K + E E E++ E+D+++ EE ++ +E
Sbjct: 47 RDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 29.6 bits (66), Expect = 3.2
Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 10/74 (13%)
Query: 179 YGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEE----DDEEEEEEEEGEDESGL 234
+G + G ++ET+ D+ + E E+EEE + D E EEE+EE E E+
Sbjct: 58 HGVIDGENSETERPR----DLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIEAP- 112
Query: 235 ITPAEGLMTPSGIT 248
P + G++
Sbjct: 113 -DPEVNPLDAEGLS 125
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 29.2 bits (66), Expect = 3.3
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 209 SESEEESEEDDEEEEEEEEGEDESGL 234
+ + + EEEEEEE + GL
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDDMGFGL 308
>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
protein p101 subunit. Class I PI3Ks are dual-specific
lipid and protein kinases involved in numerous
intracellular signaling pathways. Class IB PI3K,
p110gamma, is mainly activated by seven-transmembrane
G-protein-coupled receptors (GPCRs), through its
regulatory subunit p101 and G-protein beta-gamma
subunits.
Length = 856
Score = 29.6 bits (66), Expect = 3.4
Identities = 17/49 (34%), Positives = 25/49 (51%)
Query: 212 EEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSI 260
E ED+E EEEEEE ++ GL + L++ S + S + DS
Sbjct: 309 EPVLSEDEEVEEEEEEDDETDGLSPERDSLLSNSSVYSNDSSDSKEDSS 357
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 27.9 bits (62), Expect = 3.5
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
E E EEE EE++EE+E E E +
Sbjct: 59 ESEENDEEEEEEEEEEDEGEIEYVSD 84
Score = 27.1 bits (60), Expect = 5.4
Identities = 12/42 (28%), Positives = 20/42 (47%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGL 234
+ E+ E+ E E E EE+ E + ++EE E+ L
Sbjct: 54 ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDL 95
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific phytosulfokine
precursor proteins. Phytosulfokines, are active as
either a pentapeptide or a C-terminally truncated
tetrapeptide. These compounds were first isolated
because of their ability to stimulate cell division in
somatic embryo cultures of Asparagus officinalis.
Length = 74
Score = 27.3 bits (61), Expect = 3.6
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
++E+ + +ED+ E E EGE+E
Sbjct: 27 SKIEAAESSKVDEDEVVSECEGEGEEE 53
>gnl|CDD|235030 PRK02315, PRK02315, adaptor protein; Provisional.
Length = 233
Score = 29.0 bits (66), Expect = 3.9
Identities = 10/52 (19%), Positives = 25/52 (48%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
+VF T ++ E+ I+ + ++E E+ + E+ E+++ D +
Sbjct: 72 EVFVTKSKLDLPLNFEDLIDLPSDEDIEDLFPEDFFKLLEQSLEDKDDLDAT 123
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.9 bits (65), Expect = 4.2
Identities = 7/35 (20%), Positives = 16/35 (45%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
+ D + + S+++S++DD E+E
Sbjct: 112 EDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAA 146
>gnl|CDD|206170 pfam14000, Packaging_FI, DNA packaging protein FI. This family
includes the lambda phage DNA-packaging protein FI.
Proteins in this family are typically between 124 and
140 amino acids in length. There is a conserved EEE
sequence motif.
Length = 125
Score = 28.1 bits (63), Expect = 4.2
Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 1/60 (1%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLE 255
E D+ +A EL E EE D+ E + DES L + +++ G T + +
Sbjct: 19 EPDVSGSA-AELAQRVAEWEEELDQVSEADTASSDESALTGHEDEVVSDQGETVIGDEGD 77
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 27.2 bits (61), Expect = 4.2
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
+ + +EE+ +EEEEEEE +D+ G
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEEDDDMG 84
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 28.6 bits (64), Expect = 4.4
Identities = 14/53 (26%), Positives = 24/53 (45%)
Query: 188 ETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
E A ++DI+ E +E E D +E++EEEE ++ L +
Sbjct: 89 EEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDD 141
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 28.7 bits (64), Expect = 4.5
Identities = 15/32 (46%), Positives = 19/32 (59%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITP 237
E E ES EE EE+ EEEEE + +D + P
Sbjct: 8 EEEDESGEEEEEESEEEEETDSEDDMEPRLKP 39
>gnl|CDD|234493 TIGR04189, surface_SprA, cell surface protein SprA. SprA is a cell
surface protein widely distributed in the Bacteroidetes
lineage. In Flavobacterium johnsoniae, a species that
shows gliding motility, mutation disrupts gliding.
Length = 2315
Score = 29.3 bits (66), Expect = 4.6
Identities = 25/137 (18%), Positives = 41/137 (29%), Gaps = 13/137 (9%)
Query: 230 DESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRP- 288
++ G + + P + PD + K I +F+ LT R
Sbjct: 1816 NKLGFLKQRSQDNQSARKKPDPRTEKPPDKPKQNKTIIPGP-RPKILSAVFRFLTMLRNI 1874
Query: 289 --------DKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDA 340
+ PG + S + V G APG L S L +
Sbjct: 1875 DVNYSEQKGTVLPGFLPSVKFFGVDVLGGLAPGWGFVFGLQSDDFRDEAARNGWLTTFQS 1934
Query: 341 MAARYEQQMREQTSGLQ 357
++Y Q+ + LQ
Sbjct: 1935 FNSQYTQR---FSQNLQ 1948
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 28.7 bits (64), Expect = 4.8
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDE 231
G LE E E+E +DE EEEEE + +
Sbjct: 57 GGLEVSEETEAEVEDEGGEEEEEEDAD 83
>gnl|CDD|239523 cd03431, DNA_Glycosylase_C, DNA glycosylase (MutY in bacteria and
hMYH in humans) is responsible for repairing misread
A*oxoG residues to C*G by removing the inappropriately
paired adenine base from the DNA backbone. It belongs to
the Nudix hydrolase superfamily and is important for the
repair of various genotoxic lesions. Enzymes belonging
to this superfamily requires a divalent cation, such as
Mg2+ or Mn2+ for their activity. They are also
recognized by a highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V). However, DNA
glycosylase does not seem to contain this signature
motif. DNA glycosylase consists of 2 domains: the
N-terminal domain contains the catalytic properties of
the enzyme and the C-terminal domain affects substrate
(oxoG) binding and enzymatic turnover. The C-terminal
domain is highly similar to MutT, based on secondary
structure and topology, despite low sequence identity.
MutT sanitizes the nucleotide precursor pool by
hydrolyzing oxo-dGTP to oxo-dGMO and inorganic
pyrophosphate. The similarity strongly suggests that the
two proteins share a common evolutionary origin.
Length = 118
Score = 27.7 bits (62), Expect = 5.0
Identities = 11/35 (31%), Positives = 17/35 (48%), Gaps = 5/35 (14%)
Query: 281 QVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAP 315
+VL EKRP+K G++A ++E P
Sbjct: 15 RVLLEKRPEK---GLLAG--LWEFPSVEWEEEADG 44
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 28.5 bits (64), Expect = 5.1
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDE 231
+SE +E +D+EE E+E + E+
Sbjct: 60 DSEDDEPESDDEEEGEKELQREER 83
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 29.0 bits (65), Expect = 5.1
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E + EE+SEE EE +EE E+E G
Sbjct: 284 EEKDEEKSEEVKTEEVDEEFEEEEKG 309
Score = 28.3 bits (63), Expect = 10.0
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDE 231
E + E +EE E+ + EE +EE E+E
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEE 306
>gnl|CDD|206355 pfam14187, DUF4310, Domain of unknown function (DUF4310). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 214 and 231 amino
acids in length.
Length = 210
Score = 28.5 bits (64), Expect = 5.2
Identities = 11/19 (57%), Positives = 12/19 (63%)
Query: 291 LGPGMMASTHMYEVPGAGA 309
L G+ A THMY V G GA
Sbjct: 20 LSAGIFAGTHMYVVYGVGA 38
>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
Caspases. Second Mitochondria-derived Activator of
Caspases promotes apoptosis by activating caspases in
the cytochrome c/Apaf-1/caspase-9 pathway, and by
opposing the inhibitory activity of inhibitor of
apoptosis proteins (XIAP-BIR3). The protein assumes an
elongated three-helix bundle structure, and forms a
dimer in solution.
Length = 234
Score = 28.7 bits (64), Expect = 5.2
Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 5/44 (11%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGED 230
AE K AE E+I++ A + + E+ E ED
Sbjct: 196 AEKKLAESKAEEIQRMA-----EYASSIDLSELEDIPEAYLRED 234
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 28.8 bits (64), Expect = 5.3
Identities = 25/129 (19%), Positives = 45/129 (34%), Gaps = 10/129 (7%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGL 254
N+ + + E +++D EE EEE+ E+E GL + L G +
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Query: 255 ETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGA 314
++ +SD++ ++ L +K P K P + G S P
Sbjct: 355 DSDSG----DDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPS------SPGNSGPAR 404
Query: 315 PVKRTLPSS 323
P +
Sbjct: 405 PSPESKDKG 413
>gnl|CDD|177537 PHA03134, PHA03134, thymidine kinase; Provisional.
Length = 340
Score = 28.6 bits (64), Expect = 5.3
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 335 LVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVAR 371
L +TD ++ Y+ Q R+Q L ED + + A + AR
Sbjct: 56 LFETDVVSGIYDAQNRKQQGSLAAEDAAGITAHYQAR 92
>gnl|CDD|235785 PRK06365, PRK06365, acetyl-CoA acetyltransferase; Provisional.
Length = 430
Score = 28.7 bits (64), Expect = 5.3
Identities = 22/67 (32%), Positives = 29/67 (43%), Gaps = 11/67 (16%)
Query: 32 DIDYQKLHDAFFKWQTKPRLSIHGDL----YYEGKEFETKLKEKKPGDLSAELRTAL--- 84
D+D +LHDA+ T + + DL Y EG +F K + PG L L
Sbjct: 314 DLDLIELHDAY----TSSEIQTYEDLGLCKYGEGGQFIESGKPELPGKLPVNPSGGLLAA 369
Query: 85 GMPVGPT 91
G VG T
Sbjct: 370 GHAVGAT 376
>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
Length = 487
Score = 28.9 bits (65), Expect = 5.4
Identities = 9/46 (19%), Positives = 21/46 (45%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
D F E E + + G+LE E +++++++++E
Sbjct: 433 DKFEQRMSGDFIEGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDE 478
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 28.6 bits (64), Expect = 5.5
Identities = 7/28 (25%), Positives = 14/28 (50%), Gaps = 3/28 (10%)
Query: 208 ESESEEESEEDDEEEEEEE---EGEDES 232
+ E + ++DDE+ E + +D S
Sbjct: 250 DDEGDGSDDDDDEDAIESDLDDSDDDVS 277
>gnl|CDD|182789 PRK10863, PRK10863, anti-RNA polymerase sigma factor SigE;
Provisional.
Length = 216
Score = 28.5 bits (64), Expect = 5.5
Identities = 18/56 (32%), Positives = 23/56 (41%), Gaps = 12/56 (21%)
Query: 302 YEVPGAGASAPGAPVKRTLP-----SSVELALDPSELDLVDTDAMAARYEQQMREQ 352
Y + P PV TLP S V L + PSE A +QQ++EQ
Sbjct: 121 YNGQSETSQQPETPVFNTLPMMGKASPVSLGV-PSE------AAAGNGQQQQVQEQ 169
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 28.6 bits (64), Expect = 5.6
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 187 AETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEE 224
E + AE + D ++ +E + E +EE+++EE E
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 28.2 bits (63), Expect = 6.6
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESGLITP 237
++ + E+E+ +++E E EE EDE G +
Sbjct: 44 VVDEDEEDEAVVEEDENELTEEEEDEEGEVKA 75
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 28.7 bits (64), Expect = 5.8
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 6/57 (10%)
Query: 178 LYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEE---EEEEEEGEDE 231
Y D+F + E + E T + ESE E E +E E E E E + +
Sbjct: 324 YYEDIFEVVEVVEKQEGDVVTEEST---DEESEDEVEIDESVIEEVAEMELLEVQVD 377
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 27.7 bits (62), Expect = 6.2
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 190 KTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEE 226
K + +++ I+ + + + E+ EEDDE+EE+EE
Sbjct: 93 KFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 27.3 bits (61), Expect = 7.2
Identities = 16/58 (27%), Positives = 28/58 (48%)
Query: 170 PVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEE 227
PVDE + + + K ++DI + EE+ +E+D+E+EE+EE
Sbjct: 72 PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 28.8 bits (64), Expect = 6.3
Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 2/90 (2%)
Query: 154 KPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKT-AEVNEEDIEKTAWGELESESE 212
KP + G+ GK F T+T E +D E E E + +
Sbjct: 241 KPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETE-EDDVD 299
Query: 213 EESEEDDEEEEEEEEGEDESGLITPAEGLM 242
E + EDD+++ + E+ +ESG + E M
Sbjct: 300 ESAIEDDDDDSDWEDSVEESGRSSVDEKTM 329
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 29.0 bits (64), Expect = 6.3
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
D+EK + ++ SE+ES ED+EEE + E+E
Sbjct: 418 DLEKQKYMDMLDGSEDESVEDNEEEHSGDANEEE 451
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 27.7 bits (62), Expect = 6.3
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 192 AEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLIT 236
E ++E+ +T E + E E + + EEE DE T
Sbjct: 81 EENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDEDKYKT 125
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.8 bits (65), Expect = 6.4
Identities = 13/28 (46%), Positives = 15/28 (53%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDESG 233
ELE EE EE+ EE EE + ED
Sbjct: 827 ELEPLPYEEPEEETAEEPEEVDPEDVKS 854
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 28.6 bits (64), Expect = 6.6
Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 196 EEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDE 231
EE + + + E E + +E E+ DE+E+E+E+ EDE
Sbjct: 877 EEYL-ISQFQENEDDDADEDEDQDEDEDEDED-EDE 910
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 28.6 bits (64), Expect = 6.7
Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 4/60 (6%)
Query: 177 PLYGDVFGTSAETKTAEVNEEDIEKTAWGELESES----EEESEEDDEEEEEEEEGEDES 232
P+ ++ S +K + +E S + +E+S++D+E + E E E E+
Sbjct: 403 PVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPESEQEPET 462
>gnl|CDD|226077 COG3547, COG3547, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 303
Score = 28.4 bits (63), Expect = 6.7
Identities = 18/101 (17%), Positives = 33/101 (32%), Gaps = 6/101 (5%)
Query: 295 MMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTS 354
+ +T E A G PV+ P V+ DA+A +
Sbjct: 40 GIEATGYSENLARYLRAEGYPVRLINPRQVKKKARGKRAKTDKADALAIAVAALRPK--- 96
Query: 355 GLQKEDLSDMLAEHVARQKNKRKR--QQTTQDNKQAKKYKE 393
L+ + + K +R+R + TQ + + + E
Sbjct: 97 -LEPVSVPSEEQRALRELKRRRRRLLRLRTQLDNRIRGLLE 136
>gnl|CDD|181506 PRK08622, PRK08622, galactose-6-phosphate isomerase subunit LacB;
Reviewed.
Length = 171
Score = 27.7 bits (62), Expect = 6.9
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 5/51 (9%)
Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEED--DEEEEEEEEGE 229
D F +AE K E N++ IEK LE + +++ DE E+ + GE
Sbjct: 121 DAF-INAEYKPTEENKKLIEKID--HLEKPNADQANPHFFDEFLEKWDRGE 168
>gnl|CDD|215313 PLN02575, PLN02575, haloacid dehalogenase-like hydrolase.
Length = 381
Score = 28.3 bits (63), Expect = 6.9
Identities = 13/26 (50%), Positives = 14/26 (53%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDE 220
N DIE +G E E E E EED E
Sbjct: 344 NLADIESPEFGPPEPELEMEKEEDRE 369
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 28.7 bits (65), Expect = 6.9
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 210 ESEEESEEDDEEEEEEEEGEDESGLI 235
E S+ +DE E+ + G D S LI
Sbjct: 21 AEAECSDGEDELEDLFDTGSDISDLI 46
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 27.6 bits (62), Expect = 7.0
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 206 ELESESEEESEEDDEEEEEEEEGEDES 232
E E E E E EE DEEE+ +E E E
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKEL 128
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 28.2 bits (63), Expect = 7.2
Identities = 12/39 (30%), Positives = 18/39 (46%)
Query: 195 NEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESG 233
E++ + E S+ SE+ D EE E GE E+
Sbjct: 231 KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAA 269
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 28.4 bits (63), Expect = 7.5
Identities = 12/40 (30%), Positives = 27/40 (67%)
Query: 193 EVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
E EE +E+ E + + +++ +EDD++++++E+ EDE
Sbjct: 44 EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDED 83
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 27.9 bits (62), Expect = 7.9
Identities = 27/96 (28%), Positives = 32/96 (33%), Gaps = 10/96 (10%)
Query: 113 YGPPPSYPNLKIP---GLNAPIP-QGCSFGYHAGGWGKP------PVDETGKPLYGDVFG 162
YG +YP P G N P P Q GY G G P P G P G
Sbjct: 1 YGNMQNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGGPVGGG 60
Query: 163 TGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEED 198
GG G PP D + ++ + E D
Sbjct: 61 GGGSGGPPGGGEVAGEAEDAMSEFDDYSSSSIEEGD 96
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 27.7 bits (62), Expect = 8.4
Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 6/58 (10%)
Query: 188 ETKTAEVNEEDIEKTAWG----ELESESEEESEED--DEEEEEEEEGEDESGLITPAE 239
+ EE + +A E +S S E S D EEEEEE E+E + P +
Sbjct: 124 SESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.0 bits (62), Expect = 9.1
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESG 233
E E E + EEEE+EE G D G
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYG 202
>gnl|CDD|235378 PRK05256, PRK05256, condesin subunit E; Provisional.
Length = 238
Score = 27.6 bits (62), Expect = 9.1
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 207 LESESEEESEEDDEEEEEEEEGEDE 231
+E+ ++ES +DE EEE+ + E+E
Sbjct: 213 IENHLQQESSLNDETEEEQPDEEEE 237
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 27.7 bits (62), Expect = 9.2
Identities = 15/39 (38%), Positives = 18/39 (46%), Gaps = 1/39 (2%)
Query: 205 GELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMT 243
GE E E + D E E E+E G +TP EG T
Sbjct: 25 GEPEPEVPQTEATDPSEVVVETVTEEEGGPLTP-EGNAT 62
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 28.0 bits (62), Expect = 9.3
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 198 DIEKTAWGELESESEEESEEDDEEEEEEEEGEDES 232
D + +G + E S D +EEE+ ++ ++E
Sbjct: 263 DFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNED 297
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 28.1 bits (62), Expect = 9.7
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 208 ESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDS 259
+++SEE+ ++ +EE+ + +D G G+ T SG T+ L T S
Sbjct: 113 KTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQS 164
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 28.0 bits (62), Expect = 10.0
Identities = 14/76 (18%), Positives = 22/76 (28%)
Query: 162 GTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEE 221
+P D G P + + E T +++ E +DE
Sbjct: 178 PEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237
Query: 222 EEEEEEGEDESGLITP 237
E E EG G +
Sbjct: 238 TEPEREGPPFPGHRSH 253
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.130 0.376
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,668,806
Number of extensions: 2059186
Number of successful extensions: 7490
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5337
Number of HSP's successfully gapped: 567
Length of query: 396
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 297
Effective length of database: 6,546,556
Effective search space: 1944327132
Effective search space used: 1944327132
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (27.0 bits)