RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9536
(287 letters)
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 37.8 bits (88), Expect = 0.005
Identities = 17/49 (34%), Positives = 30/49 (61%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E+EV + E +EE +E+ E E+ + +E+E ++KKKK KK ++
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 34.7 bits (80), Expect = 0.049
Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 9/59 (15%)
Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGK------EKENKKKKKKKKKKEKKN 285
+W E+EVE + + EE+EE+ E +K++E+ E+E K++KKKK KK K+
Sbjct: 25 LWVEKEVEKE---VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 36.5 bits (85), Expect = 0.012
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+T K + VL K E K+K+K+K K +K++R
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Score = 32.6 bits (75), Expect = 0.21
Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)
Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
K ++ ++ KEKE +K K KK+ ++ KN
Sbjct: 394 KVLAKRAEK----KEKEKEKPKVKKRHRDTKN 421
Score = 31.5 bits (72), Expect = 0.48
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ E+KT K EK+ K+K+K K KK ++ +
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 28.4 bits (64), Expect = 4.8
Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 4/53 (7%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGK--EKENKKKKKKKKKKEKKNR 286
ER +E L+ R DE + +T+ E GK +K K+ +KK+K+++K +
Sbjct: 361 ERYIEEPLK--ARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 34.3 bits (79), Expect = 0.022
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 9/55 (16%)
Query: 236 EREVELDLEWIRREEDEE----KTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
ER +D +W + EDEE + E ++KDE EK KK+ K++KKK+KK +
Sbjct: 50 ERLELMDEKWKKETEDEEFQQKREEKKRKDE-----EKTAKKRAKRQKKKQKKKK 99
Score = 33.5 bits (77), Expect = 0.042
Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 3/38 (7%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
++ +DEEKT ++ K+K+ KKKKKK KK KK
Sbjct: 75 KKRKDEEKTAKKRAKRQ---KKKQKKKKKKKAKKGNKK 109
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 33.8 bits (78), Expect = 0.066
Identities = 22/43 (51%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 244 EWIRREEDEEKT--ETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
EW R+E EK E +KK E+ L K K++KK KKKKK K+K
Sbjct: 161 EWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.2 bits (76), Expect = 0.082
Identities = 14/41 (34%), Positives = 25/41 (60%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ E EE+ + KK + + KEK+ KK KK+K + K +++
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Score = 32.8 bits (75), Expect = 0.13
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 2/38 (5%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+++++ +K + KKD+ K E K KKKKKKK+KK
Sbjct: 158 KKKKEVKKEKKEKKDKKE--KMVEPKGSKKKKKKKKKK 193
Score = 32.4 bits (74), Expect = 0.19
Identities = 18/42 (42%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKK--KKKKKEKKNRR 287
++E T +K+ +V +EK+ KKKKK KK+KKEKK+++
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKK 174
Score = 30.1 bits (68), Expect = 1.1
Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 248 REEDEEKTETRKKDEDVL-GKEKENKKKKKKKKKKEKK 284
E +T ++ V E E ++KK+KKKKKE K
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164
Score = 28.1 bits (63), Expect = 4.0
Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 8/45 (17%)
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKK-------KKKKEKKNRR 287
E+EEK E +KK E V ++KE K KK+K KKKK+KK ++
Sbjct: 150 EEEEKKEKKKKKE-VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 27.4 bits (61), Expect = 8.0
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKK-KKKKK-KKEKKN 285
+E K E + E+ KEK+ KK+ KK+KK KK+KK
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 33.1 bits (76), Expect = 0.14
Identities = 10/38 (26%), Positives = 20/38 (52%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+DE E + + + + K K+K K ++ KEK+ +
Sbjct: 253 SDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRK 290
Score = 28.9 bits (65), Expect = 2.9
Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
Query: 238 EVELDLEWIRREEDEEKTETRKKDE-----DVLGKEKENKKKKKKKKKK 281
+VE LE R E K DE D G E KKK++KK KK
Sbjct: 16 DVEQFLEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKK 64
Score = 27.4 bits (61), Expect = 9.6
Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
GE E + + W E + E + + K K + K+K++K+ E++
Sbjct: 249 GEEESDDESAWEGFESEYEPINKPVRPKR---KTKAQRNKEKRRKELERE 295
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 33.0 bits (76), Expect = 0.16
Identities = 14/30 (46%), Positives = 22/30 (73%)
Query: 258 RKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+K++E K K+ KK+KKK KK++KK R+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.5 bits (72), Expect = 0.56
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 257 TRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+KK E+ ++++ KKK+KKK KK KK R
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 28.0 bits (63), Expect = 6.5
Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 6/37 (16%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
+++ +E+K + RKK K+K KK KK+KKK K
Sbjct: 384 KKKREEKKPQKRKK------KKKRKKKGKKRKKKGRK 414
Score = 27.6 bits (62), Expect = 7.6
Identities = 12/31 (38%), Positives = 19/31 (61%), Gaps = 2/31 (6%)
Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
K KK + K+K+ +KKK KK+KK+ +
Sbjct: 385 KKREEKKPQK--RKKKKKRKKKGKKRKKKGR 413
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 33.1 bits (75), Expect = 0.18
Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 10/56 (17%)
Query: 241 LDLEWIRREEDEEK--------TETRK--KDEDVLGKEKENKKKKKKKKKKEKKNR 286
LD++ + D EK ET K + DV EK++KK KKK+KK+++K R
Sbjct: 159 LDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKER 214
Score = 31.2 bits (70), Expect = 0.75
Identities = 21/78 (26%), Positives = 35/78 (44%)
Query: 210 KTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEK 269
K +E K L ++ A V E L D E E + + + K
Sbjct: 219 KKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSP 278
Query: 270 ENKKKKKKKKKKEKKNRR 287
++KKKK++K+K+EKK ++
Sbjct: 279 KHKKKKQRKEKEEKKKKK 296
Score = 28.9 bits (64), Expect = 4.1
Identities = 12/30 (40%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
Query: 259 KKDEDVLGKEK-ENKKKKKKKKKKEKKNRR 287
K +E+ +++ E K++KKK++KEK+ RR
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRR 112
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 31.9 bits (72), Expect = 0.21
Identities = 16/39 (41%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++ ++ KT+++K D+ KEK ++KK K KKK+EK N
Sbjct: 12 KQRKESGKTQSQKSDKK--KKEKVSEKKGKSKKKEEKPN 48
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.7 bits (73), Expect = 0.22
Identities = 10/19 (52%), Positives = 11/19 (57%)
Query: 268 EKENKKKKKKKKKKEKKNR 286
EK KK +K K K EKK
Sbjct: 72 EKAEKKAEKAKAKAEKKKA 90
Score = 28.3 bits (64), Expect = 3.3
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
EE + ++ E L +KE K +K +KK EK
Sbjct: 49 EEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKA 83
Score = 27.1 bits (61), Expect = 7.9
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 267 KEKENKKKKKKKKKKEKKNRR 287
EK K +KKK KKE+ R
Sbjct: 78 AEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.6 bits (72), Expect = 0.26
Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
E E +W +++ +K + + KD+ +K++K +KK +K+ E K
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKK--DDKKDDKSEKKDEKEAEDKLE 123
Score = 29.7 bits (67), Expect = 1.1
Identities = 18/41 (43%), Positives = 24/41 (58%), Gaps = 6/41 (14%)
Query: 251 DEEKTETRKKDEDV------LGKEKENKKKKKKKKKKEKKN 285
D E TE +KK +++ + KE E K+K K KKKK KK
Sbjct: 54 DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK 94
Score = 27.7 bits (62), Expect = 5.9
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+++E E+ E KK+ + K K KKK KKKK K+K
Sbjct: 63 KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD 100
Score = 27.0 bits (60), Expect = 8.4
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
E + + + E ++ K E +K + K+K KKK K K KK+ K
Sbjct: 59 EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK 106
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 31.7 bits (72), Expect = 0.27
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 4/38 (10%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+ EK +KK ED +KE KKKKK+KKKK+K++
Sbjct: 137 LKGHEKKHKKKKHED----DKERKKKKKEKKKKKKRHS 170
Score = 28.6 bits (64), Expect = 2.7
Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 6/46 (13%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKK------KKKKKEKKNRR 287
R +D ET E + G EK++KKKK KKKKKEKK ++
Sbjct: 121 RTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.6 bits (74), Expect = 0.31
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
++ EE+ E RKK ++ KEKE KK K +K+ + K
Sbjct: 10 KKILTEEELE-RKKKKEEKAKEKELKKLKAAQKEAKAK 46
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 30.0 bits (68), Expect = 0.32
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 259 KKDEDVLGKEKENKKKKKKKKKKEKKN 285
K + D++ K+K+ KKKK + K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKA 43
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 31.6 bits (72), Expect = 0.38
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
EE++E+ E K K K+ K K ++K+K K+
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 31.2 bits (71), Expect = 0.48
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
++EE E ++ V K K K K K ++KEK R
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 29.2 bits (66), Expect = 2.4
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 245 WIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
W +ED++ ++ ++ED +E++ K K K KK K
Sbjct: 24 WDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALK 63
Score = 28.9 bits (65), Expect = 2.5
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 5/44 (11%)
Query: 249 EEDEEKTETR-----KKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
EEDEEK E + K K K +K+K K++K+EK R
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83
Score = 28.5 bits (64), Expect = 4.0
Identities = 11/52 (21%), Positives = 25/52 (48%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ + ++ W E++E++ E K K+ K ++K+K K +K +
Sbjct: 28 DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79
Score = 28.5 bits (64), Expect = 4.0
Identities = 9/18 (50%), Positives = 10/18 (55%)
Query: 267 KEKENKKKKKKKKKKEKK 284
EK K KKKK K + K
Sbjct: 197 AEKAAKGGKKKKGKAKAK 214
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.9 bits (70), Expect = 0.40
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
I RE E + +E+ + ++ + K + KEKK R
Sbjct: 87 IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKR 127
Score = 30.5 bits (69), Expect = 0.51
Identities = 12/38 (31%), Positives = 20/38 (52%)
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E+E ET ++D E + KKK++ K K +K +
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139
Score = 27.0 bits (60), Expect = 6.8
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
EE ++TE E +++ K++ K K +++K +
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKT 141
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 30.3 bits (69), Expect = 0.41
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E +EEKT T K+ + + KEK +KK+++KK E +
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAE 36
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 31.3 bits (72), Expect = 0.48
Identities = 7/21 (33%), Positives = 11/21 (52%)
Query: 267 KEKENKKKKKKKKKKEKKNRR 287
K KKK+K++ K K +
Sbjct: 69 WHKAQKKKEKQEAKAAKAKSK 89
Score = 29.0 bits (66), Expect = 2.5
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E+ E +++ + L +KE K K +KKKEK+ +
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAK 82
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.7 bits (70), Expect = 0.49
Identities = 10/39 (25%), Positives = 23/39 (58%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
EE E + ++ D L +++ K K++K+++ E+K +
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 29.5 bits (67), Expect = 0.52
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 13/55 (23%)
Query: 232 IVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+W EL+ +++ EE EEK +K DE N KKKKK+K KK R
Sbjct: 19 RIW----DELNKDYL--EEQEEKELKQKADE-------GNNSGKKKKKRKAKKKR 60
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.8 bits (70), Expect = 0.57
Identities = 13/43 (30%), Positives = 16/43 (37%), Gaps = 1/43 (2%)
Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKK 275
+ ER+ E EE +E K K E KKKK
Sbjct: 155 WYAERKDAKQKEEFAAEERKEALAAAAKKSATPQK-VETKKKK 196
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 30.3 bits (69), Expect = 0.67
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 8/57 (14%)
Query: 234 WGEREVELDLE--WIRREEDEEKTETRKKDEDVLGKEKENKKKKKKK-KKKEKKNRR 287
WG + DLE WI E ED K++ KK++ K +K+E KN++
Sbjct: 113 WGYKRANKDLEKDWII-----EVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.5 bits (69), Expect = 0.67
Identities = 12/39 (30%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 248 REEDEEKTETRKKDEDVLGKEKE--NKKKKKKKKKKEKK 284
+++ EK + K + +K +KK++KKK K+KK
Sbjct: 39 QDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.9 bits (70), Expect = 0.79
Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 3/48 (6%)
Query: 243 LEWIRREEDEEKTETRKKDEDVLG---KEKENKKKKKKKKKKEKKNRR 287
L ++ E D + K E+ L EK+ K++ + K+K +K+ R
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 30.5 bits (69), Expect = 1.00
Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 4/36 (11%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+E++ K E +K + KE+ ++ K+K++K + KK
Sbjct: 403 KEEKLKQEENEKKQ----KEQADEDKEKRQKDERKK 434
Score = 30.5 bits (69), Expect = 1.1
Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 8/58 (13%)
Query: 237 REVELDLEWIRREEDEEKTETRK--------KDEDVLGKEKENKKKKKKKKKKEKKNR 286
+E L+ ++ E K ET K+E + +E E K+K++ + KEK+ +
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429
Score = 27.5 bits (61), Expect = 9.5
Identities = 8/33 (24%), Positives = 20/33 (60%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
+ EEK + + ++ + E+K+K++K ++K
Sbjct: 401 KAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.3 bits (70), Expect = 0.79
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
I+ E+ +K E KK + K +E++KK + KKE + +
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Score = 30.5 bits (68), Expect = 1.2
Identities = 16/61 (26%), Positives = 34/61 (55%), Gaps = 9/61 (14%)
Query: 236 EREVELDLEWIRREEDEEKT---------ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
E ++ E +++ E+E+K E +KK E++ E+ENK K ++ KK ++++
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Query: 287 R 287
+
Sbjct: 1675 K 1675
Score = 30.1 bits (67), Expect = 1.6
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ E E+K E KK D K+ E KKK + KKK +++++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 29.7 bits (66), Expect = 2.4
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Query: 208 QSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRR-EEDEEKTETRKKDEDVLG 266
+++ A EL K + + + + E + E ++ +E ++K E KK D
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAK 1335
Query: 267 KEKENKKKKKKKKKKEKKNRR 287
K+ E KK + K E +
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAA 1356
Score = 29.3 bits (65), Expect = 2.9
Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 208 QSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGK 267
++ A E+ K + R E + + E E E R+ ED +K E KK E+
Sbjct: 1180 AARKAEEVRKAEELRKAEDARKAEAA-RKAEEERKAEEARKAEDAKKAEAVKKAEEAKKD 1238
Query: 268 EKENKKKKKKKKKKEKK 284
+E KK ++++ +E +
Sbjct: 1239 AEEAKKAEEERNNEEIR 1255
Score = 29.3 bits (65), Expect = 3.2
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 204 VALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDED 263
V ++ K A E K + + E ++ E D + + EE ++ E KK +
Sbjct: 1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK--KAEEAKKAEEDEKKAAE 1692
Query: 264 VLGKEKENKKKKKKKKKKEKKNRR 287
L KE E KK ++ KKKE + ++
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKK 1716
Score = 28.6 bits (63), Expect = 5.7
Identities = 17/52 (32%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E E E ++++E EEK KK E++ E+ENK K ++ KK+ +++++
Sbjct: 1697 EAEEAKKAEELKKKEAEEK----KKAEELKKAEEENKIKAEEAKKEAEEDKK 1744
Score = 28.2 bits (62), Expect = 6.5
Identities = 6/38 (15%), Positives = 17/38 (44%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+ + ++K E KK + E E + + +++ +
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 28.2 bits (62), Expect = 6.7
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+ +E ++K E KK E+ K +E KK + KKK E+ +
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 27.8 bits (61), Expect = 7.9
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ +E ++K E KK D L K KKK + KKK ++ ++
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 27.8 bits (61), Expect = 8.5
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ EED++K + KK K E KKK ++KKK ++ ++
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 27.8 bits (61), Expect = 8.9
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+ +E ++K E KK D K E KKK + KK E+ +
Sbjct: 1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Score = 27.8 bits (61), Expect = 9.4
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ +E ++K E KK ++ K +E KKK + KK + ++
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 30.6 bits (69), Expect = 0.90
Identities = 21/95 (22%), Positives = 29/95 (30%), Gaps = 6/95 (6%)
Query: 193 IYHPTTSQYGSVALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDE 252
+ G L +K L L + G+ T RE+
Sbjct: 89 QPVENSLPPGCGILPGTKGIHHLQMLHHPQLGDNPYGT------REMHHLEVPPISTAPP 142
Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E E +K + K + K KK KKK NR
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.4 bits (69), Expect = 0.95
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 251 DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+EE + + E+ K+K+ KKKKK KK +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSS 201
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 28.4 bits (64), Expect = 0.97
Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 4/38 (10%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
EED+E+ + K + N++KKKKKKK +KK++
Sbjct: 21 EEDDEQIDASKVRRI----SQRNRRKKKKKKKLKKKSK 54
Score = 26.5 bits (59), Expect = 5.5
Identities = 14/36 (38%), Positives = 19/36 (52%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
EE EE E + ++ +KKKKKKK K+K
Sbjct: 18 EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.7 bits (67), Expect = 1.1
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
+E + E ++ ++ KE KK+KK+KK++K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 29.0 bits (65), Expect = 1.6
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E+ K+ ++V + +E + K KK+KKEKK ++
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 27.8 bits (62), Expect = 4.1
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E ++E+ E K+ E+ + + +KK+KK+KK +K
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 28.5 bits (64), Expect = 1.1
Identities = 13/19 (68%), Positives = 15/19 (78%)
Query: 266 GKEKENKKKKKKKKKKEKK 284
GK+ + KKKKKKKKKK K
Sbjct: 14 GKKIDVKKKKKKKKKKNKS 32
Score = 26.6 bits (59), Expect = 4.8
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 267 KEKENKKKKKKKKKKEKKN 285
K K+ KKKKKKKK+K
Sbjct: 13 KGKKIDVKKKKKKKKKKNK 31
Score = 26.6 bits (59), Expect = 5.2
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 266 GKEKENKKKKKKKKKKEKKN 285
K KKKKKKKKK + K
Sbjct: 15 KKIDVKKKKKKKKKKNKSKE 34
Score = 25.8 bits (57), Expect = 9.9
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 265 LGKEKENKKKKKKKKKKEKK 284
L +K + KKKKKKKKK+ K
Sbjct: 12 LKGKKIDVKKKKKKKKKKNK 31
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 30.6 bits (69), Expect = 1.1
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+++ +EK + KD + KE + K+K+K+KEKK
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 29.5 bits (66), Expect = 2.4
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+E+++EK +D + K + + K + KK +KK
Sbjct: 146 KEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP 184
Score = 28.7 bits (64), Expect = 4.2
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
++E+ +E+ + RK E+ K +K+K+K+KK E+ R
Sbjct: 120 KKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160
Score = 28.3 bits (63), Expect = 5.4
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
++EEK + + K+E KEK ++ K +K K+E K +R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141
Score = 28.3 bits (63), Expect = 5.8
Identities = 8/40 (20%), Positives = 23/40 (57%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
++ + ++++++ ++E KKKK+K K++ K +
Sbjct: 94 PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133
Score = 27.9 bits (62), Expect = 6.2
Identities = 9/41 (21%), Positives = 26/41 (63%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+++E ++ +K ++ +++ K+K+K+K+KK ++ R
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 30.0 bits (68), Expect = 1.1
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 244 EWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E +R+E+E + R + K K+ +K++ E+ R
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 30.3 bits (69), Expect = 1.1
Identities = 10/48 (20%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 240 ELDLEWIR-REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
E ++E++ ++ +K + + + +E++ +KKKK K+K+
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 30.3 bits (69), Expect = 1.2
Identities = 12/38 (31%), Positives = 27/38 (71%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
++ +EEK E +KK KE+E +++K+KK++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 29.9 bits (68), Expect = 1.7
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
EE+EEK + ++ E+ + +E K+++++KKKK+
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 29.5 bits (67), Expect = 2.0
Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 249 EEDEEKTET-RKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
E+E + T KK + K E +KK++++KKEKK +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433
Score = 29.5 bits (67), Expect = 2.4
Identities = 10/39 (25%), Positives = 29/39 (74%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++++EE+ E ++K E+ +E+E +++K++++++KK
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 28.7 bits (65), Expect = 4.0
Identities = 10/38 (26%), Positives = 27/38 (71%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+REE++++ + + +E+E +K+KK+++K+E++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.5 bits (70), Expect = 1.2
Identities = 19/54 (35%), Positives = 35/54 (64%), Gaps = 5/54 (9%)
Query: 236 EREVEL-DLE--WIRREE--DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
ER EL LE +++EE D + K++E++ KEKE ++K+++ +KKE++
Sbjct: 79 ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132
>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
Length = 128
Score = 29.0 bits (66), Expect = 1.2
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 272 KKKKKKKKKKEKKN 285
KKKK KKK KKN
Sbjct: 1 AKKKKTVKKKVKKN 14
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 14/52 (26%)
Query: 248 REEDEEKTETRKKDEDVL--------------GKEKENKKKKKKKKKKEKKN 285
++ ++K + KKD D ++ NKKKKKKKKKK+KKN
Sbjct: 19 QKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKN 70
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 30.1 bits (68), Expect = 1.2
Identities = 14/48 (29%), Positives = 22/48 (45%)
Query: 240 ELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E +ED+E + + E+V K+K KK +K+K N R
Sbjct: 330 EFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFR 377
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.0 bits (68), Expect = 1.3
Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 206 LIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVL 265
L++S E L ++ +RE E D+E E + + + K+E
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKK-------KRETEEDVE---AAESRAEKKRKSKEEIKK 260
Query: 266 GKEKENKKKKKKKKKKEK 283
K KE+K K KK K
Sbjct: 261 KKPKESKGVKALKKVVAK 278
Score = 28.8 bits (65), Expect = 2.7
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
E K + R+ +EDV E +KK+K K++ +KK
Sbjct: 226 YLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
Score = 27.3 bits (61), Expect = 8.2
Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
EED E E+R + + ++E KKKK K+ K K
Sbjct: 237 TEEDVEAAESRAEKK--RKSKEEIKKKKPKESKGVKA 271
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 30.4 bits (69), Expect = 1.3
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
+ E LE E++E + ++ GK + +K K KKK+K+KK
Sbjct: 1136 KFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.9 bits (65), Expect = 1.4
Identities = 13/19 (68%), Positives = 16/19 (84%)
Query: 267 KEKENKKKKKKKKKKEKKN 285
K K+ K+KKKKKKKK+KK
Sbjct: 108 KAKQKKQKKKKKKKKKKKT 126
Score = 28.5 bits (64), Expect = 2.0
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 267 KEKENKKKKKKKKKKEKK 284
++K+ KKKKKKKKKK K
Sbjct: 111 QKKQKKKKKKKKKKKTSK 128
Score = 28.1 bits (63), Expect = 2.6
Identities = 13/18 (72%), Positives = 16/18 (88%)
Query: 267 KEKENKKKKKKKKKKEKK 284
KEK +KK+KKKKKK+KK
Sbjct: 106 KEKAKQKKQKKKKKKKKK 123
Score = 27.7 bits (62), Expect = 4.2
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 251 DEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+E+K + E K+++ KKKKKKKKK KK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKK 129
Score = 27.7 bits (62), Expect = 4.2
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E K+ + K+K+ KKK KK K+KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.0 bits (60), Expect = 6.2
Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 4/33 (12%)
Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++ +KK + K+K+ KKKKK KK KK
Sbjct: 106 KEKAKQKKQK----KKKKKKKKKKTSKKAAKKK 134
Score = 26.6 bits (59), Expect = 8.9
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
+EK + +K+ + K+K+ KK KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 30.0 bits (68), Expect = 1.5
Identities = 8/38 (21%), Positives = 24/38 (63%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
ED+E + D+D+ +E+ ++++++++ +EK+
Sbjct: 143 SEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.9 bits (67), Expect = 1.5
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 249 EEDEEKTETRKKDEDVLGKE-KENKKKKKKKKKKEKK 284
E+ EE E + ++E L K+ K+ KK K KK +K
Sbjct: 318 EDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 28.8 bits (64), Expect = 4.0
Identities = 16/52 (30%), Positives = 26/52 (50%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
ERE +L E + E E+ ++ + +E+ +E KK KK KK + K
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNG 350
Score = 28.0 bits (62), Expect = 7.3
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
I ++ED E++E K +E+ G + KK KK K K+
Sbjct: 314 IEQDEDSEESEEEKNEEE--GGLSKKGKKLKKLKGKKNGL 351
Score = 28.0 bits (62), Expect = 7.4
Identities = 18/51 (35%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
G R +L ++ + +++++ E+ K ED G E+++KKKKKK K +KK
Sbjct: 207 GGRGKDLKIKDLEGDDEDDGDESDKGGED--GDEEKSKKKKKKLAKNKKKL 255
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 29.6 bits (67), Expect = 1.5
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
E+++ELD + I + E + K E ++K ED+ E KK +KK +K
Sbjct: 100 EKKLELDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKS 145
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 29.4 bits (66), Expect = 1.5
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 258 RKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
+K+ ED +ENK+KKKKK KK+K +
Sbjct: 76 KKETEDWFSPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.9 bits (68), Expect = 1.5
Identities = 17/54 (31%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 236 EREVELDLEWIRREEDEEKTETRKK--DEDVLGKEKENKKKKKKKKKKEKKNRR 287
E ++ E R+EE +EK E +KK E L K +++K ++K+++K+ R+
Sbjct: 269 EEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 28.4 bits (64), Expect = 4.3
Identities = 12/42 (28%), Positives = 26/42 (61%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ + +EE+ + K E+ +E + KK++KKK+++E K +
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|220274 pfam09510, Rtt102p, Rtt102p-like transcription regulator protein.
This protein is found in fungi. The family includes
Rtt102p, a transcription regulator protein which appears
to be integrally associated with both the Swi-Snf and
the RSC chromatin remodelling complexes.
Length = 130
Score = 28.8 bits (64), Expect = 1.7
Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 245 WIRREEDEEKTETRKKDEDVLG-KEKENKKKKKKKKKKEKK 284
W + +DE + +DVL E + KK KKKKK + K
Sbjct: 60 WQKDPDDEPLDFKDESTDDVLDLDEFDRTKKSKKKKKVQVK 100
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 29.3 bits (66), Expect = 1.7
Identities = 16/54 (29%), Positives = 38/54 (70%), Gaps = 1/54 (1%)
Query: 234 WGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+RE E + R++ +E++ RK++E+ L +E+E KK+++++K++E++ R+
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEE-LEEEREKKKEEEERKEREEQARK 70
>gnl|CDD|182080 PRK09795, PRK09795, aminopeptidase; Provisional.
Length = 361
Score = 29.5 bits (66), Expect = 1.8
Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 8/58 (13%)
Query: 205 ALIQSKTAIELSKL-LQFRNGETSAPTHIV------WGEREVELDLEWIRREEDEEKT 255
L Q KT E+ K+ L + A HI ERE+ +LEW R++ EK
Sbjct: 123 VLRQIKTPEEVEKIRLACGIADRGA-EHIRRFIQAGMSEREIAAELEWFMRQQGAEKA 179
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 27.6 bits (62), Expect = 1.8
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 269 KENKKKKKKKKKKEKKNRR 287
KE KKK+ KK K E++ RR
Sbjct: 12 KEQKKKELKKNKAERQARR 30
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 29.3 bits (66), Expect = 2.0
Identities = 23/85 (27%), Positives = 38/85 (44%), Gaps = 11/85 (12%)
Query: 214 ELSKLLQFRNGETSAPTHIVWGEREVEL-----------DLEWIRREEDEEKTETRKKDE 262
+L + L + ++ P HIV+ + E E E + R E+ + +K
Sbjct: 107 KLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTS 166
Query: 263 DVLGKEKENKKKKKKKKKKEKKNRR 287
V K+K+ KKK+K KE K R+
Sbjct: 167 LVDEKQKKKSAKKKRKLYKELKERK 191
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 29.3 bits (66), Expect = 2.2
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGK--EKENKKKKKKKKKKEKKNRR 287
E+E +E ++ E K T+ K E + + +KKK+KK K E++ +
Sbjct: 343 REELEKRIEKLK--EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 29.2 bits (66), Expect = 2.7
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 241 LDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
D E ++RE++E++ +K L K++E KKK+ +K +K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.2 bits (65), Expect = 2.8
Identities = 12/48 (25%), Positives = 24/48 (50%)
Query: 235 GEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKE 282
GE E E ++ R+ E E + E K+ D G+ + + + + + + E
Sbjct: 677 GENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAE 724
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 28.9 bits (65), Expect = 2.9
Identities = 10/41 (24%), Positives = 17/41 (41%), Gaps = 5/41 (12%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
RR + K + ++++ K K KK KK R+
Sbjct: 179 RRAKKAAKLSSTAS-----SGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|227550 COG5225, RRS1, Uncharacterized protein involved in ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 172
Score = 28.5 bits (63), Expect = 3.2
Identities = 12/48 (25%), Positives = 20/48 (41%), Gaps = 7/48 (14%)
Query: 240 ELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
L +W+ E + KT R+ + K N KK + ++ RR
Sbjct: 132 RLLKQWLVEGEVKLKTLLRR-------QRKNNIKKNEANRRGNISRRR 172
>gnl|CDD|131269 TIGR02214, spoVD_pbp, stage V sporulation protein D. This model
describes the spoVD subfamily of homologs of the cell
division protein FtsI, a penicillin binding protein.
This subfamily is restricted to Bacillus subtilis and
related Gram-positive species with known or suspected
endospore formation capability. In these species, the
functional equivalent of FtsI is desginated PBP-2B, a
paralog of spoVD [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan,
Cellular processes, Sporulation and germination].
Length = 636
Score = 29.0 bits (65), Expect = 3.2
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 3/58 (5%)
Query: 222 RNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKK 279
RNG A + V V+LD IR+ ++ ++ E + VL +KE K+ KK
Sbjct: 55 RNGNELAVSVSVD---RVDLDPATIRKNKEPKQREIANELARVLEMDKEKVFKRITKK 109
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.0 bits (65), Expect = 3.2
Identities = 6/39 (15%), Positives = 18/39 (46%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
++ EE + R ++ + ++ +K K+ E+ +
Sbjct: 74 EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.8 bits (64), Expect = 3.3
Identities = 17/58 (29%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 234 WGEREVELDLEWIRREED----EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E +EL+ +REE EE+ + RK++E K ++++K++ K+E + RR
Sbjct: 209 QQEAALELEELKKKREERRKVLEEEEQRRKQEE---ADRKSREEEEKRRLKEEIERRR 263
Score = 27.7 bits (61), Expect = 8.9
Identities = 12/47 (25%), Positives = 23/47 (48%)
Query: 237 REVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
E E + REE EE K ++ ++ E +K++K+ + E+
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEE 158
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 28.1 bits (63), Expect = 3.4
Identities = 9/37 (24%), Positives = 18/37 (48%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++D E ++E L KE +++KK ++ K
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKE 69
>gnl|CDD|132849 cd07210, Pat_hypo_W_succinogenes_WS1459_like, Hypothetical patatin
similar to WS1459 of Wolinella succinogenes.
Patatin-like phospholipase. This family predominantly
consists of bacterial patatin glycoproteins. The patatin
protein accounts for up to 40% of the total soluble
protein in potato tubers. Patatin is a storage protein,
but it also has the enzymatic activity of a lipid acyl
hydrolase, catalyzing the cleavage of fatty acids from
membrane lipids. Members of this family have also been
found in vertebrates.
Length = 221
Score = 28.5 bits (64), Expect = 3.5
Identities = 9/56 (16%), Positives = 20/56 (35%), Gaps = 2/56 (3%)
Query: 145 IRCDDTPVVYTHVITTPEGEFFCHNHAGDELRHPFDPSKIYMSLDTGRIYHPTTSQ 200
+R + ++Y H + + N G R P + +++ +G I
Sbjct: 162 LRPEIERILY-HHVAPRRP-WERLNIIGKISRALIRPFRWNLAILSGVIEDVLEPL 215
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 28.8 bits (65), Expect = 3.6
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 255 TETRKKDEDVLGK--EKENKKKKKKKKKKEKKN 285
+E KD + + K ++ KK+KKKKKKKE+K
Sbjct: 149 SELILKDLEFVEKRLDELTKKRKKKKKKKEEKV 181
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 27.6 bits (62), Expect = 3.9
Identities = 10/19 (52%), Positives = 15/19 (78%)
Query: 267 KEKENKKKKKKKKKKEKKN 285
+ K +K+KKKKKKKE ++
Sbjct: 72 EAKAKEKEKKKKKKKELED 90
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 27.7 bits (62), Expect = 4.0
Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 213 IELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENK 272
EL+ + FR S P HI G+ V + + +E++E+ E ++D+D E+E
Sbjct: 82 FELTPPVTFRLKSGSGPVHIS-GQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
Query: 273 KKKKKK 278
KK K
Sbjct: 141 PVKKVK 146
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 28.6 bits (63), Expect = 4.5
Identities = 7/34 (20%), Positives = 20/34 (58%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKE 282
EE E ++ E+ +GK+++ ++++ +K +
Sbjct: 1030 EEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 28.6 bits (64), Expect = 4.5
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 252 EEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
E+KT+ +KK GK + KKK K K K+ +R
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 27.2 bits (60), Expect = 4.6
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 256 ETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
KK E ++KKK KKKKKE+ R
Sbjct: 8 SKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 27.0 bits (60), Expect = 4.7
Identities = 10/36 (27%), Positives = 13/36 (36%)
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
E + K + + K K KK KK KK
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 27.2 bits (61), Expect = 4.8
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
K ++ +E++ +K+KKKKKKKE ++
Sbjct: 62 KGPAAGEEAGKAAEEEKKEKEKKKKKKKELED 93
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.1 bits (63), Expect = 5.1
Identities = 10/42 (23%), Positives = 20/42 (47%)
Query: 244 EWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
EWI E D+E + +DE+ + + K+ ++ E+
Sbjct: 117 EWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158
>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
[Transcription].
Length = 293
Score = 28.1 bits (62), Expect = 5.1
Identities = 16/92 (17%), Positives = 28/92 (30%), Gaps = 12/92 (13%)
Query: 190 TGRIYHPTTSQYGSVALIQSKTAIELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRRE 249
G I S + ELS + + W ER ++ + E
Sbjct: 137 YGDIIAEVISL----PNRLEQVEDELS----IGKSAITTLRNTDWRER----LIDDTQSE 184
Query: 250 EDEEKTETRKKDEDVLGKEKENKKKKKKKKKK 281
D E+ R + + E+ ++ K K
Sbjct: 185 WDGERMRRRDGKQGIHQYERLSEGPAHAFKGK 216
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 28.1 bits (63), Expect = 5.2
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 254 KTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
K + R++ + K+ + KKK KKK+K R
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 26.8 bits (60), Expect = 5.2
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 253 EKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
EK + +KK E EK+ KK+ +++K ++K
Sbjct: 40 EKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.1 bits (63), Expect = 5.5
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E ++K + KK+ G+ K K KK KK +KK
Sbjct: 70 SEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKK 105
Score = 28.1 bits (63), Expect = 6.4
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
E T T+K ++ E + K +KKKKKKKEKK
Sbjct: 43 SEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 28.2 bits (63), Expect = 5.6
Identities = 10/44 (22%), Positives = 19/44 (43%)
Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
+ ++E + K L + K K++K+KK KK+
Sbjct: 118 LELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 28.2 bits (63), Expect = 5.8
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 267 KEKENKKKKKKKKKKEKKNRR 287
E KK KKKKKK+KK R+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRK 290
Score = 27.4 bits (61), Expect = 10.0
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 266 GKEKENKKKKKKKKKKEKKNRR 287
E K KK KKKK+KK +R
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKR 288
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.1 bits (63), Expect = 6.0
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 8/50 (16%)
Query: 244 EWIRREEDEEKTETRKKDEDVLGKEKEN--------KKKKKKKKKKEKKN 285
E + E E E + E VL +++N KK+K KK+ K N
Sbjct: 1529 EPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 27.6 bits (62), Expect = 6.1
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
R E+ +K E + E +E++ K K +KK+K + +
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 27.2 bits (61), Expect = 6.1
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
+E + T K + L E E K++KKK+ KEKK
Sbjct: 74 AAKEVKKPGIST--KAQQALKLEHERNKQEKKKRSKEKKE 111
>gnl|CDD|233371 TIGR01354, cyt_deam_tetra, cytidine deaminase, homotetrameric.
This small, homotetrameric zinc metalloprotein is found
in humans and most bacteria. A related, homodimeric form
with a much larger subunit is found in E. coli and in
Arabidopsis. Both types may act on deoxycytidine as well
as cytidine [Purines, pyrimidines, nucleosides, and
nucleotides, Salvage of nucleosides and nucleotides].
Length = 127
Score = 27.2 bits (61), Expect = 6.1
Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 8/59 (13%)
Query: 127 LYEDEFPFISPCGRERNFIR---CDDTPVVYTHVITTPEGEFFCHNHAGDELRHPFDPS 182
+ + +SPCG R + DTP+ T+ +G + + G+ L F PS
Sbjct: 72 VADSADDSVSPCGACRQVLAEFCGPDTPIYMTN----NDGTYKVYT-VGELLPFGFGPS 125
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 27.3 bits (61), Expect = 6.3
Identities = 13/54 (24%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 236 EREVELDLEWIR--REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
ER++ +L+ + EE + +KD + L + ++N K ++++++E KN
Sbjct: 33 ERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLH 86
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 6.4
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNRR 287
+ K + + K E K K+KKK+KKKE+K ++
Sbjct: 305 MISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 26.9 bits (60), Expect = 6.4
Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 7/44 (15%)
Query: 249 EEDEE--KTETRKK-----DEDVLGKEKENKKKKKKKKKKEKKN 285
E DE+ T T+K K +K+KKKKKKKE KN
Sbjct: 46 EPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKN 89
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 27.5 bits (61), Expect = 6.6
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 267 KEKENKKKKKKKKKKEKKNRR 287
KEKE KKK K+ K++EK+ +R
Sbjct: 187 KEKEEKKKVKEAKRREKEEKR 207
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 28.0 bits (63), Expect = 6.6
Identities = 11/48 (22%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 243 LEWIRREEDEEKTETR------KKDEDVLGKEKENKKKKKKKKKKEKK 284
++ + E+E + E + ++++ EK+ +KK KK K+ K
Sbjct: 60 MKPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGK 107
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 28.0 bits (62), Expect = 6.8
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEK 283
E E+ T+ + E V+ +EN KK+ + EK
Sbjct: 432 EASEKSAITKDEFEKVVQTSQENAKKELDRYLVEK 466
>gnl|CDD|236254 PRK08384, PRK08384, thiamine biosynthesis protein ThiI;
Provisional.
Length = 381
Score = 28.0 bits (62), Expect = 7.0
Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 15/134 (11%)
Query: 163 GEFFCHNHAGDELRHPFD--------PSKIYMSLDTGRIYH--PTTSQYGSVALIQSKTA 212
GE+ N + H +D K Y+ +D + + P +Q VAL+
Sbjct: 133 GEYILENEESEVDLHNYDIEVGVELMEGKAYVFVDKVKAWGGLPIGTQGKVVALLSGGID 192
Query: 213 IELSKLLQFRNGETSAPTHIVWGEREVELDLEWIRREEDE-EKTETRKKDEDVLGKEKEN 271
++ L + G P HI GE+ LE +R+ ++ +K K E ++ K +E
Sbjct: 193 SPVAAFLMMKRGVEVIPVHIYMGEKT----LEKVRKIWNQLKKYHYGGKAELIVVKPQER 248
Query: 272 KKKKKKKKKKEKKN 285
++ +K K+ +K+N
Sbjct: 249 ERIIQKLKELKKEN 262
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 27.9 bits (62), Expect = 7.1
Identities = 10/55 (18%), Positives = 19/55 (34%), Gaps = 5/55 (9%)
Query: 238 EVELDLEWIRREEDEEKTETRKKDEDVLGK-----EKENKKKKKKKKKKEKKNRR 287
++ D + ++ K ++ K EK KK K +KK +
Sbjct: 88 HIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 27.8 bits (62), Expect = 7.1
Identities = 12/45 (26%), Positives = 23/45 (51%)
Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
++E I + E+ + RK D G + +K K++K+ +K R
Sbjct: 236 EVEAIANQRMIERCKNRKAKRDPNGTFQVARKAAMKRRKRNRKLR 280
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 27.6 bits (62), Expect = 7.5
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 247 RREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+ ++ + R + L KEK+ KKKKKKKK++ ++
Sbjct: 67 KLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELRE 104
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 26.8 bits (60), Expect = 7.7
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 9/46 (19%)
Query: 249 EEDEE--KTETR--KKDEDVLGK-----EKENKKKKKKKKKKEKKN 285
E DE+ T TR +K + + E+ K+KKKKKKKE +N
Sbjct: 46 EPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELEN 91
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 27.4 bits (61), Expect = 7.7
Identities = 10/44 (22%), Positives = 24/44 (54%)
Query: 242 DLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++E ++ EE E K+D+ KE E +++ +++ +E +
Sbjct: 61 EIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104
Score = 27.0 bits (60), Expect = 9.8
Identities = 10/37 (27%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 249 EEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
++ E K+E+ KE N + K+ K EK++
Sbjct: 55 DDQETAEIEEVKEEE---KEAANSEDKEDKGDAEKED 88
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 27.3 bits (61), Expect = 7.9
Identities = 13/49 (26%), Positives = 23/49 (46%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
E E + + E + + EE+ + +K+ + KE KKKKK +
Sbjct: 65 EPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSP 113
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.7 bits (62), Expect = 8.0
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 247 RREEDEEKTETRKKDEDV---LGKEKENKKKKKKKKKKEKK 284
+ EK E +K E++ L +E K KKKK+KKKE
Sbjct: 398 SAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWF 438
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 26.9 bits (60), Expect = 8.2
Identities = 14/46 (30%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 242 DLEWIRREEDEEKTETRKKDE--DVLGKEKENKKKKKKKKKKEKKN 285
D+E R E++ EK E +K+E GK K++++K+ ++ +K
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 27.7 bits (62), Expect = 8.2
Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 4/52 (7%)
Query: 236 EREVELDLEWIRREEDEEKTETRKKD-EDVLGKEKENKKKKKKKKKKEKKNR 286
+ E EL+L EE EE E ++D D + + ++ K+++E + R
Sbjct: 142 KNEFELELPEEEEEEPEEMEEELEEDAADR---DARKRAAEEAKEQEELRRR 190
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 27.7 bits (62), Expect = 8.3
Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 218 LLQFRNGETSAPTHIVWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKK 277
LL+ E + + E + + E +E + + ++ E ++E KK+ K
Sbjct: 438 LLKKFLEEEAILKSLE--EFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPK 495
Query: 278 KKKKE 282
K KK
Sbjct: 496 KVKKI 500
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 8.4
Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 4/41 (9%)
Query: 249 EEDEEKTETRKK----DEDVLGKEKENKKKKKKKKKKEKKN 285
E + TR+K DV ++ E K +K++K KK ++
Sbjct: 109 ESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDED 149
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 27.4 bits (61), Expect = 8.6
Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 212 AIELSKLLQFRNGETSAPTHIVWGEREVEL-DLEWIRREEDEEKTETRKKDEDVLGKEKE 270
I LS+ R+G S P + + + +L L ++EE+E + E R+ LG E
Sbjct: 88 LIVLSR--HRRDGIKSDPNYEIRSDVRRQLRFLAQKQKEEEERRVERRR----ELGLEDP 141
Query: 271 NKKKKKKKKKKEKKN 285
+ + K+K K+E+K
Sbjct: 142 EQLRLKQKAKEEQKA 156
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 27.4 bits (61), Expect = 8.6
Identities = 11/37 (29%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 252 EEKTETRKKDEDVLG--KEKENKKKKKKKKKKEKKNR 286
+ ++ + DED+L E E +K K KK ++ +R
Sbjct: 22 DASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASR 58
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 27.8 bits (63), Expect = 8.8
Identities = 14/46 (30%), Positives = 22/46 (47%)
Query: 239 VELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
+LDL EE EEK E ++ + +L + KE K K + +
Sbjct: 472 GDLDLGKEDEEEKEEKEEAEEEFKPLLERLKEALGDKVKDVRLSHR 517
>gnl|CDD|204061 pfam08779, SARS_X4, SARS coronavirus X4 like. The structure of the
coronavirus X4 protein (also known as 7a and U122) shows
similarities to the immunoglobulin like fold and
suggests a binding activity to integrin I domains. In
SARS-CoV- infected cells, the X4 protein is expressed
and retained intra-cellularly within the Golgi network.
X4 has been implicated to function during the
replication cycle of SARS-CoV.
Length = 83
Score = 25.9 bits (56), Expect = 8.9
Identities = 10/31 (32%), Positives = 12/31 (38%)
Query: 125 TGLYEDEFPFISPCGRERNFIRCDDTPVVYT 155
+G YE PF PC I C T +
Sbjct: 22 SGTYEGNSPFAIPCADNAFAITCTSTHFAFA 52
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 27.1 bits (60), Expect = 8.9
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 233 VWGEREVELDLEWIRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKK 284
VWG E LD E RREE +E+ + +K ++ + + + + KK+ +
Sbjct: 89 VWGSEE-ALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRV 139
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 27.1 bits (60), Expect = 9.3
Identities = 16/38 (42%), Positives = 25/38 (65%)
Query: 248 REEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKN 285
E E++ E RKK ED++ + ++NKK+ KK K KK+
Sbjct: 91 EEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKS 128
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 26.5 bits (59), Expect = 9.5
Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 6/41 (14%)
Query: 246 IRREEDEEKTETRKKDEDVLGKEKENKKKKKKKKKKEKKNR 286
I E+T+ R + KEK +K ++KK K+ +K +
Sbjct: 94 IALRLRRERTKERAE------KEKRTRKNREKKFKRRQKEK 128
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 27.5 bits (61), Expect = 10.0
Identities = 12/18 (66%), Positives = 16/18 (88%)
Query: 269 KENKKKKKKKKKKEKKNR 286
++ KKKKK K+KK+KKNR
Sbjct: 684 RKKKKKKKSKEKKKKKNR 701
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.138 0.419
Gapped
Lambda K H
0.267 0.0694 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,162,478
Number of extensions: 1492431
Number of successful extensions: 5599
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4563
Number of HSP's successfully gapped: 432
Length of query: 287
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 191
Effective length of database: 6,679,618
Effective search space: 1275807038
Effective search space used: 1275807038
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.2 bits)