RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16246
(118 letters)
>gnl|CDD|217293 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand
binding domain. This family is the extracellular ligand
binding domain of these ion channels. This domain forms
a pentameric arrangement in the known structure.
Length = 215
Score = 110 bits (278), Expect = 1e-31
Identities = 36/63 (57%), Positives = 44/63 (69%), Gaps = 1/63 (1%)
Query: 50 NADGNFEVTLA-TKATIYYQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTYDG 108
ADG ++T T +Y G V W PPAIYKSSC +D+ YFPFD+Q C LKFGSWTY+G
Sbjct: 92 KADGIHDITTPNTNVRVYPDGTVLWSPPAIYKSSCPMDLTYFPFDQQNCSLKFGSWTYNG 151
Query: 109 FKV 111
+V
Sbjct: 152 EEV 154
>gnl|CDD|233155 TIGR00860, LIC, Cation transporter family protein. The
Ligand-gated Ion Channel (LIC) Family of
Neurotransmitter Receptors TC 1.A.9)Members of the LIC
family of ionotropic neurotransmitter receptors are
found only in vertebrate and invertebrate animals. They
exhibit receptor specificity for (1)acetylcholine, (2)
serotonin, (3) glycine, (4) glutamate and (5)
g-aminobutyric acid (GABA). All of these receptor
channels are probably hetero- orhomopentameric. The best
characterized are the nicotinic acetyl-choline receptors
which are pentameric channels of a2bgd subunit
composition. All subunits arehomologous. The three
dimensional structures of the protein complex in both
the open and closed configurations have been solved at
0.9 nm resolution.The channel protein complexes of the
LIC family preferentially transport cations or anions
depending on the channel (e.g., the acetylcholine
receptors are cationselective while glycine receptors
are anion selective) [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 459
Score = 77.1 bits (190), Expect = 7e-18
Identities = 20/62 (32%), Positives = 26/62 (41%), Gaps = 2/62 (3%)
Query: 49 ENADGNFEVTLATKATI--YYQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTY 106
D F T + + G V + P +C +D+ FPFD Q C LKF SW Y
Sbjct: 120 NEKDARFHGITMTNVLVRIHPNGSVLYSPRITLTLACPMDLRNFPFDVQNCSLKFESWGY 179
Query: 107 DG 108
Sbjct: 180 TT 181
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.9 bits (94), Expect = 8e-05
Identities = 21/48 (43%), Positives = 38/48 (79%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K+ +K ++K E+E+K++++K G+KKE EEEEEK+KK+ E+++ +E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 39.5 bits (93), Expect = 1e-04
Identities = 24/51 (47%), Positives = 36/51 (70%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K ++ EK+ +E KKE++KK KK+ EEEEE+K+KK EEK+ +E A+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 39.5 bits (93), Expect = 1e-04
Identities = 14/46 (30%), Positives = 34/46 (73%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
++K K++EE++++K+++ +K+E EEEE++KK++ +E++
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 39.1 bits (92), Expect = 1e-04
Identities = 18/63 (28%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
K KK EK EK+R++E+++K+ + G+++EE+++++ E+K+ KE + E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKK-KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Query: 62 KAT 64
+
Sbjct: 469 EEE 471
Score = 33.7 bits (78), Expect = 0.011
Identities = 13/48 (27%), Positives = 29/48 (60%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
KK K+ ++ K E++++E ++K+ + KKK++ EE++ K+
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 39.7 bits (92), Expect = 1e-04
Identities = 17/44 (38%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+ KKR+E EK +++ EQK E++E E+E+EK++++ E++
Sbjct: 576 KLAKKREEAVEKAKREAEQK--AREEREREKEKEKERERERERE 617
Score = 32.0 bits (72), Expect = 0.060
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
+K ++E E+K R++ E++KE +++E E E E ++
Sbjct: 586 EKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 28.5 bits (63), Expect = 0.99
Identities = 14/38 (36%), Positives = 26/38 (68%), Gaps = 3/38 (7%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
K + ++K +EE+ER+KE++K+ ++E E E E+ K
Sbjct: 589 KREAEQKAREEREREKEKEKE---REREREREAERAAK 623
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.4 bits (85), Expect = 0.002
Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 12 EKERKKEEQKKEVGEKKEGEEEEEK--KKKKVEEKKNNKENADGN 54
ERKK +K+ EKK +EE EK KKK E + DG
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
Score = 34.1 bits (79), Expect = 0.010
Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K++KK++ K EK+ +KEE +K +KK ++ K E KK
Sbjct: 413 KLRKKQR-KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
Score = 33.4 bits (77), Expect = 0.014
Identities = 15/59 (25%), Positives = 28/59 (47%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
+KK ++K+ K KK E+++ + + E KK K + + K + D E T+
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKLARTE 469
Score = 33.4 bits (77), Expect = 0.017
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
+ +K+ EKEE E+ ++K E KK + E KK
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 27.2 bits (61), Expect = 2.2
Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 2 KMKKKRKEKEEKE--RKKEEQKKEVGEKKEGEEEEEKKKKK 40
K ++K ++K EKE K +KK K+ + + + KK
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 35.7 bits (82), Expect = 0.003
Identities = 13/51 (25%), Positives = 20/51 (39%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
K + K++ + K K+ E E E E+ K E+K A N
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILN 842
Score = 31.5 bits (71), Expect = 0.092
Identities = 9/42 (21%), Positives = 16/42 (38%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+ + E E + E + E+K + KK + K
Sbjct: 809 KEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 34.9 bits (81), Expect = 0.004
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
K K E+K + K+V K+ ++E+EK+K KV+++ + +N
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 31.1 bits (71), Expect = 0.11
Identities = 12/46 (26%), Positives = 21/46 (45%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
++ K K EK+ K +K ++ E+E+EK K K +
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 29.5 bits (67), Expect = 0.33
Identities = 11/51 (21%), Positives = 22/51 (43%), Gaps = 2/51 (3%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
+KK + +K K +KKE ++K ++ + K + K K +
Sbjct: 385 EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI--GKRRKPSGTSE 433
Score = 28.0 bits (63), Expect = 1.2
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
K EK+ + K+ K +K++ +E+ + KK+ + K K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 27.6 bits (62), Expect = 1.7
Identities = 10/48 (20%), Positives = 17/48 (35%), Gaps = 7/48 (14%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K K E+KE++KE+ K + + K K +
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRD-------TKNIGKRRKPSGTSEE 434
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.0 bits (79), Expect = 0.011
Identities = 16/48 (33%), Positives = 30/48 (62%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
++++K +E E ++ E+ K+E+ EKKE +EEE K + EK+ +
Sbjct: 530 RELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577
Score = 28.6 bits (65), Expect = 0.64
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 3/36 (8%)
Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
EE ER+ E++ +E + E EK K+++EEKK
Sbjct: 526 EELERELEQKAEEAEALLK---EAEKLKEELEEKKE 558
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 33.1 bits (76), Expect = 0.020
Identities = 11/38 (28%), Positives = 26/38 (68%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKV 41
+++++EKE + +K +K + E+K+ +E E+ +K K+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591
Score = 31.2 bits (71), Expect = 0.095
Identities = 14/40 (35%), Positives = 28/40 (70%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K +KEE +R+KEE++ +K+ + +++++KKK E +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 30.4 bits (69), Expect = 0.16
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 17 KEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
KEE ++E EK+ +E++ +K K +E+K KE
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583
Score = 29.6 bits (67), Expect = 0.30
Identities = 11/41 (26%), Positives = 31/41 (75%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+ ++E++E++ +++K + + K+ EE+++K+ +K+E+ K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 27.3 bits (61), Expect = 2.2
Identities = 11/37 (29%), Positives = 26/37 (70%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKK 37
+ K++++ +E++R ++ +K+E +KKE E+ E+ K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.4 bits (74), Expect = 0.033
Identities = 18/45 (40%), Positives = 34/45 (75%), Gaps = 1/45 (2%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
++++EK+E+E K ++++EV E++E +EE++KK KKV+E E
Sbjct: 42 EEKEEKKEEEEKTTDKEEEVDEEEE-KEEKKKKTKKVKETTTEWE 85
Score = 25.9 bits (57), Expect = 5.7
Identities = 16/41 (39%), Positives = 27/41 (65%)
Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
EKE EE+++E EKKE EE+ K+++V+E++ +E
Sbjct: 31 VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71
Score = 25.9 bits (57), Expect = 6.9
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 12 EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
EKE +KE +E E+KE ++EEE+K EE+ + +E +
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.5 bits (74), Expect = 0.034
Identities = 16/50 (32%), Positives = 31/50 (62%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
+++K KE+ E+ER+ EEQ++ EK E + + K +VE+++ +N
Sbjct: 257 ELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNL 306
Score = 26.4 bits (58), Expect = 4.6
Identities = 10/38 (26%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
K++++R+ +EE R K E+ + + E +E+ K
Sbjct: 438 KLQEERRLREEAIRAKAEKTA----RMKAEMKEKTLKM 471
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.8 bits (73), Expect = 0.039
Identities = 14/49 (28%), Positives = 34/49 (69%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
++++ K+ EE++ + E++ E+ K E E+ E++++++EEK++ E A
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173
Score = 29.9 bits (68), Expect = 0.20
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 6/53 (11%)
Query: 2 KMKKKRKEKEEKERKKEE-QKKEVGEKKEGEEE-EEKKKKKVEE----KKNNK 48
++KK +EKEE E++ E + K +K EEE + ++K+ +E KK N+
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.7 bits (73), Expect = 0.040
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
+ KK + EK KK E+ K EKK+ ++EE K +
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 26.7 bits (60), Expect = 2.2
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 2 KMKKKRKEKEEKERKKEEQKK 22
K +K K K EK++ K+E+ K
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPK 96
Score = 25.2 bits (56), Expect = 7.4
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
+K+ K +K E +K + + EKKK K EE K
Sbjct: 59 EAALLDKKELKAWEKAEKKAEKA-KAKAEKKKAKKEEPK 96
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 32.3 bits (74), Expect = 0.043
Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 6/62 (9%)
Query: 1 MKMKKKRK-EKEEKERKKE-----EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
K+K+ + E+ E+E E E K E++ EE+ + +E + E+ D
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 55 FE 56
Sbjct: 284 EI 285
Score = 30.4 bits (69), Expect = 0.22
Identities = 12/50 (24%), Positives = 20/50 (40%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
+ + K+E+E EE E+KE ++ E K K+ E
Sbjct: 244 ETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293
Score = 27.3 bits (61), Expect = 2.0
Identities = 10/48 (20%), Positives = 17/48 (35%), Gaps = 1/48 (2%)
Query: 4 KKKRKEK-EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
E E + K+EQ+ E EE++ K E+ +
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
Score = 27.3 bits (61), Expect = 2.3
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K+KE K E+K + EKKE + EEK++ ++ E + A
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPLA 692
Score = 26.1 bits (58), Expect = 5.7
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
MK K+ + E+KKE + + E+K +EE+++K K+ E++
Sbjct: 636 LMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 31.9 bits (73), Expect = 0.050
Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 7/51 (13%)
Query: 2 KMKKKRKEKEEKERKK---EEQ----KKEVGEKKEGEEEEEKKKKKVEEKK 45
K+ KK +EK EK +++ EQ KKE+G +K+ ++E EK K+K+EE K
Sbjct: 210 KITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELK 260
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 31.8 bits (72), Expect = 0.064
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 5 KKRKEKEEKERKKEEQ----KKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
KK+ EK ++E EQ +KE+GE + +E E K+K++ K KE
Sbjct: 218 KKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEALKRKIDAAKMPKE 266
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 31.8 bits (73), Expect = 0.064
Identities = 19/56 (33%), Positives = 36/56 (64%), Gaps = 7/56 (12%)
Query: 1 MKMKKKRKEKEEKERKK---EEQ----KKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
+++KK KE+ EK +++ EQ +KE+GE + ++E E+ ++K+E+ K KE
Sbjct: 212 KRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKE 267
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 31.5 bits (72), Expect = 0.081
Identities = 12/49 (24%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
+K ++ +++E +E+K+E++K+E + + + ++++K+EEK+ K+
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEE--REAKLAKLSPEEQRKLEEKERKKQ 319
Score = 29.9 bits (68), Expect = 0.22
Identities = 16/47 (34%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 1 MKMKKKRKEKEEKERKKEEQK--KEVGEKKEGEEEEEKKKKKVEEKK 45
+K+ + K +K R++EE+K K E+++ E +E+K++KK EE++
Sbjct: 251 LKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEERE 297
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 31.2 bits (70), Expect = 0.086
Identities = 12/45 (26%), Positives = 26/45 (57%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K ++EKE E + +++E+G+K++ E+ +K K+ K+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKK 1071
Score = 27.0 bits (59), Expect = 2.5
Identities = 12/44 (27%), Positives = 29/44 (65%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
+M+ ++ KEE+ KKE+++++ K + +E KK++++K+
Sbjct: 1034 RMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 30.8 bits (69), Expect = 0.11
Identities = 16/52 (30%), Positives = 39/52 (75%), Gaps = 3/52 (5%)
Query: 2 KMKKKRKEKEEKERKKEEQKKE--VGEKKEGEEEE-EKKKKKVEEKKNNKEN 50
+++K R+E+EE E + E +++E + E+++ EE+E E++++ +EE+ ++ E+
Sbjct: 127 QIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEH 178
Score = 25.4 bits (55), Expect = 9.1
Identities = 11/51 (21%), Positives = 33/51 (64%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+M+ +R+E++ ER+ +E+++ E++ EE+ + + ++ E+ ++ +D
Sbjct: 141 RMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191
Score = 25.0 bits (54), Expect = 10.0
Identities = 13/52 (25%), Positives = 32/52 (61%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
++R+EK ++ +EEQ++E E+ E+ ++ + + +E+ ++ E+ D E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 30.3 bits (69), Expect = 0.11
Identities = 13/35 (37%), Positives = 24/35 (68%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
K+ EKEE+E +K +K+ + + K+ EE E K+++
Sbjct: 79 KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQ 113
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 30.4 bits (69), Expect = 0.14
Identities = 15/48 (31%), Positives = 32/48 (66%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
+K+K+E + E+ KKE EK++ + +++K KKK ++ K+ K++ +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Score = 30.4 bits (69), Expect = 0.14
Identities = 14/42 (33%), Positives = 30/42 (71%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+K K K++K +KK+++ K+ + K+ ++ E+K +K+ E+K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 30.0 bits (68), Expect = 0.21
Identities = 20/63 (31%), Positives = 39/63 (61%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
+++K +KE EEK++ K ++KK +K + +++++ KK EKK+ KE D ++T +
Sbjct: 70 EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSY 129
Query: 62 KAT 64
T
Sbjct: 130 SET 132
Score = 29.3 bits (66), Expect = 0.36
Identities = 13/49 (26%), Positives = 29/49 (59%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KK+K K++K++ K+++ + +K E ++E+E + K + K+ E
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 27.3 bits (61), Expect = 1.7
Identities = 14/48 (29%), Positives = 28/48 (58%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K KK +K+K++ + KK+++K + EKK+ +E E+K + +
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 27.0 bits (60), Expect = 2.0
Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 9 EKEEKERKKEEQKKEV-GEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
E E ++KK+E +E+ KKE EE+++ K KK + KK ++ D
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.5 bits (69), Expect = 0.14
Identities = 15/50 (30%), Positives = 29/50 (58%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KK + K+E + +E + K EK + EE E+K+K++ +E K ++ +
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 28.2 bits (63), Expect = 0.95
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEE-EEKKKKKVEEKKNNKENAD 52
+K+ K+ ++ +K E K E +E E + +E+K K+ E +K KE AD
Sbjct: 371 LKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQAD 421
Score = 27.8 bits (62), Expect = 1.3
Identities = 15/51 (29%), Positives = 26/51 (50%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KK E +++ EE + + E+K +EE EKK+K+ ++ K D
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKD 430
Score = 27.5 bits (61), Expect = 1.7
Identities = 10/51 (19%), Positives = 29/51 (56%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K+ + + E + E + + K+E +++E E++++++ + +EK+ E
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 30.6 bits (69), Expect = 0.16
Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 2 KMKKKRKE--KEEKERKKEE-QKKEVGEKKEGEEEEEKKKKKVEEKK 45
++KK R E K RKKE QK+ E++ E +++ + +
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARAR 377
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.2 bits (68), Expect = 0.18
Identities = 16/48 (33%), Positives = 27/48 (56%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
+K K+ E+ ++ EE++K+ E K + E K K + E +K KE A
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148
Score = 27.9 bits (62), Expect = 1.2
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K + E E +++ KEE KK+ E+ + + E KKK E KK + A
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEA 179
Score = 26.7 bits (59), Expect = 3.2
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
+ +++ EK K+ EQ + E+K+ + EE K K+ E K
Sbjct: 93 ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKA 134
Score = 26.3 bits (58), Expect = 4.8
Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K ++R E+ ++ E+ K+ EK+ ++ EE K K+ E K E
Sbjct: 92 KELEQRAAAEKAAKQAEQAAKQAEEKQ--KQAEEAKAKQAAEAKAKAEAE 139
Score = 26.3 bits (58), Expect = 4.9
Identities = 16/48 (33%), Positives = 24/48 (50%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KK+ E+E K + E KK+ E K+ E E K K + + K +E
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKA 196
Score = 26.0 bits (57), Expect = 5.5
Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 1/60 (1%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKA 63
+ K K E ++K E KK+ E + + E K K K EE K E A A KA
Sbjct: 155 EAKAKAAAEAKKKAAEAKKK-AEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKA 213
Score = 25.6 bits (56), Expect = 8.7
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 5 KKRKEKEEKERKKEEQK-KEVGEKKEGEEEEEKKKKKVEEKK 45
+ K+ EEK+++ EE K K+ E K E E +KK K E KK
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 30.3 bits (69), Expect = 0.18
Identities = 12/35 (34%), Positives = 26/35 (74%), Gaps = 3/35 (8%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
K+K +E+K +K++++KK KK+G++ ++K +K
Sbjct: 383 PKKKREEKKPQKRKKKKK---RKKKGKKRKKKGRK 414
Score = 26.5 bits (59), Expect = 3.7
Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 4/43 (9%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
K K + ++K+EE+K +K ++++ KKK K +KK K
Sbjct: 376 KEKYPKPPKKKREEKKP----QKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 30.1 bits (68), Expect = 0.19
Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 6/45 (13%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K+KEE+ER+ E ++ G E+ E+ + K + K+ K ++
Sbjct: 121 KQKEEEERRVER------RRELGLEDPEQLRLKQKAKEEQKAESE 159
Score = 27.0 bits (60), Expect = 2.7
Identities = 8/42 (19%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 2 KMKKKRKEKEEKER---KKEEQKKEVGEKKEGEEEEEKKKKK 40
K +++R+ + +E E+ + + K+E + E E+ + +
Sbjct: 123 KEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHR 164
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 30.4 bits (69), Expect = 0.20
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
KE EE +++V E+ E E +++ KK KK K
Sbjct: 461 KEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKI 500
Score = 26.2 bits (58), Expect = 5.0
Identities = 9/40 (22%), Positives = 18/40 (45%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
K E +E+ ++ + E +EE +K+ K +K
Sbjct: 460 VKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499
Score = 25.8 bits (57), Expect = 7.5
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
K +E +E+ E+ ++ ++E ++E KK KK+ E
Sbjct: 461 KEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.0 bits (68), Expect = 0.25
Identities = 20/43 (46%), Positives = 28/43 (65%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
M KRK K + ER E+ EV EKK+ +++EEKKKK++E
Sbjct: 305 MISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 29.6 bits (67), Expect = 0.25
Identities = 10/37 (27%), Positives = 18/37 (48%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
E ++ + +E +E +EEE + +K E KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194
Score = 28.5 bits (64), Expect = 0.65
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
K E+ E + + E EGE++E KKKK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 26.5 bits (59), Expect = 3.3
Identities = 9/36 (25%), Positives = 17/36 (47%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+ K ++ ++ EEE + +KK +KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 30.0 bits (67), Expect = 0.28
Identities = 15/43 (34%), Positives = 30/43 (69%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
E+EE+E ++EE+++E E++E EEEE ++ +E + ++ A
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQA 906
Score = 29.6 bits (66), Expect = 0.39
Identities = 13/31 (41%), Positives = 25/31 (80%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
+ EE+E ++EE+++E E++E EEEEE+ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 29.2 bits (65), Expect = 0.52
Identities = 13/29 (44%), Positives = 24/29 (82%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEK 36
+E+EE+E ++EE+++E E++E EEE E+
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 26.9 bits (59), Expect = 3.6
Identities = 12/31 (38%), Positives = 24/31 (77%)
Query: 12 EKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
+ E ++EE+++E E++E EEEEE++++ E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 26.5 bits (58), Expect = 4.4
Identities = 12/31 (38%), Positives = 25/31 (80%)
Query: 19 EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
+ ++E E++E EEEEE+++++ EE++ N+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 26.1 bits (57), Expect = 5.8
Identities = 11/31 (35%), Positives = 25/31 (80%)
Query: 14 ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
+ ++EE+++E E++E EEEEE+++++ E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 25.3 bits (55), Expect = 9.5
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
K++++ + G+ +E EEEEE+++++ EE++ +E + N E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.5 bits (66), Expect = 0.34
Identities = 14/54 (25%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK------KVEEKKNNKENAD 52
+++K+KEEK ++KE +K + +K+ + + ++ K EKK+ K + +
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72
Score = 29.5 bits (66), Expect = 0.45
Identities = 15/55 (27%), Positives = 28/55 (50%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
K ++K KEKE K+ K +++ + + + + KK E+K ++ D N E
Sbjct: 23 KKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPE 77
Score = 26.4 bits (58), Expect = 4.4
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 23 EVGEKKEGEEEEEKKKKKVEEKKNNKE 49
EKK EEE ++KKK EEK KE
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKE 32
Score = 25.6 bits (56), Expect = 7.4
Identities = 14/62 (22%), Positives = 31/62 (50%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
+ E E+K +EE +++ ++++ +E+E KK K +++ K A + T K
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60
Query: 63 AT 64
+
Sbjct: 61 KS 62
>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
of Ustilago maydis Rim23 (PalC), and related domains.
This family contains the Bro1-like domain of Ustilago
maydis Rim23 (also known as PalC), and related proteins.
It belongs to the BRO1_Alix_like superfamily which
includes the Bro1-like domains of mammalian Alix
(apoptosis-linked gene-2 interacting protein X),
His-Domain type N23 protein tyrosine phosphatase
(HD-PTP, also known as PTPN23), RhoA-binding proteins
Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, and related
domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
interact with the ESCRT (Endosomal Sorting Complexes
Required for Transport) system. Rim20 and Rim23
participate in the response to the external pH via the
Rim101 pathway. Through its Bro1-like domain, Rim23
allows the interaction between the endosomal and plasma
membrane complexes. Bro1-like domains are
boomerang-shape, and part of the domain is a
tetratricopeptide repeat (TPR)-like structure.
Intermediates in the Rim101 pathway may play roles in
the pathogenesis of fungal corneal infection during
Candida albicans keratitis. This family lacks the
V-shaped (V) domain found in many members of the
BRO1_Alix_like superfamily.
Length = 413
Score = 29.3 bits (66), Expect = 0.39
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 4/54 (7%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEV 57
K K K+ + K+E++K E G EEE + ++ KK K N +F+
Sbjct: 336 ASKAKLKKSWKEKREDRKVEKGAGV----EEELRTLEMLLKKYKKMNDTVSFQP 385
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.0 bits (65), Expect = 0.40
Identities = 10/37 (27%), Positives = 18/37 (48%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
+ + K E + KK+ + + +KE + E KK K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 29.1 bits (66), Expect = 0.41
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 16 KKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K++++KK+ ++ E EE++KK EE+K +++ D
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVD 118
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 29.3 bits (66), Expect = 0.41
Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
+ + K K+ KKE++ E KK+ E++++KKK+K E K +
Sbjct: 44 EEENKVATTSTKKDKKEDKNNE--SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101
Query: 62 KATI 65
Sbjct: 102 TKKK 105
Score = 27.4 bits (61), Expect = 1.7
Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 10 KEEKERKKEEQKKEVGEKK-EGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKAT 64
+E K +KK + E +KK +K ++KK K+ E L K
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96
Score = 27.4 bits (61), Expect = 1.7
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
K K + K++ E+KK+++K++ K EGE + K K +K K
Sbjct: 59 KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPP 107
Score = 25.8 bits (57), Expect = 5.8
Identities = 14/48 (29%), Positives = 30/48 (62%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K K ++K+K++KE+K+ + + E + ++ +K KKK + K N++
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Score = 25.8 bits (57), Expect = 6.5
Identities = 17/49 (34%), Positives = 30/49 (61%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KK+ EK++K++K++++ K GE K G + +K KK ++ K N D
Sbjct: 66 SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Score = 25.8 bits (57), Expect = 7.0
Identities = 16/51 (31%), Positives = 30/51 (58%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K +KK+K+K+EK+ K E + ++G K + ++ KKK + + +NA
Sbjct: 68 KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 29.3 bits (66), Expect = 0.46
Identities = 19/66 (28%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKK----EGEEEEEKKKKKVEEKKNNKENADGNFEV 57
+ KK E EE + K+ E K++G+ K + EE +K+ K+++E+ A E
Sbjct: 38 ERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEA 97
Query: 58 TLATKA 63
L K
Sbjct: 98 ELQDKL 103
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 29.0 bits (66), Expect = 0.47
Identities = 11/28 (39%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 18 EEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
EEQ++ + E+ E EE+++KK E+K
Sbjct: 64 EEQRERL-EELAPELLEEEEEKKEEKKG 90
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.9 bits (65), Expect = 0.63
Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 10/64 (15%)
Query: 3 MKKKRKEKEEKERKKEEQKKEV----------GEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+ ++EKE + + E KE EK E+ E KK + KK K N +
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTE 1580
Query: 53 GNFE 56
+
Sbjct: 1581 AELD 1584
Score = 25.4 bits (56), Expect = 9.7
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGE 26
K KKK K+KEE+ +++E+ + E+ E
Sbjct: 747 KKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 28.8 bits (65), Expect = 0.65
Identities = 15/54 (27%), Positives = 31/54 (57%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
+K K K +E+++R +E ++ + E+KE + E+ +KK+E+ + E N
Sbjct: 318 LKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSN 371
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 28.9 bits (64), Expect = 0.73
Identities = 14/35 (40%), Positives = 26/35 (74%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
E++ K+ ++ ++ +V E KEG+E ++KK KKV+E
Sbjct: 220 EEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKE 254
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 28.0 bits (63), Expect = 0.73
Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 4/56 (7%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
K ++K +EKEEKERK+E++K E +G ++E+++K++EE + K +
Sbjct: 16 EKREEKEREKEEKERKEEKEK----EWGKGLVQKEEREKRLEELEKAKNKPLARYA 67
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 28.5 bits (64), Expect = 0.76
Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 4/45 (8%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEK----KEGEEEEEKKKKKVEEKK 45
+K KE E++E K+E+ ++ E+ K+ E+E EK++ K E K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEAK 196
Score = 25.4 bits (56), Expect = 8.9
Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE---EKKNNK 48
K E K ++ E+Q+++ + E + E + KK+E EK+ K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 28.1 bits (63), Expect = 0.89
Identities = 12/38 (31%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEE--EEKKKK 39
+K K+KEE ++ ++ KK + E KKKK
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 26.2 bits (58), Expect = 4.7
Identities = 10/39 (25%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
RK+ ++KE E++KE ++ +K +KK
Sbjct: 159 RKDAKQKEEFAAEERKE--ALAAAAKKSATPQKVETKKK 195
Score = 25.4 bits (56), Expect = 8.8
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 12 EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
E++ K++++ E+KE KK ++ + K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 28.5 bits (64), Expect = 0.90
Identities = 11/50 (22%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEE---EKKKKKVEEKKNNK 48
+K+KR+E K ++ + E E ++ E++K+++ +K K
Sbjct: 193 ALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEK 242
>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 472 to 574 amino acids in
length.
Length = 485
Score = 28.3 bits (63), Expect = 0.91
Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 4/75 (5%)
Query: 12 EKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE---NADGNFEV-TLATKATIYY 67
+ +K E KKE+ E K ++ +K+ K E+K D F V +L K
Sbjct: 28 DLLQKIEALKKELAELKAQLKDLQKRVDKTEKKSAGDRLTFGGDFRFRVDSLNYKTAALG 87
Query: 68 QGLVEWKPPAIYKSS 82
Q + K
Sbjct: 88 QLMGFNPMVPGQKYK 102
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.1 bits (63), Expect = 1.0
Identities = 9/46 (19%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+ E+EE+E+ ++E+++E + E+ + K+ ++ +E+ +
Sbjct: 121 QLEREEEEK-RDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex
which functions in messenger ribonucleoprotein
metabolism and plays a role in preventing the
transcription-associated genetic instability. Tho2,
along with four other subunits forms THO.
Length = 296
Score = 28.1 bits (63), Expect = 1.1
Identities = 17/47 (36%), Positives = 26/47 (55%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN 47
K+K+EK+R K KK E K+ E EK KK++ E+K++
Sbjct: 45 SSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSS 91
>gnl|CDD|199838 cd10280, PQQ_mGDH, Membrane-bound PQQ-dependent glucose
dehydrogenase. This bacterial subfamily of enzymes
belongs to the dehydrogenase family with
pyrroloquinoline quinone (PQQ) as cofactor, and is the
only subfamily that is bound to the membrane. Glucose
dehydrogenase converts D-glucose to
D-glucono-1,5-lactone in a reaction that is coupled with
the respiratory chain in the periplasmic oxidation of
sugars and alcohols in gram-negative bacteria.
Ubiquinone functions as the electron acceptor. The
alignment model contains an 8-bladed beta-propeller.
Length = 616
Score = 28.3 bits (64), Expect = 1.1
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 89 YFPFDEQTCVLKFGSWTYDGFKVPGSVNGA 118
PFD+ C ++F S Y+G P S++G
Sbjct: 378 ATPFDQLACRIQFRSLRYEGLFTPPSLDGT 407
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 28.1 bits (64), Expect = 1.1
Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 4 KKKRKEKEEKERKKEEQK---KEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+++R+ + E E + E+ KE+G+ K E+ E +V+E K + +
Sbjct: 35 EERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALE 86
Score = 25.8 bits (58), Expect = 5.5
Identities = 11/43 (25%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEK-KEGEEEEEKKKKKVEE 43
++ + +++ E+ E E KE+ E+ K E E ++ + ++EE
Sbjct: 57 EIGQAKRKGEDAEALIAEV-KELKEEIKALEAELDELEAELEE 98
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 28.3 bits (63), Expect = 1.1
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K E+ R+K ++ GE ++ + KK KK K+ ++
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE 1216
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 27.9 bits (62), Expect = 1.2
Identities = 20/45 (44%), Positives = 34/45 (75%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
+ KKK+KEK ++E K + K+E EK+ +E+E++K+KKVEE ++
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Score = 27.5 bits (61), Expect = 1.9
Identities = 15/51 (29%), Positives = 31/51 (60%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
+ KE+E+++ + +E+KK+ EK + E ++ K K++ +EK+ KE
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150
Score = 27.2 bits (60), Expect = 2.1
Identities = 20/46 (43%), Positives = 33/46 (71%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K+K + KEEK++KKE+ K+E ++K EE +EK+ K +EK+ K+
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 27.2 bits (60), Expect = 2.5
Identities = 13/48 (27%), Positives = 33/48 (68%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
++K+++K+K+EK +++ + +K E KE +EK+K+K ++ + ++
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Score = 27.2 bits (60), Expect = 2.5
Identities = 17/41 (41%), Positives = 27/41 (65%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K + +E+E++KE+ K+E +KKE +EE K +K EE K
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 25.6 bits (56), Expect = 7.3
Identities = 17/48 (35%), Positives = 33/48 (68%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K +KK+K+++ KE K+ + KE ++K +E+EK+K+K E+ ++E
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161
Score = 25.6 bits (56), Expect = 8.7
Identities = 16/45 (35%), Positives = 23/45 (51%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
KKR+ K R K+ KK+ KK+ EEEK+++ E K
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP 208
Score = 25.6 bits (56), Expect = 8.9
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K+K KEK+ +E + E++K+ + ++ KKK KK
Sbjct: 146 KEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191
>gnl|CDD|220428 pfam09826, Beta_propel, Beta propeller domain. Members of this
family comprise secreted bacterial proteins containing
C-terminal beta-propeller domain distantly related to
WD-40 repeats.
Length = 521
Score = 28.0 bits (63), Expect = 1.2
Identities = 19/80 (23%), Positives = 27/80 (33%), Gaps = 11/80 (13%)
Query: 48 KENADGNFEVTLATKATIY------YQGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKF 101
N L + +T+Y Y W P + E E+T + KF
Sbjct: 195 TSGEIVNSTSLLGSGSTVYMSENNLYVASSHWYPERALPLRLRAEAEPQEETEKTEIYKF 254
Query: 102 ----GSWTYDGF-KVPGSVN 116
G TY G +VPG +
Sbjct: 255 SLDDGKVTYVGSGEVPGRLL 274
>gnl|CDD|220615 pfam10175, MPP6, M-phase phosphoprotein 6. This is a family of
M-phase phosphoprotein 6s which is necessary for
generation of the 3' end of the 5.8S rRNA precursor. It
preferentially binds to poly(C) and poly(U).
Length = 104
Score = 27.1 bits (60), Expect = 1.3
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
MK ++ K+KEEK+ EE++ + E+ + + +E + +EE
Sbjct: 8 MKFMQRTKDKEEKKADDEEERALISEEIDLKMLKETENFIIEEPS 52
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 28.0 bits (63), Expect = 1.3
Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 14/82 (17%)
Query: 13 KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKATIYYQGLVE 72
+KK ++K +G K E EE EE+K ++ ++ N +++ ++ +
Sbjct: 148 LPKKKVKKKNLLGGKSEKEEPEEEKTPAPDKIVSHWHP---NLTLSVVVDFGVFSLNQI- 203
Query: 73 WKPPAIYKSSCEIDVEYFPFDE 94
PPAI Y +
Sbjct: 204 --PPAI--------ARYLRVEP 215
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 27.7 bits (63), Expect = 1.4
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
+K+E+E + K+EV E EEEEE+ +K+++E
Sbjct: 260 TDKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKE 295
>gnl|CDD|234052 TIGR02900, spore_V_B, stage V sporulation protein B. SpoVB is the
stage V sporulation protein B of the bacterial endopore
formation program in Bacillus subtilis and various other
Firmcutes. It is nearly universal among
endospore-formers. Paralogs with rather high sequence
similarity to SpoVB exist, including YkvU in B. subtilis
and a number of proteins in the genus Clostridium.
Member sequences for the seed alignment were chosen to
select those proteins, no more than one to a genome,
closest to B. subtilis SpoVB in a neighbor joining tree
[Cellular processes, Sporulation and germination].
Length = 488
Score = 27.6 bits (62), Expect = 1.4
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 66 YYQGLVEWKPPAI 78
Y+QG+ KPPA
Sbjct: 138 YFQGISNMKPPAY 150
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 27.4 bits (61), Expect = 1.4
Identities = 21/47 (44%), Positives = 34/47 (72%), Gaps = 2/47 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
+K E EE+E+K++++KKEV KKE +E+++KK+K VE K + K
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEV--KKEKKEKKDKKEKMVEPKGSKK 185
Score = 27.0 bits (60), Expect = 2.1
Identities = 15/36 (41%), Positives = 28/36 (77%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
KKK++ K+EK+ KK++++K V K +++++KKKK
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 27.6 bits (62), Expect = 1.5
Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 5/68 (7%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
+KK K +ERK E + E KKE E+ + ++E + A +VTL
Sbjct: 45 LKKLGKLSPLEERK--EVGALINELKKEVEDAITELTPELEAAGLWERLAFEKIDVTLPG 102
Query: 62 KATIYYQG 69
+ Y G
Sbjct: 103 RR--IYPG 108
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 27.8 bits (62), Expect = 1.5
Identities = 9/46 (19%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
++ K + + + EE K E+ G + E+ K+ +E ++
Sbjct: 895 LQNHHKTEALSQNNKYNEE--KSAQERMPGADSPEELMKRAKEYQD 938
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 27.4 bits (61), Expect = 1.6
Identities = 12/45 (26%), Positives = 30/45 (66%), Gaps = 7/45 (15%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN 47
+KK +++++++E ++ +Q+K+ ++EEE++K+K EE
Sbjct: 189 LKKLKQQQQKREEERRKQRKK-------QQEEEERKQKAEEAWQK 226
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 27.6 bits (61), Expect = 1.7
Identities = 15/46 (32%), Positives = 32/46 (69%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
+ EE+E ++EE+++E E +E E E+E+++++VE ++E +G
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488
>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
function prediction only].
Length = 546
Score = 27.6 bits (61), Expect = 1.8
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
+K KRK +K+R K +++ ++ EE + + + E+
Sbjct: 490 LLKNKRKRLAQKQRLKRLYLEQLEDELLRTAEEILELRLLLEQ 532
>gnl|CDD|131369 TIGR02316, propion_prpE, propionate--CoA ligase. This family
contains one of three readily separable clades of
proteins in the group of acetate and propionate--CoA
ligases. Characterized members of this family act on
propionate. From propionyl-CoA, there is a cyclic
degradation pathway: it is ligated by PrpC to the TCA
cycle intermediate oxaloacetate, acted upon further by
PrpD and an aconitase, then cleaved by PrpB to pyruvate
and the TCA cycle intermediate succinate.
Length = 628
Score = 27.6 bits (61), Expect = 1.9
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 59 LATKATIYYQGLVEWKPPAIYKSSCE 84
LA AT+ Y+GL P ++ S E
Sbjct: 298 LAGAATVLYEGLPTNPDPGVWWSIVE 323
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 27.6 bits (62), Expect = 1.9
Identities = 11/54 (20%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 4 KKKRKEKEEKERKKE------EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K + + E++++K E E ++ E+++ E + KK E + ++A
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREK-AAREARHKKAAEARAAKDKDA 487
Score = 26.4 bits (59), Expect = 4.3
Identities = 8/48 (16%), Positives = 19/48 (39%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
++ R E+E+ R+ +K + ++ +V+ KK
Sbjct: 458 RQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQP 505
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.4 bits (60), Expect = 1.9
Identities = 19/49 (38%), Positives = 33/49 (67%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
+K ++ +KE EE ++K EE KK+ EKK+ +++++KK EE + KE
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
Score = 27.0 bits (59), Expect = 3.1
Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEE--EKKKKKVEEKKNNKE 49
+ K E E + E+K E EKK+ E ++ + KKK EEKK E
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Score = 27.0 bits (59), Expect = 3.4
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K K++ K+K + +KK E+KK+ E K+ EE++KK ++++ K+ AD
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421
Score = 26.6 bits (58), Expect = 4.0
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEE---EEEKKKKKVEEKKNNKENA 51
K+ E+E K + EE KK +KK+ EE EE +KK E K E A
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Score = 26.6 bits (58), Expect = 4.1
Identities = 21/53 (39%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEE----EEEKKKKKVEEKKNNKEN 50
K ++ +K +E++++ E KKE E K+ EE E E+KKK E KK +EN
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728
Score = 25.9 bits (56), Expect = 6.7
Identities = 16/52 (30%), Positives = 29/52 (55%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K K + +K ++ +KK E+ K+ E K+ EE +KK + ++ K+ AD
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513
Score = 25.9 bits (56), Expect = 6.7
Identities = 14/48 (29%), Positives = 21/48 (43%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K K+K E+ +K E K E E E ++K + EKK +
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376
Score = 25.9 bits (56), Expect = 7.0
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE----EEKKKKKVEEKKNNKE 49
+ + ++ E E+KKEE KK+ K+ EE +E KKK E+KK E
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Score = 25.9 bits (56), Expect = 7.3
Identities = 13/44 (29%), Positives = 27/44 (61%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+ E+ +K EE KK+ E K+ EE+E+K + ++++ + A+
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Score = 25.5 bits (55), Expect = 8.5
Identities = 17/52 (32%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE---EEKKKKKVEEKKNNKE 49
+K ++ K+ EE ++K EE KK ++K+ E E ++ KK EE K +
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 27.6 bits (61), Expect = 1.9
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 2 KMKKKRKEKEEKERKKEEQK--KEVGEKKEGEEEEE 35
K+ +K++ +EE +R K +K KE+GE +G EE
Sbjct: 441 KLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEV 476
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 27.4 bits (62), Expect = 2.0
Identities = 13/51 (25%), Positives = 27/51 (52%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
+K + + KEE + + E +KE+ E++ ++ EK+ + EE + K
Sbjct: 54 IKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLEL 104
Score = 27.0 bits (61), Expect = 2.1
Identities = 17/50 (34%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 2 KMKKKRKEKEEK-ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
K++K+ +KEE +RK E +K E ++ E+E E+K++++E+K+ E
Sbjct: 86 KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135
Score = 27.0 bits (61), Expect = 2.6
Identities = 13/42 (30%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
++ E EK ++ E+K++ E+K+ +E EKK++++EE
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQ--QELEKKEEELEELIE 138
Score = 26.7 bits (60), Expect = 3.5
Identities = 13/42 (30%), Positives = 32/42 (76%), Gaps = 2/42 (4%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
++K+ +E E+KE++ E++++E+ +K+ EE EE +++++E
Sbjct: 104 LLEKREEELEKKEKELEQKQQELEKKE--EELEELIEEQLQE 143
Score = 26.3 bits (59), Expect = 4.7
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 5/52 (9%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKK-----EGEEEEEKKKKKVEEKKNNKEN 50
K ++ +EE +R EE KKE K E +EE K + + E++ + N
Sbjct: 31 AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82
Score = 25.5 bits (57), Expect = 9.7
Identities = 18/59 (30%), Positives = 32/59 (54%), Gaps = 12/59 (20%)
Query: 2 KMKKKRK--EKEEKERKKE--------EQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
++ K R EKE +ER+ E QK+E ++K E EK+++++E+K+ E
Sbjct: 65 EIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRK--LELLEKREEELEKKEKELEQ 121
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.0 bits (60), Expect = 2.1
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
++KEKE+KE+ K ++K+ + EK GEE +K K++
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 27.3 bits (61), Expect = 2.1
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 2 KMKKKRKEKEEKERKKE---EQKKEVGEKKEGEEEEEKKKKKVEEKK 45
+ K KE ++K+R+ E E + EKK +EE KKKK E K
Sbjct: 222 PLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Score = 25.4 bits (56), Expect = 9.9
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
+ E + KK + K+E+ +KK E + K KKV K
Sbjct: 242 EAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 27.3 bits (61), Expect = 2.2
Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 8/38 (21%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEK-KKKKVEEK 44
KE+EEKE++K+ KKEG+EE + K E+
Sbjct: 391 KEREEKEKEKKA-------KKEGKEERRIHFQNKSIEE 421
Score = 26.9 bits (60), Expect = 2.5
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEE 35
K+R+EKE++++ K+E K+E + + EE
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421
Score = 26.1 bits (58), Expect = 5.5
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEE 33
+ ++K KEK+ K+ KEE++ K E E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 27.2 bits (61), Expect = 2.3
Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKKKKKVEEKKNNKENAD 52
+ +K +E EE + ++ E KE+G K GE++ E+ +V+E K + +
Sbjct: 36 EERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELE 88
Score = 25.6 bits (57), Expect = 6.6
Identities = 7/39 (17%), Positives = 17/39 (43%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
++ + K E+ + + KE+ EK + E + +
Sbjct: 58 EIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEA 96
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 27.1 bits (60), Expect = 2.3
Identities = 12/46 (26%), Positives = 29/46 (63%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
++K+ E+ER K+ +K+ + +++ ++ EE K+ ++K +E A
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Score = 26.7 bits (59), Expect = 3.5
Identities = 9/42 (21%), Positives = 21/42 (50%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
++++K+ EE ++ ++K+ E K K + E K+
Sbjct: 114 QEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155
>gnl|CDD|182896 PRK11007, PRK11007, PTS system trehalose(maltose)-specific
transporter subunits IIBC; Provisional.
Length = 473
Score = 27.3 bits (61), Expect = 2.3
Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 4/44 (9%)
Query: 32 EEEEKKKKKVEE---KKNNKENADGNFEVTLATKATIYYQGLVE 72
+ K++E+ K NA G F+V + T+ YY+ L+
Sbjct: 39 DPANADPKEIEQLPMVKGCFTNA-GQFQVVIGTEVGDYYKALIA 81
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 27.0 bits (60), Expect = 2.3
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
KE E R+ ++ + + E +GEE+ KKK K + ++ + + NFE
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFE 203
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 27.0 bits (60), Expect = 2.4
Identities = 17/39 (43%), Positives = 30/39 (76%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKK 40
+ +KK +EK E ERK+EE+ +E EKK+ EEE ++++++
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67
Score = 26.2 bits (58), Expect = 4.1
Identities = 21/41 (51%), Positives = 31/41 (75%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
E+E +ERKK E+K+E K+E E EEE++KKK EE++ +E
Sbjct: 25 EEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 26.7 bits (59), Expect = 2.4
Identities = 5/44 (11%), Positives = 14/44 (31%), Gaps = 1/44 (2%)
Query: 2 KMKKKRKEKEEKERKKEEQKKE-VGEKKEGEEEEEKKKKKVEEK 44
M +K ++ + ++ E+ +KK +
Sbjct: 101 YMYRKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQN 144
>gnl|CDD|129200 TIGR00092, TIGR00092, GTP-binding protein YchF. This predicted
GTP-binding protein is found in a single copy in every
complete bacterial genome, and is found in Eukaryotes. A
more distantly related protein, separated from this
model, is found in the archaea. It is known to bind GTP
and double-stranded nucleic acid. It is suggested to
belong to a nucleoprotein complex and act as a
translation factor [Unknown function, General].
Length = 368
Score = 27.0 bits (60), Expect = 2.4
Identities = 13/71 (18%), Positives = 22/71 (30%), Gaps = 7/71 (9%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK-------NNKENADGNFEVTLAT 61
EK + ++ E G+ K+ E + + +KE + L T
Sbjct: 139 FLVEKRIGRSKKSAEGGKDKKEELLLLEIILPLLNGGQMARHVDLSKEELILIKSLNLLT 198
Query: 62 KATIYYQGLVE 72
K I V
Sbjct: 199 KKPIILIANVS 209
>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 147 to 176 amino acids in length. This domain
is found associated with pfam00226.
Length = 144
Score = 26.5 bits (59), Expect = 2.5
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
K ++K+E+ER++E+ K E+ +KK+ E V E+K +E
Sbjct: 2 FNKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREE 48
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 27.3 bits (61), Expect = 2.5
Identities = 15/66 (22%), Positives = 28/66 (42%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLA 60
++K K K K K RK + +KKE +KK ++ +K K+ + + +
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215
Query: 61 TKATIY 66
K+
Sbjct: 1216 KKSNSS 1221
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 26.6 bits (59), Expect = 2.6
Identities = 11/56 (19%), Positives = 31/56 (55%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
K K +E+ ++ E ++++ G++++ E EE +K++ +E+ + + E +
Sbjct: 5 CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 26.8 bits (60), Expect = 2.6
Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 4 KKKRKE---KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
KK KEE + + E ++E+ E++ + E++ + EE + K
Sbjct: 48 LKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME 97
Score = 26.4 bits (59), Expect = 3.9
Identities = 12/42 (28%), Positives = 26/42 (61%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
+ KK + E+KE++ ++K + EK+E EE ++++ E+
Sbjct: 99 LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 26.8 bits (60), Expect = 2.7
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEE 34
EKE+K+ +E+KK + E+K+ EE
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 26.5 bits (59), Expect = 2.7
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKK 38
+R EEK+ +EE K++ ++ E E EKKK
Sbjct: 95 AQRARDEEKKLDEEEAKRQ--HEEAKEREREKKK 126
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 26.6 bits (59), Expect = 2.8
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 6/53 (11%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
K KKK KEK +E KK E + EE E EE +++E+A N
Sbjct: 67 KSKKKDKEKLTEEEKKPE------SDDDKTEENENDPDNNEESGDSQESASAN 113
>gnl|CDD|177954 PLN02320, PLN02320, seryl-tRNA synthetase.
Length = 502
Score = 26.8 bits (59), Expect = 2.8
Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
++E ER + E + V K +G+ E +++ VEE KN KE
Sbjct: 106 QKEVERLRAE-RNAVANKMKGKLEPSERQALVEEGKNLKE 144
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 26.9 bits (60), Expect = 2.9
Identities = 11/48 (22%), Positives = 24/48 (50%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
+ K E E+ + EE+ +E+ E+ E E E ++ + E+ ++
Sbjct: 329 ESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEE 376
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 27.1 bits (60), Expect = 2.9
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
E+ E +K+E KK KK+ +++ KK K + + A
Sbjct: 193 EKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAAS 234
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid
layers. The innermost capsid (core) is made of VP2. The
genomic RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 26.8 bits (59), Expect = 3.3
Identities = 7/38 (18%), Positives = 22/38 (57%)
Query: 13 KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
R+ + ++K+ E++++K + +++EK +K+
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE 42
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 26.5 bits (59), Expect = 3.6
Identities = 11/51 (21%), Positives = 23/51 (45%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
+K + +E+ K++KK+++KK G E V + ++ G
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG 222
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 26.6 bits (59), Expect = 3.7
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
K++ K + K E KKE + +++E+K K + +N
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 26.7 bits (60), Expect = 3.7
Identities = 10/48 (20%), Positives = 24/48 (50%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
K+ EK E ++++E++K + ++ E +KKK+ + +
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
>gnl|CDD|217127 pfam02590, SPOUT_MTase, Predicted SPOUT methyltransferase. This
family of proteins are predicted to be SPOUT
methyltransferases.
Length = 154
Score = 26.1 bits (58), Expect = 3.8
Identities = 14/68 (20%), Positives = 28/68 (41%), Gaps = 10/68 (14%)
Query: 6 KRKEKEEKERKKEEQKK----------EVGEKKEGEEEEEKKKKKVEEKKNNKENADGNF 55
K KEK K+ E K+ E+ ++K G + + K+ E ++ G++
Sbjct: 9 KLKEKYVKDGIAEYLKRLSRYCKLELIELPDEKRGSAADIEAIKEKEGERILAAIPPGSY 68
Query: 56 EVTLATKA 63
+ L +
Sbjct: 69 VIALDERG 76
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 26.2 bits (58), Expect = 3.8
Identities = 13/55 (23%), Positives = 30/55 (54%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVT 58
K +E E E KEE+K+ + + ++ + +K+ + E++N +E+ + + E
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENE 107
Score = 25.5 bits (56), Expect = 8.3
Identities = 13/43 (30%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
+++ K+ +++E + E+ KE E+KE E+K+ K EK++
Sbjct: 48 EQEAKKSDDQETAEIEEVKE--EEKEAANSEDKEDKGDAEKED 88
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 26.4 bits (59), Expect = 3.8
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
EE + Q + ++E EEEEE+++++ E
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
>gnl|CDD|234101 TIGR03074, PQQ_membr_DH, membrane-bound PQQ-dependent
dehydrogenase, glucose/quinate/shikimate family. This
protein family has a phylogenetic distribution very
similar to that coenzyme PQQ biosynthesis enzymes, as
shown by partial phylogenetic profiling. Members of this
family have several predicted transmembrane helices in
the N-terminal region, and include the quinoprotein
glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli
and the quinate/shikimate dehydrogenase of Acinetobacter
sp. ADP1 (EC 1.1.99.25). Sequences closely related
except for the absense of the N-terminal hydrophobic
region, scoring in the gray zone between the trusted and
noise cutoffs, include PQQ-dependent glycerol (EC
1.1.99.22) and and other polyol (sugar alcohol)
dehydrogenases.
Length = 764
Score = 26.5 bits (59), Expect = 3.9
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 91 PFDEQTCVLKFGSWTYDGFKVPGSVNGA 118
PFD+ C ++F S Y+G P S G+
Sbjct: 526 PFDQLACRIQFKSLRYEGLYTPPSEQGS 553
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 26.5 bits (59), Expect = 3.9
Identities = 15/42 (35%), Positives = 25/42 (59%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
K+++ + + EE E + EEQ + V E E +EE E + + EE
Sbjct: 349 KIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEE 390
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
Trichoplein or mitostatin, was first defined as a
meiosis-specific nuclear structural protein. It has
since been linked with mitochondrial movement. It is
associated with the mitochondrial outer membrane, and
over-expression leads to reduction in mitochondrial
motility whereas lack of it enhances mitochondrial
movement. The activity appears to be mediated through
binding the mitochondria to the actin intermediate
filaments (IFs).
Length = 349
Score = 26.4 bits (59), Expect = 4.2
Identities = 14/49 (28%), Positives = 31/49 (63%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
++ KK+ K +E++E ++ ++ E K EEEE+++K+ EE++ +
Sbjct: 26 IEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRA 74
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 26.3 bits (59), Expect = 4.2
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
R + EE+ +EQ + K+ E+E++ +K ++
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEIT 544
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 26.5 bits (59), Expect = 4.2
Identities = 16/40 (40%), Positives = 28/40 (70%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKV 41
K+ K ++ EEK ++EE++ E++E EE+EE+KK+K
Sbjct: 8 KLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKP 47
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 25.6 bits (57), Expect = 4.2
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEK 44
++ K +K+KE+ E E K+ E K E E + + ++++
Sbjct: 58 EIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEAELDKL 100
>gnl|CDD|200581 cd10958, CE4_NodB_like_2, Catalytic NodB homology domain of
uncharacterized chitin deacetylases and hypothetical
proteins. This family includes some uncharacterized
chitin deacetylases and hypothetical proteins, mainly
from eukaryotes. Although their biological function is
unknown, members in this family show high sequence
homology to the catalytic NodB homology domain of
Colletotrichum lindemuthianum chitin deacetylase
(endo-chitin de-N-acetylase, ClCDA, EC 3.5.1.41), which
catalyzes the hydrolysis of N-acetamido groups of
N-acetyl-D-glucosamine residues in chitin, converting it
to chitosan in fungal cell walls. Like ClCDA, this
family is a member the carbohydrate esterase 4 (CE4)
superfamily.
Length = 190
Score = 26.1 bits (58), Expect = 4.4
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 90 FPFDEQTCVLKFGSWTYDGFKVPGSV 115
+PFD Q F S+ PGS+
Sbjct: 128 YPFDPQIPSPWFNSFFLRRRVSPGSI 153
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 26.2 bits (58), Expect = 4.5
Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 1 MKMKKKRKEKEEKERKKEEQ--KKEVGEKKEGEEEEEKKKKK 40
+K K E+ K+ ++EE+ KK V E E E+ ++K++
Sbjct: 327 LKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKER 368
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 25.7 bits (57), Expect = 4.7
Identities = 19/36 (52%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K E ER K+E+KK EKKE EE+E K++ K ++KK
Sbjct: 92 KLEHERNKQEKKKRSKEKKE-EEKERKRQLKQQKKK 126
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 26.2 bits (58), Expect = 4.8
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 6/57 (10%)
Query: 4 KKKRKEKEEKERKKEEQK------KEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
K+K +K+ K+ KK+ + + E +E E ++ KKKK ++KK K+N
Sbjct: 18 KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEA 74
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 25.9 bits (58), Expect = 4.8
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKK 28
K + K K +K+++K+E K + K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 25.9 bits (58), Expect = 6.0
Identities = 7/30 (23%), Positives = 16/30 (53%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGE 31
+ KK + K +KK+E+++ K + +
Sbjct: 60 LLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 25.2 bits (56), Expect = 10.0
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 17 KEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
K+E K +K+ E++E K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 25.7 bits (57), Expect = 4.9
Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE--KKNNKE 49
RK ++ ++R+KE QK+ +K G++ KK +K +E ++ KE
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKE 84
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 26.1 bits (58), Expect = 4.9
Identities = 10/47 (21%), Positives = 26/47 (55%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
++K+ EKE E +K+ +K+ + + ++EE +++ E + +
Sbjct: 77 ELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 26.1 bits (57), Expect = 4.9
Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKK---KKKVEEKKNNKENADGNFEVTLATKA 63
RK ++++E K + KKE ++K+ E+E E K KK K E AD E+ ++ A
Sbjct: 57 RKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVA 116
>gnl|CDD|217360 pfam03087, DUF241, Arabidopsis protein of unknown function. This
family represents a number of Arabidopsis proteins.
Their functions are unknown.
Length = 230
Score = 25.8 bits (57), Expect = 5.0
Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 2/49 (4%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K KK + R E + EK+ EE + K++EE +N+ E
Sbjct: 161 KKSKKGGSQSSLSIR--SELSRLDEEKRRDNEEVKDILKRLEELENSIE 207
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 26.2 bits (57), Expect = 5.0
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKE 29
K K+KEK+EKE+++++ KK+ E +
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 25.8 bits (56), Expect = 7.8
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
K KK+RKEKEEK++KK+ G E+ + EE
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 25.8 bits (57), Expect = 5.2
Identities = 16/40 (40%), Positives = 28/40 (70%), Gaps = 2/40 (5%)
Query: 17 KEEQK--KEVGEKKEGEEEEEKKKKKVEEKKNNKENADGN 54
EEQK K+V ++K +++++KKKKK ++KK +K+ A
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 25.6 bits (57), Expect = 5.2
Identities = 12/35 (34%), Positives = 24/35 (68%)
Query: 11 EEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
E + RK EE +K+ E+++ +EE+E+ + + E +K
Sbjct: 5 EIRRRKLEELQKQAQEQQQAQEEQEEAQAQAEAQK 39
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 25.9 bits (57), Expect = 5.2
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKE 29
K RK K K+RKK E+K E+ ++ E
Sbjct: 28 AKPRKIKRVKKRKKREEKDELDDEVE 53
Score = 25.6 bits (56), Expect = 8.0
Identities = 16/58 (27%), Positives = 21/58 (36%), Gaps = 18/58 (31%)
Query: 6 KRKEKEE-----------KERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KRK KEE K+E K + KK+KK EEK + +
Sbjct: 3 KRKIKEEMLQILAPEIYGPRPVKDEAKPRK-------IKRVKKRKKREEKDELDDEVE 53
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 25.9 bits (57), Expect = 5.2
Identities = 11/41 (26%), Positives = 18/41 (43%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
R K EKE KK+ K+ + + K+KK ++
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 25.9 bits (57), Expect = 5.4
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEE 35
++ + + + ++ + E+ KE + +GEEEE
Sbjct: 211 ELGDEPESADSEDNEDEDDPKEDEDDDQGEEEES 244
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 25.8 bits (57), Expect = 5.6
Identities = 9/42 (21%), Positives = 18/42 (42%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEE 43
++ K+ K++ K + + G EK +KK+ E
Sbjct: 91 LVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAE 132
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 26.1 bits (57), Expect = 5.6
Identities = 11/51 (21%), Positives = 24/51 (47%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENA 51
+ K + + KE+++ +K+E E+ + + KK EE K++
Sbjct: 254 QQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304
Score = 25.7 bits (56), Expect = 6.4
Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 5 KKRKEKEEK--ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K+R E +K E K + + KE ++ E +K+++E+ + + D
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKND 294
>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus. This family
aligns residues towards the N-terminus of several
proteins with multiple functions. The members of this
family all appear to possess 5'-3' exonuclease activity
EC:3.1.11.-. Thus, the aligned region may be necessary
for 5' to 3' exonuclease function. The family also
contains several Xrn1 and Xrn2 proteins. The 5'-3'
exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
degradation and processing of several classes of RNA in
Saccharomyces cerevisiae. Xrn1p is the main enzyme
catalyzing cytoplasmic mRNA degradation in multiple
decay pathways, whereas Xrn2p/Rat1p functions in the
processing of rRNAs and small nucleolar RNAs (snoRNAs)
in the nucleus.
Length = 237
Score = 25.8 bits (57), Expect = 6.0
Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 7/45 (15%)
Query: 2 KMKKKR-------KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKK 39
KM ++R K+ +EKE + EE ++E+ + E+ +K+K
Sbjct: 92 KMNQQRSRRFRAAKDAKEKEAEAEENREELETEGIKLPEKVEKEK 136
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 26.1 bits (58), Expect = 6.1
Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 17 KEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLA 60
+EE ++ V E + EE++K+K+++ E +NN ++ E TL
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEI-EARNNADSLAYQAEKTLK 542
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 25.7 bits (57), Expect = 6.3
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKE 23
K KKK KEEK+ KEE++K
Sbjct: 97 EKEKKKAMSKEEKKAIKEEKEKL 119
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 25.8 bits (57), Expect = 6.4
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEE 32
K +KE+ E++ ++EE+K EK EE
Sbjct: 153 KIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 25.7 bits (57), Expect = 6.4
Identities = 8/39 (20%), Positives = 21/39 (53%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
+E K + + ++ ++ +K+E E+EK ++ + K
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKA 142
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 26.1 bits (57), Expect = 6.6
Identities = 16/71 (22%), Positives = 30/71 (42%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLAT 61
+ K+K+KE+ +K ++ E E+ E + +E K K+ +K E+
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 62 KATIYYQGLVE 72
+ Y L E
Sbjct: 227 LLYLDYLKLNE 237
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 26.0 bits (58), Expect = 6.8
Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 3/84 (3%)
Query: 6 KRKEKEEKE---RKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
+RKEK+ KE K K++ E E ++ EE++K + + ++ E +
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 63 ATIYYQGLVEWKPPAIYKSSCEID 86
E + P I S E+
Sbjct: 550 EKDKETEEDEPEGPKIIVDSRELR 573
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 25.8 bits (57), Expect = 7.0
Identities = 7/43 (16%), Positives = 18/43 (41%), Gaps = 3/43 (6%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
+ E ++ E + E++ ++ K + +EE+K
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRA---QQAKLAQLLEERKKTLA 213
Score = 25.8 bits (57), Expect = 7.2
Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFE 56
++KK +E++++ K E+Q K + E + E + + + KK K+ AD N
Sbjct: 49 ALEKKIREQQDQRAKLEKQLKSL-ETEIASLEAQLIETADDLKKLRKQIADLNAR 102
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 25.5 bits (56), Expect = 7.1
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNK 48
E E+ K++++ E +KKE EE + +KK + K +
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 25.7 bits (56), Expect = 7.4
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
K + E++ EE ++E E++ G ++ KK KK++ KKN + D
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 25.7 bits (57), Expect = 7.6
Identities = 10/33 (30%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 6 KRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKK 37
K + EE+ +EE +++ E ++G+ EE +K
Sbjct: 87 KAETLEERREIREEAEEKWEEALEKGDLEEARK 119
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 25.7 bits (57), Expect = 7.8
Identities = 12/61 (19%), Positives = 21/61 (34%), Gaps = 12/61 (19%)
Query: 2 KMKKKRKE---KEEKERKKEEQKKEVGEKKEGEEEEEK---------KKKKVEEKKNNKE 49
K K+ +E KEE + + +K ++E + KKVE +
Sbjct: 18 KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVEALRAKGV 77
Query: 50 N 50
Sbjct: 78 E 78
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 25.5 bits (56), Expect = 7.9
Identities = 11/46 (23%), Positives = 19/46 (41%), Gaps = 6/46 (13%)
Query: 1 MKMKKKRKEKEE------KERKKEEQKKEVGEKKEGEEEEEKKKKK 40
+K + +++ EE K K +K+V + KKKK
Sbjct: 251 LKPRLRKQIAEEFAKNPPKAYKSPSGRKDVTPSQNQAIASLPKKKK 296
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 25.8 bits (56), Expect = 7.9
Identities = 19/67 (28%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 10 KEEKERKKEEQKKEVGEKKEGEEEEEKKKKK--VEEKKNNKENADGNFEVTLATKATIYY 67
+E++E ++E K V EK E EEEEE K K ++ + K+ + + + + A+++
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFS 208
Query: 68 QGLVEWK 74
+ L +W+
Sbjct: 209 KKLQKWR 215
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 25.8 bits (56), Expect = 8.0
Identities = 9/44 (20%), Positives = 22/44 (50%)
Query: 9 EKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
+++ +K + + + EEEK +KKV +++ K+ +
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDE 148
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 25.5 bits (56), Expect = 8.1
Identities = 10/41 (24%), Positives = 22/41 (53%)
Query: 5 KKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKK 45
+ + EK + + + KK +E++++KKK ++KK
Sbjct: 37 NSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 25.4 bits (56), Expect = 8.1
Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 5/73 (6%)
Query: 1 MKMKKKRKEKEEK-ERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTL 59
++ K K E+E+K E +E+ K++ +K GE+ EE K E K E + ++ +
Sbjct: 89 LQRKLKELEREQKEEEVREKHNKKI-IEKFGEDLEEVYKFMKGEAK--VEEEEEK-QMEI 144
Query: 60 ATKATIYYQGLVE 72
KA Y +V+
Sbjct: 145 LEKALKSYLKIVK 157
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 25.6 bits (57), Expect = 8.3
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 10/49 (20%)
Query: 2 KMKKKRKEKEEKERKKEEQK----------KEVGEKKEGEEEEEKKKKK 40
K K +EEK R++E ++ + + ++E EEE ++K+
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKR 218
>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease. This family
includes the AlwI (recognises GGATC), Bsp6I (recognises
GC^NGC), BstNBI (recognises GASTC), PleI(recognises
GAGTC) and MlyI (recognises GAGTC) restriction
endonucleases.
Length = 429
Score = 25.4 bits (56), Expect = 8.3
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 2 KMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNN-KENADGNF 55
++ K RKE+ + + ++ KE EK+ + KKK E ++N K+ AD F
Sbjct: 101 RILKFRKERLKNKDNIKKLSKEYFEKECYSISDSSSKKKFETSQSNLKDYADTFF 155
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 25.7 bits (56), Expect = 8.4
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Query: 3 MKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
M+K+ K + + E+Q E EKK +EEEE + + ++E
Sbjct: 481 MEKRLKSEADSRVNAEKQLAE--EKKRKKEEEETAARAAAQAAASRE 525
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 25.4 bits (56), Expect = 8.8
Identities = 7/40 (17%), Positives = 22/40 (55%)
Query: 7 RKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKN 46
++E EE +R+ E+ + E+ + ++ +K +++ +
Sbjct: 442 KRELEELKREIEKLESELERFRREVRDKVRKDREIRARDR 481
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex
together with SecB and SecD on the surface of
Salmonella. All these proteins are secreted by the type
III secretion system. Many mucosal pathogens use type
III secretion systems for the injection of effector
proteins into target cells. SecB, SseC and SecD are
inserted into the target cell membrane. where they form
a small pore or translocon. In addition to SseC, this
family includes the bacterial secreted proteins PopB,
PepB, YopB and EspD which are thought to be directly
involved in pore formation, and type III secretion
system translocon.
Length = 303
Score = 25.5 bits (56), Expect = 8.8
Identities = 8/39 (20%), Positives = 25/39 (64%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
K+ + + ++ ER +++Q+K+ E +E ++ +K ++ +
Sbjct: 15 KQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 25.4 bits (56), Expect = 9.0
Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 6 KRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENAD 52
KR+ E ++ E+ E E+K E E E + ++EE + +KE A+
Sbjct: 614 KREALAELNDERRERLAEKRERKR-ELEAEFDEARIEEAREDKERAE 659
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 25.2 bits (55), Expect = 9.0
Identities = 10/50 (20%), Positives = 26/50 (52%)
Query: 1 MKMKKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKEN 50
++ + +KE+E + EE +KE + E +++K+++E ++
Sbjct: 70 SELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLR 119
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 25.5 bits (56), Expect = 9.3
Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 8 KEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATK 62
E+EE E ++EE+ ++ G KE ++EE ++ VE K E++DGN + + K
Sbjct: 390 DEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK---YEDSDGNSSLAVGYK 441
>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
Length = 423
Score = 25.4 bits (56), Expect = 9.3
Identities = 7/32 (21%), Positives = 15/32 (46%)
Query: 22 KEVGEKKEGEEEEEKKKKKVEEKKNNKENADG 53
K+ + EGE++ K K + +++ G
Sbjct: 84 KDPVDGAEGEQKPGKSKFEEDQRVEGSSAVTG 115
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 25.5 bits (56), Expect = 9.4
Identities = 9/46 (19%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKE 49
K K E+E E+K++++K+ + + + E+ K+ + ++
Sbjct: 58 KMKAYEQEALEQKQKDKKRTLQDL--ADSFEKSDKESQDYSSSDSG 101
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 25.2 bits (55), Expect = 9.6
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 1/30 (3%)
Query: 13 KERKKEEQKKEVGEKKEGEEEEEKKKKKVE 42
KE+ K+ +G KK E+E E K + +E
Sbjct: 332 KEKNKKVPVMFLG-KKPKEKEVESKSQCIE 360
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 25.2 bits (56), Expect = 9.6
Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 6 KRKEKEEKERKKEEQKKEVGE-KKEGEEEEEKK 37
K +E EE+ +EE +++ E K+EG+ EE +K
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEEARK 72
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 25.1 bits (55), Expect = 9.6
Identities = 14/60 (23%), Positives = 29/60 (48%)
Query: 4 KKKRKEKEEKERKKEEQKKEVGEKKEGEEEEEKKKKKVEEKKNNKENADGNFEVTLATKA 63
R+EK+E+E ++EE + + G E EE +++ +E + E + N + +
Sbjct: 175 ALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSS 234
>gnl|CDD|178354 PLN02753, PLN02753, triacylglycerol lipase.
Length = 531
Score = 25.4 bits (55), Expect = 9.7
Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 7 RKEKEEKERKKE--EQKKEVGEKKEGEEEEEKKKKKVEEKK 45
R E+E +ER+ ++ + GE E+ ++ KK EE++
Sbjct: 61 RLERERRERQGLLIDEAEGAGELWLTAEDIRRRDKKTEEER 101
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.305 0.127 0.357
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,894,001
Number of extensions: 521446
Number of successful extensions: 8199
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5567
Number of HSP's successfully gapped: 1891
Length of query: 118
Length of database: 10,937,602
Length adjustment: 81
Effective length of query: 37
Effective length of database: 7,344,928
Effective search space: 271762336
Effective search space used: 271762336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 53 (24.3 bits)