RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16486
(150 letters)
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 42.0 bits (98), Expect = 4e-05
Identities = 15/43 (34%), Positives = 28/43 (65%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+KR+ +K E E++ +++ E EK+K++E+ERE E+ E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERA 621
Score = 40.1 bits (93), Expect = 2e-04
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
KK EE E K K E E+K +EE+ERE+EK +E E
Sbjct: 579 KKREEAVE--KAKREAEQKAREEREREKEKEKERERE 613
Score = 37.4 bits (86), Expect = 0.002
Identities = 17/42 (40%), Positives = 30/42 (71%), Gaps = 1/42 (2%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
K+E + KR+ E+K + EE+E EK+K++E E++++ E ER
Sbjct: 580 KREEAVEKAKREAEQKAR-EEREREKEKEKEREREREREAER 620
Score = 33.1 bits (75), Expect = 0.046
Identities = 15/31 (48%), Positives = 22/31 (70%), Gaps = 2/31 (6%)
Query: 122 KKKKEEEEKKKK--EEKEREEEKREEENVEE 150
KK++E EK K+ E+K REE +RE+E +E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKE 609
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.9 bits (94), Expect = 2e-04
Identities = 23/46 (50%), Positives = 35/46 (76%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++ E+KK+ ++K KKKEEEE+++KE+KE E+E+ EEE EE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 39.5 bits (93), Expect = 2e-04
Identities = 24/45 (53%), Positives = 33/45 (73%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + +KK EE+++EKKKK KKK+EE+E E+EK+EEE EE
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Score = 39.1 bits (92), Expect = 3e-04
Identities = 24/61 (39%), Positives = 40/61 (65%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+EF + +KK + + +KK+ EEK+E+KKK +KK++EE+E +E+K EE+ E
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Query: 150 E 150
E
Sbjct: 459 E 459
Score = 39.1 bits (92), Expect = 3e-04
Identities = 18/59 (30%), Positives = 36/59 (61%)
Query: 92 FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + + EKK++ E+KE++KK ++K+++EE+E+E+++ E+E EE
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Score = 39.1 bits (92), Expect = 4e-04
Identities = 19/57 (33%), Positives = 39/57 (68%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ K + + +++ E++KK ++K+ KK+EEE+++++EK+ EE++ EEE EE
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Score = 38.4 bits (90), Expect = 6e-04
Identities = 22/51 (43%), Positives = 35/51 (68%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
KK K++ K +E+EEE++K+++EE+K++EE+E EEEK EEE
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 38.0 bits (89), Expect = 8e-04
Identities = 21/46 (45%), Positives = 34/46 (73%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
R+ + ++KKK ++K+++EEEEK+KKEE++ EEE+ EE EE
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 36.8 bits (86), Expect = 0.002
Identities = 22/41 (53%), Positives = 33/41 (80%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EKKKK +++++++EEE++KK+EEKE EEE+ EEE EE
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 36.4 bits (85), Expect = 0.003
Identities = 20/46 (43%), Positives = 35/46 (76%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KR+ EKK+K ++ KKK+EEEE++K++++E +EE+ EE E+
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 35.7 bits (83), Expect = 0.005
Identities = 19/46 (41%), Positives = 35/46 (76%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ + +K+KK + +KK++EEEE+K+K+E+E+EEE+ E E +E
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 35.3 bits (82), Expect = 0.007
Identities = 16/46 (34%), Positives = 35/46 (76%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K + +K ++KEEE+++++E+++++KEE+E E E+ +EE E+
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 35.3 bits (82), Expect = 0.008
Identities = 16/51 (31%), Positives = 32/51 (62%)
Query: 96 TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
T++E ++ KK K ++ EKK++EE+++KKK+ ++++ EEE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 34.5 bits (80), Expect = 0.012
Identities = 19/62 (30%), Positives = 39/62 (62%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
L + TKK + + + +++K E+K++ K++EE++++E++++EEEK EEE
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 149 EE 150
E
Sbjct: 462 AE 463
Score = 34.1 bits (79), Expect = 0.018
Identities = 19/45 (42%), Positives = 35/45 (77%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+++ +KKK K++E++++EE+EKK++E++E EEE EE+ EE
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 33.0 bits (76), Expect = 0.043
Identities = 17/43 (39%), Positives = 36/43 (83%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ G+KK++ EE+E+EKK++E+EE++++ E+E+EEE+ +++
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 39.6 bits (92), Expect = 3e-04
Identities = 20/52 (38%), Positives = 34/52 (65%)
Query: 98 KEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
K+ G G +EE+EEE++++EEEE++++EE+E EEE E ++E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 38.4 bits (89), Expect = 6e-04
Identities = 19/56 (33%), Positives = 33/56 (58%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E +E + E + G + EEE++++EEEE++++EE+E EEE+ E E
Sbjct: 836 ELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 37.7 bits (87), Expect = 0.001
Identities = 18/44 (40%), Positives = 27/44 (61%)
Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+Q EK + +EEEE++++EE+E EEE+ EEE EE
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 33.8 bits (77), Expect = 0.029
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 6/49 (12%)
Query: 108 QGEKKKKNEEKEEE------KKKKEEEEKKKKEEKEREEEKREEENVEE 150
QGE K+ + + ++EEEE++++EE+E EEE+ EEE E
Sbjct: 842 QGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 27.7 bits (61), Expect = 2.9
Identities = 12/36 (33%), Positives = 27/36 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E++++ EE+EEE++++EEEE ++ E E ++++
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 26.9 bits (59), Expect = 6.0
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
GE+ E+E E + K E E + + ER+ E+ E +E
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEA 701
Score = 26.5 bits (58), Expect = 6.9
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 1/60 (1%)
Query: 91 EFRSETKKEGNL-GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E + E + EG + RK + E + + E KE + K + E E+ + E + E +E +E
Sbjct: 674 ETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIE 733
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 38.9 bits (91), Expect = 4e-04
Identities = 14/37 (37%), Positives = 27/37 (72%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E +++ EEKE K++K + KK+EEK+++E ++ E+
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 38.1 bits (89), Expect = 9e-04
Identities = 14/35 (40%), Positives = 27/35 (77%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
+R+ E+K+ +E++ +K K++EEKKKKE ++ E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 32.7 bits (75), Expect = 0.062
Identities = 20/53 (37%), Positives = 35/53 (66%), Gaps = 3/53 (5%)
Query: 101 NLGFR---KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
NLG R K G K ++KEE +++KEE+E K++++ R+ +K+EE+ +E
Sbjct: 530 NLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582
Score = 31.9 bits (73), Expect = 0.095
Identities = 15/58 (25%), Positives = 34/58 (58%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
L R E K +G ++ E+ ++ +E++E K+++ K KK+E+++++E + E
Sbjct: 531 LGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 30.8 bits (70), Expect = 0.23
Identities = 13/39 (33%), Positives = 28/39 (71%), Gaps = 2/39 (5%)
Query: 114 KNEEKEEEKKKKEEEEKK--KKEEKEREEEKREEENVEE 150
K E + E+++K+ +E+K +K +K+ E++K+E E +E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 35.7 bits (83), Expect = 0.005
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 19/72 (26%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKN----------EEKEEEKKKKEE--------EEKK 131
L+ E ++ + R+ + EK K E+KEE+KK++ E EE++
Sbjct: 251 LKLSPEVLRKVD-KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309
Query: 132 KKEEKEREEEKR 143
K EEKER+++ R
Sbjct: 310 KLEEKERKKQAR 321
Score = 34.9 bits (81), Expect = 0.008
Identities = 19/47 (40%), Positives = 29/47 (61%)
Query: 100 GNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
NL K K E+EEEK K EE++++E +E++EEK++EE
Sbjct: 249 ANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE 295
Score = 31.1 bits (71), Expect = 0.17
Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 5/46 (10%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
E E K+E +K++ + K + EE++K EE+E+KK+ K
Sbjct: 282 EEAQEKKEE-----KKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 35.2 bits (81), Expect = 0.008
Identities = 15/49 (30%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEE---EKKKKEEKEREEEKREEENVEE 150
+K K + + +E E K KEE+ E+ +K++KE+ +E +E+ +E
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 33.2 bits (76), Expect = 0.040
Identities = 17/62 (27%), Positives = 31/62 (50%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
L + E K +L KRQ K+ N++ ++ KK E + + +E E + +EE+
Sbjct: 349 LTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLK 408
Query: 149 EE 150
+E
Sbjct: 409 QE 410
Score = 29.0 bits (65), Expect = 0.98
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKK 133
Q E +KK +E+ +E K+K +++++KK
Sbjct: 409 QEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.0 bits (65), Expect = 0.99
Identities = 12/29 (41%), Positives = 24/29 (82%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
E+K K EE E+++K++ +E+K+K+++ ER
Sbjct: 404 EEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 26.3 bits (58), Expect = 6.9
Identities = 12/55 (21%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
+ E K E + + E K K E+ ++E+ +K+++E+ +++++R++++R
Sbjct: 381 YTKKLGEVKDETD---ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 34.7 bits (79), Expect = 0.008
Identities = 21/60 (35%), Positives = 39/60 (65%), Gaps = 2/60 (3%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE--EENVEE 150
+ + KK+ +L R+ Q EK ++ E+ EE+ + E E+K E +++EE++RE E+ +EE
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEE 171
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 35.2 bits (81), Expect = 0.009
Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKK----KKEEEEKKKKEEKEREEEKREEE 146
E ++E+ KE + + EKKKK E+ +EE K K+E +EK+ +EKE+E+EK+ EE
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Query: 147 NVEE 150
+
Sbjct: 157 PRDR 160
Score = 33.3 bits (76), Expect = 0.032
Identities = 14/57 (24%), Positives = 34/57 (59%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K ++ + + EE++E+++ KEE++KKK++ KE ++++ +E +E
Sbjct: 83 GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
Score = 29.9 bits (67), Expect = 0.45
Identities = 17/57 (29%), Positives = 34/57 (59%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E +KE +K++ EK K+ + + K++ +E+ K++EKE+E++ E + EE
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162
Score = 29.9 bits (67), Expect = 0.54
Identities = 18/58 (31%), Positives = 35/58 (60%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ + K+E K + ++K+ +EKE+EK+KK EE + ++EEK+RE + + +
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKP 178
Score = 29.5 bits (66), Expect = 0.64
Identities = 18/57 (31%), Positives = 32/57 (56%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
L K+ G + + K E K E K++E+E+++ KEEK++++EK +EE
Sbjct: 71 LSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127
Score = 28.3 bits (63), Expect = 1.9
Identities = 14/54 (25%), Positives = 29/54 (53%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ +K KR ++K+K +EK+ E+ + EEEKK++ + + K+ +
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Score = 26.4 bits (58), Expect = 7.8
Identities = 14/53 (26%), Positives = 25/53 (47%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E KE K + ++KK E ++ E++KK E + K K+ ++K
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 34.9 bits (81), Expect = 0.009
Identities = 10/41 (24%), Positives = 24/41 (58%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KK + ++++ EKK ++EE +K +K+ E ++ + +
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Score = 32.6 bits (75), Expect = 0.060
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)
Query: 110 EKKK--KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+KK K + K E+K +KEE EK ++K K+ + E
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
Score = 32.2 bits (74), Expect = 0.075
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+R+ EKK + EE E+ KK+ E KK + E K+ +
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 32.2 bits (74), Expect = 0.081
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E K+ KK+++ E+K +KEE E+ K++ E
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443
Score = 29.9 bits (68), Expect = 0.45
Identities = 10/46 (21%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK--EREEEKREEENVE 149
+GE+++ ++K+ +K++K EK E+EE ++ +
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKK 438
Score = 29.5 bits (67), Expect = 0.70
Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 105 RKRQGEKKKKNEEKEE-EKKKKEEEEKKKKEEKEREEEKREEENVE 149
RK+ +K++K E+K E E+ +K +KK + ++ + E
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 28.4 bits (64), Expect = 1.7
Identities = 8/35 (22%), Positives = 20/35 (57%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+K +K++K E++ +K+E ++ +K+ E
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAA 442
Score = 28.0 bits (63), Expect = 2.5
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ EE+E E E+KK +K+R+ EK+ E+ E
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE 431
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 34.7 bits (80), Expect = 0.009
Identities = 16/46 (34%), Positives = 33/46 (71%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K EK+K +K++ KKKK++++ KK ++K+ + EK++E+ E+
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Score = 33.9 bits (78), Expect = 0.016
Identities = 11/44 (25%), Positives = 32/44 (72%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
K++ + KKK +K+++K K ++++KK + ++++E++ E++ +
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 31.6 bits (72), Expect = 0.088
Identities = 14/45 (31%), Positives = 32/45 (71%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++Q K KK + K+++ K K++++ KK ++ E+++EK E+ +E+
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 28.9 bits (65), Expect = 0.73
Identities = 12/52 (23%), Positives = 29/52 (55%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E K++ +K + +K K ++K+++K K E++ +K+ E + E+ +
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 28.5 bits (64), Expect = 0.91
Identities = 11/39 (28%), Positives = 27/39 (69%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+KK K+++K+++KK + E+K +KE +++ E+ + +
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 27.3 bits (61), Expect = 2.7
Identities = 16/41 (39%), Positives = 31/41 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KKKK +E EK KKE EEK+K + K+++ +K+++++ ++
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDK 101
Score = 27.3 bits (61), Expect = 2.8
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
+K K ++K E+K +KE E+K + K E
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133
Score = 26.2 bits (58), Expect = 7.2
Identities = 9/51 (17%), Positives = 29/51 (56%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+ K +K++ + K K ++K+++K +K++E++ + + ++ + E
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
Score = 25.8 bits (57), Expect = 8.7
Identities = 11/36 (30%), Positives = 26/36 (72%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E +K +E EE++K K +++K KK++ + +++K ++
Sbjct: 70 EIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105
Score = 25.8 bits (57), Expect = 9.8
Identities = 12/45 (26%), Positives = 31/45 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ EK KK E++++ K K+++ KKKK++ + +++ ++++ E+
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
>gnl|CDD|212071 cd11502, SLC6sbd_NTT5, Neurotransmitter transporter 5;
solute-binding domain. Human NTT5 is encoded by the
SLC6A16 gene. NTT5 is expressed in testis, pancreas, and
prostate; its expression is predominantly intracellular,
indicative of a vesicular location. Its substrates are
unknown. This subgroup belongs to the solute carrier 6
(SLC6) transporter family.
Length = 543
Score = 34.7 bits (80), Expect = 0.011
Identities = 10/34 (29%), Positives = 18/34 (52%), Gaps = 4/34 (11%)
Query: 53 YFIGDSFETSVPWDKTVLLCINVKKRLTRECTVS 86
+++G SF+ +PW++ C VK EC +
Sbjct: 107 FYLGHSFQFPLPWEQ----CPLVKNSSDPECART 136
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 34.8 bits (80), Expect = 0.012
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 110 EKKKKNEEKEEEK-KKKEEEEKKKKEEKEREEEKREE 145
KK + EK E K +KKE +K++ E+E E R+E
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQE 368
Score = 29.4 bits (66), Expect = 0.74
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E K+ +K E + +KKE ++ E N E
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINRE 364
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 34.3 bits (79), Expect = 0.015
Identities = 17/37 (45%), Positives = 25/37 (67%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
KK K E+K E KKK E++KKKK+EK+ + + E +
Sbjct: 55 KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Score = 31.6 bits (72), Expect = 0.11
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
K KK ++ E KKK E+++KKKKE+KE + E +
Sbjct: 56 KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Score = 31.6 bits (72), Expect = 0.11
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
E KKK+E+K+++KK+K+E + + + + + K+
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99
Score = 27.4 bits (61), Expect = 3.2
Identities = 14/58 (24%), Positives = 32/58 (55%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
SE + + K+ ++ K NE K++ +KKK+++++KK+ + E E + + +
Sbjct: 42 FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99
Score = 27.0 bits (60), Expect = 4.5
Identities = 16/66 (24%), Positives = 28/66 (42%)
Query: 82 ECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
L F E K +K + E K +K+ EKKKK+++EKK+ + + +
Sbjct: 32 LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Query: 142 KREEEN 147
+
Sbjct: 92 GFKTPK 97
Score = 27.0 bits (60), Expect = 5.0
Identities = 11/59 (18%), Positives = 27/59 (45%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
E ++ +K + K + ++++KKKKE++E K + E + + ++
Sbjct: 43 SEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101
Score = 26.2 bits (58), Expect = 7.8
Identities = 12/42 (28%), Positives = 20/42 (47%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+K + E K + E K K K+ ++ KKK K + E +
Sbjct: 77 KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 34.3 bits (79), Expect = 0.020
Identities = 17/38 (44%), Positives = 28/38 (73%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+ + K ++ EE+ KKK+++EKKK+EE +REE+ R E
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 32.0 bits (73), Expect = 0.11
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ E K + E+K K++K++E++K EE EE
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREE 763
Score = 31.6 bits (72), Expect = 0.13
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K E K + +++ ++KKKKE+K+ EE KREE+
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 30.8 bits (70), Expect = 0.23
Identities = 16/42 (38%), Positives = 26/42 (61%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+ + K + EEK KK+++++KKKEE+ + EEK E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 30.5 bits (69), Expect = 0.33
Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 3/65 (4%)
Query: 77 KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
+ +E G E S+ +K+GNL ++KN E++ + ++ + KK+ +
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVL---SNQEKNIEEDYAESDIKKRKNKKQYKS 1577
Query: 137 EREEE 141
E E
Sbjct: 1578 NTEAE 1582
Score = 28.1 bits (63), Expect = 2.2
Identities = 15/56 (26%), Positives = 27/56 (48%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E K+ G + EKK E ++K EE+ + + K+R+ +K+ + N E
Sbjct: 1526 EEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 28.1 bits (63), Expect = 2.3
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 92 FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
FR+ K+ EK KK ++KE K++EE+ K+EEK R E
Sbjct: 725 FRNWNGKDAEFKISDSVEEKTKKKKKKE-----KKKEEEYKREEKARIE 768
Score = 27.8 bits (62), Expect = 3.1
Identities = 15/61 (24%), Positives = 29/61 (47%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
LL + N +++ NE+K E + + +EEK+ + E E +K ++ N+
Sbjct: 1487 LLLLNLNGNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNL 1546
Query: 149 E 149
E
Sbjct: 1547 E 1547
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 34.1 bits (78), Expect = 0.021
Identities = 16/44 (36%), Positives = 27/44 (61%)
Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ EKK EE+ E KKKKEE+ K+K+ +K + +K + ++
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQA 49
Score = 32.2 bits (73), Expect = 0.10
Identities = 17/44 (38%), Positives = 28/44 (63%)
Query: 103 GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
G R +KK E+E E+KKK+EE+ K+KE K+ + ++E +
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44
Score = 30.6 bits (69), Expect = 0.35
Identities = 12/43 (27%), Positives = 28/43 (65%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+K E++ + ++K+EEK K++E +K K +KE + + + ++
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA 52
Score = 27.9 bits (62), Expect = 2.4
Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 14/72 (19%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEK------------KKKEEKER 138
E + T++E L +K++ EK K+ E K+ + +KE + K KK EK+
Sbjct: 9 EKKILTEEE--LERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66
Query: 139 EEEKREEENVEE 150
+ E+EN E+
Sbjct: 67 RKRDVEDENPED 78
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 33.0 bits (76), Expect = 0.031
Identities = 18/38 (47%), Positives = 25/38 (65%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
E KK EEKEE +K+ E E K + ++REEE+R+ E
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEE 165
Score = 30.3 bits (69), Expect = 0.31
Identities = 12/47 (25%), Positives = 27/47 (57%), Gaps = 4/47 (8%)
Query: 104 FRKRQGEKKKKNEEKEEE----KKKKEEEEKKKKEEKEREEEKREEE 146
+ + +++ EE E+ + K E EK+++EE++ EE++ +E
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE 171
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.9 bits (76), Expect = 0.031
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
+K+ K EK E+K +K + + +KK+ K+ E + R
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 28.3 bits (64), Expect = 1.2
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
++ +K +K + K E+KK K+EE K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.9 bits (63), Expect = 1.4
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K E K EK +K+ E+ K K EK++ +++ +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 27.1 bits (61), Expect = 2.7
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++E K E+ EKK ++ K + E+K+ ++ +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 26.3 bits (59), Expect = 5.7
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKE 137
K++ + +K E+K E+ K K E++K KKEE +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 32.6 bits (75), Expect = 0.041
Identities = 13/27 (48%), Positives = 19/27 (70%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
EKE++K +EE+K KEEK++ EE
Sbjct: 97 AEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 27.5 bits (62), Expect = 2.6
Identities = 14/25 (56%), Positives = 15/25 (60%), Gaps = 2/25 (8%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEK 136
KKK KEE KK +EEK K EE
Sbjct: 101 KKKAMSKEE--KKAIKEEKDKLEEP 123
Score = 26.0 bits (58), Expect = 7.0
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKRE 144
E E+EKKK +E+KK ++E+++ +
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 26.0 bits (58), Expect = 8.1
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)
Query: 106 KRQGEKKK--KNEEKEEEKKKKEEEEKK 131
+ + EKKK EEK+ K++K++ E+
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.9 bits (75), Expect = 0.050
Identities = 15/48 (31%), Positives = 33/48 (68%), Gaps = 2/48 (4%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEENVEE 150
++ + ++KKK +++ EE ++K+ E+E+ K+ EKER + +++ EE
Sbjct: 75 KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEE 122
Score = 29.4 bits (66), Expect = 0.71
Identities = 11/56 (19%), Positives = 30/56 (53%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ ++E ++ ++R + +K+ +E+KK+ EE K+ ++++ EE +
Sbjct: 86 QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141
Score = 28.6 bits (64), Expect = 1.2
Identities = 10/44 (22%), Positives = 26/44 (59%)
Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+Q + ++ K+ EK++ +E+KK+ E+ ++ +++ EE
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Score = 27.5 bits (61), Expect = 3.2
Identities = 11/36 (30%), Positives = 27/36 (75%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
N +++++K K EE++KK+E+++ EE ++++ E+
Sbjct: 65 NRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQ 100
Score = 27.1 bits (60), Expect = 4.5
Identities = 10/41 (24%), Positives = 18/41 (43%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++ + K + + E KK E KKK E + + E +
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200
Score = 26.3 bits (58), Expect = 8.5
Identities = 15/53 (28%), Positives = 22/53 (41%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+E KK+ + + KK E E +KK E +KK E + K E
Sbjct: 181 AEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 32.0 bits (73), Expect = 0.053
Identities = 11/37 (29%), Positives = 22/37 (59%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ E + + + +EKKK+E + K ++EK + E K+
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Score = 30.5 bits (69), Expect = 0.23
Identities = 12/48 (25%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKERE-------EEKREEENVEE 150
E+ ++ E E++ E + + KE+K+RE +EK + E +
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Score = 29.3 bits (66), Expect = 0.49
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+EE+E E++ +E E++ E + E +++++ V +
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131
Score = 27.8 bits (62), Expect = 1.5
Identities = 9/38 (23%), Positives = 20/38 (52%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++ + E ++KKK E K K E+++ + E ++
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Score = 25.5 bits (56), Expect = 9.0
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
ETK E +KR+ K K +EK + + KK + K K
Sbjct: 115 PETKTESKEK-KKREVPKPKTEKEKPKTEPKKPKPSKPK 152
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 32.2 bits (74), Expect = 0.057
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
EEKE++K +EE+K KEEKE+ EE
Sbjct: 96 EEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 28.0 bits (63), Expect = 1.4
Identities = 11/28 (39%), Positives = 18/28 (64%), Gaps = 2/28 (7%)
Query: 106 KRQGEKKKKN--EEKEEEKKKKEEEEKK 131
+ + EKKK EEK+ K++KE+ E+
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 26.8 bits (60), Expect = 4.4
Identities = 13/28 (46%), Positives = 21/28 (75%), Gaps = 3/28 (10%)
Query: 117 EKEEEKKK---KEEEEKKKKEEKEREEE 141
E+E+EKKK KEE++ K+E+++ EE
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.4 bits (74), Expect = 0.062
Identities = 16/54 (29%), Positives = 28/54 (51%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + +G + E + E+K K+ E E +E+++ EEE+ EEE +E
Sbjct: 134 KFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187
Score = 30.5 bits (69), Expect = 0.29
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ +K E++ K+ E E+ +++EK+ EEE+ EEE E+
Sbjct: 148 DIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188
Score = 27.4 bits (61), Expect = 3.1
Identities = 12/59 (20%), Positives = 28/59 (47%)
Query: 92 FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ + + ++ EE++ ++K E+K K+ E E +E+ E++ EE
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179
Score = 26.3 bits (58), Expect = 6.9
Identities = 12/41 (29%), Positives = 30/41 (73%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EKK K E E+ ++ E++E++++EE+E +E+ ++++ ++
Sbjct: 157 EKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 31.5 bits (72), Expect = 0.063
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ EE+K +EE +++ +E KERE EK++
Sbjct: 95 AQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 27.6 bits (62), Expect = 1.4
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+E+ E ++ ++EE+K +EE +R+ E+ +E E+
Sbjct: 90 DERAEAQRARDEEKKLDEEEAKRQHEEAKEREREK 124
Score = 25.7 bits (57), Expect = 7.3
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+E E ++ + EE++ ++E K + EE +E E
Sbjct: 90 DERAEAQRARDEEKKLDEEEAKRQHEEAKERE 121
Score = 25.7 bits (57), Expect = 8.0
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
R R EKK EE + + ++ +E E++KK+
Sbjct: 97 RARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.5 bits (74), Expect = 0.068
Identities = 15/45 (33%), Positives = 25/45 (55%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+Q E++ K + E KKK E +KK + E + + E + + EE
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEE 193
Score = 32.1 bits (73), Expect = 0.090
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 106 KRQGEKKKKNEEKE----EEKKKKEEEEKKKKEEKEREEEKREE 145
+Q E+ K E++ EE K K+ E K K E E E++ +EE
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE 147
Score = 31.7 bits (72), Expect = 0.11
Identities = 7/45 (15%), Positives = 25/45 (55%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++Q E+ +K E+ ++K+ E+ ++ ++ E+ ++ ++
Sbjct: 74 EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118
Score = 30.6 bits (69), Expect = 0.24
Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEE 146
+++Q E+ K + E + K + E EKK KEE K+ EEE + +
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKA 160
Score = 29.8 bits (67), Expect = 0.49
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 5/58 (8%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++ L R + K+ E+ K+ E+K+K+ EE K K+ E K + E E
Sbjct: 87 EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA---KAKAEAEAE 141
Score = 29.8 bits (67), Expect = 0.53
Identities = 8/42 (19%), Positives = 26/42 (61%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++RQ + +++ EE E+++ ++ +K+ ++ E+ ++ E
Sbjct: 67 QERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE 108
Score = 29.0 bits (65), Expect = 1.0
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
K + + K + + E +KK +EE KK+ EE+ + + E
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163
Score = 28.6 bits (64), Expect = 1.2
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 5/46 (10%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEE-----EEKKKKEEKEREEEKREE 145
+ +Q + K E E EKK KEE EE+ K + ++K E
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Score = 27.5 bits (61), Expect = 2.9
Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K + E KK+ EEE K K E KKK + +++ + E + E
Sbjct: 142 KKAKEEAKKQ---AEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAE 184
Score = 27.1 bits (60), Expect = 3.5
Identities = 13/47 (27%), Positives = 28/47 (59%), Gaps = 5/47 (10%)
Query: 105 RKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEE--KEREEEKREEE 146
+ + EK++ E+ KE E++ E+ K+ E+ K+ EE++++ E
Sbjct: 76 QAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122
Score = 27.1 bits (60), Expect = 4.5
Identities = 15/56 (26%), Positives = 28/56 (50%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E + +K+ E K K + + K + E E+K K+E K++ EE+ + + E
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAE 163
Score = 26.7 bits (59), Expect = 5.8
Identities = 17/50 (34%), Positives = 20/50 (40%), Gaps = 9/50 (18%)
Query: 106 KRQGEKKKKNEEK----EEEKKKKEEEEKKKKEEK-----EREEEKREEE 146
K E KKK E E E K K E + K K E+ E + K E
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE 208
Score = 26.3 bits (58), Expect = 6.3
Identities = 10/41 (24%), Positives = 27/41 (65%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++++ KK+ E +++ +++ EE EK++ E+ R++E +
Sbjct: 57 QQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQR 97
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 32.1 bits (74), Expect = 0.087
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
+K+ K K ++KK+K+E + K + K R
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 27.9 bits (63), Expect = 1.8
Identities = 5/30 (16%), Positives = 18/30 (60%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENV 148
++E K + +KKK++++ + + + + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92
Score = 27.9 bits (63), Expect = 2.3
Identities = 7/26 (26%), Positives = 14/26 (53%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKK 131
+ K +K +EK+E K K + + +
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 27.1 bits (61), Expect = 3.8
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREE 140
K E K K +K++E+++ K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 26.3 bits (59), Expect = 6.0
Identities = 7/36 (19%), Positives = 19/36 (52%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K++ ++K+ K + +KK+EK+ + + +
Sbjct: 53 KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 30.9 bits (70), Expect = 0.089
Identities = 16/26 (61%), Positives = 20/26 (76%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
EEK +++EEEKKK+EEKE EEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 28.2 bits (63), Expect = 0.70
Identities = 13/26 (50%), Positives = 21/26 (80%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
EE+ ++KEEE+KK++E++E EEE
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 27.1 bits (60), Expect = 1.9
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKE 134
++K EEKEEEKKK+EE+E++++E
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 27.1 bits (60), Expect = 2.3
Identities = 10/24 (41%), Positives = 19/24 (79%)
Query: 123 KKKEEEEKKKKEEKEREEEKREEE 146
+E+ E+K++E+K+ EE++ EEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 26.3 bits (58), Expect = 3.8
Identities = 15/31 (48%), Positives = 22/31 (70%), Gaps = 3/31 (9%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E+K +E+EE+KKKEE E+E+ EEE +
Sbjct: 73 AAEEKAEEKEEEKKKEE---EKEEEEEEALA 100
Score = 25.2 bits (55), Expect = 9.6
Identities = 10/23 (43%), Positives = 20/23 (86%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEE 128
+ + E+K++ ++KEEEK+++EEE
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.0 bits (73), Expect = 0.099
Identities = 17/37 (45%), Positives = 30/37 (81%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
EK+ +EE+EEEK++K+EEE+K +++E +E+ E+E
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68
Score = 30.5 bits (69), Expect = 0.30
Identities = 16/41 (39%), Positives = 34/41 (82%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E++++ EEK+EE++K ++E++ EE+E+EE+K++ + V+E
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
Score = 30.1 bits (68), Expect = 0.37
Identities = 14/41 (34%), Positives = 28/41 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+++ E++K +++EE +++E+EEKKKK +K +E E
Sbjct: 46 EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 30.1 bits (68), Expect = 0.43
Identities = 17/37 (45%), Positives = 27/37 (72%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K EKE +++EEE+++KKEE+E+ +K EE + EE
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEE 65
Score = 29.7 bits (67), Expect = 0.48
Identities = 17/34 (50%), Positives = 25/34 (73%)
Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EKE EK+ +EEE+++KEEK+ EEEK ++ E
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV 61
Score = 29.7 bits (67), Expect = 0.54
Identities = 14/41 (34%), Positives = 30/41 (73%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+++KK EE++ K++E +E+++KEEK+++ +K +E E
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW 84
Score = 29.4 bits (66), Expect = 0.67
Identities = 13/41 (31%), Positives = 31/41 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EK++K EE+E+ K+EE ++++++E+++++ K+ +E E
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
Score = 28.2 bits (63), Expect = 1.8
Identities = 14/37 (37%), Positives = 29/37 (78%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+K+ +E+EEE+K++++EE++K +KE E ++ EE+
Sbjct: 31 VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67
Score = 27.4 bits (61), Expect = 2.9
Identities = 15/40 (37%), Positives = 28/40 (70%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E +K+ ++EEE++K+E++E+++K + EE EEE E
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 27.0 bits (60), Expect = 4.2
Identities = 14/41 (34%), Positives = 30/41 (73%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+KK+ EEK +K+++ +EE++K+E+K++ ++ +E E
Sbjct: 45 EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Score = 27.0 bits (60), Expect = 4.9
Identities = 14/41 (34%), Positives = 31/41 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EK+ + E +EE+++++EE+K+++E+ +EE+ +EE +E
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 31.6 bits (72), Expect = 0.11
Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 10/64 (15%)
Query: 87 NGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
G+ +F S+ K E ++K+ + E ++ K++EE + K++KE EE K ++
Sbjct: 63 TGVKDFLSDQKPE----------DEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK 112
Query: 147 NVEE 150
+++
Sbjct: 113 KIDQ 116
Score = 29.7 bits (67), Expect = 0.55
Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 1/51 (1%)
Query: 92 FRSE-TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
F SE G F Q + +K + ++ +++++ + +K++E E
Sbjct: 55 FNSEFAGGTGVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.7 bits (73), Expect = 0.12
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 102 LGFRKRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
L K G + K + EEE+ +KEE +++ +EE + E E+ EE
Sbjct: 21 LEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEE 70
Score = 27.9 bits (63), Expect = 2.6
Identities = 16/57 (28%), Positives = 26/57 (45%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KE + ++ + K+ E + EEEE +K+E K EE+ + E E
Sbjct: 7 YELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEA 63
Score = 26.4 bits (59), Expect = 8.9
Identities = 12/52 (23%), Positives = 24/52 (46%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ K+ + +G K+ K + ++ + KKKK++K E+ RE
Sbjct: 124 KGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPRE 175
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.7 bits (70), Expect = 0.13
Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 8/55 (14%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKE--------REEEKREEENVEE 150
+R + G E++EE++++KEE+E+K+++EKE E EKR EE +
Sbjct: 4 YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKA 58
Score = 25.7 bits (57), Expect = 7.3
Identities = 15/58 (25%), Positives = 37/58 (63%), Gaps = 7/58 (12%)
Query: 92 FRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKK-------EEEEKKKKEEKEREEEK 142
+R ++ + ++ ++ + E++K+ +E++EEK+K+ +EE +K+ EE E+ + K
Sbjct: 4 YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61
Score = 25.7 bits (57), Expect = 9.0
Identities = 13/43 (30%), Positives = 28/43 (65%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
R+ + +K++ E KEE++K+ + +K+E ++R EE + +N
Sbjct: 18 REEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.6 bits (71), Expect = 0.13
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEENVEE 150
K E + + KK + ++K+ E+ KK EE++KK E K+ EE ++ E +++
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 30.5 bits (68), Expect = 0.33
Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 105 RKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K+ E KKK EE K +E KKK EE KK E K++ EE ++ E ++
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464
Score = 30.5 bits (68), Expect = 0.40
Identities = 15/54 (27%), Positives = 28/54 (51%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++E K+ +K+ E KK E + E + +E + +E+ E E+K+EE
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376
Score = 29.7 bits (66), Expect = 0.64
Identities = 17/54 (31%), Positives = 33/54 (61%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++E K+ + +K + KKK +E K+ + KK+ +E KK EE ++ +E ++ E
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
Score = 29.7 bits (66), Expect = 0.70
Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 3/44 (6%)
Query: 106 KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K++ E+ KK +E K EE KKK +E KK E K++ +E ++ E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE 1519
Score = 29.3 bits (65), Expect = 0.89
Identities = 16/40 (40%), Positives = 27/40 (67%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+ EK + E+K+EE KKK + KKK EEK++ +E +++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 29.0 bits (64), Expect = 1.0
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK + + KK+ E K+ E+ KK+E E+KKK E+ ++ E+ + EE
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Score = 29.0 bits (64), Expect = 1.0
Identities = 23/59 (38%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNE---EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ +K EKKK E +EE K K EE KK +E+K++ EE ++ E E+
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 29.0 bits (64), Expect = 1.1
Identities = 19/56 (33%), Positives = 32/56 (57%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ + + +KK + +K EE KKK EE KK E K++ EE ++ + ++
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490
Score = 29.0 bits (64), Expect = 1.2
Identities = 16/53 (30%), Positives = 33/53 (62%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++E K+ + +K + KK + +K+ E+ KK +E KKK EE ++ +E +++
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 28.6 bits (63), Expect = 1.4
Identities = 15/56 (26%), Positives = 32/56 (57%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ + + +KK + +K+ ++ KK E KKK +E ++ EE ++ + ++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529
Score = 28.6 bits (63), Expect = 1.5
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E + E+ E +KKKEE +KK K++ EEK++ + ++
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398
Score = 28.6 bits (63), Expect = 1.5
Identities = 19/60 (31%), Positives = 35/60 (58%), Gaps = 4/60 (6%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEE----EEKKKKEEKEREEEKREEENVEE 150
E KK+ + + + +KK +K EE KK +E EE KK +E ++ EEK++ + +++
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Score = 28.6 bits (63), Expect = 1.6
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++ ++ KK EEK++ + K+ EE KK EEK++ EE ++ E
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAE 1573
Score = 28.6 bits (63), Expect = 1.6
Identities = 18/40 (45%), Positives = 29/40 (72%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K+ EKKK +E K+ E+KKK +E KKK EE ++ +E +++
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 28.2 bits (62), Expect = 1.9
Identities = 18/56 (32%), Positives = 33/56 (58%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ + + +KK + +K +E KKK EE KK +E K++ EE ++ + ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
Score = 28.2 bits (62), Expect = 2.0
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E KK + + +KK +K KKK +E KKK EEK++ +E +++
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 28.2 bits (62), Expect = 2.2
Identities = 18/41 (43%), Positives = 28/41 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K+ E KK +E K+ E+ KK +E KK +E+K+ +E K+ EE
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
Score = 28.2 bits (62), Expect = 2.3
Identities = 20/54 (37%), Positives = 34/54 (62%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K E + + +K+ + ++K+ E+ KK+EEEKKK ++EEEK+ EE +E
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
Score = 28.2 bits (62), Expect = 2.4
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Query: 105 RKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K+ E KKK EE K +E KK +KK E K++ EEK++ + ++
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Score = 27.8 bits (61), Expect = 2.7
Identities = 22/61 (36%), Positives = 37/61 (60%), Gaps = 3/61 (4%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
++E +K+ +K++ E+KKK EE EEE K K EE KK EE +++ E+ ++ +
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Query: 150 E 150
E
Sbjct: 1687 E 1687
Score = 27.8 bits (61), Expect = 2.7
Identities = 21/63 (33%), Positives = 37/63 (58%), Gaps = 10/63 (15%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEE----------KEEEKKKKEEEEKKKKEEKEREEEK 142
+ + + +K++ E+KKK EE K EE KK+ EE+KKK EE +++EE+
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 143 REE 145
+++
Sbjct: 1756 KKK 1758
Score = 27.8 bits (61), Expect = 2.8
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 105 RKRQGEKKKKNEEKE--EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K+ E KK K+ +E KKK EE+KK E K++ EE ++ + ++
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451
Score = 27.8 bits (61), Expect = 2.8
Identities = 16/55 (29%), Positives = 25/55 (45%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E KK + + +KK +K+ E+ KK E K + E +E + EE E
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 27.8 bits (61), Expect = 2.9
Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 106 KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K++ E+ KK +E K EE KKK + KKK EE ++ E + E
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351
Score = 27.8 bits (61), Expect = 2.9
Identities = 19/56 (33%), Positives = 32/56 (57%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK + Q +KK+ E+K+ E+ KK EEE K K +E ++ + +++ EE
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
Score = 27.8 bits (61), Expect = 3.1
Identities = 21/56 (37%), Positives = 34/56 (60%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK + + +K ++ ++K E+ KKKE EEKKK EE ++ EE+ + + EE
Sbjct: 1611 EAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
Score = 27.8 bits (61), Expect = 3.1
Identities = 25/76 (32%), Positives = 44/76 (57%)
Query: 75 VKKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
+KK+ E + L + E K + ++ + +KKK E K++E++KK+ KK+E
Sbjct: 1707 LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766
Query: 135 EKEREEEKREEENVEE 150
EK+ EE ++E+E V E
Sbjct: 1767 EKKAEEIRKEKEAVIE 1782
Score = 27.4 bits (60), Expect = 3.6
Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNE--EKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E + E K+ +K+ EKKK +E +K EE KKK +E KK K++ +E +++
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 27.4 bits (60), Expect = 4.2
Identities = 17/51 (33%), Positives = 30/51 (58%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E KK+ + + +KK + +K +E KKK EE KKK +E ++ E +++
Sbjct: 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 27.0 bits (59), Expect = 4.6
Identities = 19/54 (35%), Positives = 33/54 (61%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KKE + + +KK+ E+K+ E+ KK EEE K K E+ ++E + +++ EE
Sbjct: 1695 KKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE 1748
Score = 27.0 bits (59), Expect = 5.0
Identities = 18/56 (32%), Positives = 34/56 (60%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
L+ + KK+ + +K + +KK +K+ E+ KK +E KKK EE ++ EE +++
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Score = 27.0 bits (59), Expect = 5.2
Identities = 19/48 (39%), Positives = 31/48 (64%), Gaps = 3/48 (6%)
Query: 106 KRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++ E+ KK EE K EE KK +E +KK +E K+ +E K++ E ++
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497
Score = 27.0 bits (59), Expect = 5.4
Identities = 18/53 (33%), Positives = 33/53 (62%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++E KK+ + +K + KK +K+ E+ KK EE KKK EE ++ +E +++
Sbjct: 1426 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Score = 27.0 bits (59), Expect = 5.5
Identities = 14/45 (31%), Positives = 29/45 (64%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ ++ KK EE ++ + K+ EEKKK +E ++ EE ++ E ++
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565
Score = 27.0 bits (59), Expect = 5.8
Identities = 18/51 (35%), Positives = 30/51 (58%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E KK+ + + +KK + +K +E KKK EE KK E K++ EE +++
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Score = 26.6 bits (58), Expect = 6.7
Identities = 19/46 (41%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 107 RQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ E+KKK +E K EEKKK +E +KK +E K+ +E K++ E ++
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329
Score = 26.3 bits (57), Expect = 8.0
Identities = 19/42 (45%), Positives = 26/42 (61%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
RK + KK + +K EE KK EE KK +EE+ EE ++ EE
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259
Score = 26.3 bits (57), Expect = 8.3
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
E K EE ++ + K EE +K +E K++ E+ R+ E
Sbjct: 1095 EAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEA 1133
Score = 26.3 bits (57), Expect = 8.3
Identities = 17/48 (35%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE--KEREEEKREEENVEE 150
+ EKKK+ +K+ + KK+ EEKKK +E K+ EE+K++ + +++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Score = 26.3 bits (57), Expect = 8.3
Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 1/57 (1%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKE-EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ +K KKK E K+ E K E E + E E+ + E+ EE
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 26.3 bits (57), Expect = 9.2
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ E +K + K+ E KK EE KK EE ++ EE+R E + +
Sbjct: 1212 RKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRK 1256
Score = 26.3 bits (57), Expect = 9.4
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEE---KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+E +E K++ E KKK + K EEKKK +E +KK +E+K++ +E ++
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.2 bits (71), Expect = 0.14
Identities = 13/27 (48%), Positives = 19/27 (70%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
+KEE+K +EE+K K EKE+ EE+
Sbjct: 96 AQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 31.4 bits (71), Expect = 0.14
Identities = 13/39 (33%), Positives = 29/39 (74%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
RK+ ++ +K K++EK++ E+ +K K++ ++++EEKR
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190
Score = 26.8 bits (59), Expect = 4.8
Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
KK + RKR G++K+K + E+ +K +E+ +KK+EEK R +E
Sbjct: 150 LSRKKSDDEH-RKRSGKQKEK--RRVEDSQKHKEDRRKKQEEKRRNDE 194
>gnl|CDD|176990 CHL00049, ndhB, NADH dehydrogenase subunit 2.
Length = 494
Score = 31.5 bits (72), Expect = 0.14
Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 8/33 (24%)
Query: 24 GIPAGESNGRRGY--MLTYIIAYI----RDFAC 50
GI G+SN GY M+TY++ YI FAC
Sbjct: 322 GIIVGDSND--GYASMITYMLFYIFMNLGTFAC 352
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 31.1 bits (71), Expect = 0.14
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
K E + E K++E E +KKE K+++
Sbjct: 163 KLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.1 bits (63), Expect = 1.4
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K + EE ++ E KEE+ E EK+E
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192
Score = 27.7 bits (62), Expect = 2.2
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
F + E +E+E + +K+E +KKK
Sbjct: 168 FAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 31.2 bits (70), Expect = 0.17
Identities = 13/25 (52%), Positives = 21/25 (84%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKR 143
EEE++ ++ EK K+E+K+RE+EKR
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKR 109
Score = 28.9 bits (64), Expect = 0.98
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
EKK K +K+E+K+K++E +K KK+E E +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
Score = 28.9 bits (64), Expect = 1.1
Identities = 11/33 (33%), Positives = 26/33 (78%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
E+K ++ KKKE++EK+K+ +K++++E +++
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228
Score = 28.5 bits (63), Expect = 1.4
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
R E K E+ + +K+ ++ KKKE+KE+E+E+ +++ E
Sbjct: 179 RNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221
Score = 27.7 bits (61), Expect = 2.5
Identities = 10/30 (33%), Positives = 23/30 (76%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKE 137
+ KK K +EK+E++K++++++KK+ E +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.0 bits (59), Expect = 4.4
Identities = 11/37 (29%), Positives = 26/37 (70%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
EK +++ KK K++E+K+K++E++++++K E
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 27.0 bits (59), Expect = 5.6
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
+ + ++ E E+ KK + KKKK+ KE+EE+K+
Sbjct: 259 PDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKK 294
Score = 26.6 bits (58), Expect = 7.5
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
+ET K G +K KK ++KE+++K+KE ++ KKKE + +
Sbjct: 181 AETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
Score = 26.2 bits (57), Expect = 9.9
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
+ + KK +K++++K+KEE++KKKK R
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 31.4 bits (71), Expect = 0.17
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
K++K+EEK EE K +E +E+ ++EEK E E+ N+E
Sbjct: 283 KEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIE 321
Score = 27.5 bits (61), Expect = 3.3
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+E K+KKEE++++K EE + EE E E E+
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
Score = 27.1 bits (60), Expect = 4.5
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 121 EKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EKK++++EEK ++ + E +E+ EEE
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310
Score = 26.7 bits (59), Expect = 7.1
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E KE++++K EE+ ++ K E+ EE + EE+
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
Score = 26.3 bits (58), Expect = 8.4
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+KK+ +++E+ ++ K EE ++ EE+E+ E V
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVN 319
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 31.2 bits (71), Expect = 0.17
Identities = 13/39 (33%), Positives = 29/39 (74%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+R E + +E E++++EE+E++KK++KER++ ++E
Sbjct: 167 RRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKE 205
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 30.8 bits (70), Expect = 0.18
Identities = 18/54 (33%), Positives = 35/54 (64%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++ E + + EK+ + EE+E+++KKK++E KK+K+EK+ ++EK E
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180
Score = 29.3 bits (66), Expect = 0.60
Identities = 14/37 (37%), Positives = 30/37 (81%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
EKK+K ++KE +K+KKE+++KK+K + + +K++++
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Score = 27.8 bits (62), Expect = 1.9
Identities = 14/52 (26%), Positives = 30/52 (57%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
SE + + E++KK ++K++E KK+++E+K KKE+ + ++
Sbjct: 134 TSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Score = 27.8 bits (62), Expect = 2.0
Identities = 13/38 (34%), Positives = 29/38 (76%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
E+KK+ ++K+E KK+K+E++ KK++ E + K++++
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Score = 27.0 bits (60), Expect = 3.7
Identities = 12/39 (30%), Positives = 31/39 (79%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
K++ +KKK+ +++++EKK K+E+ + K K+++++K++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 26.6 bits (59), Expect = 5.1
Identities = 13/50 (26%), Positives = 31/50 (62%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
+ +E +K++ E KK+ +EK+++K+K E + KK++K++++
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 26.2 bits (58), Expect = 6.6
Identities = 13/39 (33%), Positives = 29/39 (74%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
+K + +KK+ +EK+E+K KKE+ + K +K+++++K+
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 25.8 bits (57), Expect = 9.3
Identities = 16/42 (38%), Positives = 29/42 (69%), Gaps = 4/42 (9%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ + +K+ E +EEEKK+K KKKKE K+ ++EK++++
Sbjct: 138 ETTAKVEKEAEVEEEEKKEK----KKKKEVKKEKKEKKDKKE 175
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 31.3 bits (71), Expect = 0.18
Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 11/65 (16%)
Query: 82 ECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKK----KEEEEKKKKEEKE 137
E TV E ++ K E + K NEEK +++ EE K+ +E +
Sbjct: 885 ENTVKGQYSELQNHHKTEAL-------SQNNKYNEEKSAQERMPGADSPEELMKRAKEYQ 937
Query: 138 REEEK 142
+ +
Sbjct: 938 DKHKG 942
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 29.9 bits (68), Expect = 0.20
Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 2/36 (5%)
Query: 104 FRKRQGEKKKKNEEK--EEEKKKKEEEEKKKKEEKE 137
F++ + +KKKK+E + E+K+KKE E++K ++K
Sbjct: 37 FKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.9 bits (70), Expect = 0.20
Identities = 15/41 (36%), Positives = 28/41 (68%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K +E EE +++ EEE+++ +E E+E E++ E NVE+
Sbjct: 80 DKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK 120
Score = 29.7 bits (67), Expect = 0.50
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ E + E KEEEK+ E+K+ K + E+E+E+ EEEN EE
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Score = 28.9 bits (65), Expect = 0.91
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+E ++ EK+ N E +E+K E+E+++ +EE E E+E+ +EN +E
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKE 109
Score = 28.5 bits (64), Expect = 1.1
Identities = 13/42 (30%), Positives = 25/42 (59%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K EK+ + E+E E++ +E ++ +KE +E+ E E+E
Sbjct: 80 DKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121
Score = 28.2 bits (63), Expect = 1.5
Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEE----EKKKKEEEEKKKKEEKEREEEKREEENVE 149
+ ++E + E ++ EE++E E K+ + + +K+ EE E E E+ +EE+ +
Sbjct: 45 AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104
Query: 150 E 150
E
Sbjct: 105 E 105
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 31.0 bits (70), Expect = 0.22
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
F K++ ++KK+ +E E K+E EE+ K + E+E E ++E
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDE 267
Score = 28.6 bits (64), Expect = 1.3
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++ +EE+ EEK+K++ + KK E E E+E+ E
Sbjct: 279 ERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313
Score = 26.3 bits (58), Expect = 6.5
Identities = 14/55 (25%), Positives = 27/55 (49%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ ++ + K GE+ EE EE++K+K + KK E+++ +EE
Sbjct: 259 SAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI 313
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 30.8 bits (70), Expect = 0.23
Identities = 11/41 (26%), Positives = 23/41 (56%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++ E +E +K E+EK + E+E+ E +E +++
Sbjct: 3 ELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK 43
Score = 29.6 bits (67), Expect = 0.57
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 5/63 (7%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK-KEEKEREEEKREEEN 147
L E +E KE RK ++K E+E+ + +EE K E +R EE +E
Sbjct: 4 LREALAELAKE----LRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELE 59
Query: 148 VEE 150
+
Sbjct: 60 AKP 62
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 30.7 bits (70), Expect = 0.23
Identities = 14/36 (38%), Positives = 17/36 (47%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
GE KN K+ K K+ EEEK KEE +
Sbjct: 4 NGEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAA 39
Score = 27.3 bits (61), Expect = 4.0
Identities = 11/52 (21%), Positives = 20/52 (38%)
Query: 99 EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K +K+ K ++ EEEK KEE + + ++ +E
Sbjct: 3 SNGEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDE 54
Score = 26.5 bits (59), Expect = 7.7
Identities = 12/61 (19%), Positives = 22/61 (36%), Gaps = 3/61 (4%)
Query: 86 SNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
SNG ++ KK K+ E+K EE + + + +K +E
Sbjct: 3 SNGEPLSKNALKKRL---KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59
Query: 146 E 146
+
Sbjct: 60 Q 60
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.0 bits (70), Expect = 0.25
Identities = 20/56 (35%), Positives = 33/56 (58%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
S T E + E++ EE+EEE++++EEE++ ++EE E EEE+ E E
Sbjct: 422 SKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
Score = 29.9 bits (67), Expect = 0.46
Identities = 16/41 (39%), Positives = 30/41 (73%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++ EE+EEE++++E+E ++++ E E EEE+ E +N E
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSE 483
Score = 29.1 bits (65), Expect = 0.87
Identities = 16/38 (42%), Positives = 27/38 (71%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++E EEE+ +EEEE++++EE+E +E + EE EE
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471
Score = 29.1 bits (65), Expect = 1.00
Identities = 14/42 (33%), Positives = 29/42 (69%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+ + EE+EEE++++EE+E +++E ++ EEE+ E +
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGS 482
Score = 28.7 bits (64), Expect = 1.2
Identities = 16/37 (43%), Positives = 27/37 (72%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ EE EE++++EEEE+++++E E EE + EEE E
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476
Score = 28.7 bits (64), Expect = 1.5
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ E +EEE ++EEEE++++EE+E+E E+ E E+ EE
Sbjct: 427 TSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
Score = 28.3 bits (63), Expect = 1.6
Identities = 16/42 (38%), Positives = 29/42 (69%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ ++ E EEE++++EEEE++++E +E E E EEE E
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
Score = 28.3 bits (63), Expect = 1.8
Identities = 21/74 (28%), Positives = 40/74 (54%)
Query: 76 KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
+KR RE ++ S+ E +++ +EEE++++EEEE++++ E
Sbjct: 404 RKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESE 463
Query: 136 KEREEEKREEENVE 149
+E E++ EEE VE
Sbjct: 464 EEEGEDEEEEEEVE 477
Score = 27.6 bits (61), Expect = 3.0
Identities = 16/40 (40%), Positives = 30/40 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E++++ EE+EEE++ +EEE + ++EE+E E + EE +E
Sbjct: 448 EEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 27.2 bits (60), Expect = 4.1
Identities = 14/40 (35%), Positives = 27/40 (67%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ E+EE +++EEEE++++EE++ EE+ E+ EE
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473
Score = 27.2 bits (60), Expect = 4.5
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
GE ++ EE++ EEEE++++EE+E E+E EEE +E
Sbjct: 425 SSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE 470
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 30.7 bits (69), Expect = 0.25
Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEENVEE 150
+K + ++KKK E+ EE K K+ E+E+ K+ EKER + + +++ EE
Sbjct: 75 KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEE 122
Score = 28.0 bits (62), Expect = 2.3
Identities = 12/45 (26%), Positives = 29/45 (64%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ E+ K + E+E+ K+ E+E+ K +E++++ E+ E++ E
Sbjct: 86 EQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLE 130
Score = 26.8 bits (59), Expect = 5.7
Identities = 13/46 (28%), Positives = 19/46 (41%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K E K K E +KK + E + ++ K E K + E E
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.4 bits (69), Expect = 0.25
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+K++ + K K K ++ K K EEKE+ + ++EE+ + E
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83
Score = 30.4 bits (69), Expect = 0.30
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K + K K EEKE+ K++KEE+ ++ EE E+E E+
Sbjct: 57 KAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97
Score = 29.2 bits (66), Expect = 0.77
Identities = 15/43 (34%), Positives = 26/43 (60%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K KK + + E+K+K + EK++K +E EE+ E+E E+
Sbjct: 54 AKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96
Score = 28.1 bits (63), Expect = 1.8
Identities = 14/41 (34%), Positives = 25/41 (60%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ K + K+ K K EE+EK K+E++E+ + EE+ E+
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPED 91
Score = 27.3 bits (61), Expect = 2.9
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++K EEK + K + ++ K + +E+E+ KRE+E
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81
Score = 26.5 bits (59), Expect = 4.8
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ K +K + K +E+E+ K+++E++ E E+ +E
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDE 92
>gnl|CDD|217283 pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain.
This domain has a ferredoxin-like fold.
Length = 247
Score = 30.4 bits (69), Expect = 0.26
Identities = 13/69 (18%), Positives = 23/69 (33%), Gaps = 2/69 (2%)
Query: 17 SIAKRYGGIPAGESNGRRGYMLTYIIAYIRDFACDYYFIGDSFETSVPWDKTVLLCINVK 76
E R Y + +RD + + + SVPW + L ++K
Sbjct: 90 GAGDVVVAEDEAERE-RLWRARKYAMP-LRDALGGAGPLVFTEDVSVPWSRLPDLVADIK 147
Query: 77 KRLTRECTV 85
+ L + V
Sbjct: 148 ELLAKYGLV 156
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 30.6 bits (69), Expect = 0.27
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 6/51 (11%)
Query: 97 KKEGNLGFRKR------QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
K + + G +K Q E+ N+ + KKK+ EK K+ K +E E
Sbjct: 352 KLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEGGKGKETE 402
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.5 bits (70), Expect = 0.29
Identities = 20/66 (30%), Positives = 41/66 (62%), Gaps = 9/66 (13%)
Query: 90 LEFRSETKKEGNLGFRKRQGE---KKKKNEEKEEE-KKKKEEEEKKKKE-EKEREEEKRE 144
+ R+E +KE R+R+ E +K+ +KEE +K E EK+++E EK+ +E +++
Sbjct: 67 HKLRNEFEKE----LRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK 122
Query: 145 EENVEE 150
++ +E+
Sbjct: 123 QQELEK 128
Score = 29.7 bits (68), Expect = 0.61
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 106 KRQGEKKKKNEEKEEEKKKKE------EEEKKKKEEKEREEEKREEE 146
+ + ++ + +KE E KKE EE K + E E+E +R E
Sbjct: 37 EEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83
Score = 28.2 bits (64), Expect = 1.7
Identities = 14/44 (31%), Positives = 30/44 (68%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
+K L R+ + EKK+K E+++++ +K+EEE ++ E++ +E
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.8 bits (63), Expect = 2.4
Identities = 21/54 (38%), Positives = 37/54 (68%), Gaps = 3/54 (5%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KE NL R+ E +K EE+ E+K+K+ E+++++ E+KE E E+ EE ++E
Sbjct: 93 QKEENL---DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.0 bits (61), Expect = 3.7
Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K+ E KEE K + E EK+ +E + E +K E+
Sbjct: 56 KEALLEAKEEIHKLRNEFEKELRE-RRNELQKLEK 89
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 30.3 bits (69), Expect = 0.31
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
E R +TK K+ G+ KK K EKK+KE+E+ K K+
Sbjct: 375 ELRPKTKAPS----EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Score = 26.8 bits (60), Expect = 5.8
Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 4/52 (7%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+T K KR +K+K ++E+ K K+ K K R+ EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEK----EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 30.1 bits (68), Expect = 0.32
Identities = 17/52 (32%), Positives = 35/52 (67%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ +++ + + E+KK E++E E+K++EE E++++++KE EE K EE
Sbjct: 15 QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE 66
Score = 29.3 bits (66), Expect = 0.51
Identities = 20/37 (54%), Positives = 33/37 (89%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
KR+GE+K++ E +EE +KKKEEEE+K++EE+ R+E++
Sbjct: 37 KREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73
Score = 27.0 bits (60), Expect = 3.4
Identities = 19/56 (33%), Positives = 37/56 (66%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KK L ++ + ++++ EE+ EE+KK EE+ + +++E+E EE+RE++ EE
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59
Score = 27.0 bits (60), Expect = 3.9
Identities = 17/45 (37%), Positives = 33/45 (73%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ G KK+ E+++ ++++ E E++++EE+++ EEKRE E EE
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEE 45
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.4 bits (69), Expect = 0.32
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 6/68 (8%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEK------KKKEEEEKKKKEEKEREEEK 142
L E + T+ +R + K + E ++ ++E E KKK EK RE+EK
Sbjct: 272 LQELKETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEK 331
Query: 143 REEENVEE 150
R EE +
Sbjct: 332 RLEELEQN 339
Score = 28.4 bits (64), Expect = 1.6
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+E ++E F ++ EK+K+ EE E+ ++ +E K EE +++ E E+
Sbjct: 311 LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKR 364
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 29.5 bits (67), Expect = 0.32
Identities = 15/32 (46%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 111 KKKKNEEKEEEKKK-KEEEEKKKKEEKEREEE 141
KK + EE+E K+ KE E+ KKEE+ RE +
Sbjct: 79 KKNEKEERELRKRAEKEALEQAKKEEELREAK 110
Score = 28.8 bits (65), Expect = 0.69
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
K +KEE E +K+ EKE E+ ++EE + E
Sbjct: 80 KNEKEERELRKRAEKEALEQAKKEEELRE 108
Score = 26.1 bits (58), Expect = 6.5
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
K + E +K+ E++ E+ KKEEE ++ K ++ +
Sbjct: 83 KEERELRKRAEKEALEQAKKEEELREAKRQQRK 115
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.4 bits (68), Expect = 0.33
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K Q E KK +EE + K K + K++ + E+E E +E E
Sbjct: 285 KAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325
Score = 28.8 bits (64), Expect = 1.2
Identities = 11/45 (24%), Positives = 24/45 (53%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ E+ K ++ + K+E + +K+ E + E +++ E V E
Sbjct: 291 KKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAE 335
Score = 27.7 bits (61), Expect = 2.7
Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 8/53 (15%)
Query: 106 KRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKE-------REEEKREEENVEE 150
K+ E +K+ EK + E KK +EE K K+ K + EK E+ E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELE 325
Score = 26.9 bits (59), Expect = 4.8
Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE------KEREEEKREEENVEE 150
++ +K++ E+ + E KK +EE K K+ +E + ++E E+ E
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.6 bits (70), Expect = 0.35
Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 105 RKRQGEKKKK--NEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ +K ++ KE EK K+E EEKK+K ++E ++ E E
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEK 573
Score = 28.3 bits (64), Expect = 1.5
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E +++ E+K EE + +E +K KEE E ++EK +EE
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563
Score = 26.7 bits (60), Expect = 5.6
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+E E+K +E E K+ EK +EE + ++E ++E
Sbjct: 527 ELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQE 562
Score = 26.3 bits (59), Expect = 8.0
Identities = 23/57 (40%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 90 LEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
LE E K E ++ EK K EE EE+K+K +EEE K EE E+E ++ +E
Sbjct: 528 LERELEQKAE-EAEALLKEAEKLK--EELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.2 bits (66), Expect = 0.35
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKERE 139
E+KK+EEEE+++KEE E E
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
Score = 29.2 bits (66), Expect = 0.41
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEE 129
++KK EE+EEE+K++ EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 28.4 bits (64), Expect = 0.66
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 125 KEEEEKKKKEEKEREEEKREEENVE 149
EEKK++EE+E E+E+ EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 27.6 bits (62), Expect = 1.5
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 126 EEEEKKKKEEKEREEEKREEENVEE 150
E+KK+EE+E EE++ EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 27.2 bits (61), Expect = 1.5
Identities = 9/23 (39%), Positives = 18/23 (78%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
++K+EEE++++E++E EEE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96
Score = 27.2 bits (61), Expect = 1.7
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREE 140
++KKEEEE+++++E+ EE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 27.2 bits (61), Expect = 1.7
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
++++EE++++EEKE EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
Score = 26.1 bits (58), Expect = 3.9
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 124 KKEEEEKKKKEEKEREEEKREE 145
EE+K+++EE+E +EE EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 4.7
Identities = 6/23 (26%), Positives = 15/23 (65%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
+E++E++++EE++ E E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESE 94
Score = 25.7 bits (57), Expect = 5.5
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 126 EEEEKKKKEEKEREEEKREEE 146
++KKEE+E EEEK E E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESE 94
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.2 bits (68), Expect = 0.37
Identities = 14/44 (31%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 110 EKKKKNEEK----EEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E+K++ EK E E++++ EE+++++EEK E R + E
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE 295
Score = 27.9 bits (62), Expect = 2.4
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEE---EEKKKKEEKEREEEKREEENVEE 150
KR+ +K EE E E++ +E+ EE+K E +R + K E E E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 27.9 bits (62), Expect = 2.8
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
++K+ E EK KEE ERE + E+ EE
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREE 280
Score = 26.8 bits (59), Expect = 6.0
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
+RK Q E++ +E + K E+ + K E KE+
Sbjct: 436 YRKLQEERR---LREEAIRAKAEKTARMKAEMKEK 467
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 29.8 bits (67), Expect = 0.39
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
KKK E ++E+KKK++E+KKKK+ E
Sbjct: 144 HKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 27.1 bits (60), Expect = 3.1
Identities = 14/36 (38%), Positives = 25/36 (69%), Gaps = 4/36 (11%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
EKK K ++ E++K++K KKKKE+K++++ E
Sbjct: 141 EKKHKKKKHEDDKERK----KKKKEKKKKKKRHSPE 172
Score = 27.1 bits (60), Expect = 3.5
Identities = 9/34 (26%), Positives = 24/34 (70%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ + E+K KK++ E K+ +K+++E+K++++
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 30.3 bits (68), Expect = 0.39
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ + ++K +EK ++E K + E E E+ + EE
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE 832
Score = 30.3 bits (68), Expect = 0.40
Identities = 6/45 (13%), Positives = 20/45 (44%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ ++K K + +++ K E + + E+ +++ E+
Sbjct: 794 AARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838
Score = 29.5 bits (66), Expect = 0.64
Identities = 10/42 (23%), Positives = 21/42 (50%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
R +Q K+K +KE + + E + ++ ++ EEK +
Sbjct: 796 RAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA 837
Score = 28.8 bits (64), Expect = 1.3
Identities = 12/56 (21%), Positives = 24/56 (42%), Gaps = 10/56 (17%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEE----EKKKKEEKEREEE------KREEENVE 149
+++ EK ++E + E E E K++EEK + K++ V+
Sbjct: 797 AKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
Score = 28.4 bits (63), Expect = 1.8
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ K+K E+ +K+ K E + E + ++REE+ ++
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA 837
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 30.4 bits (69), Expect = 0.41
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
FR +KK+ E+K K+EE KKE+ +REE++R E
Sbjct: 643 FRNWIEKKKEFKILDYTEEKTKKEE---KKEKNKREEKERIE 681
Score = 30.0 bits (68), Expect = 0.47
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 111 KKKKN----EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
KK K EE+EEE + E + K K+ +E EE+
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263
Score = 29.2 bits (66), Expect = 0.89
Identities = 11/42 (26%), Positives = 20/42 (47%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
F K+ E + E +EE + E + K ++E+E E+
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEED 263
Score = 28.8 bits (65), Expect = 1.1
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E+K+E K EEK KKEEK+ + ++ E+E +E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681
Score = 28.8 bits (65), Expect = 1.2
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KK +E E ++++EE + + + E + K+E+E E
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTE 261
Score = 28.8 bits (65), Expect = 1.3
Identities = 9/39 (23%), Positives = 22/39 (56%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KK E E E++++E + + + + + ++ +E + EE
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEE 262
Score = 28.0 bits (63), Expect = 2.4
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KKKE + EEK ++EEK+E+ EE
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREE 676
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 30.3 bits (68), Expect = 0.43
Identities = 13/44 (29%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+ +R R+ E N ++ +EK +++++K + E KE+ +K+EE
Sbjct: 1 MAYRNRR-EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEE 43
Score = 28.0 bits (62), Expect = 2.4
Identities = 9/45 (20%), Positives = 24/45 (53%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++K++EK+++K + E +EK +++E + + E+
Sbjct: 12 NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQ 56
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.9 bits (65), Expect = 0.45
Identities = 9/22 (40%), Positives = 19/22 (86%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKE 137
+EEE++++EEEE++++ E+E
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEE 95
Score = 28.1 bits (63), Expect = 0.81
Identities = 9/20 (45%), Positives = 18/20 (90%)
Query: 110 EKKKKNEEKEEEKKKKEEEE 129
E++++ EE+EEE++++ EEE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
Score = 28.1 bits (63), Expect = 0.83
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREE 140
E++++EEEE++++EE+ EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 28.1 bits (63), Expect = 0.95
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
++++EEEE++++EE+E EEE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEE 95
Score = 27.7 bits (62), Expect = 1.0
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 125 KEEEEKKKKEEKEREEEKREEEN 147
EE++++EE+E EEE+ EE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEE 95
Score = 27.7 bits (62), Expect = 1.0
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
+++EEEE++++EE+E EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
Score = 27.7 bits (62), Expect = 1.3
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 124 KKEEEEKKKKEEKEREEEKREEENVE 149
EEE++++EE+E EEE+ EEE +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 27.7 bits (62), Expect = 1.3
Identities = 8/20 (40%), Positives = 17/20 (85%)
Query: 110 EKKKKNEEKEEEKKKKEEEE 129
++++ EE+EEE++++E EE
Sbjct: 75 AEEEEEEEEEEEEEEEESEE 94
Score = 26.2 bits (58), Expect = 4.4
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEE 141
++EEEE++++EE+E E E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESE 93
Score = 25.8 bits (57), Expect = 5.0
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 126 EEEEKKKKEEKEREEEKREEE 146
++++EE+E EEE+ EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES 92
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 30.1 bits (69), Expect = 0.47
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++K+ E +E +E E + EEE+ ++
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEESWDD 356
Score = 27.4 bits (62), Expect = 3.8
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
LGF + K++K + ++ +EEE + +EE
Sbjct: 319 LGFLAYRLRKREKKAAELAAEEAEEEEAAEPEEE 352
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.1 bits (63), Expect = 0.47
Identities = 20/39 (51%), Positives = 29/39 (74%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KKK + EEEK+++EEE +K++E KERE+ K EE + E
Sbjct: 25 KKKELKAEEEKREEEEEARKREERKEREKNKSFEELLNE 63
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 29.8 bits (68), Expect = 0.48
Identities = 9/20 (45%), Positives = 16/20 (80%)
Query: 116 EEKEEEKKKKEEEEKKKKEE 135
+EEE++++EEEE++ EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319
Score = 29.1 bits (66), Expect = 0.74
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K +E+ +E + + +EE+E EEE+ EEE EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 28.7 bits (65), Expect = 1.1
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 125 KEEEEKKKKEEKEREEEKREEE 146
EE++++EE+E EEE EEE
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEEE 320
Score = 28.7 bits (65), Expect = 1.2
Identities = 8/20 (40%), Positives = 16/20 (80%)
Query: 110 EKKKKNEEKEEEKKKKEEEE 129
++++ EE+EEE+++ EEE
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320
Score = 26.4 bits (59), Expect = 6.1
Identities = 7/22 (31%), Positives = 15/22 (68%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEE 129
+++ EE+EEE++++ EE
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEE 319
Score = 26.0 bits (58), Expect = 8.5
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+K +EE +E + + ++EE+E EEE+ EE + EE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|216317 pfam01131, Topoisom_bac, DNA topoisomerase. This subfamily of
topoisomerase is divided on the basis that these enzymes
preferentially relax negatively supercoiled DNA, from a
5' phospho- tyrosine linkage in the enzyme-DNA covalent
intermediate and has high affinity for single stranded
DNA.
Length = 365
Score = 29.9 bits (68), Expect = 0.51
Identities = 16/63 (25%), Positives = 31/63 (49%)
Query: 77 KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
+ + G EF++ K G++K ++K ++EE E K+ EE + K E +
Sbjct: 208 VYEETKVELEVGGEEFKASGKVLLEPGWKKVYPDEKDEDEEPLPELKEGEELDVKDVELE 267
Query: 137 ERE 139
E++
Sbjct: 268 EKK 270
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 29.6 bits (66), Expect = 0.54
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K K ++++ K +E ++ K EEK+ +EE+ EEE EE
Sbjct: 193 KAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEE 232
Score = 28.5 bits (63), Expect = 1.3
Identities = 14/42 (33%), Positives = 29/42 (69%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ +Q + K +E +E K +E++ ++++EE+E EEE ++EE
Sbjct: 196 KPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
Score = 26.5 bits (58), Expect = 6.3
Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 4/32 (12%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
E+K+ EE+EEE E EE+ K+EE + ++
Sbjct: 216 EEKQPQEEQEEE----EVEEEAKQEEGQGTDD 243
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 29.6 bits (67), Expect = 0.55
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK-EREEEKREEEN 147
++ EK+K+ E+ E +++ +E K+ EK E E EK E
Sbjct: 220 QELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQ 263
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.7 bits (67), Expect = 0.56
Identities = 9/34 (26%), Positives = 27/34 (79%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
KK ++++++++++ +++KK++E+E ++K EE
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 29.3 bits (66), Expect = 0.68
Identities = 12/32 (37%), Positives = 26/32 (81%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
+ +Q ++K++ E +++ KK++EEEE+K+K E+
Sbjct: 191 KLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 28.9 bits (65), Expect = 0.88
Identities = 13/39 (33%), Positives = 30/39 (76%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
Q E KK+ +E E +K K+++++++++ K+R++++ EEE
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEE 215
Score = 28.5 bits (64), Expect = 1.1
Identities = 13/42 (30%), Positives = 30/42 (71%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
R E+ KK ++ E KK K++++K+++E +++ ++++EEE
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEE 214
Score = 28.2 bits (63), Expect = 1.7
Identities = 11/39 (28%), Positives = 25/39 (64%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ +++K +K E+K+K+E EK+++E + R+ +E
Sbjct: 94 SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKE 132
Score = 28.2 bits (63), Expect = 1.8
Identities = 9/31 (29%), Positives = 25/31 (80%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
E KK +++++ ++++ ++ KK++EE+ER++
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQ 218
Score = 26.6 bits (59), Expect = 5.4
Identities = 12/32 (37%), Positives = 23/32 (71%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
K+Q +K+++ K+ +K+++EEE K+K EE
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQKAEEA 223
Score = 26.6 bits (59), Expect = 5.5
Identities = 12/37 (32%), Positives = 27/37 (72%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
R+++ +K + ++K+E +K++EE E +++ KE+ EE
Sbjct: 100 RQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136
Score = 26.6 bits (59), Expect = 5.9
Identities = 11/45 (24%), Positives = 30/45 (66%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+R +++ + +E + KK +++++K+EE+ R++ K+++E E
Sbjct: 171 PERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEE 215
Score = 26.2 bits (58), Expect = 7.3
Identities = 12/42 (28%), Positives = 28/42 (66%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K+ +KK + EE++K++ E+E+++ E ++R +++ EE
Sbjct: 95 AKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEE 136
Score = 25.8 bits (57), Expect = 8.6
Identities = 15/42 (35%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKE--EKEREEEKREEENVEE 150
K ++N +EE KK+ +E E KK + +++REEE+R++ ++
Sbjct: 170 KPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQ 211
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 29.7 bits (67), Expect = 0.59
Identities = 15/63 (23%), Positives = 24/63 (38%)
Query: 88 GLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
GLLE ++E L + K+ N KEE+ K KE ++
Sbjct: 190 GLLERKNELPHLKKLIILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNE 249
Query: 148 VEE 150
+E+
Sbjct: 250 LED 252
Score = 26.6 bits (59), Expect = 6.2
Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 1/44 (2%)
Query: 97 KKEGNLGFR-KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
K+E N G G K K E+K + + E+ E ++E
Sbjct: 220 KEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKE 263
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 29.5 bits (67), Expect = 0.59
Identities = 13/30 (43%), Positives = 24/30 (80%)
Query: 121 EKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+++ EE++K EE ++EEE+ EEE+V++
Sbjct: 326 RKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 29.7 bits (66), Expect = 0.61
Identities = 16/51 (31%), Positives = 27/51 (52%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
K E + K G+K + EE E +++ +E EE KK+ ++ E EE+
Sbjct: 225 KLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKF 275
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 29.7 bits (66), Expect = 0.61
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + + +++ E E ++ KEEE KK++E+E+ K +N +E
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067
Score = 27.0 bits (59), Expect = 5.0
Identities = 9/46 (19%), Positives = 29/46 (63%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ R +++ K++E++E + + ++++ +KE+E E+R + + +
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 28.9 bits (65), Expect = 0.61
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
++++ + KKK + E+ +KK +++ EKR+
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALE 59
Score = 25.8 bits (57), Expect = 7.4
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
+ +KK K+E+ E++ K++ EK++ EK R
Sbjct: 30 LLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGR 62
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 29.6 bits (67), Expect = 0.68
Identities = 13/32 (40%), Positives = 25/32 (78%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
KKK EEK+ +K+KK+++ KKK ++++++ K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.4 bits (64), Expect = 1.5
Identities = 11/39 (28%), Positives = 29/39 (74%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
K + K K + +E++ +K+++++K+KK+ K+R+++ R+
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.0 bits (63), Expect = 2.2
Identities = 13/46 (28%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 105 RKRQGEKKKKNEE-----KEEEKKKKEEEEKKKKEEKEREEEKREE 145
KR E K+K + +EE+K +K +++KK+K++ ++ ++K +
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.0 bits (65), Expect = 0.69
Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 3/48 (6%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKK---KKEEKEREEEKREEENVEE 150
K +K K ++ K+ KK E EKK + E E++ E V
Sbjct: 113 KVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160
Score = 27.0 bits (60), Expect = 3.5
Identities = 12/55 (21%), Positives = 20/55 (36%)
Query: 96 TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ L K+ +K EK+ + + E +KK E + EE
Sbjct: 115 NAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169
Score = 26.3 bits (58), Expect = 5.9
Identities = 10/34 (29%), Positives = 15/34 (44%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
KK NE + E +K+ E +E + EE
Sbjct: 137 KKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEE 170
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 29.2 bits (66), Expect = 0.70
Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 91 EFRSETKKE-GNLGFRKRQGEKKKKNEEKEEEKKK-KEEEEKKKKEEKEREEEKREEE 146
E S+ +K+ L +R+ E+K+ EE+E + K E+ + ++K +E +K E+E
Sbjct: 107 EITSDVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164
Score = 26.1 bits (58), Expect = 7.8
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 124 KKEEEEKKKKEEKEREEEKREEENVEE 150
+ E EE++K++E+ERE R ++ E
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSE 147
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.9 bits (65), Expect = 0.78
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 5/43 (11%)
Query: 113 KKNEEKEEEK-----KKKEEEEKKKKEEKEREEEKREEENVEE 150
K EEK +E+ KKK + K KE +++ K + V
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAV 195
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 28.9 bits (65), Expect = 0.79
Identities = 10/37 (27%), Positives = 10/37 (27%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KKKK E E E EE
Sbjct: 117 PTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEE 153
>gnl|CDD|216381 pfam01237, Oxysterol_BP, Oxysterol-binding protein.
Length = 335
Score = 29.1 bits (66), Expect = 0.90
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EE + EEEK + EEK+RE K EE EE
Sbjct: 276 EEGDYDEAEEEKLRLEEKQRERRKEREEKGEE 307
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 28.8 bits (65), Expect = 0.91
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEK 136
+KK + KEE KKKK +E K K K
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALK 273
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 29.2 bits (66), Expect = 0.93
Identities = 13/37 (35%), Positives = 25/37 (67%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E KK+ ++ EE+ ++ E E +KK+EE E+ + K ++
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
Score = 27.3 bits (61), Expect = 3.9
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
GE+ + E + EE KK+ ++ ++K E+ E E EK+EEE
Sbjct: 87 GEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEEL 125
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.4 bits (64), Expect = 0.94
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
EK+++ EE+ E ++ EEE+ + EKE + KRE+ E
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE 140
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.1 bits (65), Expect = 0.95
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E KK+ E +E ++ E+E+ E +E++E+ EE
Sbjct: 74 ELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEE 114
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.3 bits (64), Expect = 0.98
Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 108 QGEKKKKNEEKEEEKKKKEE-EEKKKKEEKEREE 140
Q + + +EK + + +E E K K+ E E +
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106
Score = 27.5 bits (62), Expect = 1.7
Identities = 6/36 (16%), Positives = 16/36 (44%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ K ++EK + E K+ + + + E + +
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHAL 108
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 28.7 bits (64), Expect = 1.0
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEK 142
EKE++K + +EE+K KEEK++E E
Sbjct: 93 REKEKKKSRTKEEKKALKEEKDKEAEP 119
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 28.7 bits (65), Expect = 1.0
Identities = 10/45 (22%), Positives = 22/45 (48%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+R ++ KK E E++K E K K + ++ + R ++ +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 26.0 bits (58), Expect = 6.7
Identities = 12/47 (25%), Positives = 23/47 (48%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ ++ E++ K E EK+K E+K K + R++ K + E
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 26.0 bits (58), Expect = 8.0
Identities = 8/46 (17%), Positives = 24/46 (52%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ + + KK+ E+EK + E + K K +++ + + ++ +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
of Ustilago maydis Rim23 (PalC), and related domains.
This family contains the Bro1-like domain of Ustilago
maydis Rim23 (also known as PalC), and related proteins.
It belongs to the BRO1_Alix_like superfamily which
includes the Bro1-like domains of mammalian Alix
(apoptosis-linked gene-2 interacting protein X),
His-Domain type N23 protein tyrosine phosphatase
(HD-PTP, also known as PTPN23), RhoA-binding proteins
Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, and related
domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
interact with the ESCRT (Endosomal Sorting Complexes
Required for Transport) system. Rim20 and Rim23
participate in the response to the external pH via the
Rim101 pathway. Through its Bro1-like domain, Rim23
allows the interaction between the endosomal and plasma
membrane complexes. Bro1-like domains are
boomerang-shape, and part of the domain is a
tetratricopeptide repeat (TPR)-like structure.
Intermediates in the Rim101 pathway may play roles in
the pathogenesis of fungal corneal infection during
Candida albicans keratitis. This family lacks the
V-shaped (V) domain found in many members of the
BRO1_Alix_like superfamily.
Length = 413
Score = 28.9 bits (65), Expect = 1.1
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 4/47 (8%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
KKE L K K + K+ K+K+ E++K ++ EEE R
Sbjct: 323 KKE--LEDLKSPSGVASKAKLKKSWKEKR--EDRKVEKGAGVEEELR 365
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.5 bits (64), Expect = 1.1
Identities = 16/61 (26%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 86 SNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
S+ E RS+ +++ ++Q E++++ E+ E E+ E+ + ++K +EE+K E
Sbjct: 101 SDPNYEIRSDVRRQLR-FLAQKQKEEEERRVERRRE-LGLEDPEQLRLKQKAKEEQKAES 158
Query: 146 E 146
E
Sbjct: 159 E 159
Score = 26.6 bits (59), Expect = 5.3
Identities = 11/37 (29%), Positives = 21/37 (56%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K +E+EE + ++ E + E+ R ++K +EE E
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAE 157
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.6 bits (64), Expect = 1.1
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
R+ KK+++K++EK +EE++ + ++K E E + N E
Sbjct: 59 PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103
Score = 28.6 bits (64), Expect = 1.2
Identities = 12/46 (26%), Positives = 26/46 (56%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KR +K K+ +KK KE+ +++K+ + +++ E EN +
Sbjct: 53 KKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPD 98
Score = 26.6 bits (59), Expect = 4.1
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 2/44 (4%)
Query: 105 RKRQGEKKKKN--EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
RK KK K +EK E++KK E + K EE E + + EE
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103
Score = 26.6 bits (59), Expect = 5.4
Identities = 10/45 (22%), Positives = 26/45 (57%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+Q +KK+ ++ KK +++ K+K +E ++ + +++ EE
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEE 92
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 28.8 bits (65), Expect = 1.1
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 5/37 (13%)
Query: 114 KNEEKEEEKKKKEEEEKK-----KKEEKEREEEKREE 145
K E + E++KK+ EE K ++ ERE+ RE
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREA 471
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 28.6 bits (64), Expect = 1.1
Identities = 10/62 (16%), Positives = 24/62 (38%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
LL + +K L + + + +E K +K EE ++K + ++ ++
Sbjct: 5 LLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASI 64
Query: 149 EE 150
Sbjct: 65 RA 66
>gnl|CDD|216259 pfam01039, Carboxyl_trans, Carboxyl transferase domain. All of the
members in this family are biotin dependent
carboxylases. The carboxyl transferase domain carries
out the following reaction; transcarboxylation from
biotin to an acceptor molecule. There are two recognised
types of carboxyl transferase. One of them uses acyl-CoA
and the other uses 2-oxoacid as the acceptor molecule of
carbon dioxide. All of the members in this family
utilise acyl-CoA as the acceptor molecule.
Length = 487
Score = 28.7 bits (65), Expect = 1.1
Identities = 6/29 (20%), Positives = 13/29 (44%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
+ E + K+ E + ++ E EE+
Sbjct: 416 RKELAAATMRYKDLEALRAQKIAEYEEQL 444
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 28.8 bits (65), Expect = 1.2
Identities = 8/29 (27%), Positives = 17/29 (58%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ K+EEE + + + E + EEE++ +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 28.4 bits (64), Expect = 1.4
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 122 KKKKEEEEKKKKEEKEREEEKREEEN 147
++ K+EEE + K +EE EEE
Sbjct: 311 RRSKQEEEAAAEAAKAQEEAAEEEEE 336
Score = 26.8 bits (60), Expect = 4.7
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEK 130
R +Q E+ K +E+ +EEEE
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEES 337
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 28.0 bits (63), Expect = 1.2
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
R ++ K ++E++ +K E+K K+ +KE+E
Sbjct: 95 ALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.5 bits (64), Expect = 1.2
Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 2/33 (6%)
Query: 118 KEEEKKKKEEEEKKKKEEKER--EEEKREEENV 148
+E EK KKE E+K++EE+E+ EEEK EE +
Sbjct: 149 RELEKIKKERAEEKEREEEEKAAEEEKAREEEI 181
Score = 27.4 bits (61), Expect = 3.1
Identities = 18/32 (56%), Positives = 24/32 (75%), Gaps = 1/32 (3%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
E +K +E+ EEK ++EEEEK +EEK REEE
Sbjct: 150 ELEKIKKERAEEK-EREEEEKAAEEEKAREEE 180
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 28.7 bits (65), Expect = 1.3
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + EK + +E+E+EK KE E KK E E++++ + + E
Sbjct: 169 KEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEG 213
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 28.7 bits (64), Expect = 1.3
Identities = 17/40 (42%), Positives = 23/40 (57%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KKKK ++ +E+KKKK E K+ E EE E N E+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEK 724
>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit D;
Validated.
Length = 419
Score = 28.7 bits (65), Expect = 1.3
Identities = 7/38 (18%), Positives = 17/38 (44%)
Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
N+G + + + E+ E+ K++ +E +K
Sbjct: 40 NIGIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPN 77
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.5 bits (64), Expect = 1.4
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
QG N E+++E++++E+EE++++EE+ E E E
Sbjct: 336 QGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 26.6 bits (59), Expect = 7.0
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 123 KKKEEEEKKKKEEKEREEEKREEENVEE 150
++K+EEE+++ EE+E EEE+ EE EE
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 26.2 bits (58), Expect = 9.3
Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 2/31 (6%)
Query: 120 EEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++K EEEE +++E+E EEE+ EE E
Sbjct: 343 NPEQKDEEEE--QEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 28.1 bits (63), Expect = 1.4
Identities = 12/38 (31%), Positives = 17/38 (44%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K E K + K K + + KK EE+ + E K E
Sbjct: 93 PKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130
Score = 28.1 bits (63), Expect = 1.6
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
E K K + K + K K+ EE+ K+E K E
Sbjct: 96 EPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 28.3 bits (63), Expect = 1.5
Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 104 FRKRQGEKKKKNEEKE--EEKKKKEEEEKKKKEEKEREEEKRE 144
F R+ +K + E++E + K K E E+K KEE ++ E E
Sbjct: 3 FELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 28.2 bits (63), Expect = 1.5
Identities = 7/32 (21%), Positives = 20/32 (62%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
K ++ E ++++KK EE +++ ++ + E+
Sbjct: 20 KLQQLERARDKQEKKAEEYQEQIKKAIEKAEE 51
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of
mediator complex subunit 2 proteins is conserved in
fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
with and phosphorylates Med2. Post-translational
modifications of Mediator subunits are important for
regulation of gene expression.
Length = 99
Score = 27.4 bits (61), Expect = 1.5
Identities = 13/20 (65%), Positives = 18/20 (90%)
Query: 112 KKKNEEKEEEKKKKEEEEKK 131
+ K +++EEE+KKKEEEEKK
Sbjct: 80 ENKKKQEEEERKKKEEEEKK 99
Score = 27.0 bits (60), Expect = 2.2
Identities = 14/19 (73%), Positives = 18/19 (94%)
Query: 120 EEKKKKEEEEKKKKEEKER 138
E KKK+EEEE+KKKEE+E+
Sbjct: 80 ENKKKQEEEERKKKEEEEK 98
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 28.5 bits (63), Expect = 1.5
Identities = 16/35 (45%), Positives = 23/35 (65%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+EEE + +E+EE +KE + +EK EEE EE
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 27.6 bits (62), Expect = 1.6
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+K +KEEE +++ EE + + E E E E+++
Sbjct: 101 RKEAKKEEELQQELEELEARVGELELEIERQKAR 134
>gnl|CDD|233752 TIGR02153, gatD_arch, glutamyl-tRNA(Gln) amidotransferase, subunit
D. This peptide is found only in the Archaea. It is
part of a heterodimer, with GatE (TIGR00134), that acts
as an amidotransferase on misacylated Glu-tRNA(Gln) to
produce Gln-tRNA(Gln). The analogous amidotransferase
found in bacteria is the GatABC system, although GatABC
homologs in the Archaea appear to act instead on
Asp-tRNA(Asn) [Protein synthesis, tRNA aminoacylation].
Length = 404
Score = 28.5 bits (64), Expect = 1.6
Identities = 9/38 (23%), Positives = 17/38 (44%)
Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER 138
N+GF+ + + EE E ++ E +KK +
Sbjct: 27 NIGFKISEIRNIEVLEEGSEPREVPPPAEIEKKPGLPK 64
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 28.5 bits (64), Expect = 1.7
Identities = 8/25 (32%), Positives = 16/25 (64%)
Query: 119 EEEKKKKEEEEKKKKEEKEREEEKR 143
EE K +E+ K + + ER++++R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRR 607
Score = 27.3 bits (61), Expect = 3.5
Identities = 10/46 (21%), Positives = 28/46 (60%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++Q ++++ + +EK++ ++E K E++ +E +EE V++
Sbjct: 670 EQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQ 715
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 27.8 bits (62), Expect = 1.7
Identities = 12/66 (18%), Positives = 27/66 (40%), Gaps = 9/66 (13%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKK---------KEEEEKKKKEEKEREEEKR 143
R+ ++ K E K +++ +K E +K +EK+ E++K
Sbjct: 14 RALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKP 73
Query: 144 EEENVE 149
+++ V
Sbjct: 74 KKKKVP 79
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 28.2 bits (63), Expect = 1.7
Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 76 KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEE-EKKKKEEEEKKKKE 134
K N L+E S +KE + K++ + K+E E++ +E +E+++ +
Sbjct: 503 KTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNK 562
Query: 135 EKEREEEKRE 144
+ E E+E +E
Sbjct: 563 KLELEKEAQE 572
Score = 27.5 bits (61), Expect = 3.0
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E+ EK +K+ E++ + E+ +E+EK ++E
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKE 547
Score = 27.5 bits (61), Expect = 3.6
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEE-EKKKKEEEEKKKKEEKEREEEKREEENVE 149
F E K+E N+ K +K+ ++ E EK KE+E+ KK+ E+E EE K E N +
Sbjct: 504 TFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKK 563
Query: 150 E 150
Sbjct: 564 L 564
Score = 26.7 bits (59), Expect = 5.7
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
R+R + + + E +E K K+E E +E KE++ K +E
Sbjct: 558 RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKE 598
Score = 26.7 bits (59), Expect = 6.6
Identities = 15/42 (35%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 105 RKRQGEKKKKNEEK-EEEKKKKEEEEKKKKEEKEREEEKREE 145
K +K + E+ +E++K K+E E++ +E KERE K+ E
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLE 565
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 27.7 bits (62), Expect = 1.7
Identities = 19/46 (41%), Positives = 36/46 (78%), Gaps = 2/46 (4%)
Query: 103 GFRKRQGEKKKKNEE--KEEEKKKKEEEEKKKKEEKEREEEKREEE 146
GF KR+ +++KK +E KE+E++++ EE K+ +EE+++E EK+ +E
Sbjct: 26 GFHKRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKE 71
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 28.1 bits (63), Expect = 1.8
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
F K + ++N +KEEE++++ ++ K+ E E+ +++E N
Sbjct: 326 FLKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERN 369
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 28.3 bits (64), Expect = 1.8
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K G K+ E+ EE + +E K+K +E E ++ E E
Sbjct: 57 KEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEAE 97
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 28.0 bits (62), Expect = 1.8
Identities = 16/49 (32%), Positives = 26/49 (53%)
Query: 99 EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
E +L + G KKK+ ++KE EK + E+ + E + E E E E+
Sbjct: 81 EKDLALQPPPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPED 129
Score = 27.2 bits (60), Expect = 2.9
Identities = 10/44 (22%), Positives = 22/44 (50%)
Query: 103 GFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
G +K++ +KK+ + + +K + E K + E E+ E+
Sbjct: 91 GGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPED 134
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.1 bits (63), Expect = 1.9
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ KKE F++++ +++ E EEE K ++ K K ++ ++E+ EEE EE
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEE 493
>gnl|CDD|119356 cd02877, GH18_hevamine_XipI_class_III, This conserved domain family
includes xylanase inhibitor Xip-I, and the class III
plant chitinases such as hevamine, concanavalin B, and
PPL2, all of which have a glycosyl hydrolase family 18
(GH18) domain. Hevamine is a class III endochitinase
that hydrolyzes the linear polysaccharide chains of
chitin and peptidoglycan and is important for defense
against pathogenic bacteria and fungi. PPL2 (Parkia
platycephala lectin 2) is a class III chitinase from
Parkia platycephala seeds that hydrolyzes beta(1-4)
glycosidic bonds linking
2-acetoamido-2-deoxy-beta-D-glucopyranose units in
chitin.
Length = 280
Score = 28.0 bits (63), Expect = 1.9
Identities = 9/40 (22%), Positives = 19/40 (47%), Gaps = 4/40 (10%)
Query: 17 SIAKRYGGIPAGESNGRRGYM----LTYIIAYIRDFACDY 52
S AK + G+PA GY+ L ++ ++ + ++
Sbjct: 218 SNAKVFLGLPASPEAAGSGYVDPSELASLVLPVKQKSPNF 257
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 2.1
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
K + + +K + E +E+KK+K++EEKKKK+ + EE
Sbjct: 315 SKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 28.0 bits (62), Expect = 2.1
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
N+ +E E K E K + E++ + E+ EEE EE
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE 330
>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
Length = 437
Score = 27.8 bits (62), Expect = 2.1
Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 117 EKEEEKKKKEE-EEKKKKEEKEREEEKREEENVE 149
E EEEK EE E + K +K+R E+KRE EN E
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFENGE 258
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.0 bits (62), Expect = 2.1
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+ +K NE+ EE ++ EE + + + E E+ E +
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQ 4097
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 27.6 bits (62), Expect = 2.1
Identities = 14/43 (32%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE-EEKREEE 146
RK++ + +K ++ ++E K+ EE K +K E++RE +R+ E
Sbjct: 25 RKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAE 67
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 27.8 bits (62), Expect = 2.2
Identities = 5/37 (13%), Positives = 25/37 (67%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+++ EE+ E+ + + ++ +K+++++++E+++
Sbjct: 76 SNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 28.0 bits (63), Expect = 2.2
Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEE---KKKKEEEEKKKKEEKEREEEKREEENV 148
++E L + +KK++N EK+E+ K+K +EK+++ E+ E++ E E +
Sbjct: 87 QREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERI 141
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.1 bits (62), Expect = 2.3
Identities = 16/45 (35%), Positives = 31/45 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++ E+++K E+EE+++K+EE ++K +EE+E+ K E E
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264
Score = 26.2 bits (57), Expect = 9.5
Identities = 16/51 (31%), Positives = 31/51 (60%)
Query: 100 GNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
G G + G++ +K ++K++E + EE KKK+EE+ + E+ E+ +E
Sbjct: 190 GAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQE 240
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 28.0 bits (62), Expect = 2.3
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++K K + EK+ K E + + EK+ EEK+ ++ EE
Sbjct: 469 SAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEE 511
Score = 26.0 bits (57), Expect = 10.0
Identities = 13/43 (30%), Positives = 24/43 (55%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
K++ + +K + + K K EE +K+ +E K ++ EKE E
Sbjct: 535 KQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 28.0 bits (62), Expect = 2.3
Identities = 10/45 (22%), Positives = 30/45 (66%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++ +K +E EEE ++E+++++ ++ + E+E ++++ +E
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDE 77
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 27.5 bits (61), Expect = 2.4
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
K++ EK+ +++ +E KK+ ++ +EE +E++N
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQN 39
Score = 26.0 bits (57), Expect = 8.4
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
EK EE +E KKE +E K+E++E++
Sbjct: 9 EKDLPQEETTDEAPKKEAKEAPKEEDQEKQNP 40
>gnl|CDD|202551 pfam03142, Chitin_synth_2, Chitin synthase. Members of this family
are fungal chitin synthase EC:2.4.1.16 enzymes. They
catalyze chitin synthesis as follows:
UDP-N-acetyl-D-glucosamine +
{(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP +
{(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1).
Length = 527
Score = 27.8 bits (62), Expect = 2.4
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
R +GEKKKK ++E K + K+ EE ERE
Sbjct: 493 RVVEGEKKKKAHGEDEGKFDSSKIPMKRWEEYERE 527
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 27.7 bits (61), Expect = 2.5
Identities = 15/51 (29%), Positives = 26/51 (50%)
Query: 94 SETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ + N+G R +G K+K NEE EE+ + + E ++E+ E E
Sbjct: 200 PSSSERSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGSSSEADE 250
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 27.9 bits (62), Expect = 2.5
Identities = 11/32 (34%), Positives = 21/32 (65%), Gaps = 3/32 (9%)
Query: 118 KEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
K+E K +K + KK+K+ +E++E ++ VE
Sbjct: 25 KDEAKPRKIKRVKKRKKREEKDE---LDDEVE 53
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 27.4 bits (61), Expect = 2.5
Identities = 7/37 (18%), Positives = 22/37 (59%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ K+K + +++ KK K + + K K + + + + + ++
Sbjct: 83 KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119
Score = 27.4 bits (61), Expect = 3.2
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+K EK+ +K K + + K K + K + + K ++ +
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKT 124
Score = 26.7 bits (59), Expect = 5.6
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
EK K ++ ++ K K + + K K + K + + K+
Sbjct: 86 EKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTA 125
Score = 26.3 bits (58), Expect = 6.7
Identities = 7/32 (21%), Positives = 15/32 (46%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEK 142
KK K + K + K K + + + K ++ +
Sbjct: 95 KKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 27.7 bits (62), Expect = 2.6
Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 113 KKNEEKEEEKK-KKEEEEKKKKEEKEREEEKRE 144
K+ EEKE+EKK KKE +E+++ + + E+ E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
Score = 27.7 bits (62), Expect = 2.6
Identities = 13/33 (39%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 118 KEEEKKKKEEEEKKK-KEEKEREEEKREEENVE 149
KE E+K+KE++ KK+ KEE+ + + E E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 27.9 bits (62), Expect = 2.6
Identities = 12/45 (26%), Positives = 29/45 (64%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K + + KK +++ +E K++EE+ + E++ + EE E+++ +E
Sbjct: 150 KAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDE 194
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 27.5 bits (61), Expect = 2.7
Identities = 15/43 (34%), Positives = 26/43 (60%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+K++ +KK K+ +KKKKE+EE + +E + + EEN
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 27.5 bits (61), Expect = 2.8
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 6/42 (14%)
Query: 100 GNLGFRKR------QGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
GN G R EKKKK + K+ + K K++E+KKK E
Sbjct: 74 GNTGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 26.6 bits (59), Expect = 2.9
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
E E+E++K E E+K K+ RE E NV
Sbjct: 7 ERLEKEQRKLEAPEEKSKKNAAREVPLEENLNV 39
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 27.7 bits (62), Expect = 3.0
Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
L E ++E ++E ++++ E K EE+ + E ++ E++ R E +RE+E +
Sbjct: 179 LAELKAEEEEELERALKEKREELLSKLEEELLARL---ESKEAALEKQLRLEFEREKEEL 235
Query: 149 EE 150
+
Sbjct: 236 RK 237
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 27.0 bits (60), Expect = 3.0
Identities = 13/42 (30%), Positives = 27/42 (64%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
R+ + + KK+ EKE+EK+++ + + E++ E EK++ E
Sbjct: 73 RRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114
Score = 26.2 bits (58), Expect = 5.1
Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKE-EKEREEEKREEENVEE 150
+ R+ + K K EE E+EK+K+E K E EKER E ++++ +
Sbjct: 71 KLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKL 117
Score = 25.4 bits (56), Expect = 9.3
Identities = 12/46 (26%), Positives = 31/46 (67%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
RK+ EK ++++ K ++++ ++E+EK+++ K E ++E +E+
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 27.0 bits (60), Expect = 3.1
Identities = 8/33 (24%), Positives = 13/33 (39%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+KN+ E+ E + +KREE
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREE 136
>gnl|CDD|163616 cd00840, MPP_Mre11_N, Mre11 nuclease, N-terminal
metallophosphatase domain. Mre11 (also known as SbcD
in Escherichia coli) is a subunit of the MRX protein
complex. This complex includes: Mre11, Rad50, and
Xrs2/Nbs1, and plays a vital role in several nuclear
processes including DNA double-strand break repair,
telomere length maintenance, cell cycle checkpoint
control, and meiotic recombination, in eukaryotes.
During double-strand break repair, the MRX complex is
required to hold the two ends of a broken chromosome
together. In vitro studies show that Mre11 has 3'-5'
exonuclease activity on dsDNA templates and
endonuclease activity on dsDNA and ssDNA templates. In
addition to the N-terminal phosphatase domain, the
eukaryotic MRE11 members of this family have a
C-terminal DNA binding domain (not included in this
alignment model). MRE11-like proteins are found in
prokaryotes and archaea was well as in eukaryotes.
Mre11 belongs to the metallophosphatase (MPP)
superfamily. MPPs are functionally diverse, but all
share a conserved domain with an active site consisting
of two metal ions (usually manganese, iron, or zinc)
coordinated with octahedral geometry by a cage of
histidine, aspartate, and asparagine residues. The MPP
superfamily includes: Mre11/SbcD-like exonucleases,
Dbr1-like RNA lariat debranching enzymes, YfcE-like
phosphodiesterases, purple acid phosphatases (PAPs),
YbbF-like UDP-2,3-diacylglucosamine hydrolases, and
acid sphingomyelinases (ASMases). The conserved domain
is a double beta-sheet sandwich with a di-metal active
site made up of residues located at the C-terminal side
of the sheets. This domain is thought to allow for
productive metal coordination.
Length = 223
Score = 27.3 bits (61), Expect = 3.1
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 1/43 (2%)
Query: 38 LTYIIAYIRDFACDYYFI-GDSFETSVPWDKTVLLCINVKKRL 79
I+ + D+ I GD F+++ P + + L I +RL
Sbjct: 30 FEEIVELAIEEKVDFVLIAGDLFDSNNPSPEALELLIEALRRL 72
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 3.2
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 8/68 (11%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKK--------NEEKEEEKKKKEEEEKKKKEEKEREEEK 142
E + +K ++K EEK E+K +K + KK E+ E + +
Sbjct: 97 ELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156
Query: 143 REEENVEE 150
E +VEE
Sbjct: 157 SEVSDVEE 164
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 27.5 bits (61), Expect = 3.3
Identities = 11/46 (23%), Positives = 23/46 (50%)
Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
RKR+ + E ++ + + +E + EKER+ + E++N
Sbjct: 353 GPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDKN 398
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 27.1 bits (61), Expect = 3.3
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
K EE EE ++ +EE E+K +E KE + +
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEE 69
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 27.4 bits (61), Expect = 3.3
Identities = 11/50 (22%), Positives = 24/50 (48%)
Query: 101 NLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
N K E++++ E++ EE+ + +K + + + E E +EE
Sbjct: 107 NYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE 156
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 27.2 bits (61), Expect = 3.3
Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 5/63 (7%)
Query: 93 RSETKKEGNLGFRKRQG-----EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
+ LG + K + + K EK+K+ ++++ E E KRE +
Sbjct: 1 EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60
Query: 148 VEE 150
+ E
Sbjct: 61 LRE 63
>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain. This domain is found in
chromatin proteins.
Length = 158
Score = 26.9 bits (60), Expect = 3.4
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 2/32 (6%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKERE--EEKREE 145
++ E +++ E+EK++KEEK RE + REE
Sbjct: 126 RQRAELQRQLAEKEKQEKEEKLRELAQRAREE 157
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 27.1 bits (60), Expect = 3.6
Identities = 15/40 (37%), Positives = 27/40 (67%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K KK++++E++ + + E+K+ +E E EEE EEE E+
Sbjct: 243 KNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 27.2 bits (61), Expect = 3.6
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKR 143
K E EE ++ +EE E+K +E E+ + +
Sbjct: 87 KAETLEERREIREEAEEKWEEALEKGDLEE 116
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 27.4 bits (61), Expect = 3.7
Identities = 17/65 (26%), Positives = 30/65 (46%)
Query: 77 KRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEK 136
K+ E + L EF+ KE L + EK + E+ E +++ ++E KK +K
Sbjct: 440 KKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499
Query: 137 EREEE 141
E +
Sbjct: 500 IPEVD 504
Score = 26.2 bits (58), Expect = 8.8
Identities = 11/64 (17%), Positives = 24/64 (37%)
Query: 87 NGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
+ + L + + ++ + + +E+ KKE +K KK + EE
Sbjct: 453 EEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEEL 512
Query: 147 NVEE 150
+ E
Sbjct: 513 GINE 516
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 26.5 bits (59), Expect = 3.7
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++ T KE + K + + +KK EEK+ E +K + ++K E E E+ ++E E +E
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Score = 26.1 bits (58), Expect = 5.4
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E +K + EEK + E E+ +K+ E+ E R E
Sbjct: 34 EAEKLAKMSAEEKAEYELEKLEKELEELEAELARRE 69
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 27.4 bits (61), Expect = 3.8
Identities = 8/40 (20%), Positives = 18/40 (45%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K + K E+ E + +E +K+ E++ +E +
Sbjct: 61 KEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAA 100
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 27.0 bits (60), Expect = 3.8
Identities = 10/42 (23%), Positives = 22/42 (52%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
F+KRQ E +K +E + ++ K +++ K + + E +
Sbjct: 53 FKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIK 94
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 26.9 bits (60), Expect = 3.9
Identities = 12/40 (30%), Positives = 25/40 (62%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KKKK +++EE KK K++ E + + R++ + ++ + E
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 26.0 bits (58), Expect = 3.9
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 121 EKKKKEEEEKKKKEEKEREEE 141
EEEKK++EE+E E++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 25.3 bits (56), Expect = 5.9
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 127 EEEKKKKEEKEREEEKREEE 146
+EEK+ EEE+ EE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80
Score = 25.3 bits (56), Expect = 7.0
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 121 EKKKKEEEEKKKKEEKEREEE 141
E++KKEE+E EEE
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEE 79
Score = 25.3 bits (56), Expect = 7.9
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 109 GEKKKKNEEKEEEKKKKEEEE 129
EEK+EE++++EE++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 26.9 bits (60), Expect = 4.0
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 117 EKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+E+++ K+ +E +E E +EE+ EE E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAE 168
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 26.5 bits (58), Expect = 4.1
Identities = 17/40 (42%), Positives = 23/40 (57%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
KK+K EK+ + KKKEE+ K E E +E E E + E
Sbjct: 29 KKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIE 68
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 26.6 bits (59), Expect = 4.1
Identities = 10/35 (28%), Positives = 26/35 (74%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E ++E+K+ +E E+K+ + ++E+++++E+ V E
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAE 128
Score = 26.6 bits (59), Expect = 4.5
Identities = 13/30 (43%), Positives = 23/30 (76%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
EK++K KE E+K+ + +K+KKE+KE++
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 25.9 bits (57), Expect = 6.6
Identities = 9/32 (28%), Positives = 24/32 (75%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
E +K+ +E +E ++KE + KK++++++E++
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
Length = 508
Score = 27.1 bits (60), Expect = 4.2
Identities = 13/44 (29%), Positives = 22/44 (50%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
+R E K + + E K+K E E+ + ++ E +E E N E
Sbjct: 51 RRVAEGKALAQAEAAEWKRKYELERARNQQLEHKELSSGECNEE 94
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 26.6 bits (59), Expect = 4.3
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
+ K+K E EE+K+ KK ++ E K+++
Sbjct: 161 DAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 26.6 bits (59), Expect = 4.6
Identities = 11/46 (23%), Positives = 23/46 (50%)
Query: 99 EGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
+ L + +R+ K+K+ EE K+ KK ++ E +K++
Sbjct: 151 QAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 26.2 bits (58), Expect = 6.3
Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKK 125
EF +E +KE KK +K E KKKK
Sbjct: 167 EFAAEERKE-----ALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 26.5 bits (59), Expect = 4.3
Identities = 14/47 (29%), Positives = 28/47 (59%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ K+++ E E K E+ ++KK++E EE KRE++ ++E
Sbjct: 87 LQQEVEAKRERLLEARRELKALEKLKEKKQKEYRAEEAKREQKEMDE 133
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 26.4 bits (59), Expect = 4.4
Identities = 13/36 (36%), Positives = 26/36 (72%), Gaps = 2/36 (5%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEE--EKKKKEEKE 137
F++++ +KKKK E + E+KKK+ E++K+++K
Sbjct: 81 FKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 25.6 bits (57), Expect = 8.3
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 114 KNEEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
K ++K+++KKK+ E ++KK++ EE++E
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112
Score = 25.6 bits (57), Expect = 9.0
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ ++K+++KK+ E E+KKK+ EE K +++ V+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDA 119
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 26.9 bits (59), Expect = 4.5
Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEE-KEREEEKREE 145
++ EK+ +E++E+ KK E+ E+ K EE KE +E K+++
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKK 248
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 26.4 bits (58), Expect = 4.5
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEE 135
K KK +E EKK +++EKK
Sbjct: 53 KPKKISAEEAEKKLLQKKEKKALTN 77
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 26.9 bits (60), Expect = 4.7
Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 8/57 (14%)
Query: 102 LGFRKRQ--GEKKKKNEEK---EEEKKKKEEEEKKKK---EEKEREEEKREEENVEE 150
L F +R ++ K EE E E+ KK E E+ ++ EE++ EEE+ +E+ ++
Sbjct: 245 LTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADD 301
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 26.7 bits (59), Expect = 4.8
Identities = 13/60 (21%), Positives = 30/60 (50%)
Query: 73 INVKKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKK 132
I ++ ++ N R + + E L F K + E+++ +KEEE+++ + + K+
Sbjct: 122 IETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 26.6 bits (59), Expect = 4.9
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 110 EKKKKNEEKEEEKKKKEE--EEKKKKEEKEREEEKREEENVEE 150
K K+E EE+ K+ E EE K KEE EE +EE +E+
Sbjct: 5 CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIED 47
Score = 26.6 bits (59), Expect = 5.3
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+ K+NE KEE+K K+E+ E ++ E++E E+ E V+
Sbjct: 16 DCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKI 56
>gnl|CDD|205200 pfam13019, Telomere_Sde2, Telomere stability and silencing. Sde2
has been identified in fission yeast as an important
factor in telomere formation and maintenance. This is a
more N-terminal domain on these nuclear proteins, and is
essential for telomeric silencing and genomic stability.
Length = 163
Score = 26.3 bits (58), Expect = 5.0
Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 2/33 (6%)
Query: 115 NEEKE--EEKKKKEEEEKKKKEEKEREEEKREE 145
NE K E K E EKK+KE + + E E
Sbjct: 127 NEAKALAEYLAIKPEMEKKEKEARRKRWESVVE 159
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 27.0 bits (60), Expect = 5.1
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 91 EFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
R E ++E +K + + + E + +E E+ E E EE +++ +N
Sbjct: 74 REREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130
Score = 26.2 bits (58), Expect = 9.5
Identities = 10/61 (16%), Positives = 26/61 (42%)
Query: 89 LLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
LL +++ R+ ++++ +KEE+ + E+ + + E E+ +
Sbjct: 60 LLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119
Query: 149 E 149
E
Sbjct: 120 E 120
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 26.2 bits (58), Expect = 5.3
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEK 136
K+KK ++K+++KKKK+ +K K++K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 26.2 bits (58), Expect = 6.0
Identities = 10/28 (35%), Positives = 21/28 (75%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKK 133
K + ++KK+ ++K+++KKKK ++ KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 26.7 bits (60), Expect = 5.3
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKRE 144
EE K++ E K+ EEKE++ EK E
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 25.9 bits (58), Expect = 9.2
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E+E K++ E KE +E+E++ + E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|205219 pfam13038, DUF3899, Domain of unknown function (DUF3899). Putative
Tryptophanyl-tRNA synthetase.
Length = 92
Score = 25.7 bits (57), Expect = 5.3
Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 103 GFRK--RQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
GFR+ RQ +KK+ + EE++ +++ KKKK
Sbjct: 33 GFRRLFRQLVEKKEKDSIEEDEFANDKKVKKKKY 66
>gnl|CDD|219996 pfam08738, Gon7, Gon7 family. In S. cerevisiae Gon7 is a member of
the KEOPS protein complex. A protein complex proposed to
be involved in transcription and promoting telomere
uncapping and telomere elongation.
Length = 97
Score = 25.5 bits (56), Expect = 5.7
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEE 141
E EE+KKK + + +KE +++EEE
Sbjct: 69 ERMEEDKKKAAAKGRGEKEIEDKEEE 94
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 26.6 bits (59), Expect = 5.8
Identities = 15/46 (32%), Positives = 23/46 (50%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
R+R E + + EE EE ++ +E+ + KEE E E EE
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLL 353
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 26.3 bits (59), Expect = 5.8
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 107 RQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
RQG + EE+EEE +++EEEE++++ E+ E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 25.9 bits (58), Expect = 9.8
Identities = 11/34 (32%), Positives = 23/34 (67%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++ E ++++EE E++++EE+E E E+ E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 26.8 bits (58), Expect = 5.9
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 6/53 (11%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
++E KKE L E+ K+E+ + K +++ +K EK+RE +RE+
Sbjct: 146 KNEKKKERQLA------EQLAAKRLKDEQHRHKARKQELRKREKDRERARRED 192
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 26.4 bits (58), Expect = 5.9
Identities = 9/57 (15%), Positives = 17/57 (29%), Gaps = 2/57 (3%)
Query: 93 RSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E K+ + K +K KKK + K K + ++ +
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKP--KKKGSVSNRSVKMPGIDPRSKPDWKSQD 195
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 25.8 bits (57), Expect = 6.1
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEE 141
E E + EEEEK+++ E+E +++
Sbjct: 70 AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 25.1 bits (55), Expect = 9.7
Identities = 7/33 (21%), Positives = 16/33 (48%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREE 140
E E + +EEE++++ EE+ ++
Sbjct: 70 AAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 26.3 bits (58), Expect = 6.2
Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 99 EGNLGFRKRQGEKKKKNEEKE----EEKKKKEEEEKKKKEEKEREEEKREE 145
+G + + R+ E+ + E+E E + +KEEE K K+ +++ EE RE+
Sbjct: 28 QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARED 78
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 26.6 bits (58), Expect = 6.4
Identities = 18/53 (33%), Positives = 30/53 (56%)
Query: 96 TKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
T GN+G Q KK N K+ K+ + +E+++++EE+E EE + EE
Sbjct: 105 TPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 26.3 bits (58), Expect = 6.5
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 102 LGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEEN 147
L +++ +K K+ + E+ K E++K + ++ E E E
Sbjct: 72 LYALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGILRSK 117
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 26.2 bits (58), Expect = 6.6
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
EE E K + EEE KKK++K++++ K+
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGS 200
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 25.7 bits (57), Expect = 6.7
Identities = 15/55 (27%), Positives = 32/55 (58%), Gaps = 8/55 (14%)
Query: 104 FRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKER--------EEEKREEENVEE 150
++K+ G K ++ + +++ +EE E K+K E+E+ EEE+ E++ + E
Sbjct: 53 YKKKIGYKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLE 107
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 26.4 bits (58), Expect = 6.8
Identities = 13/52 (25%), Positives = 24/52 (46%)
Query: 95 ETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
E E R+ ++K + E KE+ + E+ K+EE +E + +E
Sbjct: 83 EPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDE 134
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 26.6 bits (59), Expect = 6.9
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
RK + ++ K+ KE E++ KEE++ KKK
Sbjct: 277 RKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 26.3 bits (58), Expect = 7.1
Identities = 12/71 (16%), Positives = 25/71 (35%)
Query: 76 KKRLTRECTVSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEKEEEKKKKEEEEKKKKEE 135
KK++ ++ + + F +K + E+ KKK + K++
Sbjct: 80 KKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKK 139
Query: 136 KEREEEKREEE 146
K+ K E
Sbjct: 140 KKVLSSKDELI 150
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 25.8 bits (57), Expect = 7.2
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+ +E EE K + EE +K K + E E +E+ EE
Sbjct: 71 NKRGSEILEEAKAEAREEREKIKAQARAEIEAEKEQAREE 110
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 26.1 bits (58), Expect = 7.2
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 112 KKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
E+K EE +E EE +K EE+E EEE+ EEEN E
Sbjct: 1 MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELE 39
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 26.6 bits (59), Expect = 7.3
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 110 EKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVE 149
E + K +E EE + EE+ ++ KEE E E + EE E
Sbjct: 327 ELESKLDELAEELAELEEKLEELKEELESLEAELEELEAE 366
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 26.0 bits (58), Expect = 7.4
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 110 EKKKKNEEKEEEKKKKE--EEEKKKKEEKEREEEKREEE 146
EK + E ++ EKKK EE+K+K E+ E E+ +
Sbjct: 9 EKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQST 47
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 25.3 bits (56), Expect = 7.7
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
K E+KEE ++ KEE ++ K+E+E E+
Sbjct: 51 KVKEKKEEREEDKEELIERIKKEEETFEDVDLGS 84
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 26.2 bits (57), Expect = 7.9
Identities = 12/64 (18%), Positives = 23/64 (35%), Gaps = 2/64 (3%)
Query: 88 GLLEFRSETKKEGNLGFRKRQGEKKKKN--EEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
E E + + GE E K+++K+K+ EE + K + +
Sbjct: 431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLASQ 490
Query: 146 ENVE 149
E +
Sbjct: 491 EGMS 494
>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
region/beta chain. The methionyl-tRNA synthetase (metG)
is a class I amino acyl-tRNA ligase. This model
describes a region of the methionyl-tRNA synthetase that
is present at the C-terminus of MetG in some species (E.
coli, B. subtilis, Thermotoga maritima, Methanobacterium
thermoautotrophicum), and as a separate beta chain in
Aquifex aeolicus. It is absent in a number of other
species (e.g. Mycoplasma genitalium, Mycobacterium
tuberculosis), while Pyrococcus horikoshii has both a
full length MetG and a second protein homologous to the
beta chain only. Proteins hit by This model should
called methionyl-tRNA synthetase beta chain if and only
if the model metG hits a separate protein not also hit
by This model [Protein synthesis, tRNA aminoacylation].
Length = 137
Score = 25.8 bits (57), Expect = 8.0
Identities = 12/31 (38%), Positives = 21/31 (67%)
Query: 116 EEKEEEKKKKEEEEKKKKEEKEREEEKREEE 146
++K EE K K ++K+KK+E E+ E ++E
Sbjct: 1 DKKIEELKLKGAKKKEKKDEGEKALEPQKET 31
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 25.9 bits (57), Expect = 8.1
Identities = 18/45 (40%), Positives = 29/45 (64%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
++ E+K + EE E+EK+ E E + E++E+EE +EEE EE
Sbjct: 52 SKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEE 96
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
Members of this family function as part of the Mediator
(Med) complex, which links DNA-bound transcriptional
regulators and the general transcription machinery,
particularly the RNA polymerase II enzyme. They play a
role in basal transcription by mediating activation or
repression according to the specific complement of
transcriptional regulators bound to the promoter.
Length = 176
Score = 25.8 bits (57), Expect = 8.2
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 109 GEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
G + + +EEE++K E E K +EE ERE ++R +
Sbjct: 140 GIEPPGFDPEEEEERKAREAEAKTEEEDEREAQERNQ 176
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 26.2 bits (58), Expect = 8.4
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 113 KKNEEKEEEKKKKEEEEKKKKEEKEREEEKREE 145
K +E++E ++ EE E + E E E+
Sbjct: 140 PKTDEEKELLREVEELESRLATEPSPAPELEEQ 172
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 25.3 bits (55), Expect = 8.7
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 114 KNEEKEEEKKKKEEEEK 130
K E K+EEKK++EEEE+
Sbjct: 87 KAEAKKEEKKEEEEEEE 103
>gnl|CDD|240407 PTZ00412, PTZ00412, leucyl aminopeptidase; Provisional.
Length = 569
Score = 26.1 bits (57), Expect = 8.7
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 106 KRQGEKKKKNEEKEEEKKKKEEEEKKKKE 134
KR+G KK +N +K+ + +K+ +++K+
Sbjct: 541 KRKGIKKAQNAKKDLKVVQKKRHGRRRKQ 569
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 25.6 bits (56), Expect = 8.7
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 122 KKKKEEEEKKKKEEKEREEE 141
KKEEE++++ E +EEE
Sbjct: 94 TSKKEEEKEEEIPEPTKEEE 113
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 26.0 bits (58), Expect = 8.8
Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 14/73 (19%)
Query: 85 VSNGLLEFRSETKKEGNLGFRKRQGEKKKKNEEK------------EEEKKKKEEEEKKK 132
+S L + RS K R+ + + K+ E+ E+E K+ EEE +
Sbjct: 55 LSEALDKLRSYLPKLN--PLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL 112
Query: 133 KEEKEREEEKREE 145
+ E + E++ E
Sbjct: 113 ENEIKELEQEIER 125
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family consists
of several ATP synthase E chain sequences which are
components of the CF(0) subunit.
Length = 83
Score = 24.7 bits (54), Expect = 8.8
Identities = 8/35 (22%), Positives = 20/35 (57%)
Query: 105 RKRQGEKKKKNEEKEEEKKKKEEEEKKKKEEKERE 139
+K + ++ + +EK EK K +KK+ +++ +
Sbjct: 34 KKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKS 68
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 26.2 bits (58), Expect = 8.9
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 108 QGEKKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K++K + K++ KKK ++ KK ++ + + EN E
Sbjct: 8 NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEA 50
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 26.2 bits (58), Expect = 9.1
Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 97 KKEGNLGFRKRQGEKKKKNEEK--EEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
+KE L ++ K+K+N K E KK +E++ K E + EE + EN ++
Sbjct: 360 RKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQ 415
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 25.3 bits (56), Expect = 9.4
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 115 NEEKEEEKKKKEEEEKKKKEEKEREEEKREEENV 148
+ E+E+ ++ KE EE+ ++ E EREE +E+E +
Sbjct: 85 SSEEEQLRRIKELEEELREVEAEREEAVKEKEKL 118
>gnl|CDD|241412 cd13258, PH_PLEKHJ1, Pleckstrin homology domain containing, family
J member 1 Pleckstrin homology (PH) domain. PLEKHJ1
(also called GNRPX2/Guanine nucleotide-releasing protein
x ). It contains a single PH domain. Very little
information is known about PLEKHJ1. PLEKHJ1 has been
shown to interact with IKBKG (inhibitor of kappa light
polypeptide gene enhancer in B-cells, kinase gamma) and
KRT33B (keratin 33B). PH domains have diverse functions,
but in general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 144
Score = 25.6 bits (56), Expect = 9.7
Identities = 9/24 (37%), Positives = 12/24 (50%)
Query: 42 IAYIRDFACDYYFIGDSFETSVPW 65
I++I + YYF S E V W
Sbjct: 86 ISFIDEPERKYYFECRSEEQCVEW 109
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 26.1 bits (58), Expect = 9.8
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 111 KKKKNEEKEEEKKKKEEEEKKKKEEKEREEEKREEENVEE 150
K+KK +E E+ + E K++KE E ++ +E EE
Sbjct: 268 KEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEE 307
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.131 0.364
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,799,653
Number of extensions: 743794
Number of successful extensions: 15107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9292
Number of HSP's successfully gapped: 2625
Length of query: 150
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 62
Effective length of database: 7,034,450
Effective search space: 436135900
Effective search space used: 436135900
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 54 (24.4 bits)