RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5557
(213 letters)
>gnl|CDD|212475 cd01728, LSm1, Like-Sm protein 1. The eukaryotic LSm proteins
(LSm1-7) assemble into a hetero-heptameric ring around
the 3'-terminus of the gamma-methyl triphosphate
(gamma-m-P3) capped U6 snRNA. Accumulation of uridylated
RNAs in an lsm1 mutant suggests an involvement of the
LSm1-7 complex in recognition of the 3' uridylation tag
and recruitment of the decapping machinery. LSm1-7,
together with Pat1, are also called the decapping
activator. Members of this family share a highly
conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 74
Score = 116 bits (292), Expect = 4e-34
Identities = 42/52 (80%), Positives = 46/52 (88%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLMGEV 178
R LIG LRS DQFANLVL T+ERI VGNQYGDIPRG+FIIRGENVVL+GE+
Sbjct: 23 RKLIGILRSFDQFANLVLEDTVERIIVGNQYGDIPRGLFIIRGENVVLLGEI 74
>gnl|CDD|212474 cd01727, LSm8, Like-Sm protein 8. The eukaryotic LSm proteins
(LSm2-8 or LSm1-7) assemble into a hetero-heptameric
ring around the 3'-terminus uridylation tag of the
gamma-methyl triphosphate (gamma-m-P3) capped U6 snRNA.
LSm2-8 form the core of the snRNP particle that, in
turn, assembles with other components onto the pre-mRNA
to form the spliceosome which is responsible for the
excision of introns and the ligation of exons. LSm1-7 is
involved in recognition of the 3' uridylation tag and
recruitment of the decapping machinery. LSm657 is
believed to be an assembly intermediate for both the
LSm1-7 and LSm2-8 rings. Members of this family share a
highly conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 91
Score = 54.5 bits (132), Expect = 3e-10
Identities = 19/53 (35%), Positives = 33/53 (62%), Gaps = 1/53 (1%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERI-HVGNQYGDIPRGIFIIRGENVVLMGEV 178
R ++G L+ DQ NL+L ER+ ++P G++++RG+NV ++GEV
Sbjct: 20 RVIVGTLKGFDQTTNLILSNCHERVYSSDEGVEEVPLGLYLLRGDNVAVIGEV 72
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 57.6 bits (140), Expect = 6e-10
Identities = 26/74 (35%), Positives = 60/74 (81%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K+ K+ KK +K EKK+++++K+++K+ KKKEE++++++E+K+++KEE++++ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 88 EEKKQKKKEEKKQK 101
EEK+++++++KKQ
Sbjct: 464 EEKEEEEEKKKKQA 477
Score = 57.6 bits (140), Expect = 7e-10
Identities = 32/73 (43%), Positives = 59/73 (80%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
G +K KK ++ +K +KK E+++K+K++K K+K+EE+EE+KEKK+EEK++++EE
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 75 KDKKKEEKKQKKK 87
+++K+EE+++KKK
Sbjct: 463 EEEKEEEEEKKKK 475
Score = 57.2 bits (139), Expect = 7e-10
Identities = 32/80 (40%), Positives = 62/80 (77%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+EE KK KK KK EK +KK+E+++++++KK ++K++++EEEK+KK+EEK
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 83 KQKKKEEKKQKKKEEKKQKK 102
+++++E +++K++EE+K+KK
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475
Score = 56.5 bits (137), Expect = 1e-09
Identities = 26/75 (34%), Positives = 59/75 (78%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ +K KK E+ +KK+++++++K+KK K+K++E++EEKE+K+E+K+EE+++
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 71 KEEEKDKKKEEKKQK 85
+EE++++++++KKQ
Sbjct: 463 EEEKEEEEEKKKKQA 477
Score = 56.1 bits (136), Expect = 2e-09
Identities = 30/76 (39%), Positives = 60/76 (78%), Gaps = 1/76 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
KK ++ KK +K E+K++++K+EKK KK ++KEE++E++KE+K+++KEEE+++
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK-KKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 78 KKEEKKQKKKEEKKQK 93
+EEK+++++++KKQ
Sbjct: 462 AEEEKEEEEEKKKKQA 477
Score = 55.7 bits (135), Expect = 3e-09
Identities = 29/68 (42%), Positives = 56/68 (82%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
KK KK +K +K EKK E+E+K++KKK +KKEEE++++KE+K+++K+EE+++ ++E
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Query: 97 EKKQKKKK 104
++++++KK
Sbjct: 466 KEEEEEKK 473
Score = 54.9 bits (133), Expect = 6e-09
Identities = 31/73 (42%), Positives = 59/73 (80%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
KK KK KK +K EK+ ++EKK+++KK ++K++++EE+K+KK+EEK+++++E ++
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 100 QKKKKEEKKNKKA 112
+K+++EEKK K+A
Sbjct: 465 EKEEEEEKKKKQA 477
Score = 53.8 bits (130), Expect = 1e-08
Identities = 25/71 (35%), Positives = 58/71 (81%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
KK KK KK ++ +K++++E+K++KK+ KKKEE+++++KE+K+++K+EE+++ +++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Query: 105 EEKKNKKAEEQ 115
+E++ +K ++Q
Sbjct: 466 KEEEEEKKKKQ 476
Score = 53.0 bits (128), Expect = 2e-08
Identities = 26/79 (32%), Positives = 60/79 (75%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
++E + ++ +K K+ E+ E+K+E++K+EKK+K K K++EE+++K+K+E+++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 93 KKKEEKKQKKKKEEKKNKK 111
+++EE+ +++K+EE++ KK
Sbjct: 456 EEEEEEAEEEKEEEEEKKK 474
Score = 49.5 bits (119), Expect = 4e-07
Identities = 20/74 (27%), Positives = 55/74 (74%)
Query: 5 IAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
+ + +I K +K +++ ++K++KK++ KK+E++++++++K+E+E+++E+++
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 65 EEKKQKKEEEKDKK 78
E+++++EEEK KK
Sbjct: 462 AEEEKEEEEEKKKK 475
Score = 49.1 bits (118), Expect = 4e-07
Identities = 18/70 (25%), Positives = 55/70 (78%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
+K +K ++ EK EKK+EE+K++K+++ K++++++++E++K+++++E+++++ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 105 EEKKNKKAEE 114
EEK+ ++ ++
Sbjct: 464 EEKEEEEEKK 473
Score = 45.3 bits (108), Expect = 8e-06
Identities = 29/81 (35%), Positives = 57/81 (70%), Gaps = 3/81 (3%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK---KEEKKQKKKEEK 98
+EE + KK K+ KK +K EKK+++E+++ KKK +KK +EE+K+KK+EEK
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 99 KQKKKKEEKKNKKAEEQNRQQ 119
++++++ E++ ++ EE+ ++Q
Sbjct: 456 EEEEEEAEEEKEEEEEKKKKQ 476
Score = 43.8 bits (104), Expect = 3e-05
Identities = 26/77 (33%), Positives = 54/77 (70%), Gaps = 1/77 (1%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
+K KK KK EK +KK EE+ KK+++KK ++K+++++EEK++K++++E++ ++A
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEE-KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 113 EEQNRQQGLYPNQKRTL 129
EE+ ++ ++ TL
Sbjct: 463 EEEKEEEEEKKKKQATL 479
Score = 43.8 bits (104), Expect = 3e-05
Identities = 20/70 (28%), Positives = 53/70 (75%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K + I + K+++++K+E+ KK K++++++++E++K+++++++EE+E ++E
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Query: 62 KKKEEKKQKK 71
K++EE+K+KK
Sbjct: 466 KEEEEEKKKK 475
Score = 42.2 bits (100), Expect = 8e-05
Identities = 22/64 (34%), Positives = 45/64 (70%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
++E + KK +K +K E+ +KK+EE+K++KK++ KK+E++++++KE+K+ +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 111 KAEE 114
K EE
Sbjct: 455 KEEE 458
Score = 29.9 bits (68), Expect = 1.1
Identities = 16/49 (32%), Positives = 39/49 (79%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK 56
+ GK+K++++EEE +KK+++KEE++++ +EEK++++++KK++
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
>gnl|CDD|197820 smart00651, Sm, snRNP Sm proteins. small nuclear ribonucleoprotein
particles (snRNPs) involved in pre-mRNA splicing.
Length = 67
Score = 51.0 bits (123), Expect = 4e-09
Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLMGE 177
R G L+ DQF NLVL E + G + G+ IRG N+V +
Sbjct: 19 REYRGTLKGFDQFMNLVLEDVEETVKDGEK--KRKLGLVFIRGNNIVYIIL 67
>gnl|CDD|201787 pfam01423, LSM, LSM domain. The LSM domain contains Sm proteins as
well as other related LSM (Like Sm) proteins. The U1,
U2, U4/U6, and U5 small nuclear ribonucleoprotein
particles (snRNPs) involved in pre-mRNA splicing contain
seven Sm proteins (B/B', D1, D2, D3, E, F and G) in
common, which assemble around the Sm site present in
four of the major spliceosomal small nuclear RNAs. The
U6 snRNP binds to the LSM (Like Sm) proteins. Sm
proteins are also found in archaebacteria, which do not
have any splicing apparatus suggesting a more general
role for Sm proteins. All Sm proteins contain a common
sequence motif in two segments, Sm1 and Sm2, separated
by a short variable linker. This family also includes
the bacterial Hfq (host factor Q) proteins. Hfq are also
RNA-binding proteins, that form hexameric rings.
Length = 66
Score = 49.9 bits (120), Expect = 1e-08
Identities = 21/51 (41%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLMGE 177
R L G L+ DQF NLVL E I G G+ +IRG N+VL+
Sbjct: 19 RELRGTLKGFDQFMNLVLDDVEETIKDGKVN---KLGLVLIRGNNIVLISP 66
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 53.6 bits (130), Expect = 1e-08
Identities = 33/105 (31%), Positives = 67/105 (63%), Gaps = 3/105 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE-KKKEEKKQKKEEEK 75
+ K+ +E + + + KKE + K+E K + + +KE +E + E +K E++ +KEE
Sbjct: 39 EAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENL 98
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQK--KKKEEKKNKKAEEQNRQ 118
D+K E +++++E +K++K+ E+KQ+ +KKEE+ + EEQ ++
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 49.0 bits (118), Expect = 5e-07
Identities = 31/102 (30%), Positives = 66/102 (64%), Gaps = 6/102 (5%)
Query: 19 KKKKKEEENKKKQK----KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
++ KKE E KK+ K+E K + + EK+ +++ + +K EK+ +KEE +K E
Sbjct: 45 EEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLEL 104
Query: 75 KDKKKEE-KKQKKKEEKKQKKKEEKKQK-KKKEEKKNKKAEE 114
+K++EE +K++K+ E+KQ++ E+K+++ ++ E++ ++ E
Sbjct: 105 LEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Score = 47.9 bits (115), Expect = 1e-06
Identities = 29/101 (28%), Positives = 60/101 (59%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
RKK + + + +++ K ++ KKE + KK+ E KEE + + E +K+ +E +
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+K EK+ +KEE +K E ++++++ EKK K+ E++ ++
Sbjct: 85 QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125
Score = 47.9 bits (115), Expect = 1e-06
Identities = 32/118 (27%), Positives = 69/118 (58%), Gaps = 2/118 (1%)
Query: 14 RIGKRKKKKKEEENKK--KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+I + K K+ EEE K+ ++ KKE + KK+ + K++ K E +KE ++ + +K
Sbjct: 27 KIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQK 86
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
E++ +KEE +K E +++++E +K++K+ E+K+ + +++ + L Q + L
Sbjct: 87 LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
Score = 44.8 bits (107), Expect = 2e-05
Identities = 32/98 (32%), Positives = 55/98 (56%), Gaps = 10/98 (10%)
Query: 32 KKKEEKKQKKKEE--KKQKKKEKKEEKEEKKEK----KKEEKKQKKEEEKDKKKEEKKQK 85
KK E K K+ EE K+ ++ KKE + KKE K+E K + E EK+ ++ + +
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ 85
Query: 86 KKEEKKQKKKEEKKQK----KKKEEKKNKKAEEQNRQQ 119
K E++ +K+E +K +K+EE+ KK +E ++Q
Sbjct: 86 KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Score = 37.5 bits (88), Expect = 0.003
Identities = 29/113 (25%), Positives = 67/113 (59%), Gaps = 16/113 (14%)
Query: 18 RKKKKKEEENKKKQK----KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R+ + ++ E + QK ++ + +K+EE+ +KK+++ E+K+++ EKK+EE ++ EE
Sbjct: 80 RRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
Query: 74 EKDK-------KKEEKKQKKKEEKKQKKKEE-----KKQKKKKEEKKNKKAEE 114
+ + EE K+ E+ +++ + E K+ +++ +E+ +KKA+E
Sbjct: 140 QLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAKEEADKKAKE 192
Score = 34.4 bits (80), Expect = 0.031
Identities = 30/104 (28%), Positives = 66/104 (63%), Gaps = 9/104 (8%)
Query: 17 KRKKKKKEEENKKKQ--KKKEEKKQKKKEEKKQKKK--EKKEEKEEKKEKKKEEKKQK-- 70
KR +K+E ++K + +K+EE+ +KK++E +QK++ EKKEE+ E+ +++ ++ ++
Sbjct: 89 KRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERIS 148
Query: 71 ---KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
EE K+ E+ +++ + E KE +++ K++ +KK K+
Sbjct: 149 GLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAKEEADKKAKE 192
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 49.6 bits (119), Expect = 3e-07
Identities = 25/70 (35%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+ ++ K +++EE+K K ++E +EE +EKK+E+KK+++E + K E+ Q+K E
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEE-QRKLE 312
Query: 89 EKKQKKKEEK 98
EK++KK+ K
Sbjct: 313 EKERKKQARK 322
Score = 47.6 bits (114), Expect = 1e-06
Identities = 30/76 (39%), Positives = 50/76 (65%), Gaps = 3/76 (3%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+SP + ++ K +EEE +K K EE++Q EE ++KK+EKK+E+ E K K +
Sbjct: 250 NLKLSPEVLRKVDKTREEEEEKILKAAEEERQ---EEAQEKKEEKKKEEREAKLAKLSPE 306
Query: 68 KQKKEEEKDKKKEEKK 83
+Q+K EEK++KK+ +K
Sbjct: 307 EQRKLEEKERKKQARK 322
Score = 47.2 bits (113), Expect = 2e-06
Identities = 19/65 (29%), Positives = 45/65 (69%), Gaps = 6/65 (9%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE------KKQKKKEEK 98
+ +K K +EE+++ K +++++EE ++KK+E+KK++++ + ++Q+K EEK
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEK 314
Query: 99 KQKKK 103
++KK+
Sbjct: 315 ERKKQ 319
Score = 35.7 bits (83), Expect = 0.012
Identities = 18/57 (31%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ +K +K +++EEEK K +++++E+ Q+KKEEKK+++++ + EEQ
Sbjct: 255 PEVLRKVDKTREEEEEKILKA---AEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQ 308
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 49.5 bits (119), Expect = 3e-07
Identities = 20/68 (29%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K + +++E + E+KK KKQ+K E+K +K+E +K K++ + K K +
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK--KAK 447
Query: 101 KKKKEEKK 108
E KK
Sbjct: 448 GPDGETKK 455
Score = 48.0 bits (115), Expect = 1e-06
Identities = 23/71 (32%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K E E ++ + ++KK KKQ+K EKK EKEE +K KK+ + K K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEE--AEKAAAKKKAEAAAKKAK 447
Query: 79 KEEKKQKKKEE 89
+ + KK +
Sbjct: 448 GPDGETKKVDP 458
Score = 44.9 bits (107), Expect = 1e-05
Identities = 18/70 (25%), Positives = 36/70 (51%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
K + E+++ + E KK +KK+ K +KK E+++ +K K+K + K+ K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Query: 97 EKKQKKKKEE 106
+ + KK +
Sbjct: 450 DGETKKVDPD 459
Score = 41.5 bits (98), Expect = 2e-04
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
E ++EE + ++KK KKQ+K E+K +K++ EK KKK E KKA+ +
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Score = 41.1 bits (97), Expect = 2e-04
Identities = 17/67 (25%), Positives = 33/67 (49%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
K E ++++ + + KK K+++K +KK EK++ ++ KK E +K K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Query: 90 KKQKKKE 96
+ KK
Sbjct: 450 DGETKKV 456
Score = 39.9 bits (94), Expect = 6e-04
Identities = 15/52 (28%), Positives = 28/52 (53%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
E + +K+++K EKK +K+E +K K+K E +K + E K+ +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 39.5 bits (93), Expect = 8e-04
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
++KK KKQ+K EKK +K+E ++ +KK + +K + + E K +
Sbjct: 408 PAERKKLRKKQRK-AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 38.0 bits (89), Expect = 0.002
Identities = 15/57 (26%), Positives = 32/57 (56%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
E K ++K+++ +K+ +KEE ++ +KK E KK K + + +K + + +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
Score = 35.3 bits (82), Expect = 0.019
Identities = 13/65 (20%), Positives = 31/65 (47%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+G +KK ++++ K ++K ++E+ +K +KK + KK + + +
Sbjct: 394 AEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Query: 62 KKKEE 66
KK +
Sbjct: 454 KKVDP 458
Score = 34.9 bits (81), Expect = 0.024
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 3/54 (5%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ +EEE + ++KK KKQ+K E +K +KEE + A+++
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAE---KKAEKEEAEKAAAKKKAEAA 442
Score = 34.1 bits (79), Expect = 0.038
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
++ +KK E+E +K K++ + K+ K + KK + + EK
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
Score = 29.9 bits (68), Expect = 1.1
Identities = 13/49 (26%), Positives = 24/49 (48%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
++K +K+E E +KK E +K K + KK + EK + ++
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKLARTED 470
Score = 27.6 bits (62), Expect = 5.6
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK--KKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E ++++ E E KK +KK ++K KK EK++ +K KK +A + +
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKK---QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 48.3 bits (115), Expect = 9e-07
Identities = 21/125 (16%), Positives = 51/125 (40%), Gaps = 17/125 (13%)
Query: 17 KRKKKKKEEENKKKQKKKEEKK----------------QKKKEEKKQKKKEKKEEKEEKK 60
+++ + +E++ +KK ++ + E KK + EK E K KK
Sbjct: 289 EQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKK 348
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E ++++ + E + ++E+ + ++ + K+K + N+ +N +
Sbjct: 349 E-IAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESK 407
Query: 121 LYPNQ 125
P Q
Sbjct: 408 GSPPQ 412
Score = 42.9 bits (101), Expect = 5e-05
Identities = 17/93 (18%), Positives = 43/93 (46%), Gaps = 1/93 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK + E+N+ K +KKE QK++ +++ +E ++E+ + + + K ++K
Sbjct: 333 KKTRTAEKNEAKARKKEIA-QKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
+ E ++ K + + + N++
Sbjct: 392 DASPNEDTPSENEESKGSPPQVEATTTAEPNRE 424
Score = 39.4 bits (92), Expect = 7e-04
Identities = 17/96 (17%), Positives = 40/96 (41%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E K + +K E K +KKE ++++ ++E E ++++ + + K ++K
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ E ++ K + + E NR+
Sbjct: 391 IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPS 426
Score = 37.9 bits (88), Expect = 0.003
Identities = 14/94 (14%), Positives = 37/94 (39%), Gaps = 1/94 (1%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
K+ + +K E K +KKE +++ + + E +Q++ + + + K ++K
Sbjct: 333 KKTRTAEKNEAK-ARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391
Query: 94 KKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
+ + E+ + PN++
Sbjct: 392 DASPNEDTPSENEESKGSPPQVEATTTAEPNREP 425
Score = 37.5 bits (87), Expect = 0.003
Identities = 17/94 (18%), Positives = 37/94 (39%), Gaps = 1/94 (1%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K + + K +KKE K++ ++E ++ ++E+ + + K KKK
Sbjct: 333 KKTRTAEKNEAKARKKEIAQKRRAAEREINRE-ARQERAAAMARARARRAAVKAKKKGLI 391
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
E+ + EE K + + E ++
Sbjct: 392 DASPNEDTPSENEESKGSPPQVEATTTAEPNREP 425
Score = 37.5 bits (87), Expect = 0.003
Identities = 24/140 (17%), Positives = 56/140 (40%), Gaps = 15/140 (10%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK----------EEKKEKKKEEK 67
+K +E K+ ++ + ++K++EKK + + E + E K
Sbjct: 274 ERKYPLTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIK 333
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
K + E K E K +KKE ++++ E++ ++ +++ ++ +K+
Sbjct: 334 KTRTAE-----KNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKK 388
Query: 128 TLIGYLRSVDQFANLVLHKT 147
LI + D + K
Sbjct: 389 GLIDASPNEDTPSENEESKG 408
Score = 33.3 bits (76), Expect = 0.074
Identities = 11/79 (13%), Positives = 33/79 (41%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K++ E + ++ ++E+ + ++ K ++K E+ + EE K
Sbjct: 351 AQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSP 410
Query: 79 KEEKKQKKKEEKKQKKKEE 97
+ + E ++ +E+
Sbjct: 411 PQVEATTTAEPNREPSQED 429
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 47.9 bits (115), Expect = 1e-06
Identities = 15/104 (14%), Positives = 43/104 (41%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EEE +K++ K E +++ K E ++ + EE+ + + ++ E + +
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
+ + + + + + KK +KK ++ + + +
Sbjct: 99 AEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGK 142
Score = 39.1 bits (92), Expect = 0.001
Identities = 23/113 (20%), Positives = 45/113 (39%), Gaps = 4/113 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK----KKEEKKQKKEEE 74
K + E K+ +E +K KE + K +EE+ K ++ E++ K E E
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
+ E +++ K E EE + E + AE++ + ++
Sbjct: 62 EAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRP 114
Score = 38.3 bits (90), Expect = 0.002
Identities = 22/119 (18%), Positives = 48/119 (40%), Gaps = 4/119 (3%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK----EEKKEKKKEEKKQ 69
R+ + K+ + +K KE + K +++E ++E+ E++ K + E+
Sbjct: 5 RVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAA 64
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
E E++ K E EE + + + E++ + AE R+ K+
Sbjct: 65 AAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKK 123
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 47.4 bits (113), Expect = 2e-06
Identities = 21/92 (22%), Positives = 47/92 (51%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
N+ + + +++KQ+ K++ +KK +K ++ + + E + +E ++K+ +KK+
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
KKK++KK+K E A Q+
Sbjct: 61 KKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQD 92
Score = 42.8 bits (101), Expect = 6e-05
Identities = 21/94 (22%), Positives = 41/94 (43%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ + E + +K+Q K++ K+K K+ KK + E E ++ E KQ +++K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
KKK++KK+ E + +
Sbjct: 61 KKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94
Score = 37.4 bits (87), Expect = 0.004
Identities = 19/72 (26%), Positives = 41/72 (56%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
+ + E K++K++ K+K ++K +K +++ D + E ++ + K+ K+KKKK+
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 106 EKKNKKAEEQNR 117
+KK KK +
Sbjct: 64 KKKKKKNLGEAY 75
Score = 32.4 bits (74), Expect = 0.14
Identities = 13/76 (17%), Positives = 35/76 (46%)
Query: 52 KKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
E E + K+++++ ++K +K KK +K + + E ++ + K+ K KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 112 AEEQNRQQGLYPNQKR 127
+++ +++
Sbjct: 61 KKKKKKKKKNLGEAYD 76
Score = 28.9 bits (65), Expect = 2.4
Identities = 19/80 (23%), Positives = 39/80 (48%), Gaps = 13/80 (16%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKK-------------EEKKQKKKEEKKQKKKKEEKKN 109
+ E + K++++++K+K KK+ KK E + ++ E KQ KK++KK
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 110 KKAEEQNRQQGLYPNQKRTL 129
KK +++N + +
Sbjct: 64 KKKKKKNLGEAYDLAYDLPV 83
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 47.1 bits (112), Expect = 2e-06
Identities = 31/97 (31%), Positives = 58/97 (59%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K++ ++ + + +Q+ EK K+ E+ ++ +EK+++ EE K K+ E K K E E +
Sbjct: 82 KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAE 141
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
KK +E+ +K+ EE+ + K + +KK E KK +AE
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE 178
Score = 45.2 bits (107), Expect = 8e-06
Identities = 30/97 (30%), Positives = 59/97 (60%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ ++ E++ +Q +++E +Q+ EK K+ E+ ++ E+K+K+ EE K K+ E
Sbjct: 74 EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K E + +KK +E+ +K+ EE+ + K E K K AE
Sbjct: 134 AKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Score = 43.3 bits (102), Expect = 4e-05
Identities = 33/102 (32%), Positives = 58/102 (56%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
+ +R +K + ++ K+ E+KQK+ EE K K+ + + K E + +KK +++ KK+
Sbjct: 92 KELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E++ K + + KK+ + KKK E + K K E K KAEE
Sbjct: 152 AEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEE 193
Score = 41.7 bits (98), Expect = 1e-04
Identities = 33/103 (32%), Positives = 55/103 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+K+K+ EE K KQ + + K + + EKK K++ KK+ +EE K K E K+K E K
Sbjct: 114 AEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K + E K K + + K K +E K + + + K +A + +
Sbjct: 174 KAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
Score = 41.7 bits (98), Expect = 1e-04
Identities = 34/106 (32%), Positives = 64/106 (60%), Gaps = 2/106 (1%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+ K+ KKE+E +KK +++ E+ +K++ ++ ++KE ++ +K K+ E+ K+
Sbjct: 54 RIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQ 113
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
EEK K+ EE K K+ E K K E + +KK +E+ K+AEE+ +
Sbjct: 114 AEEKQKQAEEAKAKQAAE--AKAKAEAEAEKKAKEEAKKQAEEEAK 157
Score = 40.2 bits (94), Expect = 4e-04
Identities = 30/102 (29%), Positives = 50/102 (49%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ K + E + ++K KEE K++ +EE K K + ++K + +KK E + + K E K K
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K E+ + K E K K E K + E AE + +
Sbjct: 189 AKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKAD 230
Score = 37.1 bits (86), Expect = 0.004
Identities = 31/116 (26%), Positives = 49/116 (42%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
K A + + +K+ + + K++ +EE K K E K+K E K++ E + + K E
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAE 184
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
K + K EE K E K K E K + E E ++ E GL
Sbjct: 185 AKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGL 240
Score = 37.1 bits (86), Expect = 0.004
Identities = 29/98 (29%), Positives = 54/98 (55%), Gaps = 2/98 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEK--KQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K+ +++++ ++ K K+ + K K E ++K KE+ +++ E++ K K + KK+ +
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
KKK E + K K E K K K E+ + K + K AE
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE 208
Score = 34.8 bits (80), Expect = 0.026
Identities = 26/98 (26%), Positives = 54/98 (55%), Gaps = 4/98 (4%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ E+ K+ ++ ++ ++K+K+ ++ K K+ E K K E + E +K +EE K
Sbjct: 95 EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKA-KAEAEAE---KKAKEEAKK 150
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ EE+ + K + +KK E K+K + E K +A+ +
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 34.4 bits (79), Expect = 0.028
Identities = 24/101 (23%), Positives = 60/101 (59%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++ N+ +Q+KK K++++ +KK +++ ++ EK+ E+ ++++ +++ +
Sbjct: 45 PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K+ ++ K+ E+KQK+ EE K K+ E K +AE + + +
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAK 145
Score = 32.5 bits (74), Expect = 0.15
Identities = 22/102 (21%), Positives = 44/102 (43%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K++ K++ ++ + K + +KK E K+K + + + K E K K K E+ + K E
Sbjct: 143 KAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAK 202
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K + K E + + ++K E + +
Sbjct: 203 AKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGS 244
Score = 29.4 bits (66), Expect = 1.2
Identities = 25/104 (24%), Positives = 46/104 (44%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ +++ K + + +KK E K+K + E K K + K + K E+ + K E K K E
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAA 210
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
K E + + ++K +E + AE+Q +G
Sbjct: 211 AKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARG 254
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 47.4 bits (113), Expect = 2e-06
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 3/68 (4%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK---EKKKEEKKQKK 71
IG + + EEN K + E + K E Q K +E+ +++ EE ++
Sbjct: 874 IGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRA 933
Query: 72 EEEKDKKK 79
+E +DK K
Sbjct: 934 KEYQDKHK 941
Score = 42.4 bits (100), Expect = 8e-05
Identities = 16/98 (16%), Positives = 36/98 (36%), Gaps = 7/98 (7%)
Query: 14 RIGKRKKKKKEEENKKKQ--KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
I +R + + K Q E + + + + E+ K + E + K +
Sbjct: 845 IIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEAL 904
Query: 72 EEEKDKKKEEKKQKK-----KEEKKQKKKEEKKQKKKK 104
+ +E+ Q++ E+ K+ +E + K K
Sbjct: 905 SQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 41.7 bits (98), Expect = 2e-04
Identities = 12/88 (13%), Positives = 34/88 (38%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
+ + ++ + + + + E+ K + + + K E + + +E+
Sbjct: 855 IRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEK 914
Query: 67 KKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
Q++ D +E K+ K+ + K K
Sbjct: 915 SAQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 39.3 bits (92), Expect = 0.001
Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 8/87 (9%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ--KKKEEKK 99
+ + K + E E + + ++ K + E + K E Q K EEK
Sbjct: 856 RNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKS 915
Query: 100 QKK------KKEEKKNKKAEEQNRQQG 120
++ EE + E Q++ +G
Sbjct: 916 AQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 34.7 bits (80), Expect = 0.035
Identities = 14/87 (16%), Positives = 33/87 (37%), Gaps = 9/87 (10%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ---KKKEEK 90
+ + K + + + + + + ++ K Q E + K E Q +E+
Sbjct: 856 RNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKS 915
Query: 91 KQK------KKEEKKQKKKKEEKKNKK 111
Q+ EE ++ K+ + K+K
Sbjct: 916 AQERMPGADSPEELMKRAKEYQDKHKG 942
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 47.2 bits (113), Expect = 2e-06
Identities = 31/105 (29%), Positives = 57/105 (54%), Gaps = 2/105 (1%)
Query: 17 KRKKKKKEEENKK--KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ K+ EE KK + KKE + K+E K + + ++E KE + E ++ E++ + EE
Sbjct: 31 EELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREE 90
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+K E KK+E ++K+KE ++K +EK+ + E Q+
Sbjct: 91 TLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQR 135
Score = 44.9 bits (107), Expect = 1e-05
Identities = 25/101 (24%), Positives = 54/101 (53%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
RK+ +++ ++ K ++ KKE + KK+ E KEE + + E +++ KE +
Sbjct: 19 RKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNEL 78
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++ E++ ++EE +K E +K++ EKK K+ + +
Sbjct: 79 QRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKN 119
Score = 43.0 bits (102), Expect = 5e-05
Identities = 29/118 (24%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 14 RIGKRKKKKKEEENKK-KQKKKEEKKQKKKEEKKQKKKEKKEEKEE-KKEKKKEEKKQKK 71
RI ++K EE K+ ++ K+E + KKE + K+E + + E ++E K+ + ++
Sbjct: 21 RIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQR 80
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
E + ++EE +K E +K++ +K++K+ K+ E++ + L Q+ L
Sbjct: 81 LERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREEL 138
Score = 35.3 bits (82), Expect = 0.017
Identities = 29/104 (27%), Positives = 62/104 (59%), Gaps = 7/104 (6%)
Query: 18 RKKKKKEE--ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
R+ ++EE + K + K+E+ +KKE++ K++ +EKEE+ E+ E++++ E
Sbjct: 83 RRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142
Query: 76 DKKKEEKKQKKKEEKKQKKKEE-----KKQKKKKEEKKNKKAEE 114
+EE K+ EE +++ + E K+ +++ +E+ +KKA+E
Sbjct: 143 GLTQEEAKEILLEEVEEEARHEAAKLIKEIEEEAKEEADKKAKE 186
Score = 34.5 bits (80), Expect = 0.031
Identities = 22/103 (21%), Positives = 56/103 (54%), Gaps = 5/103 (4%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK--KQKKKEEK 98
K+ +K+ ++ K +E KKE + KKE + K+E K + + E+ K+++ E +
Sbjct: 20 KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79
Query: 99 KQKKK---KEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQ 138
+ +++ +EE ++K E ++++ +++ L +++D+
Sbjct: 80 RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDE 122
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 46.6 bits (111), Expect = 3e-06
Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 2/122 (1%)
Query: 12 SPRIGKRKKKK--KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
S K K + N K+ ++E K KK++KE+K + K++ ++
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
KK+++K+KK+ + + + K K KK +K +KK + K N+ + + +
Sbjct: 73 KKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
Query: 130 IG 131
IG
Sbjct: 133 IG 134
Score = 40.9 bits (96), Expect = 2e-04
Identities = 27/101 (26%), Positives = 46/101 (45%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ K +S + K K K++ K + +KK E+KK+KKKEKKE K E + K
Sbjct: 32 LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
+ +K ++ K K + K + + K E ++
Sbjct: 92 GFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
Score = 38.9 bits (91), Expect = 0.001
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 2/92 (2%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
K A + +K+ K E KK +KKK++K K+K+E K + + K K KK K
Sbjct: 43 SEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKK--KEKKEPKSEGETKLGFKTPKKSK 100
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
K ++K K + +D K + EK
Sbjct: 101 KTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 46.7 bits (110), Expect = 4e-06
Identities = 34/98 (34%), Positives = 58/98 (59%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KKK EE K + K E + E + ++K + EK++++ KKK + +KK EEK K
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
E KK+ ++++KK + ++ KKK ++ KKAEE+ +
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 44.7 bits (105), Expect = 2e-05
Identities = 35/117 (29%), Positives = 61/117 (52%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K A K A + KKK EE KK K++ ++ KK + K + + E +
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++K E +KK+EE KK + K+K +E+KK + ++K ++ KK+ + KKA ++
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 44.0 bits (103), Expect = 3e-05
Identities = 31/102 (30%), Positives = 61/102 (59%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K E E + + E+K + E+KK++ K+K + ++K E+KK+ + KK+ E+D
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
KKK ++ +K KK+ + +KK ++KK+ + KK E+ ++
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 42.4 bits (99), Expect = 1e-04
Identities = 31/97 (31%), Positives = 59/97 (60%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK EE+ K + KK+ ++ KK +E K+K +E K++ + K+K +E KK + + + +
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAA 1355
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ + EEK + +++K++ KKK + KKAEE+ +
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 42.1 bits (98), Expect = 1e-04
Identities = 34/105 (32%), Positives = 67/105 (63%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ + K K EE K +++KK+ ++ KKKE +++KK E+ ++ EE+ + K E+ +K EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+K K +E KK ++ E+K + +++ ++ KK E+ KK E+ ++
Sbjct: 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 41.7 bits (97), Expect = 2e-04
Identities = 48/144 (33%), Positives = 87/144 (60%), Gaps = 6/144 (4%)
Query: 14 RIGKRKKKKKEEENKKK--QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ-- 69
+I ++ KK EE+KKK + KK E+ +KK E +K+ E+ ++ EE K+K+ EEKK+
Sbjct: 1660 KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE 1719
Query: 70 --KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
KK EE++K K E+ +K+ EE K+K +E KK +++K++ + K EE+ + + + ++
Sbjct: 1720 ELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEA 1779
Query: 128 TLIGYLRSVDQFANLVLHKTIERI 151
+ L D+ + + K I+ I
Sbjct: 1780 VIEEELDEEDEKRRMEVDKKIKDI 1803
Score = 41.3 bits (96), Expect = 2e-04
Identities = 30/102 (29%), Positives = 66/102 (64%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK ++E + K ++ K+ ++ KKK E+ +K +E +++ E +K+ EE K+ +E +K +
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+E+KK ++ ++ +++ K + ++ KK+ E+ KKAEE + +
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEE 1754
Score = 40.9 bits (95), Expect = 4e-04
Identities = 39/109 (35%), Positives = 73/109 (66%), Gaps = 1/109 (0%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
++ + +KK K EE KK ++ K + ++ KK E+++KK E+ ++KE +++KK EE K+ +EE
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Query: 74 EKDKKKEEKKQKKKEEKK-QKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
K K EE K+ ++++KK ++ K+ ++ +KK E K+AEE + + L
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
Score = 40.5 bits (94), Expect = 4e-04
Identities = 29/105 (27%), Positives = 57/105 (54%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + KK EE KK + K+ +++KK +E K+K +E K+ E KK+ ++ +KK ++K
Sbjct: 1279 KADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
++ ++ + K E + E + ++K E + KK E + +
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
Score = 40.1 bits (93), Expect = 6e-04
Identities = 38/116 (32%), Positives = 67/116 (57%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
A K A + KKK EE K ++ KK+ ++ KK +E K+K +E K+ E KK+ ++
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEA 1495
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+KK + ++ + K++ + KK E+ +K E KK ++ K+ + KKAEE+ + L
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADEL 1551
Score = 40.1 bits (93), Expect = 6e-04
Identities = 34/100 (34%), Positives = 65/100 (65%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+KK +++++ + +KK EE K+ + +KK ++ +K EE ++K E+ K+ + KK+ E+ K
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
K +E K+K +E KK+ + +K + KK+ + KKAEE +
Sbjct: 1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Score = 39.4 bits (91), Expect = 0.001
Identities = 41/102 (40%), Positives = 65/102 (63%), Gaps = 4/102 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK---EKKKEEKKQKKEEEK 75
KKK +E++ + KKK E+ +KK +E K+ KK+ E KK EKKK ++ +KK EE
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
KK +E K+K +E KK ++ ++K ++ KK ++ KKAEE +
Sbjct: 1444 -KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 39.0 bits (90), Expect = 0.001
Identities = 37/107 (34%), Positives = 62/107 (57%), Gaps = 4/107 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK----KKEEKKQKKE 72
++K + E KK++ KK+ KKK E+K+K E K++ EE K+K KK +KK
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+E KK EEKK+ + +KK ++ ++ + KKK E+ K E + + +
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Score = 39.0 bits (90), Expect = 0.001
Identities = 38/103 (36%), Positives = 63/103 (61%), Gaps = 3/103 (2%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK---EKKKEEKKQKKEEEKD 76
KKK +E KK ++KK+ + KKK E+ +K E K++ EE K E KK+ ++ KK +E
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
KK EE K+ + +KK ++ ++K + KK + KKA+E + +
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE 1519
Score = 39.0 bits (90), Expect = 0.001
Identities = 35/105 (33%), Positives = 68/105 (64%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K A + + +KK E KK+ ++ ++ ++ KK+E ++KKK ++ +K E++ K K E+
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
+K+ EE KK EE K+ ++E+KK +++++KK +E +K K+A
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEA 1779
Score = 38.6 bits (89), Expect = 0.002
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 2/124 (1%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK--EKK 63
A K A + KKK EE K + KK+ ++ KK EE K+K +E K+ E KK E+
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K+ + KK+ E+ KKK ++ +K E KK+ + +K ++ KK ++ K E + +
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKA 1542
Query: 124 NQKR 127
+K+
Sbjct: 1543 EEKK 1546
Score = 38.6 bits (89), Expect = 0.002
Identities = 41/103 (39%), Positives = 67/103 (65%), Gaps = 5/103 (4%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK----EKKKEEKKQKKEEE 74
+KKK+E + K KK+ +++KK +E K+K +E K++ +E K KKK ++ +KK EE
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
K KK +E K+K +E KK + ++K ++ KK E+ KKAEE +
Sbjct: 1430 K-KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 38.6 bits (89), Expect = 0.002
Identities = 40/120 (33%), Positives = 71/120 (59%), Gaps = 2/120 (1%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K A K A + KKK EE+ KK + K+ KKK ++ +KK E+K++ +E K+
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Query: 62 KKKEEKK--QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K +E KK + K++ ++ KK E+ +KK EE K+ + +KK ++ K+ + KK E+ +++
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Score = 37.8 bits (87), Expect = 0.003
Identities = 37/110 (33%), Positives = 70/110 (63%), Gaps = 2/110 (1%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ + +KKK E+ KK+ ++K++ ++ KK E++ K K +E K+ +++KKK E+ +K EE
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685
Query: 74 EKDKKKEEKKQKKKEEKK--QKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
++ K E K++ +E KK + KK+E ++KKK EE K + E + + +
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEA 1735
Score = 37.0 bits (85), Expect = 0.006
Identities = 31/101 (30%), Positives = 66/101 (65%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++ KK +E KKK ++ ++ + KK+ ++ KKK + +K + +KK +E K+ +E +K
Sbjct: 1465 KAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ ++ ++ KK ++ +K +E+KK + K+ ++ KKAEE+ +
Sbjct: 1525 DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565
Score = 36.7 bits (84), Expect = 0.007
Identities = 35/101 (34%), Positives = 62/101 (61%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KKK E + + KKK ++ +K +E KK + +K EE ++ E KK E+K+K +E K +
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ +K ++KK+ ++ KK EE K ++ ++ KKAEE ++
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEE 1596
Score = 36.7 bits (84), Expect = 0.009
Identities = 29/107 (27%), Positives = 60/107 (56%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+++ K ++ +K ++ +K +E+KK + +K EEK++ E KK+ ++ KK +E KK EE
Sbjct: 1267 RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
K+K KK+ ++ +K + K E + E + ++ +K+
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 36.3 bits (83), Expect = 0.010
Identities = 30/111 (27%), Positives = 66/111 (59%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ KK +E KK ++ K+ + KKK E+ +KK ++ ++ E K+K E KK ++ ++ D+
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
K+ ++ KK +E K+ ++++K + KK E+ K E++ ++ + + +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578
Score = 35.9 bits (82), Expect = 0.013
Identities = 31/101 (30%), Positives = 63/101 (62%), Gaps = 2/101 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK ++ + KKK E+K+K E KK+ ++ KK ++ +KK ++ ++ ++ K++ ++ K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Query: 79 KEEKKQKKKEEKKQKKKEEKK--QKKKKEEKKNKKAEEQNR 117
K ++ +KK EE K+ + +KK + KKK ++ K AE + +
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 35.5 bits (81), Expect = 0.016
Identities = 34/99 (34%), Positives = 62/99 (62%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK EE+ K ++ KK E+ + K ++ K+ +E + E+ K EE+K+ K EE K +E
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K + ++ +K +++K++ +Q KKKE ++ KKAEE + +
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAE 1656
Score = 35.5 bits (81), Expect = 0.019
Identities = 31/114 (27%), Positives = 61/114 (53%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
A + + I K ++ + +++ K E+ +K E KK ++K+K +E ++ +EKKK
Sbjct: 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKA 1304
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++ +KK EE K E KK+ ++ +KK ++K ++ KK + K E +
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Score = 34.0 bits (77), Expect = 0.061
Identities = 38/110 (34%), Positives = 68/110 (61%), Gaps = 7/110 (6%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++ + EE K +++K+ K ++ K+ ++ K K ++ +K E+++KK E+ K+K+ EEK
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647
Query: 77 K----KKEEKKQKKKEEKKQKKKEEKKQKK---KKEEKKNKKAEEQNRQQ 119
K KK E++ K K ++ KK EE K+K KK E+ KKA E +++
Sbjct: 1648 KAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697
Score = 32.8 bits (74), Expect = 0.15
Identities = 32/103 (31%), Positives = 64/103 (62%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K+ ++KK+ + KK ++ ++ ++KKK E+ +K +E K K E+ K+ ++ + EE
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+EEKK K +E KK ++ + K ++ KK E++ KK E+ +++
Sbjct: 1600 LYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
Score = 32.4 bits (73), Expect = 0.16
Identities = 37/103 (35%), Positives = 61/103 (59%), Gaps = 3/103 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++ KK +E KK ++ K+ + KK EEKK+ + KK E+ +K E+KK+ ++ KK EE
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE-- 1574
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K +K +E KK ++ ++ K EE+K KAEE + +
Sbjct: 1575 -DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAE 1616
Score = 32.0 bits (72), Expect = 0.25
Identities = 35/103 (33%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE--KKKEEKKQKKEEEKD 76
+ KK EE+ +K EE K+ ++ ++ K +EEK+ K E KK EE K K EE K
Sbjct: 1568 EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++E+KK ++ ++K+ ++K++ ++ KK EE+ KA E+ ++
Sbjct: 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKA 1670
Score = 32.0 bits (72), Expect = 0.26
Identities = 30/102 (29%), Positives = 60/102 (58%), Gaps = 4/102 (3%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK EE KK ++++ ++ +K EE + +++ + +E +K ++ +K EE+K +
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA 1295
Query: 81 EKKQKKKEEKKQKKKEEKKQK----KKKEEKKNKKAEEQNRQ 118
+K ++KK+ + KKK E+ +K KKK E+ KKA+ ++
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 31.6 bits (71), Expect = 0.35
Identities = 32/113 (28%), Positives = 70/113 (61%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+KK+ +E++ ++ KK EE+ + K EE K++ +E K++ EE K+ ++E+KK +++++
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLI 130
KK E+ +K+KE +++ +E+ +K++ E K K N + ++ L+
Sbjct: 1768 KKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLV 1820
Score = 30.5 bits (68), Expect = 0.78
Identities = 32/91 (35%), Positives = 59/91 (64%), Gaps = 2/91 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K ++ E K ++E+K + ++ +K ++ K K EE ++ +E+KK+ ++ KK+E ++K
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEK 1646
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
KK E+ +K +EE K K EE K K +E+KK
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAK--KAEEDKK 1675
Score = 29.7 bits (66), Expect = 1.4
Identities = 35/100 (35%), Positives = 64/100 (64%), Gaps = 2/100 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKE--EKKEKKKEEKKQKKEEE 74
K ++KKK EE KK ++ K +K +E KK ++ +E + E+++K K E+ +K EE
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K K +E KK +++++K ++ K+++ ++KKK E+ K EE
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Score = 28.6 bits (63), Expect = 2.8
Identities = 29/99 (29%), Positives = 58/99 (58%), Gaps = 2/99 (2%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE--EKDKK 78
+K E+ K + KK E+ +K EE K+ ++E+ E+ K E+ + +++ + ++
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
++ + KK EEKK+ + +K ++KKK ++ KKAEE +
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316
Score = 27.8 bits (61), Expect = 5.1
Identities = 31/98 (31%), Positives = 58/98 (59%), Gaps = 1/98 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+K EEE K ++ +K E +K + KK ++ +K E+ +K E+++ ++ +K EE
Sbjct: 1206 RKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHF 1265
Query: 81 EKKQKK-KEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
++Q K E+ +K E KK ++KK+ + KKAEE+ +
Sbjct: 1266 ARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKK 1303
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 46.3 bits (110), Expect = 5e-06
Identities = 22/83 (26%), Positives = 44/83 (53%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+K ++KK +K+ + + +EEK+ + E + KE++ + + Q+K E+
Sbjct: 1500 KKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEE 1559
Query: 91 KQKKKEEKKQKKKKEEKKNKKAE 113
+ + KK+K KK+ K N +AE
Sbjct: 1560 DYAESDIKKRKNKKQYKSNTEAE 1582
Score = 45.5 bits (108), Expect = 9e-06
Identities = 22/80 (27%), Positives = 41/80 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+KKK N+KK + E + Q++KE Q + E +EK+ E +++ EE+
Sbjct: 1503 NQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYA 1562
Query: 77 KKKEEKKQKKKEEKKQKKKE 96
+ +K++ KK+ K + E
Sbjct: 1563 ESDIKKRKNKKQYKSNTEAE 1582
Score = 31.2 bits (71), Expect = 0.43
Identities = 19/70 (27%), Positives = 33/70 (47%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K I S I R +++KE + + + +EKK + ++K +E+ E
Sbjct: 1507 KGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDI 1566
Query: 62 KKKEEKKQKK 71
KK++ KKQ K
Sbjct: 1567 KKRKNKKQYK 1576
Score = 30.5 bits (69), Expect = 0.70
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
K E K E+K KKK+KKE+K+E++ K++E+ +
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 29.7 bits (67), Expect = 1.2
Identities = 13/37 (35%), Positives = 25/37 (67%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
K E K +++ KK++KK+KK+EE+ K++E+ +
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.9 bits (65), Expect = 2.2
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
K + + + +EK K++KK++K++E++ K+EEK +
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.9 bits (65), Expect = 2.4
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K E K E+K +K++KKEKKKEE+ +++E+ +
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.1 bits (63), Expect = 3.7
Identities = 18/52 (34%), Positives = 30/52 (57%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
I KK N K + K ++K +KK+KK++KKEE+ +++EK + E
Sbjct: 717 ISGLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 27.0 bits (60), Expect = 9.9
Identities = 17/38 (44%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K E K E+ KK++KK+KKKEE + K+EEK +
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEE--EYKREEKAR 766
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 46.0 bits (109), Expect = 5e-06
Identities = 23/69 (33%), Positives = 42/69 (60%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E KK ++E +++KKKEEK ++K+ KK + +K+ K K + +Q + KK EKK
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 84 QKKKEEKKQ 92
+K++ + +
Sbjct: 66 SRKRDVEDE 74
Score = 46.0 bits (109), Expect = 6e-06
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 10/80 (12%)
Query: 31 QKKKEEKKQKKKE-EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
+ + E+K ++E E+K+KK+EK +EKE KK K +K+ K + + Q+ +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAA---------QKEAKAKLQAQQASDG 55
Query: 90 KKQKKKEEKKQKKKKEEKKN 109
KK EKK +K+ E +N
Sbjct: 56 TNVPKKSEKKSRKRDVEDEN 75
Score = 44.5 bits (105), Expect = 2e-05
Identities = 22/73 (30%), Positives = 41/73 (56%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+ + EKK EE+ E+KK+++++ KE+E K K +K+ K + + Q+ + KK E+
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 107 KKNKKAEEQNRQQ 119
K K+ E +
Sbjct: 65 KSRKRDVEDENPE 77
Score = 44.1 bits (104), Expect = 3e-05
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 4/63 (6%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+KKKKEE K K+KE KK K +++ + K + ++ + KK EKK +K + +D+
Sbjct: 20 RKKKKEE----KAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDEN 75
Query: 79 KEE 81
E+
Sbjct: 76 PED 78
Score = 43.3 bits (102), Expect = 5e-05
Identities = 21/77 (27%), Positives = 43/77 (55%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+ ++K +EE + KKK+E+K K++E K K +K+ K K + +Q KK +++
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 107 KKNKKAEEQNRQQGLYP 123
+ + E++N + + P
Sbjct: 66 SRKRDVEDENPEDFIDP 82
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 46.0 bits (110), Expect = 6e-06
Identities = 33/96 (34%), Positives = 57/96 (59%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K +K KEE +KK+K +EE+ + +E +K+ ++ KE K+E E KE ++ +K
Sbjct: 544 KEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYAS 603
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K E + +K K +K+EKK+KK+KE+++ K
Sbjct: 604 VKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639
Score = 45.6 bits (109), Expect = 8e-06
Identities = 28/100 (28%), Positives = 57/100 (57%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+E K K++ +E+K++ ++EE K ++ +KE ++ KE KKE + KE + +K
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
K E + +K+ K +KK+++KK +K +++ + G
Sbjct: 601 YASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVG 640
Score = 43.7 bits (104), Expect = 4e-05
Identities = 25/88 (28%), Positives = 49/88 (55%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+++ E+K ++ E ++ ++ KEE EEKKEK +EE+ + EE + + ++ K+ KKE
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEAD 587
Query: 91 KQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ K+ + QK K + E ++
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKR 615
Score = 41.4 bits (98), Expect = 2e-04
Identities = 21/81 (25%), Positives = 46/81 (56%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
+++ E+K ++ E ++ EK +++ EEKK+K +EE+DK EE +++ ++ K+ KKE
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEAD 587
Query: 99 KQKKKKEEKKNKKAEEQNRQQ 119
+ K+ + + +
Sbjct: 588 EIIKELRQLQKGGYASVKAHE 608
Score = 41.0 bits (97), Expect = 3e-04
Identities = 24/96 (25%), Positives = 57/96 (59%), Gaps = 1/96 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K+ E+ ++ ++KKE+ ++++ + ++ +KE ++ +E K++ E K+ ++ +K
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600
Query: 78 KKEEKKQKKKEEKKQ-KKKEEKKQKKKKEEKKNKKA 112
K + E +K+ K EKK+KKKK++K+ ++
Sbjct: 601 YASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
Score = 39.0 bits (92), Expect = 0.001
Identities = 20/84 (23%), Positives = 48/84 (57%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
++ + + E+K E+ E +E ++ KEE ++KK++ ++++ K ++ +K+ ++ K+
Sbjct: 523 ASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEA 582
Query: 104 KEEKKNKKAEEQNRQQGLYPNQKR 127
K+E E + Q+G Y + K
Sbjct: 583 KKEADEIIKELRQLQKGGYASVKA 606
Score = 38.7 bits (91), Expect = 0.002
Identities = 31/99 (31%), Positives = 58/99 (58%), Gaps = 2/99 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++K+K +EEE+K ++ ++E +Q KE KK+ + KE ++ +K K + E +
Sbjct: 555 EKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARK 614
Query: 77 K--KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+ K EKK+KKK+++K+K++E K + K +K E
Sbjct: 615 RLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGE 653
Score = 28.3 bits (64), Expect = 3.2
Identities = 24/107 (22%), Positives = 47/107 (43%), Gaps = 24/107 (22%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK-----KKEEKKQK 85
+ K + E +++ + E+KE+KK+K+KE++++ K ++ K +K E
Sbjct: 598 KGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSI 657
Query: 86 KKE-------------------EKKQKKKEEKKQKKKKEEKKNKKAE 113
+ EK QK K++KK+K K + K +
Sbjct: 658 PDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVS 704
Score = 27.1 bits (61), Expect = 9.0
Identities = 27/119 (22%), Positives = 53/119 (44%), Gaps = 15/119 (12%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK---- 63
K I + ++ +K K + E +++ + ++K+K+KK++KE+++E K
Sbjct: 583 KKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDE 642
Query: 64 -KEEKKQKKEE--EKDKKKEEKKQ--------KKKEEKKQKKKEEKKQKKKKEEKKNKK 111
K +K E KE Q + +K +K ++KK+KK K K +
Sbjct: 643 VKYLSLGQKGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 46.1 bits (109), Expect = 6e-06
Identities = 24/92 (26%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
N+++ + ++K+++KQ +K + E KE+ +KK+E + ++ ++ K
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 87 KEEKKQKKKEEKKQ-----KKKKEEKKNKKAE 113
+E K+ KEE KQ K K+E +K + E
Sbjct: 65 ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYE 96
Score = 44.5 bits (105), Expect = 2e-05
Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 23/124 (18%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEE------KEEKKEKKKEEKKQK 70
R++ ++ ++K E++ QK + E K+K +KKEE KE+ +E +
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 71 KEEEKDKKKEEKKQ-----KKKEEKKQKKKEEKKQK------------KKKEEKKNKKAE 113
+E K KEE KQ K KEE +++ + E QK KK E+ K ++A+
Sbjct: 65 ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAK 124
Query: 114 EQNR 117
+Q +
Sbjct: 125 KQTK 128
Score = 37.2 bits (86), Expect = 0.005
Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
+ ++E +E+ +EK+++K + + KE+ DKK+E K++ +E
Sbjct: 4 RNRREANINNNDRMQEKDDEKQDQK--NRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE- 60
Query: 99 KQKKKKEEKKNKKAEEQNRQ 118
K E KK+ K E+++Q
Sbjct: 61 NLKIADEVKKSTK--EESKQ 78
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 45.0 bits (107), Expect = 1e-05
Identities = 20/96 (20%), Positives = 41/96 (42%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK + ++K K+E K+ + ++K ++ ++ K ++ + +K K
Sbjct: 21 KKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTK 80
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K K K+K K+E KK E+K ++
Sbjct: 81 TAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Score = 45.0 bits (107), Expect = 1e-05
Identities = 24/100 (24%), Positives = 45/100 (45%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ K++ K K K + K+E K+ + +KK ++ + K ++
Sbjct: 11 AAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDAT 70
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ KK+ K K K K+K K E +KKAE++N
Sbjct: 71 ESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKN 110
Score = 43.8 bits (104), Expect = 3e-05
Identities = 21/103 (20%), Positives = 42/103 (40%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K +E K+ + ++K ++ ++ ++ ++ E +KK K +
Sbjct: 27 SKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAA 86
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K K+K K+E KK EKK K++ N + Q
Sbjct: 87 AAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQ 129
Score = 43.1 bits (102), Expect = 6e-05
Identities = 25/96 (26%), Positives = 38/96 (39%), Gaps = 1/96 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQK-KKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K E +++ KKK +K K + K K+E KE E KK+ ++ +
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K + + KK+ K K K KK K E
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99
Score = 40.0 bits (94), Expect = 5e-04
Identities = 22/98 (22%), Positives = 39/98 (39%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ KKK K+ K K K K++ K+ + +KK ++ ++ + E
Sbjct: 15 EAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDI 74
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK+ K K K K++ K + +K KK
Sbjct: 75 PKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112
Score = 39.6 bits (93), Expect = 6e-04
Identities = 20/95 (21%), Positives = 33/95 (34%), Gaps = 6/95 (6%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKK------QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K + E++ KKK +K + K+E KE + KKK ++ +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
+ E KKK + K + KK
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKK 95
Score = 37.7 bits (88), Expect = 0.003
Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 2/105 (1%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K+K ++ ++ + E KKK K K K KK+ K+E
Sbjct: 45 KKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAK--AAAAKAPAKKKLKDELDS 102
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
KK EKK ++ ++ + ++ + ++ +
Sbjct: 103 SKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI 147
Score = 37.7 bits (88), Expect = 0.003
Identities = 20/93 (21%), Positives = 33/93 (35%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ +++ ++K +K + K K KEE +E E KK+ +Q +
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K E KK+ K K K K
Sbjct: 62 MVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94
Score = 36.9 bits (86), Expect = 0.006
Identities = 14/111 (12%), Positives = 37/111 (33%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ K + KK+ K E KK E+K K+ + +
Sbjct: 68 DATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVL 127
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ +++ D ++ +++ + ++ ++E+K + E
Sbjct: 128 NQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178
Score = 36.5 bits (85), Expect = 0.007
Identities = 19/116 (16%), Positives = 39/116 (33%), Gaps = 2/116 (1%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+S K KKK + K K KK+ K E KK EKK ++ +
Sbjct: 58 FLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDD 117
Query: 62 KKKEEKKQKKEE--EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ + + D ++ + E+ + ++ ++ E++
Sbjct: 118 LNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKK 173
Score = 35.7 bits (83), Expect = 0.013
Identities = 18/113 (15%), Positives = 43/113 (38%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K + KKK K+E + K+ +K+ K + K + + ++ +
Sbjct: 76 KKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDD 135
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
++ + + D E+ + ++ + EEKK+ K+ E+ +
Sbjct: 136 DDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFV 188
Score = 35.0 bits (81), Expect = 0.022
Identities = 15/100 (15%), Positives = 38/100 (38%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + KKK K + + +K +++ K + ++ + + +++ D
Sbjct: 82 AAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDD 141
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
++ +E + ++ + EEKK K E+
Sbjct: 142 LDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKL 181
Score = 34.2 bits (79), Expect = 0.046
Identities = 15/111 (13%), Positives = 35/111 (31%), Gaps = 1/111 (0%)
Query: 5 IAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
+P + K E KKK + K K +KK K +
Sbjct: 41 ALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAK-APAKKKLKDE 99
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ +K E++ K++ K+ + ++ ++ + ++
Sbjct: 100 LDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDD 150
Score = 33.0 bits (76), Expect = 0.091
Identities = 12/98 (12%), Positives = 40/98 (40%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+KK K E ++ KK +KK + + + ++ + ++ ++ D
Sbjct: 92 AKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDD 151
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+++ + ++ + +E+K+ K+ ++ +
Sbjct: 152 DDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVW 189
Score = 29.2 bits (66), Expect = 1.9
Identities = 14/54 (25%), Positives = 26/54 (48%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K + EE+ KKK +K K + K KEE K+ + ++K ++ ++
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQ 54
Score = 26.9 bits (60), Expect = 9.6
Identities = 11/96 (11%), Positives = 37/96 (38%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K K + ++ + ++ ++ ++ +++ ++E ++K
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KE K+ +K + +E + ++ +K+ K
Sbjct: 173 KEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTA 208
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 45.0 bits (107), Expect = 1e-05
Identities = 20/86 (23%), Positives = 42/86 (48%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
++ +E + KE +K + EK E EE+K + K++ + + + + E+ + E K
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 91 KQKKKEEKKQKKKKEEKKNKKAEEQN 116
E ++++EE K + AE +
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRA 87
Score = 41.2 bits (97), Expect = 2e-04
Identities = 15/90 (16%), Positives = 39/90 (43%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
++ +E + KE ++ E++K + EE+K + + K++ + + + E+ + E K
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 99 KQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
+ ++ EE + + R
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRAYLRG 91
Score = 40.8 bits (96), Expect = 3e-04
Identities = 17/86 (19%), Positives = 40/86 (46%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+E + + +E ++ ++EK + +E+K E + KE+ + + E+ + E K
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKN 109
E ++EE++ K + E +
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRA 87
Score = 40.0 bits (94), Expect = 5e-04
Identities = 17/84 (20%), Positives = 38/84 (45%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++ E K+ +K E++K + EE+K + KEE ++ + ++ E E
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKK 102
+ +EE+++ K E + +
Sbjct: 64 ASGEGGGGEEEEEEAKAEAAEFRA 87
Score = 37.3 bits (87), Expect = 0.004
Identities = 14/96 (14%), Positives = 43/96 (44%), Gaps = 2/96 (2%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K+ +E + ++ ++ + +K E +++K E KEE + E + ++ ++ + K
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ ++++EE K + + + ++
Sbjct: 62 PAA--SGEGGGGEEEEEEAKAEAAEFRAYLRGGDDA 95
Score = 30.8 bits (70), Expect = 0.53
Identities = 14/80 (17%), Positives = 32/80 (40%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K + EE K + +E+ K E + ++ E + + + +++EEE +
Sbjct: 22 EKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAE 81
Query: 79 KEEKKQKKKEEKKQKKKEEK 98
E + + +E K
Sbjct: 82 AAEFRAYLRGGDDALAEERK 101
Score = 30.0 bits (68), Expect = 0.93
Identities = 13/67 (19%), Positives = 28/67 (41%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + EEE + KEE + E + ++ + E + + ++++EE K
Sbjct: 21 DEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKA 80
Query: 77 KKKEEKK 83
+ E +
Sbjct: 81 EAAEFRA 87
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 44.2 bits (105), Expect = 2e-05
Identities = 34/106 (32%), Positives = 58/106 (54%), Gaps = 11/106 (10%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ--KKEEEKDK 77
K +EE ++++++KE K++K+ K +K++EKK+++ EK EK K + K++E+K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605
Query: 78 KKEEK----KQKKKEE--KKQKKKEEK---KQKKKKEEKKNKKAEE 114
+E EE KK++KK K KQ K EE K +
Sbjct: 606 AFDETGLPTHDADGEEISKKERKKLSKEYDKQAKLHEEYLAKGGKS 651
Score = 36.2 bits (84), Expect = 0.009
Identities = 21/65 (32%), Positives = 42/65 (64%), Gaps = 1/65 (1%)
Query: 52 KKEEKEE-KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
K ++KEE ++EK+++E ++++ + KK+E+K+KK+ EK +K K + K++E K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605
Query: 111 KAEEQ 115
+E
Sbjct: 606 AFDET 610
Score = 35.8 bits (83), Expect = 0.015
Identities = 12/47 (25%), Positives = 32/47 (68%)
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K ++++ ++++E+K+ KE+K+ +K K++++ KK E + ++ P
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP 592
Score = 31.9 bits (73), Expect = 0.27
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 10/84 (11%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQ-------KKKEEKKQKKKEK---KEEKEEKKEKKKEE 66
+ +K KK+EE KKK+ +K EK + K++E+K E + + ++ KKE
Sbjct: 568 RLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGEEISKKER 627
Query: 67 KKQKKEEEKDKKKEEKKQKKKEEK 90
KK KE +K K E+ K +
Sbjct: 628 KKLSKEYDKQAKLHEEYLAKGGKS 651
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 44.3 bits (105), Expect = 3e-05
Identities = 31/149 (20%), Positives = 56/149 (37%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ KE+ K K K K K +K K +KK+KKK+K + KK K+ +EK K
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKL 1209
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQ 138
++ KK ++++++QK K ++ K+ + + S +
Sbjct: 1210 DDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEG 1269
Query: 139 FANLVLHKTIERIHVGNQYGDIPRGIFII 167
+ + P G
Sbjct: 1270 KPKNAPKRVSAVQYSPPPPSKRPDGESNG 1298
Score = 43.1 bits (102), Expect = 6e-05
Identities = 23/77 (29%), Positives = 42/77 (54%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
+Q++ EEK+ K+++ + K + K K + K KK+E+K KK K KK K+ +
Sbjct: 1143 EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVD 1202
Query: 98 KKQKKKKEEKKNKKAEE 114
+K+K ++K + K
Sbjct: 1203 SDEKRKLDDKPDNKKSN 1219
Score = 41.6 bits (98), Expect = 2e-04
Identities = 31/97 (31%), Positives = 49/97 (50%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K EE ++Q++ EEK+ K++ K K K K + + K KKKE+KK+K +K KK
Sbjct: 1135 DKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASV 1194
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K+ + +K+K + K KK E+ Q
Sbjct: 1195 VGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQ 1231
Score = 41.2 bits (97), Expect = 2e-04
Identities = 27/109 (24%), Positives = 58/109 (53%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+K ++ + ++K+KE +K K K ++ + +E +E+++ E+K+ +E++ K
Sbjct: 1101 KEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKS 1160
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
K + K K + K KKKE+KK+K ++ K +++ +K
Sbjct: 1161 KTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKL 1209
Score = 41.2 bits (97), Expect = 3e-04
Identities = 26/102 (25%), Positives = 52/102 (50%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+R K K + + K +K K +KK+KKK++ K +K K +EK++ ++
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+KK ++++++QK K +K K+ + KKN ++
Sbjct: 1215 NKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSED 1256
Score = 38.5 bits (90), Expect = 0.002
Identities = 25/124 (20%), Positives = 50/124 (40%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ + ++ K K KKKE++ KK K +K K+ EK++ ++ KK
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKK 1217
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
+E+++++K + KK K K +K K + E + ++E +
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKR 1277
Query: 124 NQKR 127
Sbjct: 1278 VSAV 1281
Score = 36.9 bits (86), Expect = 0.006
Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 9/116 (7%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKE---------KKEEKEEKKEK 62
+ + K KK + + KE K + + K E
Sbjct: 1237 KSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGES 1296
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K ++K KK+ E ++KK+ +K+ ++KK K K A + +R
Sbjct: 1297 NGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRL 1352
Score = 36.2 bits (84), Expect = 0.010
Identities = 26/101 (25%), Positives = 52/101 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K+ +K + K E+ + ++ ++Q++ E+KE +E++ K K + K K +
Sbjct: 1114 KEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPK 1173
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
KK+EKK+KK K KK K+ + + +K +++
Sbjct: 1174 LKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214
Score = 33.1 bits (76), Expect = 0.093
Identities = 21/97 (21%), Positives = 37/97 (38%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K+ K + K+ + K + K KK+ KK + +K K
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK 1325
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K E+K +KK+ K + K+ Q + + KK +
Sbjct: 1326 KSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362
Score = 32.7 bits (75), Expect = 0.15
Identities = 15/76 (19%), Positives = 35/76 (46%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
K +KKKK E++ +K+K K KQ + + + +++K + + ++
Sbjct: 1311 KKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDD 1370
Query: 67 KKQKKEEEKDKKKEEK 82
+ E++D + +E
Sbjct: 1371 SEVDDSEDEDDEDDED 1386
Score = 31.2 bits (71), Expect = 0.47
Identities = 17/89 (19%), Positives = 42/89 (47%)
Query: 9 GAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
G+ K+K KK+ E + KKK++ ++K +KK K + K+ + + +K
Sbjct: 1299 GSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRK 1358
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
+K + + + + ++E + +++
Sbjct: 1359 KKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387
Score = 29.2 bits (66), Expect = 1.9
Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 3/84 (3%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+K+ + + KK+ KK + K+KKK E KK K K K
Sbjct: 1285 PPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSE---KKTARKKKSKTRV 1341
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEE 106
KQ + + + +K+K
Sbjct: 1342 KQASASQSSRLLRRPRKKKSDSSS 1365
Score = 28.5 bits (64), Expect = 3.3
Identities = 21/103 (20%), Positives = 42/103 (40%), Gaps = 5/103 (4%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
P K KKK KK+ E ++KK K E+K +K+K K KQ
Sbjct: 1291 RPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK-----KSEKKTARKKKSKTRVKQAS 1345
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ + ++KK + + + + + E+ ++ + ++
Sbjct: 1346 ASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
Score = 28.1 bits (63), Expect = 4.6
Identities = 36/183 (19%), Positives = 72/183 (39%), Gaps = 54/183 (29%)
Query: 11 ISPRIGKRKKKKKEEEN----------KKKQKKKEEKKQKKKEEKKQKKKEKKEEKE--- 57
I+ + KKK+ K KKK EK ++EE ++ E +E +
Sbjct: 1022 INGELVITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEE 1081
Query: 58 --------------------EKKEKKKEEKKQKKEE---------------------EKD 76
EK EK E ++K++E E
Sbjct: 1082 LGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEAL 1141
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
+++EE ++K+ ++++ K + K + K + K KK E++ ++ ++K +++G + V
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201
Query: 137 DQF 139
D
Sbjct: 1202 DSD 1204
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 42.5 bits (101), Expect = 3e-05
Identities = 27/64 (42%), Positives = 35/64 (54%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
KKK KK E + EE K K+ + +KKE K EK +KK E+ K E+KK KK+
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKE 93
Query: 88 EEKK 91
E K
Sbjct: 94 EPKP 97
Score = 39.4 bits (93), Expect = 3e-04
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 12/68 (17%)
Query: 15 IGKRKKKKKEE-------ENKKKQKKK-----EEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
+ ++KK KK E E K K+ +KK+ K EK +KK EK + K EKK+
Sbjct: 31 LAQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKA 90
Query: 63 KKEEKKQK 70
KKEE K +
Sbjct: 91 KKEEPKPR 98
Score = 36.4 bits (85), Expect = 0.003
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 46 KQKKKEKKEEKEEKK--EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
QKKK KK E E E+ K+ K+ + DKK E K +K EKK +K + K +KKK
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKK--ELKAWEKAEKKAEKAKAKAEKKK 89
Query: 104 KEEKKNKK 111
++++ K
Sbjct: 90 AKKEEPKP 97
Score = 35.2 bits (82), Expect = 0.007
Identities = 21/65 (32%), Positives = 36/65 (55%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
K+K KK E E ++ ++ K+ + KK+ K EK EKK +K + K +KK+ +
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
Query: 75 KDKKK 79
K++ K
Sbjct: 92 KEEPK 96
Score = 27.9 bits (63), Expect = 2.9
Identities = 10/37 (27%), Positives = 23/37 (62%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKK 50
+ K++ K E+ KK +K K + ++KK ++++ K +
Sbjct: 62 LLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.1 bits (61), Expect = 4.7
Identities = 20/59 (33%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
KK+ KK E + E+ K +E + +KK+ K EK +KK EK KAE++
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKA--EKAKAKAEKKKA 90
>gnl|CDD|212466 cd01719, Sm_G, Sm protein G. The eukaryotic Sm proteins (B/B', D1,
D2, D3, E, F and G) assemble into a hetero-heptameric
ring around the Sm site of the 2,2,7-trimethyl guanosine
(m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
forming the core of the snRNP particle. The snRNP
particle, in turn, assembles with other components onto
the pre-mRNA to form the spliceosome which is
responsible for the excision of introns and the ligation
of exons. Members of this family share a highly
conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet. Sm subunit G binds subunits E and F to form
a trimer which then assembles onto snRNA along with the
D1/D2 and D3/B heterodimers forming a seven-membered
ring structure.
Length = 70
Score = 40.2 bits (95), Expect = 4e-05
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 124 NQKRTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLM 175
N R + G LR D F NLVL +E + G + P G+ +IRG +++++
Sbjct: 18 NGNRKVSGVLRGFDPFMNLVLDDAVEEVGDGEK---TPIGMVVIRGNSIIMI 66
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 42.9 bits (101), Expect = 6e-05
Identities = 21/84 (25%), Positives = 48/84 (57%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
++ K K ++ K+ KK + KK + +++ EE + K KEEK +++
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNK 110
+ EKKQK++ ++ ++K++++++ K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434
Score = 41.7 bits (98), Expect = 1e-04
Identities = 22/70 (31%), Positives = 44/70 (62%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
K++E K+ KK ++ +K E K + ++ E + K+EK K+E+ +KK++E+ + +
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Query: 81 EKKQKKKEEK 90
EK+QK + +K
Sbjct: 425 EKRQKDERKK 434
Score = 41.3 bits (97), Expect = 2e-04
Identities = 19/83 (22%), Positives = 50/83 (60%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K EE K +K+Q+ +E +K ++ ++ E K++ ++ + + +++K K+E
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
Query: 82 KKQKKKEEKKQKKKEEKKQKKKK 104
+ +KK++E+ + KE++++ ++K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERK 433
Score = 39.8 bits (93), Expect = 6e-04
Identities = 16/69 (23%), Positives = 47/69 (68%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
KR++ KE K + K+ + K + + ++ + K +E++ K+E+ ++++K++ +E+K+
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425
Query: 77 KKKEEKKQK 85
K+++++++K
Sbjct: 426 KRQKDERKK 434
Score = 38.6 bits (90), Expect = 0.001
Identities = 16/70 (22%), Positives = 44/70 (62%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
++ ++ K+ +K Q ++ + K E +E + + K+E+ KQ++ E+K K++ ++ +
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Query: 85 KKKEEKKQKK 94
+K+++ ++KK
Sbjct: 425 EKRQKDERKK 434
Score = 37.9 bits (88), Expect = 0.002
Identities = 23/86 (26%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K EE K ++++ KE ++ ++ K+ + K E + ++ E K KEEK
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKA---KEEKL 407
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNR 117
++++ EKKQK++ +E K K+ +++ +
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERK 433
Score = 36.3 bits (84), Expect = 0.009
Identities = 18/71 (25%), Positives = 46/71 (64%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G ++++ +E NKK Q ++ + K E ++ E K ++E+ K+++ E+K++++ +E
Sbjct: 364 GDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADED 423
Query: 76 DKKKEEKKQKK 86
+K+++ ++KK
Sbjct: 424 KEKRQKDERKK 434
Score = 35.9 bits (83), Expect = 0.010
Identities = 18/68 (26%), Positives = 41/68 (60%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ + KK ++ K + K E ++ E K +++K K+EE E+K++++ +E K+K
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEK 426
Query: 71 KEEEKDKK 78
+++++ KK
Sbjct: 427 RQKDERKK 434
Score = 29.8 bits (67), Expect = 0.96
Identities = 16/75 (21%), Positives = 42/75 (56%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
++ K + +K+++ +E +K ++ K E K + E+ + K +E+K K+++
Sbjct: 352 LYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411
Query: 105 EEKKNKKAEEQNRQQ 119
EKK K+ ++++++
Sbjct: 412 NEKKQKEQADEDKEK 426
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 42.6 bits (101), Expect = 6e-05
Identities = 18/56 (32%), Positives = 30/56 (53%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ K E++ K KK K+ +KKE++K+K K KK ++ K K+ + EE
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 42.6 bits (101), Expect = 6e-05
Identities = 18/67 (26%), Positives = 36/67 (53%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ R+ + K + ++KK K +K K+ EKK+K+KEK + K+ ++ K K++K
Sbjct: 368 LKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Query: 71 KEEEKDK 77
++
Sbjct: 428 PSGTSEE 434
Score = 41.9 bits (99), Expect = 1e-04
Identities = 13/61 (21%), Positives = 31/61 (50%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+E + K K ++K + K ++ +KK++EK++ K +++ + K K+ K +
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433
Query: 117 R 117
Sbjct: 434 E 434
Score = 41.5 bits (98), Expect = 2e-04
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+E K + E+K K ++ K+ EKK+K+KE+ K KK+ + K K + +E
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433
Query: 114 E 114
E
Sbjct: 434 E 434
Score = 40.3 bits (95), Expect = 4e-04
Identities = 23/77 (29%), Positives = 38/77 (49%)
Query: 9 GAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
G I I + K + +E + K K EKK K +K K+ +K+EKE++K K K+ +
Sbjct: 358 GKIERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Query: 69 QKKEEEKDKKKEEKKQK 85
K K +K ++
Sbjct: 418 DTKNIGKRRKPSGTSEE 434
Score = 39.2 bits (92), Expect = 0.001
Identities = 15/63 (23%), Positives = 35/63 (55%)
Query: 56 KEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+E K + +E + K + +KK + +K ++ +KK++EK++ K K+ ++ K +
Sbjct: 365 EEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Query: 116 NRQ 118
R+
Sbjct: 425 RRK 427
Score = 34.5 bits (80), Expect = 0.034
Identities = 12/53 (22%), Positives = 28/53 (52%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE 58
AP + + K+ K+ E+ +K+++K + KK+ + + K+++ EE
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 33.0 bits (76), Expect = 0.084
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEE 55
S G S ++ ++ +KKE+E K+K K K+ + K K++K EE
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKE-KEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 42.5 bits (101), Expect = 7e-05
Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 5/64 (7%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
+KK +K E + E+ K+ K+E K +KKE K K QKK+E K+E K K
Sbjct: 32 RKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKE-----KQEAKAAKA 86
Query: 88 EEKK 91
+ K
Sbjct: 87 KSKP 90
Score = 39.0 bits (92), Expect = 9e-04
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 15 IGKRKKKKKEE-ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ +RKK +K E E ++ +E K++ K KK+ K K +KK++K+E K K +
Sbjct: 29 LAQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Query: 74 EK 75
+
Sbjct: 89 KP 90
Score = 39.0 bits (92), Expect = 9e-04
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 7/64 (10%)
Query: 46 KQKKKEKKEEKE-----EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
Q+KK +K E E E+ ++ KEE K ++K+ K K QKKKE KQ+ K K +
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKE--KQEAKAAKAK 87
Query: 101 KKKK 104
K +
Sbjct: 88 SKPR 91
Score = 34.4 bits (80), Expect = 0.034
Identities = 19/64 (29%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+K+ +K E + E+ K+ +E+ + +KK+ K K QKKK EK+ KA + +
Sbjct: 32 RKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKK--EKQEAKAAKAKSK 89
Query: 119 QGLY 122
L+
Sbjct: 90 PRLF 93
Score = 31.3 bits (72), Expect = 0.27
Identities = 18/62 (29%), Positives = 33/62 (53%), Gaps = 8/62 (12%)
Query: 30 KQKKKEEKKQKK--------KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
Q+KK +K + + KE K++ K ++KE K K ++KK+K+E + K K +
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Query: 82 KK 83
+
Sbjct: 90 PR 91
Score = 29.8 bits (68), Expect = 0.88
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKK 50
KKE + K +KK+EK++ K + K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 27.5 bits (62), Expect = 5.1
Identities = 9/33 (27%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKK 46
+ K++ K + KKK+K++ K K + K
Sbjct: 60 LLDKKELKAWHKAQKKKEKQE--AKAAKAKSKP 90
>gnl|CDD|224869 COG1958, LSM1, Small nuclear ribonucleoprotein (snRNP) homolog
[Transcription].
Length = 79
Score = 39.6 bits (93), Expect = 9e-05
Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGI-FIIRGENVVLMGE 177
R G L DQ+ NLVL E I + G +IRG+N+VL+
Sbjct: 28 REYRGTLVGFDQYMNLVLDDVEEIISHDGEKNVRRLGGEVLIRGDNIVLISP 79
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 42.3 bits (99), Expect = 1e-04
Identities = 24/111 (21%), Positives = 62/111 (55%), Gaps = 3/111 (2%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
GK + + + E +K + + + + Q E K + E++ E + E K++EK
Sbjct: 798 EGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGG 857
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA---EEQNRQQGLY 122
D E++++++EE++++++EE+++++++EE + + E ++Q +Y
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIY 908
Score = 38.1 bits (88), Expect = 0.003
Identities = 20/111 (18%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE---------EKKQ 69
K E E+ ++ +E ++ + E + ++ + E+E + E K E E+K
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKG 691
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
++E E + + +E K + E ++ + E + + + E++ + E+ +
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVE 742
Score = 37.3 bits (86), Expect = 0.005
Identities = 19/101 (18%), Positives = 51/101 (50%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R G+ ++ E E + ++ E +Q+ + E K + + + E E+K +++ E + + +E
Sbjct: 644 RTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKE 703
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K + E ++ + E + + + E + + + E+ + +E
Sbjct: 704 ADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDE 744
Score = 35.4 bits (81), Expect = 0.018
Identities = 18/86 (20%), Positives = 49/86 (56%)
Query: 9 GAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+ + + ++E N + Q + ++ ++ + +EE+EE++E+++EE++
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKK 94
+++EEE++++ EE + E +QK+
Sbjct: 880 EEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 33.8 bits (77), Expect = 0.054
Identities = 18/115 (15%), Positives = 54/115 (46%), Gaps = 2/115 (1%)
Query: 8 KGAISPRIGKRKKKKKEE--ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
G + G+R + + E E + ++E + + K E + + + + + E++ E + E
Sbjct: 641 TGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIE 700
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
K+ + E + ++ E + + + E + + E + ++ +E + + E + + +
Sbjct: 701 AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755
Score = 33.4 bits (76), Expect = 0.073
Identities = 20/109 (18%), Positives = 55/109 (50%), Gaps = 4/109 (3%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G+ + E+E + + K + E + + E+K +++ + E + ++ + K E + ++ E E
Sbjct: 660 GEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719
Query: 76 DKKKE----EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ + E E + + EE ++ + E + + + K E + + ++ +G
Sbjct: 720 ETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768
Score = 33.4 bits (76), Expect = 0.092
Identities = 19/102 (18%), Positives = 53/102 (51%), Gaps = 10/102 (9%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K E + K + E + + EK E + + + + + + + + E++ E + + K++EK
Sbjct: 792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851
Query: 92 ----------QKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
++EE+++++++EE++ ++ EE+ ++ P
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEP 893
Score = 33.4 bits (76), Expect = 0.092
Identities = 25/105 (23%), Positives = 51/105 (48%), Gaps = 1/105 (0%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K K E E ++ + + E + + ++E + + E+ EE E++ E + E K + E E
Sbjct: 701 AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEA-EGKHEVETEG 759
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
D+K+ E + + + E K+ + E + Q + E K + E +
Sbjct: 760 DRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHE 804
Score = 31.5 bits (71), Expect = 0.39
Identities = 20/98 (20%), Positives = 53/98 (54%), Gaps = 2/98 (2%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K +E + K + + E + +K+E + + + + ++ E K E E++ E + + K++E
Sbjct: 792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDET--GEQELNAENQGEAKQDE 849
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K EE+++++++EE++ ++ EE+ ++
Sbjct: 850 KGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEE 887
Score = 31.5 bits (71), Expect = 0.40
Identities = 20/97 (20%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E E K+ K E + ++ + E + + + ++E E + ++ EE + + E E + K E +
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEA-EGKHEVE 756
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ ++ + + E + + K+ E++ +A E +G
Sbjct: 757 TEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKG 793
Score = 30.3 bits (68), Expect = 0.89
Identities = 22/119 (18%), Positives = 56/119 (47%), Gaps = 7/119 (5%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKK----EEKKQKKKEEKKQKKKEKKEEKEEKKE 61
G S G +++ E E K + + + E+K +++ E + + KE + E + E
Sbjct: 657 GENGEES---GGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAE 713
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ + E + + E +D+ + E ++ +E + + + E + + + + E K+ E + +
Sbjct: 714 EVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEA 772
Score = 30.0 bits (67), Expect = 0.96
Identities = 22/107 (20%), Positives = 50/107 (46%), Gaps = 2/107 (1%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKK--EKKEEKEEKKEKKKEEKKQKKEE 73
+ K + + E ++K E++ + + E K+ K + EE E + E + E + + E
Sbjct: 673 TETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI 732
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E ++ EE + + + E + K + E + +K+ E + + E +
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDED 779
Score = 29.2 bits (65), Expect = 1.7
Identities = 19/118 (16%), Positives = 64/118 (54%), Gaps = 14/118 (11%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G+ + + KE+E++ + + E+ + K E + K + + E + +K++ + + + + ++ +
Sbjct: 768 GETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTE 827
Query: 76 DKKKEEKKQKKKEEKKQKKKEEK--------------KQKKKKEEKKNKKAEEQNRQQ 119
K + +++ E + + K++EK ++++++EE++ ++ EE+ ++
Sbjct: 828 VKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE 885
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 41.7 bits (98), Expect = 1e-04
Identities = 26/98 (26%), Positives = 44/98 (44%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+R +++++ ++ K+ KQK+ EE K + K E + K+ +K E
Sbjct: 109 ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK E + KK + +KK E + K E K K E
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206
Score = 41.3 bits (97), Expect = 2e-04
Identities = 28/97 (28%), Positives = 48/97 (49%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R K+ ++E +++KK+ ++ K+ KQK+ E+ K K K E + K+ K
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK 161
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K + +KK E + KK + +KK + E K A E
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAE 198
Score = 39.8 bits (93), Expect = 5e-04
Identities = 28/98 (28%), Positives = 46/98 (46%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K++ ++ ++ KQK+ EE K K K + + + +K E K+K E E
Sbjct: 117 KKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAA 176
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK + +KK E + K + +KK + E K K A E
Sbjct: 177 KKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAE 214
Score = 39.0 bits (91), Expect = 9e-04
Identities = 23/98 (23%), Positives = 50/98 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++K+ ++E K+ +K++ +++KK+ ++ K+ ++K+ ++ K K + E +
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K+ KK + +KK E + KK E K K E
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAE 190
Score = 38.2 bits (89), Expect = 0.002
Identities = 23/101 (22%), Positives = 50/101 (49%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ K+ E+E Q++K++ ++ K+ ++K+ ++ + K + + + K K
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK 161
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K + KKK E + KK + KKK E + KA + +++
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202
Score = 38.2 bits (89), Expect = 0.002
Identities = 25/108 (23%), Positives = 59/108 (54%), Gaps = 2/108 (1%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+ R +++KKK++++ ++ Q+K+ ++++ K+ +K++ ++++K+ ++ K+ KQK+
Sbjct: 74 AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ 133
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
EE K + K E + K+ KK E K K E ++
Sbjct: 134 AEEAAAKAAAAAKAKAEA--EAKRAAAAAKKAAAEAKKKAEAEAAKKA 179
Score = 38.2 bits (89), Expect = 0.002
Identities = 29/104 (27%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK-EEKKQKKEEEK 75
++KK+ +E + K+K+ ++ K K K + E K KK + +KK E +
Sbjct: 115 EQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE 174
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
KK + KKK E + K + KKK E + KKA + +++
Sbjct: 175 AAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKK 218
Score = 38.2 bits (89), Expect = 0.002
Identities = 28/98 (28%), Positives = 44/98 (44%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K+ K+++ + K K K + E K+ K+ E K+K + E +K E
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK E + K + +KK E + +KK E K K A E
Sbjct: 185 KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAE 222
Score = 38.2 bits (89), Expect = 0.002
Identities = 20/99 (20%), Positives = 53/99 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ ++ +++ +Q++ ++ ++++ ++QKK+ ++ K+ ++K+ E+ K
Sbjct: 86 QQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAA 145
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
K K E + K+ +K E K+K + E K AE +
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184
Score = 36.7 bits (85), Expect = 0.006
Identities = 23/92 (25%), Positives = 39/92 (42%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK KK+ + E K+ E KK+ + E + + +KK E + +KK + KKK
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKA 219
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
+ K K + + +K + K A
Sbjct: 220 AAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAA 251
Score = 35.6 bits (82), Expect = 0.013
Identities = 21/96 (21%), Positives = 40/96 (41%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KK E KKK + + KK + +KK + + + E K+K + E K+K E KK
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKA 219
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ + + + K ++ + + A +
Sbjct: 220 AAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Score = 35.6 bits (82), Expect = 0.014
Identities = 26/100 (26%), Positives = 40/100 (40%), Gaps = 1/100 (1%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK- 78
+ K+ KK + +KK + + KK + KK+ + E K E K+K E E KK
Sbjct: 152 EAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKA 211
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E K+K E K + + K EK +
Sbjct: 212 AAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAA 251
Score = 35.2 bits (81), Expect = 0.021
Identities = 22/90 (24%), Positives = 59/90 (65%), Gaps = 2/90 (2%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
++++++K K+ EE+++KK++++ E+ ++K+ ++E+ ++ E+E+ +E+KKQ ++ K
Sbjct: 66 RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
Query: 91 K--QKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ K+K+ ++ K KAE + ++
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 41.9 bits (99), Expect = 1e-04
Identities = 17/69 (24%), Positives = 38/69 (55%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K K+ + E+++++ + + + E K K+E++ +E+ EE++D K E K +
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284
Query: 88 EEKKQKKKE 96
K++K +E
Sbjct: 285 ILKEKKDEE 293
Score = 41.5 bits (98), Expect = 2e-04
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 2/69 (2%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK--KEEKKQKKKEEKKQKK 94
KK K+ E +++++E E E E K +++Q+ E+D EEK+ K E K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 95 KEEKKQKKK 103
+ K++K +
Sbjct: 284 EILKEKKDE 292
Score = 41.5 bits (98), Expect = 2e-04
Identities = 16/71 (22%), Positives = 40/71 (56%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
KK K+ E ++++++ + + + E K K++++ +E+ +E++ K E+ DK
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282
Query: 78 KKEEKKQKKKE 88
+ K++K +E
Sbjct: 283 LEILKEKKDEE 293
Score = 40.0 bits (94), Expect = 6e-04
Identities = 21/71 (29%), Positives = 36/71 (50%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KK KE ++++++ + + + E K K+E++ EE EEK+ + E K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 80 EEKKQKKKEEK 90
E K+KK EE
Sbjct: 284 EILKEKKDEEL 294
Score = 39.2 bits (92), Expect = 0.001
Identities = 17/71 (23%), Positives = 38/71 (53%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
KK ++ + +E++EE + + E E K K+E+E +++ +++E K ++ K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 101 KKKKEEKKNKK 111
+ KE+K +
Sbjct: 284 EILKEKKDEEL 294
Score = 37.7 bits (88), Expect = 0.003
Identities = 14/71 (19%), Positives = 38/71 (53%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
K+ K+ + EE++++ + E E K K+E++ +E+ +E++ K E+ +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 94 KKEEKKQKKKK 104
+ ++K+ ++
Sbjct: 284 EILKEKKDEEL 294
Score = 37.3 bits (87), Expect = 0.004
Identities = 19/65 (29%), Positives = 34/65 (52%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ +++EEE + + E K K+E++ +++ EEK++ K E K E K+K
Sbjct: 230 SETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEK 289
Query: 78 KKEEK 82
K EE
Sbjct: 290 KDEEL 294
Score = 35.7 bits (83), Expect = 0.012
Identities = 14/57 (24%), Positives = 29/57 (50%)
Query: 60 KEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K+ K+ + +++EEE D + E + K +++Q+ E+ EEK++ E
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDL 280
Score = 35.0 bits (81), Expect = 0.028
Identities = 14/64 (21%), Positives = 31/64 (48%)
Query: 52 KKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
KK ++ + E+++EE + E + K +++Q+ E+ E+K+ K E +K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 112 AEEQ 115
+
Sbjct: 284 EILK 287
Score = 33.1 bits (76), Expect = 0.092
Identities = 12/70 (17%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK--KQKKKEEKKQKKKKEE 106
KK K+ + E++E++ + + + E K K+E++ +++ ++K++ K + +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 107 KKNKKAEEQN 116
+ K+ +++
Sbjct: 284 EILKEKKDEE 293
Score = 30.7 bits (70), Expect = 0.61
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
EKKKE K EEK KK+E+K++ K+EEK++
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
Score = 30.0 bits (68), Expect = 0.92
Identities = 11/57 (19%), Positives = 30/57 (52%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ + E ++ K K+E++ +++ ++++ +K E +K + K++K EE
Sbjct: 237 EETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293
Score = 29.6 bits (67), Expect = 1.4
Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 2/71 (2%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE- 65
K + +R+++ E + K +++Q+ E+ +++E +K E +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 66 -EKKQKKEEEK 75
K+KK+EE
Sbjct: 284 EILKEKKDEEL 294
Score = 28.8 bits (65), Expect = 2.2
Identities = 14/31 (45%), Positives = 23/31 (74%)
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
KK+E K E+K KK+E+K++ K+EEK++
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 40.4 bits (95), Expect = 2e-04
Identities = 30/119 (25%), Positives = 64/119 (53%), Gaps = 8/119 (6%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+KKKK+ E +K KK+ E+KQK K +KK+ KK+K ++K++K +KK ++ ++K E+E
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQ--------KKKKEEKKNKKAEEQNRQQGLYPNQ 125
+ K E+ + E + + ++ + + + K+ + ++ ++ P
Sbjct: 118 AEDKLEDLTKSYSETLSTLSELKPRKYALHKDIYQSRLDRKRRAEVAKKEAEELQNPGF 176
Score = 34.3 bits (79), Expect = 0.022
Identities = 26/65 (40%), Positives = 42/65 (64%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E E K++KKE +E +K KKE E+ +K + KK+K K++K + K ++ +K K EKK++
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Query: 111 KAEEQ 115
K E
Sbjct: 116 KEAED 120
Score = 32.0 bits (73), Expect = 0.14
Identities = 27/59 (45%), Positives = 43/59 (72%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
E E KK++K+ +E EK KK+ E+KQK K +KK+ KK++ K K KK++KK+ K+E++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Score = 31.6 bits (72), Expect = 0.17
Identities = 20/94 (21%), Positives = 50/94 (53%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+++K K +++ KK+K K++ K+ K++ K +KK++KE +++ ++ K +
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTL 136
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+ K ++ K + + +K + KK+ E+
Sbjct: 137 SELKPRKYALHKDIYQSRLDRKRRAEVAKKEAEE 170
Score = 31.6 bits (72), Expect = 0.18
Identities = 29/69 (42%), Positives = 43/69 (62%), Gaps = 5/69 (7%)
Query: 54 EEKEEKKEKKKE-----EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
E E K+KKKE EK +K+ EEK K K +KK+ KK++ K K K++ K+ K E+K
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Query: 109 NKKAEEQNR 117
K+AE++
Sbjct: 115 EKEAEDKLE 123
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 40.9 bits (96), Expect = 2e-04
Identities = 21/90 (23%), Positives = 40/90 (44%), Gaps = 1/90 (1%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E ++ + E + +++ K E + + K K++ + +KK ++ K K K + K K
Sbjct: 50 EAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109
Query: 95 K-EEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K + + + KK K KA Q P
Sbjct: 110 KVKPQPKPKKPPSKTAAKAPAAPNQPARPP 139
Score = 37.8 bits (88), Expect = 0.002
Identities = 17/84 (20%), Positives = 35/84 (41%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
EE + + +E+ K + E + K +++ K EKK ++ K K + + K +
Sbjct: 51 APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPK 110
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKK 104
K Q K ++ K + +
Sbjct: 111 VKPQPKPKKPPSKTAAKAPAAPNQ 134
Score = 37.1 bits (86), Expect = 0.004
Identities = 19/78 (24%), Positives = 35/78 (44%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
K E ++ + + E E++ K E + E K KE+ K +KK +K + K +
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKP 103
Query: 98 KKQKKKKEEKKNKKAEEQ 115
K + K K + + K +
Sbjct: 104 KPKPKPKVKPQPKPKKPP 121
Score = 36.3 bits (84), Expect = 0.007
Identities = 17/74 (22%), Positives = 33/74 (44%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ + E ++ + E + + K K+K K E+K +K + K + K + K + K +
Sbjct: 56 PQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115
Query: 79 KEEKKQKKKEEKKQ 92
K +K K K
Sbjct: 116 KPKKPPSKTAAKAP 129
Score = 29.4 bits (66), Expect = 1.1
Identities = 17/61 (27%), Positives = 26/61 (42%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
P K K+K K E+ KK K K + K K K + K + K KK + + + +
Sbjct: 79 PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARP 138
Query: 73 E 73
Sbjct: 139 P 139
Score = 28.2 bits (63), Expect = 2.5
Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK--KEEKKQKKKKEEKKNKKAEE 114
E ++ + + + EE+ E + + K K+ K EKK KK K + K K +
Sbjct: 49 LEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108
Query: 115 QNRQ 118
+
Sbjct: 109 PKVK 112
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 41.4 bits (97), Expect = 2e-04
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 3/114 (2%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
I K+ +K +++NKK K + K KK + K++KK +KK ++ + + K K+
Sbjct: 46 LDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQ 105
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
++ K + +K KK K K KK KK + Y N K
Sbjct: 106 FAGPLAISLMRKPKPKTEKLKK---KITVNKSTNKKKKKVLSSKDELIKYDNNK 156
Score = 34.0 bits (78), Expect = 0.049
Identities = 23/99 (23%), Positives = 40/99 (40%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+ + K K ++ K KQKKK +KK ++ K K+ ++ K
Sbjct: 60 NKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPK 119
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+ + KK+ K +KK+K K + K + K K
Sbjct: 120 PKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPK 158
Score = 29.4 bits (66), Expect = 1.7
Identities = 33/162 (20%), Positives = 62/162 (38%), Gaps = 1/162 (0%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
K K K + + + K +K +K KK +K K ++ KD KK + KQKKK
Sbjct: 24 KNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKI 83
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQFANLVLHKTI 148
+KK ++ + + A P K + +V++ N K +
Sbjct: 84 KKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVL 143
Query: 149 ERIHVGNQYG-DIPRGIFIIRGENVVLMGEVVTVVNVKVIKG 189
+Y + P+ I I + + ++ + ++IK
Sbjct: 144 SSKDELIKYDNNKPKSISIHSPLTIQELSTLLCIPETEIIKS 185
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 41.1 bits (96), Expect = 2e-04
Identities = 24/116 (20%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ ++ ++ +EE KKQ ++ +QK + K++ E +++++E K K
Sbjct: 206 RESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPAD 265
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK-------EEKKNKKAEEQNRQQ 119
K+ K+ + QK++ EK Q + ++ ++ K + K+ KA E+ +
Sbjct: 266 TSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAED 321
Score = 39.2 bits (91), Expect = 8e-04
Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 3/100 (3%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQK--KKEEKKQKKKEKKEEKEEKKEKKKE-EKKQKKEEE 74
++K ++N KQ+ + +KQ+ K K KE+K+ + +K+E EK Q + ++
Sbjct: 233 QQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKK 292
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
D++ + K K + KQ+ K +K+ + KE + KK E
Sbjct: 293 NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREP 332
Score = 39.2 bits (91), Expect = 0.001
Identities = 20/103 (19%), Positives = 53/103 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K +++ + + +++ + ++++++ K K + KE K+ + QK+E EK
Sbjct: 227 IDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKA 286
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ + +K ++ + K K + KQ+ K EK+ + E + +++
Sbjct: 287 QIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329
Score = 34.6 bits (79), Expect = 0.031
Identities = 26/96 (27%), Positives = 52/96 (54%), Gaps = 1/96 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++K++E +N K K+ K+ + QK++ +K + E KK +E K K + D K
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKK-NDEEALKAKDHKAFDLK 309
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+E K +K+ E K+ + ++K++ ++ +K K E
Sbjct: 310 QESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVE 345
Score = 33.4 bits (76), Expect = 0.065
Identities = 15/86 (17%), Positives = 45/86 (52%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
++ Q+ KEE +K+ + + +++ + KQ+ E + +++ + K + K+
Sbjct: 213 KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKED 272
Query: 96 EEKKQKKKKEEKKNKKAEEQNRQQGL 121
++ + +K+E +K + ++N ++ L
Sbjct: 273 KQVAENQKREIEKAQIEIKKNDEEAL 298
Score = 33.4 bits (76), Expect = 0.067
Identities = 23/119 (19%), Positives = 59/119 (49%), Gaps = 2/119 (1%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
+S +IS + ++ E+ ++ + K+++ +E ++ ++ KEE ++K+
Sbjct: 169 VSDVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQID 228
Query: 63 --KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K ++K ++ DK+++E +QK++E K K + K+ ++ + E + Q
Sbjct: 229 ADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQ 287
Score = 30.7 bits (69), Expect = 0.56
Identities = 15/89 (16%), Positives = 44/89 (49%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KK E ++ E+ +++ + KE + ++ + +Q KEE K+ + K ++K +
Sbjct: 180 KKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFA 239
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
Q ++++ + ++++++ K +
Sbjct: 240 QDNADKQRDEVRQKQQEAKNLPKPADTSS 268
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 41.5 bits (97), Expect = 2e-04
Identities = 13/54 (24%), Positives = 25/54 (46%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+ K+K ++K +E K + + E E K++EEK + KK++
Sbjct: 794 AARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847
Score = 41.1 bits (96), Expect = 3e-04
Identities = 18/59 (30%), Positives = 27/59 (45%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
K KQK KE+ K+ K E + E ++ KQ++E+ D +KKK K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
Score = 39.5 bits (92), Expect = 9e-04
Identities = 19/68 (27%), Positives = 25/68 (36%), Gaps = 9/68 (13%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K KQK KEK +E K + + + E K++EEK
Sbjct: 792 KAAARAKQKPKEKGPNDKEIK---------IESPSVETEGERCTIKQREEKGIDAPAILN 842
Query: 100 QKKKKEEK 107
KKKK K
Sbjct: 843 VKKKKPYK 850
Score = 35.3 bits (81), Expect = 0.018
Identities = 14/54 (25%), Positives = 24/54 (44%), Gaps = 2/54 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
K+K KE+ K+ K E + E ++ +++EEK KK+K
Sbjct: 798 KQKPKEKGPNDKEIKIE--SPSVETEGERCTIKQREEKGIDAPAILNVKKKKPY 849
Score = 34.5 bits (79), Expect = 0.033
Identities = 20/103 (19%), Positives = 41/103 (39%), Gaps = 7/103 (6%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+++E + + EE K K K + ++K +EK +E K + +
Sbjct: 761 EEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVET 820
Query: 92 QKKKEEKKQKKKKEE-------KKNKKAEEQNRQQGLYPNQKR 127
+ ++ KQ+++K K KK + + + P Q R
Sbjct: 821 EGERCTIKQREEKGIDAPAILNVKKKKPYKVDNMTRILPQQSR 863
Score = 33.8 bits (77), Expect = 0.060
Identities = 12/63 (19%), Positives = 28/63 (44%), Gaps = 2/63 (3%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+S I + K++ K+K +E K + + + ++ +++EEK K
Sbjct: 787 LSTTIKAAARAKQKP--KEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVK 844
Query: 71 KEE 73
K++
Sbjct: 845 KKK 847
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 41.1 bits (97), Expect = 2e-04
Identities = 17/102 (16%), Positives = 42/102 (41%), Gaps = 1/102 (0%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R+ K + ++++KK EE K + E ++ + + +K +E + +K E + K ++
Sbjct: 432 RQAKAEIRAIEQEKKKAEEAKARF-EARQARLEREKAAREARHKKAAEARAAKDKDAVAA 490
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K KK + + + ++A + +
Sbjct: 491 ALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARA 532
Score = 39.2 bits (92), Expect = 0.001
Identities = 21/117 (17%), Positives = 42/117 (35%), Gaps = 15/117 (12%)
Query: 8 KGAISPRIGKRKKKKKEEENKK----KQKKKEEKKQKKKEEKKQKKKEKKEEK------- 56
K I ++KKK EE K +Q + E +K ++ K+ + + +
Sbjct: 435 KAEIRAI---EQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAA 491
Query: 57 -EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K KK + + + +E +K + + + +K+ KKA
Sbjct: 492 LARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKA 548
Score = 34.9 bits (81), Expect = 0.027
Identities = 16/84 (19%), Positives = 32/84 (38%), Gaps = 8/84 (9%)
Query: 51 EKKEEKEEKKEKKK--------EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
K E + ++EKKK E ++ + E EK ++ K+ + + K
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALA 493
Query: 103 KKEEKKNKKAEEQNRQQGLYPNQK 126
+ + KK + + G P+
Sbjct: 494 RVKAKKAAATQPIVIKAGARPDNS 517
Score = 34.9 bits (81), Expect = 0.027
Identities = 19/103 (18%), Positives = 36/103 (34%), Gaps = 6/103 (5%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R+K +E +KK + + K + + K KK + K +
Sbjct: 464 REKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIA-A 522
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKK-----EEKKNKKAEEQ 115
++ K Q + + +++ KK K KKA +Q
Sbjct: 523 REARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQ 565
Score = 29.1 bits (66), Expect = 2.0
Identities = 22/118 (18%), Positives = 49/118 (41%), Gaps = 9/118 (7%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEK-----KQKKKEKKEEKEEKKEK 62
K A++ I + K KK ++ + ++E +K K KK ++ E +E+
Sbjct: 547 KAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQ 606
Query: 63 --KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK--KNKKAEEQN 116
+ + KK + K +K +++ + ++ + +K K +KA +Q
Sbjct: 607 VAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQ 664
Score = 28.0 bits (63), Expect = 4.4
Identities = 20/124 (16%), Positives = 42/124 (33%), Gaps = 25/124 (20%)
Query: 17 KRKKKKKEEENK----------KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
K+ + + ++K K +K + K + +E +K + +
Sbjct: 474 KKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARAR 533
Query: 67 KKQKKEEEKDKKKEEK--------KQKKKEEKKQKKKEEKKQKKKK-------EEKKNKK 111
+ +K+ K+ K KK ++ + E++ KK K KK
Sbjct: 534 QAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKK 593
Query: 112 AEEQ 115
A +Q
Sbjct: 594 AAQQ 597
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 41.2 bits (96), Expect = 2e-04
Identities = 19/48 (39%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKE-EKEEKKEKKKEEKKQKKEEEK 75
K KK+EE +K K E +QK +E++E EKE++KE+++E +++ + K
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 40.8 bits (95), Expect = 3e-04
Identities = 16/50 (32%), Positives = 35/50 (70%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
P + KK+EE +K +++ E+K ++++E +K+K+KE++ E+E + E+
Sbjct: 571 PLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 40.4 bits (94), Expect = 4e-04
Identities = 15/45 (33%), Positives = 34/45 (75%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
+++EE EK K E +QK EE++++KE++K++++E +++ ++ K
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 38.9 bits (90), Expect = 0.001
Identities = 15/45 (33%), Positives = 35/45 (77%)
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+K+EE +K K E +QK +EE++++K++EK++++++E + + A+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 38.9 bits (90), Expect = 0.001
Identities = 15/39 (38%), Positives = 30/39 (76%)
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
KK+EE +K K E +QK +EE++++K+KE+++ ++ E +
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
Score = 38.1 bits (88), Expect = 0.002
Identities = 15/62 (24%), Positives = 42/62 (67%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K KK+++ EK K++ E+K +++ E++++K++E+++++++E ++ KA + + +
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRMSE 635
Query: 124 NQ 125
Q
Sbjct: 636 PQ 637
Score = 38.1 bits (88), Expect = 0.002
Identities = 19/51 (37%), Positives = 35/51 (68%), Gaps = 3/51 (5%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+ K KK EE +K K+ E +QK +EE ++++KEK++E+E ++E++ E
Sbjct: 572 LASSKLAKKREEAVEKAKR--EAEQKAREE-REREKEKEKEREREREREAE 619
Score = 37.4 bits (86), Expect = 0.004
Identities = 17/50 (34%), Positives = 35/50 (70%), Gaps = 2/50 (4%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
K KK+EE +K K + E+K +E+++ EK+++KE E+++++E ++ K
Sbjct: 576 KLAKKREEAVEKAKREAEQK--AREEREREKEKEKEREREREREAERAAK 623
Score = 37.0 bits (85), Expect = 0.007
Identities = 14/45 (31%), Positives = 34/45 (75%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
K KK E+ +K K++ E+K ++E E++K+KE+++++++E + ++
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 31.2 bits (70), Expect = 0.40
Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 2/91 (2%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K E K +K KEE K K+Q++K + +E ++ K+ K Q+ + E
Sbjct: 58 KAESTKKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGE 117
Query: 82 KKQKKKEEKKQKK--KEEKKQKKKKEEKKNK 110
+ + + E + EE K ++ N+
Sbjct: 118 GEGEGEGESSDSRSVNEEGSSDPKDIDQDNR 148
Score = 30.0 bits (67), Expect = 1.2
Identities = 15/51 (29%), Positives = 33/51 (64%), Gaps = 5/51 (9%)
Query: 19 KKKKKEEENKKKQKKKEEK-KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+K K+E E QK +EE+ ++K+KE+++++++E++ E+ K E +
Sbjct: 586 EKAKREAE----QKAREEREREKEKEKEREREREREAERAAKASSSSHESR 632
Score = 27.7 bits (61), Expect = 6.0
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+ K E K+ K+ KEE K K+Q++K ++ E KK K +
Sbjct: 56 DSKAESTKKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVTAKKSKTQ 105
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 40.8 bits (95), Expect = 3e-04
Identities = 21/99 (21%), Positives = 43/99 (43%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G KKK K+ KK + + EE + + +++ +K ++K
Sbjct: 66 GAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAAS 125
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+EEK +KK ++++ KK ++ + + E + EE
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEE 164
Score = 34.6 bits (79), Expect = 0.027
Identities = 21/93 (22%), Positives = 39/93 (41%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+KK E ++ E KK+ + K+ K+ E E +E + E+
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
KE KK ++ +K +++K E+K K+
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKR 140
Score = 28.9 bits (64), Expect = 2.4
Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 15/106 (14%)
Query: 17 KRKKKKKEEENKKKQKKKEE----------KKQKKKEEKKQKK-----KEKKEEKEEKKE 61
KR KK E + ++ E K+ KK + ++K + +EEK EKK
Sbjct: 78 KRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKV 137
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+K+ + K+ E+ +D+ E + +E + E + +++ EK
Sbjct: 138 RKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEK 183
Score = 27.7 bits (61), Expect = 5.1
Identities = 14/66 (21%), Positives = 31/66 (46%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K KK +K + +++K E+K +K+++ K+ E+ +++ E + EE +
Sbjct: 107 DKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESE 166
Query: 76 DKKKEE 81
E
Sbjct: 167 FVTSLE 172
Score = 27.3 bits (60), Expect = 7.6
Identities = 14/63 (22%), Positives = 28/63 (44%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
++ KK ++K +++K EKK +K+ K ++ +E E + E +
Sbjct: 102 EDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161
Query: 72 EEE 74
EE
Sbjct: 162 VEE 164
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 40.3 bits (94), Expect = 5e-04
Identities = 34/112 (30%), Positives = 73/112 (65%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+ S + K KE +N+ +++++EK+Q K+E+KK+K+K K+E K+ K +++ +EK
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ KE+EK+K+K+ ++ + +EE+K++++ K + KK KK +++ +
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPE 192
Score = 39.9 bits (93), Expect = 6e-04
Identities = 36/112 (32%), Positives = 73/112 (65%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
S P P + + KEEE +K+Q K+E+KK+K+K +++ K ++ KEE +EK+ K
Sbjct: 85 SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPK 144
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
++EK+++K+ E+ + +EE+K++++ K + K+ K+K ++K+ + E+Q
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Score = 39.5 bits (92), Expect = 8e-04
Identities = 43/162 (26%), Positives = 80/162 (49%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
A S KR +K + K K +E K + +E++++K++ KEEK++KKEK
Sbjct: 64 AKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLY 122
KEE K +K +E+ K+K K+K+KE++K+ ++ ++++KK E+ K+ + +
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183
Query: 123 PNQKRTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGI 164
PN+K+ + V K E + + G
Sbjct: 184 PNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGK 225
Score = 34.9 bits (80), Expect = 0.022
Identities = 30/107 (28%), Positives = 68/107 (63%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
A K + K++E+E ++ +++K++KK+K KEE K +K +++ +++
Sbjct: 84 GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
++K++EK++K EE +D+++E+K+++ + + + KK +KK KK+E
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190
Score = 33.7 bits (77), Expect = 0.057
Identities = 27/105 (25%), Positives = 67/105 (63%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ K + K + K+ +K K + K K+ + + KEE+KEK++ ++++KK++E
Sbjct: 63 LAKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K K++ + ++ K+E K+++ +EK+++K+K+ ++ + EE+ +++
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167
Score = 31.4 bits (71), Expect = 0.32
Identities = 21/113 (18%), Positives = 66/113 (58%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++ K++ +E+ K+K+KE++K+ ++ ++++K+++ + + + KK +KK ++++
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
++E+++Q +E K K +E +++++E+ + K E +++
Sbjct: 191 PEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSS 243
Score = 31.0 bits (70), Expect = 0.40
Identities = 20/111 (18%), Positives = 62/111 (55%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ K++E+ K+K+ ++ ++++K+ ++ + K + ++ +KK K+++ ++E+++
Sbjct: 140 KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQA 199
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
+E K K +E +++E+++ K E EE +Q +++ +
Sbjct: 200 AREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSS 250
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 39.8 bits (94), Expect = 7e-04
Identities = 24/91 (26%), Positives = 44/91 (48%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
K E + KKK K K+ EK K + + + +++ E+ + +KEEE +K +
Sbjct: 129 KVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAK 188
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
E + K+ + +KKK+ K K+ + K
Sbjct: 189 EALEAMKKLEAEKKKQSKNFDPKEGPVQIGK 219
Score = 39.4 bits (93), Expect = 0.001
Identities = 22/92 (23%), Positives = 44/92 (47%), Gaps = 2/92 (2%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
++ KK K ++ +K + E + KEE +K E +K++++E+ K+ E
Sbjct: 136 RDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEE--LEKFEAQKEEEDEKLAKEALEA 193
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+K + EKK++ K ++ + K EE
Sbjct: 194 MKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225
Score = 38.7 bits (91), Expect = 0.001
Identities = 28/99 (28%), Positives = 49/99 (49%), Gaps = 1/99 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE-KDKKK 79
KKK KQ +K K + + KE+ E+ E +KE++ E+ ++ E K +
Sbjct: 140 KKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEA 199
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E+KKQ K + K+ + K+ K+E K+ E+ R+
Sbjct: 200 EKKKQSKNFDPKEGPVQIGKKIDKEEITPMKEINEEERR 238
Score = 36.0 bits (84), Expect = 0.011
Identities = 24/107 (22%), Positives = 51/107 (47%), Gaps = 3/107 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK---EEKKEKKKEEKKQKKEEEK 75
K+K + E NK K E ++ ++K K K+ EK K + + +++ E+
Sbjct: 116 KQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKF 175
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLY 122
+ +KEE+ +K +E + K+ + +KKK+ + + K + +
Sbjct: 176 EAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKID 222
Score = 34.8 bits (81), Expect = 0.028
Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E E +KK K K+ E+ + E ++ KE+ ++ + QK+EE++ KE +
Sbjct: 133 EIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALE 192
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
KK + +KK++ K KE +
Sbjct: 193 AMKK--LEAEKKKQSKNFDPKEGPVQIGKKIDK 223
Score = 33.3 bits (77), Expect = 0.082
Identities = 22/109 (20%), Positives = 52/109 (47%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+ KK++ + E K K + ++ K++ K K+ E+ + + + K+
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKE 170
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGY 132
+ +K E ++++++EK K+ E K +AE++ + + P + IG
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGK 219
Score = 32.5 bits (75), Expect = 0.15
Identities = 26/107 (24%), Positives = 49/107 (45%), Gaps = 1/107 (0%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K KK+K E E K K + ++ +KK K K+ ++ K +
Sbjct: 104 KKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDF 163
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ +E+ +K E +K+++ +EK K+ E +K + E+KK K +
Sbjct: 164 EIDDSKEELEKFEAQKEEE-DEKLAKEALEAMKKLEAEKKKQSKNFD 209
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 39.3 bits (92), Expect = 8e-04
Identities = 25/87 (28%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K E+ +K +E+ K K + K K+ + ++K + K ++ EK + +EKK
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK 332
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++KK+E+K KKK+ E+ ++ E+ Q
Sbjct: 333 KEKKKEEK-KKKQIERLEERIEKLEVQ 358
Score = 39.3 bits (92), Expect = 9e-04
Identities = 27/84 (32%), Positives = 48/84 (57%), Gaps = 1/84 (1%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE-KKQKKEEEKDKKKEEKKQ 84
+ K K K + K+ KK + + K + + K E+ E+ + KE++K+KKKEEKK+
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKK 108
K+ E +++ ++ + Q KEE K
Sbjct: 343 KQIERLEERIEKLEVQATDKEENK 366
Score = 39.3 bits (92), Expect = 0.001
Identities = 25/91 (27%), Positives = 54/91 (59%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+K +E+ K K + K+ KK E ++K + K ++ E+ D + +EKK++KK+E+
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339
Query: 91 KQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
K+KK+ E+ +++ ++ + +E+N+ L
Sbjct: 340 KKKKQIERLEERIEKLEVQATDKEENKTVAL 370
Score = 37.7 bits (88), Expect = 0.003
Identities = 26/84 (30%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K K + K+ KK + + K++ K + + + E+ + KE+KK+KK+EEK KK
Sbjct: 284 EKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
Query: 79 KEEK--KQKKKEEKKQKKKEEKKQ 100
+ E+ ++ +K E + KEE K
Sbjct: 344 QIERLEERIEKLEVQATDKEENKT 367
Score = 30.8 bits (70), Expect = 0.48
Identities = 24/65 (36%), Positives = 39/65 (60%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
I K+K K + + +K E K+KKKE+KK++KK+K+ E+ E++ +K E + K
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDK 362
Query: 72 EEEKD 76
EE K
Sbjct: 363 EENKT 367
Score = 27.3 bits (61), Expect = 6.5
Identities = 16/59 (27%), Positives = 39/59 (66%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+ ++ + ++ E+ + + ++KK+EKK+++K++K+ ++ E++ EK E + KEE K
Sbjct: 309 LKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 38.9 bits (91), Expect = 9e-04
Identities = 24/67 (35%), Positives = 34/67 (50%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K E K K ++ +KKK + EE E E + E+K++ KEE K KK +E K K
Sbjct: 212 LKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKA 271
Query: 88 EEKKQKK 94
+K K
Sbjct: 272 LKKVVAK 278
Score = 38.5 bits (90), Expect = 0.001
Identities = 22/76 (28%), Positives = 37/76 (48%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
K+ K E K K K+ +K++++ ++ E + + E+K K KEE K+KK +
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
Query: 89 EKKQKKKEEKKQKKKK 104
E K K +K K
Sbjct: 265 ESKGVKALKKVVAKGM 280
Score = 35.8 bits (83), Expect = 0.010
Identities = 27/79 (34%), Positives = 38/79 (48%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
P ++ K + K K K+ KKK+ + ++ E E + EKK K KEE K+KK
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
Query: 73 EEKDKKKEEKKQKKKEEKK 91
+E K KK K KK
Sbjct: 264 KESKGVKALKKVVAKGMKK 282
Score = 33.5 bits (77), Expect = 0.057
Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE-----EEKDKKKEEKKQKKKEEKKQKKK 95
K+ K E + + KE KK+K+E E + + E+K++ K+E KK+K K
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
Query: 96 EEKKQKKKKEEKKNKK 111
E K K K K K
Sbjct: 265 ESKGVKALK--KVVAK 278
Score = 29.6 bits (67), Expect = 0.96
Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 5 IAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
I + + + KKKK+E E + + +K++K +E+ +KKK KE K K KK
Sbjct: 217 IPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKK-PKESKGVKALKKV 275
Query: 65 EEKKQKK 71
K KK
Sbjct: 276 VAKGMKK 282
Score = 29.2 bits (66), Expect = 1.5
Identities = 23/81 (28%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK-KEEKKQKKEE 73
+ K+ + + K +K K+ ++KK++ +E E E + EKK K +++ KK++
Sbjct: 203 LPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKK 262
Query: 74 EKDKKKEEKKQKKKEEKKQKK 94
K+ K K KK K KK
Sbjct: 263 PKESKG-VKALKKVVAKGMKK 282
Score = 29.2 bits (66), Expect = 1.5
Identities = 20/72 (27%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
++ K E K K K+ +KK+++ EE + E + +K++K +E+ +KKK
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRA------EKKRKSKEEIKKKK 262
Query: 104 KEEKKNKKAEEQ 115
+E K KA ++
Sbjct: 263 PKESKGVKALKK 274
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 39.3 bits (92), Expect = 0.001
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 4/90 (4%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ +E++N K++ K E + ++ EE+ Q E K E E K K K K KD
Sbjct: 837 ELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPK--KLNKLIIAKD-- 892
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
K +E KQ +K KK+KK E+
Sbjct: 893 VLIKLVISSDEIKQDEKTTKKKKKDLEKTD 922
Score = 34.3 bits (79), Expect = 0.042
Identities = 22/130 (16%), Positives = 53/130 (40%), Gaps = 9/130 (6%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE---KKEKKKEEKKQKKEEEKDKKKE 80
E+ + + ++ Q+ ++ E EE K + + E+ + EE + E
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINE 841
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQFA 140
E+K K+E K + + ++ +++ + +NK + + N + L + + D
Sbjct: 842 EQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVE------FKNDPKKLNKLIIAKDVLI 895
Query: 141 NLVLHKTIER 150
LV+ +
Sbjct: 896 KLVISSDEIK 905
Score = 33.1 bits (76), Expect = 0.11
Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 4/99 (4%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE-KKKEEKKQKKEEEKDKKKE 80
+ EEE K + + E+ + EE + +E+K K+E K + + Q+ EEE E
Sbjct: 813 ESEEERLKALRIQREEMLMRPEELE---LINEEQKNLKQEIKLELSEIQEAEEEIQNINE 869
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K + + + KK + K K ++E + +
Sbjct: 870 NKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDE 908
Score = 28.9 bits (65), Expect = 2.6
Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 3/72 (4%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
++ EEE + + K E + K + KK K ++ K +E KQ ++ K KKK
Sbjct: 857 IQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKK 916
Query: 80 EEKKQKKKEEKK 91
+K +E+
Sbjct: 917 ---DLEKTDEEA 925
Score = 27.3 bits (61), Expect = 7.6
Identities = 14/59 (23%), Positives = 24/59 (40%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
I + + + + + KK K K + + K++EK KK+KK EK
Sbjct: 864 IQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTD 922
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 39.3 bits (92), Expect = 0.001
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 5/83 (6%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKE-----KKEEKEEKKEKKKEEKKQKK 71
+ K K K+ +K K+++ K+ E + E+K + K KK+ +K KK
Sbjct: 730 APRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789
Query: 72 EEEKDKKKEEKKQKKKEEKKQKK 94
K +K + K+ KK+
Sbjct: 790 PSAKTQKIAAATKAKRAAKKKVA 812
Score = 39.0 bits (91), Expect = 0.001
Identities = 24/85 (28%), Positives = 39/85 (45%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+R + + ++K KK + K+ K + KK + + + EKK + K KK+ K
Sbjct: 728 ERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKA 787
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQK 101
KK K QK K K+ +KK
Sbjct: 788 KKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 39.0 bits (91), Expect = 0.001
Identities = 19/80 (23%), Positives = 36/80 (45%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
N K +++ KK ++ ++++ K+ E + EKK K + K ++ KK
Sbjct: 733 NVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSA 792
Query: 87 KEEKKQKKKEEKKQKKKKEE 106
K +K + K+ KKK
Sbjct: 793 KTQKIAAATKAKRAAKKKVA 812
Score = 38.6 bits (90), Expect = 0.002
Identities = 22/76 (28%), Positives = 36/76 (47%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
K +K KK + K+ K ++ K+ E + +KK + K KK+ +K KK K Q
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQ 795
Query: 93 KKKEEKKQKKKKEEKK 108
K K K+ ++K
Sbjct: 796 KIAAATKAKRAAKKKV 811
Score = 36.6 bits (85), Expect = 0.007
Identities = 19/68 (27%), Positives = 28/68 (41%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
R KK E ++ + +KK + K KK+ +K KK K +K K K
Sbjct: 746 DAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805
Query: 66 EKKQKKEE 73
K+K E
Sbjct: 806 AAKKKVAE 813
Score = 36.6 bits (85), Expect = 0.007
Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 11/87 (12%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE-----EKDKKKEEKKQKKKEEKK 91
++ + K ++K KK + +K K+++ K+ E +KK + K KK +K
Sbjct: 728 ERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDA-RK 786
Query: 92 QKKKEEKKQK-----KKKEEKKNKKAE 113
KK K QK K K K K AE
Sbjct: 787 AKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 36.3 bits (84), Expect = 0.011
Identities = 18/88 (20%), Positives = 39/88 (44%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
I + + + +++ KK + +K + ++ KK E + +KK + K K++
Sbjct: 725 ISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDA 784
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+ KK K +K + K+ KK+
Sbjct: 785 RKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 31.3 bits (71), Expect = 0.46
Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK--QKKKEEKKQKKKKEEKKNKKAEEQ 115
K +++ KK ++ K+++ K+ E + +K+ K + KK+ +K KK +
Sbjct: 734 VGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAK 793
Query: 116 NRQQGLYPNQKRT 128
++ KR
Sbjct: 794 TQKIAAATKAKRA 806
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 39.3 bits (92), Expect = 0.001
Identities = 18/95 (18%), Positives = 56/95 (58%), Gaps = 3/95 (3%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE--- 72
+R+ + ++ ++ +EE ++ +++ ++Q + ++ ++ E EK + + +Q++
Sbjct: 618 NERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRR 677
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
E + ++ +EK+Q ++E K +E+ Q+ ++EE+
Sbjct: 678 ERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEER 712
Score = 38.9 bits (91), Expect = 0.001
Identities = 17/100 (17%), Positives = 58/100 (58%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ K +E+ K + K E++Q +++ ++ ++++ E ++ + + + E ++ +EE + ++
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRR 644
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ ++Q + + Q+ + +K + + E+++ + E Q R+
Sbjct: 645 QAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRN 684
Score = 35.4 bits (82), Expect = 0.017
Identities = 23/111 (20%), Positives = 63/111 (56%), Gaps = 11/111 (9%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE--- 73
+ K +++++ K +Q + ++ +++ + ++E +E +EE + +++ ++Q E
Sbjct: 597 EAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRES 656
Query: 74 ------EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
EK + ++E++Q + E+++++ +EK+Q + +E K EEQ+ Q
Sbjct: 657 QQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQ--QEAKALNVEEQSVQ 705
Score = 31.2 bits (71), Expect = 0.46
Identities = 14/70 (20%), Positives = 44/70 (62%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E +K + + E+++ ++E ++++ EK++ ++E K EE+ ++ E++++ ++ +
Sbjct: 659 AEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQEERVQQVQP 718
Query: 84 QKKKEEKKQK 93
++K+ + QK
Sbjct: 719 RRKQRQLNQK 728
Score = 28.9 bits (65), Expect = 2.7
Identities = 11/78 (14%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
E+ + +++ + E K E++++ +K ++ +D+ + + + ++ ++ E + ++
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENRE-ENRR 641
Query: 103 KKEEKKNKKAEEQNRQQG 120
+ + + + AE + QQ
Sbjct: 642 NRRQAQQQTAETRESQQA 659
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 38.4 bits (90), Expect = 0.002
Identities = 21/51 (41%), Positives = 38/51 (74%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
++ KEE ++ EE K+K + K+K++E+K QK+K++KK+KKK +++K K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 36.1 bits (84), Expect = 0.008
Identities = 23/50 (46%), Positives = 37/50 (74%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
K+E + EE KEK + K+K+EE+K +K+++KK++KK+ KK+KKK K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 35.7 bits (83), Expect = 0.011
Identities = 23/53 (43%), Positives = 39/53 (73%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+E +E+ K+ EE K+K + KK+EEKK +K+++KK++KK+ KK+KKK +
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.9 bits (81), Expect = 0.025
Identities = 18/52 (34%), Positives = 38/52 (73%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+E ++ K+ EE K+K + K+K++EKK +K +KK+K+K++ K++K++ +
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 34.2 bits (79), Expect = 0.043
Identities = 20/50 (40%), Positives = 37/50 (74%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KEE ++ +E KE+ + +KK+EEKK +K ++K K+K++ K++KK+ +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 32.6 bits (75), Expect = 0.12
Identities = 19/50 (38%), Positives = 38/50 (76%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
KEE ++ +E K++ K K+++EEKK +K+++KK++K++ K +KK+ +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.0 bits (63), Expect = 4.3
Identities = 21/48 (43%), Positives = 35/48 (72%), Gaps = 1/48 (2%)
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
EE +K+ EE K+K + K KK+EEKK +K+K++KK KK ++ +++G
Sbjct: 366 EELNKRIEEIKEKYPKPPK-KKREEKKPQKRKKKKKRKKKGKKRKKKG 412
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 38.2 bits (89), Expect = 0.002
Identities = 24/64 (37%), Positives = 45/64 (70%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
K+ +KE +E++E++KEEKK+++E+ DK++E ++++KEEKK+K K+ K+ + E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 109 NKKA 112
K
Sbjct: 89 KTKP 92
Score = 37.1 bits (86), Expect = 0.004
Identities = 25/56 (44%), Positives = 41/56 (73%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
EK+ EKE E+++EEK++KKEEE+ +E++ ++EEK++KKK+ KK K+ E
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
Score = 37.1 bits (86), Expect = 0.005
Identities = 24/63 (38%), Positives = 45/63 (71%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K+ EK+ +E++EEKEEKKE++++ +++E +++++KEEKK+K K+ K+ + E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 101 KKK 103
K K
Sbjct: 89 KTK 91
Score = 36.7 bits (85), Expect = 0.006
Identities = 20/63 (31%), Positives = 44/63 (69%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
KE +K+ E++EE++E+K++++E+ K+EE +++++E+K+KK ++ K+ E +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 102 KKK 104
K K
Sbjct: 89 KTK 91
Score = 36.3 bits (84), Expect = 0.008
Identities = 22/61 (36%), Positives = 43/61 (70%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
KE +K+ EE++++K+EKKEE+E+ +K++E +++++EEK KK ++ K+ E +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 94 K 94
K
Sbjct: 89 K 89
Score = 34.0 bits (78), Expect = 0.054
Identities = 20/62 (32%), Positives = 44/62 (70%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E K+ ++EE+++++K+E+++K +K+EE +E++EK++++KK KK +E + E +
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Query: 85 KK 86
K
Sbjct: 90 TK 91
Score = 33.2 bits (76), Expect = 0.085
Identities = 19/64 (29%), Positives = 46/64 (71%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E+E +K+ +EE+++K++++++++K KEE+ +++E+K+E+KK+ K+ ++ + E
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87
Query: 84 QKKK 87
K K
Sbjct: 88 NKTK 91
Score = 30.5 bits (69), Expect = 0.61
Identities = 20/48 (41%), Positives = 32/48 (66%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
+ K++KKEEE K K++E ++++KEEKK+K K+ KE E + K
Sbjct: 42 EEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Score = 27.4 bits (61), Expect = 7.1
Identities = 21/63 (33%), Positives = 38/63 (60%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K++K+EE +K K+EE +++++E+K+KK +K +E + E + K KD
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTRNPKDVT 102
Query: 79 KEE 81
KEE
Sbjct: 103 KEE 105
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.5 bits (90), Expect = 0.002
Identities = 20/102 (19%), Positives = 57/102 (55%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ ++ +KE E K + ++ ++ ++ +E+ + KE+ EE E + +E ++ + E ++
Sbjct: 261 ELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEE 320
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++ ++ K+K E +++ EE++ ++ E+ + EE +
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362
Score = 38.5 bits (90), Expect = 0.002
Identities = 18/104 (17%), Positives = 51/104 (49%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+++ E ++ ++ + E ++ ++ + + ++ E+ ++ K+K E +++ EE++
Sbjct: 285 ELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERET 344
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
EE +Q E ++ K++ E+K EE + + L
Sbjct: 345 LLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAEL 388
Score = 38.2 bits (89), Expect = 0.002
Identities = 21/97 (21%), Positives = 56/97 (57%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ +++ EE +++ + +E+ ++ + E ++ +E + E +E ++ ++ K++ E K+
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ E+++ EE +Q E ++ K++ EEK + EE
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL 374
Score = 36.6 bits (85), Expect = 0.009
Identities = 18/103 (17%), Positives = 53/103 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K++ +E E + ++ ++ + + E+ +++ E+ +EK E +++ EE++ EE +
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ 351
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
E ++ K++ E+K E+ ++ + ++ E +
Sbjct: 352 LLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAE 394
Score = 35.8 bits (83), Expect = 0.012
Identities = 24/109 (22%), Positives = 58/109 (53%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ + EE ++ +++ E+ +K+ EE K + +E +EE EE +E+ E K++ +E E +
Sbjct: 245 EEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEI 304
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
++ ++ E + ++ +E ++ K+K E ++ EE+
Sbjct: 305 SLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLL 353
Score = 35.1 bits (81), Expect = 0.024
Identities = 22/112 (19%), Positives = 61/112 (54%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++ ++ E+ ++ K + E+ +++ EE +++ E KEE EE + + +++ +E E + +
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELE 319
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLI 130
+ E++ ++ +EK + KEE ++++ E+ + E + + L+
Sbjct: 320 ELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371
Score = 34.7 bits (80), Expect = 0.032
Identities = 19/104 (18%), Positives = 54/104 (51%), Gaps = 1/104 (0%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G+ ++ E + + ++ E++ ++ +EK + KE+ EE+E E+ ++ + +E ++
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361
Query: 76 DKKKEEKKQKK-KEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ +++ + EE + +EE + + + + + EE R+
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405
Score = 33.9 bits (78), Expect = 0.059
Identities = 19/101 (18%), Positives = 60/101 (59%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ ++ EEE + +++ EE +++ +E +K+ ++ K E +E ++E ++ +++ + +E+
Sbjct: 238 RKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEI 297
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
++ E + +E ++ + E ++ +++ EE K K +
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEE 338
Score = 33.9 bits (78), Expect = 0.065
Identities = 15/101 (14%), Positives = 56/101 (55%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K +K++ E + ++ ++ E + +E +KQ +K +++ ++ ++ ++ + + ++ E
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
K ++ +K+ EE +++ +++ ++ +E+ + +E
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEE 271
Score = 33.5 bits (77), Expect = 0.079
Identities = 23/107 (21%), Positives = 57/107 (53%), Gaps = 4/107 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ K + EE +++ ++ +E+ + KEE ++ + E +E +E + E ++ ++ E+ K+
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKE 330
Query: 79 KEEKKQKKKEEKKQKKKE----EKKQKKKKEEKKNKKAEEQNRQQGL 121
K E +++ EE++ +E + ++ KEE + K + + L
Sbjct: 331 KIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377
Score = 33.5 bits (77), Expect = 0.085
Identities = 15/103 (14%), Positives = 55/103 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ ++ ++ E K++ + +++ +++E ++ ++ E EE KE+ +E+ EE ++
Sbjct: 317 ELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ +++ + E + + + ++ K+E + ++ E+ ++
Sbjct: 377 LFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419
Score = 32.8 bits (75), Expect = 0.13
Identities = 22/111 (19%), Positives = 57/111 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+R + EE K+ +K + + ++ ++ ++ + + + E + K+ K+ ++ EE+
Sbjct: 189 ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEEL 248
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
+ EE+ ++ +EE ++ +KE ++ K + EE + + E Q L +
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE 299
Score = 32.4 bits (74), Expect = 0.16
Identities = 16/96 (16%), Positives = 57/96 (59%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K K+ + ++ +++ + +++ EE +++ +E ++E EE K + +E +++ +E +++
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ +++ ++ E + +E ++ + + E+ ++ EE
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEE 327
Score = 32.4 bits (74), Expect = 0.18
Identities = 20/111 (18%), Positives = 66/111 (59%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+ + + + ++ E +++ ++ E++ ++ EEK + +E+ EE E++ E+ KEE
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++ + E+++ + E K+ ++++E+ +++ E + + + +++ +K E+ +
Sbjct: 866 EELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Score = 32.4 bits (74), Expect = 0.19
Identities = 25/113 (22%), Positives = 65/113 (57%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+++ EE +++ ++ EEK + +EE ++ +KE +E KEE +E + E+++ + E ++ ++++
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEK 886
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYL 133
E+ +++ E + + E K++ +K E+ + + R + P + L
Sbjct: 887 EELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939
Score = 32.0 bits (73), Expect = 0.27
Identities = 20/104 (19%), Positives = 55/104 (52%), Gaps = 1/104 (0%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ ++ K++ E K++ ++ E ++ E+ + +E KEE EEK EE ++ E +
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382
Query: 76 DK-KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
++ + E + + + ++ K E + +++ E+ +++ E+ +
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEE 426
Score = 31.6 bits (72), Expect = 0.33
Identities = 24/122 (19%), Positives = 61/122 (50%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ ++ E ++ +++ EEK EE ++ + +EE E + + E + + +E +++
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
+ E++ ++ E+ + KEE K+ + + E+ + EE N + Q L L+ +
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKEL 465
Query: 137 DQ 138
++
Sbjct: 466 ER 467
Score = 31.2 bits (71), Expect = 0.44
Identities = 17/100 (17%), Positives = 58/100 (58%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
G K K+++EE ++K ++ EE ++ ++ ++ +K+ ++ + + ++ ++ ++ + + E
Sbjct: 165 AGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ K K+ ++ ++ +EE + +++ E+ ++ EE
Sbjct: 225 LELALLLAKLKELRKELEELEEELSRLEEELEELQEELEE 264
Score = 30.5 bits (69), Expect = 0.73
Identities = 21/113 (18%), Positives = 56/113 (49%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K++ EE+ ++ EE + +EE + + E E + E +E K+E + ++ E+
Sbjct: 357 EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
++ E +++ +E + + +E + + ++ E+ + E+ + +R L
Sbjct: 417 SERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL 469
Score = 30.5 bits (69), Expect = 0.75
Identities = 24/103 (23%), Positives = 68/103 (66%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ ++ ++E E +++ E++ + E+++++ +++ EE EE+ E+ +E+ + +EE ++
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+KE ++ K++ E+ + +KEE + + K+ E++ ++ EE+ R+
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELREL 896
Score = 30.5 bits (69), Expect = 0.86
Identities = 16/103 (15%), Positives = 67/103 (65%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ ++ ++E E +++ ++ +++ ++ EE+ + +E + +E+ E+ +E+++ +EE +
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ ++E ++ +++ + +++ E +Q++++ E++ ++ EE+ +
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE 839
Score = 30.1 bits (68), Expect = 0.89
Identities = 18/99 (18%), Positives = 57/99 (57%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
R+ + K K+ +KE ++ +++ + +++ E+ +E+ E+ EK+ EE K + EE ++
Sbjct: 222 LRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELRE 281
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ +E +++ + +++ ++ E + ++ ++ + E+
Sbjct: 282 ELEELQEELLELKEEIEELEGEISLLRERLEELENELEE 320
Score = 29.7 bits (67), Expect = 1.2
Identities = 19/100 (19%), Positives = 57/100 (57%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+++ E+ + + ++ EE+ ++ +EE ++ ++ +E +EE + ++ K K+E E+ +
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+K + Q++ EE +++ +E +++ E + + + R
Sbjct: 786 EKRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825
Score = 29.7 bits (67), Expect = 1.3
Identities = 21/104 (20%), Positives = 69/104 (66%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ ++E E+ ++++++ E++ ++ EE+ ++ +EK +E EE+ E+ ++E ++ KEE +
Sbjct: 807 RRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE 866
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ + E+++ + + ++ +++KEE +++ ++ E + + +E+ +
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKL 910
Score = 29.7 bits (67), Expect = 1.4
Identities = 24/140 (17%), Positives = 63/140 (45%), Gaps = 1/140 (0%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKK-KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
+I K++ +E E ++ ++ + ++ KEE ++K EE EE E +EE + +
Sbjct: 331 KIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEA 390
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGY 132
E + + E ++ K++ E +++ E ++ + +++ K+ E + + + +
Sbjct: 391 ELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE 450
Query: 133 LRSVDQFANLVLHKTIERIH 152
L + L + +
Sbjct: 451 LEEQLEELRDRLKELERELA 470
Score = 28.5 bits (64), Expect = 2.7
Identities = 20/101 (19%), Positives = 65/101 (64%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+++ ++ EE + ++ K +++ EE ++K++ +EE EE +E+ +E +++ E++
Sbjct: 757 QERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ E+++++ E++ ++ +EE ++ ++K ++ ++ EE ++
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKE 857
Score = 28.1 bits (63), Expect = 4.1
Identities = 23/114 (20%), Positives = 60/114 (52%), Gaps = 8/114 (7%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKK-----KEKKEEKEEKKEKKKEEKKQKK 71
+RK ++ EE ++ + EE +++ ++ ++Q + +E K E E + K ++
Sbjct: 178 ERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKEL 237
Query: 72 EEEKDKKKEEKKQKKKE---EKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLY 122
+E ++ +EE + ++E +++ ++ EK+ ++ K E + + E + Q+ L
Sbjct: 238 RKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291
Score = 27.8 bits (62), Expect = 4.9
Identities = 20/94 (21%), Positives = 57/94 (60%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K+E +++ ++ + + ++ EE+ ++ +E+ EE +E+ E+ +EE + +E K+E
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
++ ++K + Q++ EE +++ ++ E++ E +
Sbjct: 782 EELEEKRQALQEELEELEEELEEAERRLDALERE 815
Score = 27.8 bits (62), Expect = 5.1
Identities = 21/88 (23%), Positives = 56/88 (63%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
I + ++K E E + ++ +KE ++ K++ E+ + +KE+ E++ ++ E++KEE +++ E
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+ + E K++ +K ++ ++ E K ++
Sbjct: 896 LESELAELKEEIEKLRERLEELEAKLER 923
Score = 27.8 bits (62), Expect = 5.6
Identities = 18/93 (19%), Positives = 55/93 (59%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+K++ E+ K+ ++++E+ +++ E + + E KEE E+ +E+ +E + + + E
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+ + E++ +++ E + + E++ ++ +EE +
Sbjct: 927 ELPELEEELEEEYEDTLETELEREIERLEEEIE 959
Score = 27.8 bits (62), Expect = 6.2
Identities = 23/113 (20%), Positives = 65/113 (57%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ +++ EE +++ + EE K KEE ++ +++++ +EE +E ++E ++ ++ +
Sbjct: 753 LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDAL 812
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
+++ E ++++E +Q+ +E +++ ++ EEK ++ EE + K L
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865
Score = 27.4 bits (61), Expect = 7.9
Identities = 15/93 (16%), Positives = 54/93 (58%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E+ +++ K + + + ++ ++ +++ +E + + +E K+E ++E E+ + + E+ +
Sbjct: 684 EKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELE 743
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
++ EE +++ +E +++ ++ EE+ E +
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAK 776
Score = 27.0 bits (60), Expect = 9.4
Identities = 21/99 (21%), Positives = 56/99 (56%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+R ++ K E + + K ++ ++E ++ ++E +EE +E ++E ++ +KE E+
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL 272
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
K + E+ +++ EE +++ E K++ ++ E + + E
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLRERL 311
Score = 27.0 bits (60), Expect = 9.9
Identities = 13/98 (13%), Positives = 59/98 (60%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ + + +++ +E +++ ++ +++ ++ E++ E E+ K +E+ ++ EE++
Sbjct: 731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQA 790
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+EE ++ ++E ++ +++ + +++ + ++ ++ EQ
Sbjct: 791 LQEELEELEEELEEAERRLDALERELESLEQRRERLEQ 828
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 37.4 bits (87), Expect = 0.002
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
E+++ K+K++ +++KE K+ +K E KK+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 35.8 bits (83), Expect = 0.007
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+++ KQK+E +++KE K+ +K E KK+K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 34.3 bits (79), Expect = 0.025
Identities = 12/38 (31%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 41 KKEEKKQKKKEKKEEKEEKKE-KKKEEKKQKKEEEKDK 77
+K+ K++++ +E KE KK QK E +K K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 33.5 bits (77), Expect = 0.045
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEE 55
+RK K++EE +++K+ KK QK + KK++
Sbjct: 157 AERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 32.7 bits (75), Expect = 0.082
Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
E+K K++++ EE+KE K+ +K E KK+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAA---AAKKSATPQKVETKKKK 196
Score = 32.7 bits (75), Expect = 0.084
Identities = 11/40 (27%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+++ K K++ +++K+ KK Q K E KK K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQ--KVETKKKK 196
Score = 32.0 bits (73), Expect = 0.13
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
E+K K++E+ +E K+ KK ++ + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 32.0 bits (73), Expect = 0.14
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 26 ENKKKQKKKEEKKQKKKEEK--KQKKKEKKEEKEEKKEK 62
E K ++K+E +++KE KK ++ E KK+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 30.0 bits (68), Expect = 0.58
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+K K+KEE ++++E +K ++ + KKK+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 29.7 bits (67), Expect = 0.83
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 35 EEKKQKKKEE-KKQKKKEKKEEKEEKKEK--KKEEKKQK 70
E K K+KEE +++KE +K K E KK+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 29.3 bits (66), Expect = 1.2
Identities = 11/39 (28%), Positives = 20/39 (51%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
E K K+KE+ +E K+ KK ++ + KK++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 2.9
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
E KD K +K++ EE+K+ K+ ++ + KK +
Sbjct: 158 ERKDAK--QKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 37.6 bits (88), Expect = 0.002
Identities = 12/58 (20%), Positives = 34/58 (58%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+E K + + ++ E++ +K+E +K+K ++++K + K +KK + + +K + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161
Score = 35.3 bits (82), Expect = 0.011
Identities = 15/67 (22%), Positives = 36/67 (53%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
EE + K + ++ E++ KK E +K++ ++++K + K +K+ K + +K E + ++
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163
Query: 114 EQNRQQG 120
G
Sbjct: 164 ISELTVG 170
Score = 34.9 bits (81), Expect = 0.017
Identities = 15/73 (20%), Positives = 37/73 (50%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+K+ EE K K + ++ +++ +K+E EK++ + E+K K + + K+K ++ + +
Sbjct: 100 RKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Query: 91 KQKKKEEKKQKKK 103
E +
Sbjct: 160 PVSDISELTVGQA 172
Score = 31.8 bits (73), Expect = 0.17
Identities = 14/73 (19%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
+K+ EE K K + ++ E+ + +K++ +++K +++K + K +KK + + +K E +
Sbjct: 100 RKQLEEAKAKVQAQRAEQ--QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Query: 99 KQKKKKEEKKNKK 111
+
Sbjct: 158 HTPVSDISELTVG 170
Score = 31.4 bits (72), Expect = 0.22
Identities = 12/68 (17%), Positives = 35/68 (51%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ + ++ E+ KK++ EK++ + E+K K K +++++ + +K + + + +
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168
Query: 78 KKEEKKQK 85
+ K K
Sbjct: 169 VGQAVKVK 176
Score = 31.0 bits (71), Expect = 0.30
Identities = 14/59 (23%), Positives = 27/59 (45%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
R ++ KK++ K+K ++E K + K KK+K + +K E + E +
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
Score = 30.7 bits (70), Expect = 0.39
Identities = 14/77 (18%), Positives = 35/77 (45%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
K Q ++ E++ KK+E EKE+ ++++ K + +++ + ++ + + +
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167
Query: 96 EEKKQKKKKEEKKNKKA 112
+ K K K A
Sbjct: 168 TVGQAVKVKAGKSAMDA 184
Score = 30.3 bits (69), Expect = 0.47
Identities = 12/69 (17%), Positives = 36/69 (52%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K+ +E + K + ++ E++ +K++ +++K ++E K + K +K+ K + ++ E
Sbjct: 100 RKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Query: 79 KEEKKQKKK 87
+
Sbjct: 160 PVSDISELT 168
Score = 29.5 bits (67), Expect = 0.87
Identities = 10/63 (15%), Positives = 29/63 (46%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K + + E + K+++ +K+K +++ K + +K + + +K E + + +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Query: 77 KKK 79
Sbjct: 167 LTV 169
Score = 29.5 bits (67), Expect = 0.94
Identities = 12/61 (19%), Positives = 30/61 (49%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ KK+E +K++ + E+K K K +K++K ++ + + + + K
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKV 175
Query: 77 K 77
K
Sbjct: 176 K 176
Score = 29.1 bits (66), Expect = 1.2
Identities = 13/68 (19%), Positives = 34/68 (50%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K K + + ++Q KK E +K++ ++++K K + +K++ + ++ + + D +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Query: 80 EEKKQKKK 87
Q K
Sbjct: 167 LTVGQAVK 174
Score = 28.7 bits (65), Expect = 1.8
Identities = 14/53 (26%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 68 KQKKEEEKDK---KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+++ EE K K ++ E++ KK+E +K+K ++++K K + KK + + +
Sbjct: 100 RKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 28.7 bits (65), Expect = 2.0
Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 2/75 (2%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+K+ EE K K + + ++Q KK E +K++ + E+K K K +K++K + + +
Sbjct: 100 RKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAP--RRERKPKPKAPRKKRKPRAQKPEPQ 157
Query: 81 EKKQKKKEEKKQKKK 95
E +
Sbjct: 158 HTPVSDISELTVGQA 172
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 37.0 bits (86), Expect = 0.003
Identities = 20/90 (22%), Positives = 46/90 (51%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
+ +Q+ K+ Q+ E +E KEE+KE E K+ K + + + +E +++ +
Sbjct: 37 PSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+E+++ + EK+ ++K E K+ +
Sbjct: 97 EEDEESSDENEKETEEKTESNVEKEITNPS 126
Score = 36.6 bits (85), Expect = 0.004
Identities = 19/82 (23%), Positives = 46/82 (56%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ E+E KK ++ + ++ KEE+K+ + +E + EK+ EE +++ EEE
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Query: 76 DKKKEEKKQKKKEEKKQKKKEE 97
++ +E +++ +E+ + ++E
Sbjct: 100 EESSDENEKETEEKTESNVEKE 121
Score = 34.3 bits (79), Expect = 0.026
Identities = 14/80 (17%), Positives = 49/80 (61%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
S + +++ K+ ++++ + +E K+++K+ + K++K + ++E +E ++E +++ +
Sbjct: 41 SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
Query: 72 EEEKDKKKEEKKQKKKEEKK 91
E + +KE +++ + +K
Sbjct: 101 ESSDENEKETEEKTESNVEK 120
Score = 33.2 bits (76), Expect = 0.053
Identities = 19/82 (23%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
++ ++E KK +++ + +E KEE++E + +E K E+E ++ +EE +++
Sbjct: 41 SDQAAADEQEAKK--SDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Query: 86 KKEEKKQKKKEEKKQKKKKEEK 107
+E + +KE +++ + EK
Sbjct: 99 DEESSDENEKETEEKTESNVEK 120
Score = 29.3 bits (66), Expect = 1.3
Identities = 14/62 (22%), Positives = 37/62 (59%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ E++ +K ++++ + EE K+++KE + KE+K +K++E+ ++ EE++
Sbjct: 41 SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
Query: 118 QQ 119
+
Sbjct: 101 ES 102
Score = 28.5 bits (64), Expect = 1.9
Identities = 18/89 (20%), Positives = 43/89 (48%), Gaps = 2/89 (2%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+ A + + + +E + ++K+ E K+ K + +K+ ++ ++E +EE +E
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
++ +K+ EEK + EK+ K
Sbjct: 102 --SSDENEKETEEKTESNVEKEITNPSWK 128
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 37.8 bits (88), Expect = 0.003
Identities = 26/144 (18%), Positives = 73/144 (50%), Gaps = 10/144 (6%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K++++ +E + Q+ ++E ++K + + K+ E+ +++++ + E+++ + EE+
Sbjct: 26 KKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEER 85
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ--------QGLYPNQKRT 128
++E++ + EK + + ++++K + + EE +Q GL P Q R
Sbjct: 86 LVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARK 145
Query: 129 LIGYLRSVDQFANLVLHKTIERIH 152
L+ L+ +D + +++I
Sbjct: 146 LL--LKLLDAELEEEKAQRVKKIE 167
Score = 32.4 bits (74), Expect = 0.18
Identities = 21/104 (20%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R++ ++EEE +++++ + + +K + + + +E+++ ++ + +E +KQ E
Sbjct: 76 REELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRV 135
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQK----KKKEEKKNKKAEEQNR 117
+Q +K K E +++K KK EE+ + +AE + +
Sbjct: 136 AGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAERKAQ 179
Score = 27.4 bits (61), Expect = 6.2
Identities = 18/100 (18%), Positives = 47/100 (47%), Gaps = 2/100 (2%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKK--EEKEEKKEKKKEEKKQKK 71
R +R +K+E+ + + +K + Q ++ EK +E + E +++ +
Sbjct: 81 REEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTP 140
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
E+ + + + +EEK Q+ K+ +++ + E+K +
Sbjct: 141 EQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAERKAQN 180
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 37.9 bits (88), Expect = 0.003
Identities = 25/101 (24%), Positives = 59/101 (58%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K+ E++N+ +K +E+++ KKE +++ ++ K+ E+ +K E +KE ++ K +K+ +
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVE 582
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++ K+K+ K K+ + + K +E K K ++ Q
Sbjct: 583 SIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQ 623
Score = 37.5 bits (87), Expect = 0.003
Identities = 26/94 (27%), Positives = 55/94 (58%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
K +K+ ++K E +K +E+++ KKE ++E EE KE+++ +K + ++E ++ K K
Sbjct: 519 KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALK 578
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K+ + ++ K+K+ K K+ K + K +E
Sbjct: 579 KEVESIIRELKEKKIHKAKEIKSIEDLVKLKETK 612
Score = 34.8 bits (80), Expect = 0.025
Identities = 27/101 (26%), Positives = 56/101 (55%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
I + K E + + ++ +K+ E+K + EK +++EK +K+ E++ ++ +E
Sbjct: 499 IEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKER 558
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ KK E +++ +E K KKE + ++ +EKK KA+E
Sbjct: 559 ERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEI 599
Score = 32.9 bits (75), Expect = 0.10
Identities = 25/94 (26%), Positives = 51/94 (54%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++K K+E ++ ++ KE ++ KK E +K+ ++ K K+E + +E K++K + K+
Sbjct: 539 KEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKE 598
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
K E K KE K++ ++ + K K +
Sbjct: 599 IKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVR 632
Score = 29.0 bits (65), Expect = 2.2
Identities = 19/54 (35%), Positives = 34/54 (62%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
EK E+E ++K E ++ KE++K KK+ E++ ++ KE ++NKK E + Q
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQ 571
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 37.7 bits (88), Expect = 0.003
Identities = 16/81 (19%), Positives = 38/81 (46%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+I K K + K+ K +K+K E + E++ +K++++ ++ + K+ + +
Sbjct: 451 KIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQLKEATELQ 510
Query: 74 EKDKKKEEKKQKKKEEKKQKK 94
E E+ ++ KE K
Sbjct: 511 EFMINNEDLIEEAKELFGIKS 531
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 37.0 bits (85), Expect = 0.004
Identities = 23/87 (26%), Positives = 56/87 (64%), Gaps = 1/87 (1%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE-KKKEEKKQKKEEEKDKKKEEKKQKKK 87
KK K++K ++E+ +K ++++EE EE+ E +++EEK ++E+++++++E ++Q +
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E+ + E +Q + + E + K E+
Sbjct: 171 EQSDDSEHEIIEQDESETESDDDKTEK 197
Score = 32.7 bits (74), Expect = 0.079
Identities = 18/94 (19%), Positives = 62/94 (65%), Gaps = 1/94 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKE-EKKEKKKEEKKQKKEEEKD 76
R + + +++ K+KK +Q++ E+ +Q+++E +E E E++E+K +E++ ++E+E++
Sbjct: 104 RDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERE 163
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
++++ +++ + + + ++++ + + ++K K
Sbjct: 164 REEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
Score = 30.0 bits (67), Expect = 0.59
Identities = 16/79 (20%), Positives = 51/79 (64%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
KK + KK+K ++E+ EK +++EE +++ E E+ ++K ++++ ++E++++++++ +
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
Query: 100 QKKKKEEKKNKKAEEQNRQ 118
++ E + + +E +
Sbjct: 171 EQSDDSEHEIIEQDESETE 189
Score = 29.3 bits (65), Expect = 1.2
Identities = 15/72 (20%), Positives = 48/72 (66%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
KKK+ ++E+ EK ++E+++ +E + +++EEK +++++++Q+++ E++ +++ +
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175
Query: 108 KNKKAEEQNRQQ 119
+ EQ+ +
Sbjct: 176 SEHEIIEQDESE 187
Score = 28.5 bits (63), Expect = 2.3
Identities = 14/78 (17%), Positives = 51/78 (65%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+K + K++K ++E+ +K +E+++ +E + +++EEK +++++++Q++++EE+ +
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
Query: 111 KAEEQNRQQGLYPNQKRT 128
+ + + + + ++ T
Sbjct: 171 EQSDDSEHEIIEQDESET 188
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 37.4 bits (87), Expect = 0.004
Identities = 13/96 (13%), Positives = 36/96 (37%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ E ++ + +Q+ ++ K + E++++ + + ++K EE + +
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANE 233
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K + E K E + + + AE
Sbjct: 234 SRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEA 269
Score = 35.1 bits (81), Expect = 0.018
Identities = 13/102 (12%), Positives = 37/102 (36%), Gaps = 1/102 (0%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ + E + +Q K + +++K+ Q E ++++ +E + E + K E
Sbjct: 184 QAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIAS- 242
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ K ++ + + + K + K +
Sbjct: 243 AEAAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKM 284
Score = 34.7 bits (80), Expect = 0.026
Identities = 15/100 (15%), Positives = 36/100 (36%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K ++ + + E+ + +Q+ ++ K + ++ KK + + KK
Sbjct: 167 KATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKL 226
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
EE + + K + E K +E +A +
Sbjct: 227 EELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARA 266
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 37.2 bits (86), Expect = 0.004
Identities = 30/111 (27%), Positives = 42/111 (37%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
KK K++ K K+ K K KK KK KK K+ K KK KK K K
Sbjct: 222 AAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKA 281
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
K K K K + K+K ++ K + ++ +KR
Sbjct: 282 AKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKR 332
Score = 37.2 bits (86), Expect = 0.004
Identities = 31/111 (27%), Positives = 42/111 (37%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K A K + K K+ + K+ KK KK KK K KK K+ K
Sbjct: 218 KAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKA 277
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K K K + K ++K KK + K K K+ + KK KK
Sbjct: 278 AAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKV 328
Score = 35.6 bits (82), Expect = 0.012
Identities = 31/106 (29%), Positives = 42/106 (39%), Gaps = 3/106 (2%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
A K A +P KK + +K ++K K KK K + K+ KK
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTA---VSAKKAAKTAAKAAKKAKKT 255
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
KK K+ K KK KK K K K + + K K +KK K
Sbjct: 256 AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGK 301
Score = 34.1 bits (78), Expect = 0.044
Identities = 33/105 (31%), Positives = 45/105 (42%), Gaps = 2/105 (1%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEK--KQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ K A + +K K + KK KK +K K+ K KK KK K + K
Sbjct: 227 SKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAA 286
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
K K + K ++K KK K K K K+ K +K KK KK
Sbjct: 287 KATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 32.9 bits (75), Expect = 0.099
Identities = 28/109 (25%), Positives = 41/109 (37%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K A GA + KK + K + KK K K KK +K +K KK
Sbjct: 204 KTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKA 263
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
K +K KK + K + K + K + KK+ K+ + K
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKAT 312
Score = 32.9 bits (75), Expect = 0.10
Identities = 25/101 (24%), Positives = 38/101 (37%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K K KK K + KK KK+ + KK K K +K KK +K KK
Sbjct: 204 KTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKA 263
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
+ +K ++ + K K KA + + +K
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAA 304
Score = 31.0 bits (70), Expect = 0.36
Identities = 27/106 (25%), Positives = 43/106 (40%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+G KK + K K+ + K KK KK+ + K+ K K K+ +
Sbjct: 194 VGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAK 253
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
K KK KK K +K KK + K K K K + + +++
Sbjct: 254 KTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKA 299
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 37.4 bits (86), Expect = 0.004
Identities = 33/122 (27%), Positives = 59/122 (48%), Gaps = 5/122 (4%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
S +P+ P + K+ KK K++E K+K+K++++ K+K+ E K +
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVA 243
Query: 64 KEEKKQ-----KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ ++ + E K + +E KK K ++KKQ+K+KEEKK KK +R
Sbjct: 244 EADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRC 303
Query: 119 QG 120
Sbjct: 304 HH 305
Score = 34.3 bits (78), Expect = 0.044
Identities = 27/123 (21%), Positives = 56/123 (45%), Gaps = 25/123 (20%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEK-------------------------KQKKKEK 52
+K KKKE++ K+K++ K++KK+ + +
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
E +E K+ + EE K+ + +K K+++EK++KKK++K + ++ +N
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAV 319
Query: 113 EEQ 115
EE+
Sbjct: 320 EEE 322
Score = 31.2 bits (70), Expect = 0.42
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
QK + + K EK + +K+ KK +KK+KKE+EK++ K++KK+ + +
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 29.6 bits (66), Expect = 1.4
Identities = 19/51 (37%), Positives = 29/51 (56%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+K E K +K + +++ K KK+EKK+K+KE K KKKE + K
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 29.3 bits (65), Expect = 1.5
Identities = 17/49 (34%), Positives = 30/49 (61%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+K + E K ++ + +++ K+ KK+EKK+K+KE K KKK+ E
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 27.7 bits (61), Expect = 4.8
Identities = 18/54 (33%), Positives = 33/54 (61%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
EK +K + E K +K + +K+ KK KKKE+K+++K++++ K K+ E
Sbjct: 171 EKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 27.7 bits (61), Expect = 4.9
Identities = 16/52 (30%), Positives = 32/52 (61%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+K + E K +K E+K +K K+++K++KE+E+DK K+++ + K
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 36.5 bits (85), Expect = 0.004
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
E K K E+ +E + +E++ + E+K+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 36.2 bits (84), Expect = 0.005
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
++ K +E ++ + K++E + +KKE KK+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 35.8 bits (83), Expect = 0.008
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
E K+ K E+ +E E +E++ + E+K+ KKK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 35.4 bits (82), Expect = 0.010
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
E ++ K ++ ++ + K++E + +KKE KK+K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 35.0 bits (81), Expect = 0.015
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
E KD K E+ ++ E KEE+ + +K+E K KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 34.6 bits (80), Expect = 0.018
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+ K K E+ +E + +++E + +KKE KK+K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 34.6 bits (80), Expect = 0.021
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ K ++ +E + K++E + +K+E K KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 34.2 bits (79), Expect = 0.027
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
K+ K +E E+ + K+EE + +K+E K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.040
Identities = 9/37 (24%), Positives = 21/37 (56%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
++ K ++ ++ + ++EE E +K+E K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.041
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
K+ K ++ ++ + +E+E + +KKE KK+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.042
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
K K E+ ++ E KE++ + EKKE KK++
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.052
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
++ K ++ EE E K+EE + +++E KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.1 bits (76), Expect = 0.057
Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 17 KRKKK--KKEEENKKKQKKKEEKKQKKKEEKKQK 48
K ++ +E E K++E + +KKE KK+K
Sbjct: 163 KLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.1 bits (76), Expect = 0.064
Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
E K K EE ++ E KEE+ + EKK+ KKK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEG--EKKESKKKK 196
Score = 33.1 bits (76), Expect = 0.065
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
K K EE ++ E K+E+ E E+K+ KKK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 32.7 bits (75), Expect = 0.081
Identities = 7/39 (17%), Positives = 20/39 (51%), Gaps = 2/39 (5%)
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
++ K ++ ++ + ++++ + + KKE KK K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGE--KKESKKKK 196
Score = 32.3 bits (74), Expect = 0.10
Identities = 8/40 (20%), Positives = 21/40 (52%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEE 55
+ K K EE ++ + ++++ + +KK+ KK++
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 32.3 bits (74), Expect = 0.11
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
E K+ K ++ +E + +E++ + EKK+ KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.9 bits (73), Expect = 0.13
Identities = 6/37 (16%), Positives = 21/37 (56%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K+ K ++ ++ + ++++ + +KK+ +KK+
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.9 bits (73), Expect = 0.15
Identities = 9/37 (24%), Positives = 21/37 (56%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEK 59
K+ + ++ +E + +E++ + EKKE K++K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.5 bits (72), Expect = 0.17
Identities = 7/39 (17%), Positives = 21/39 (53%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
E K+ K ++ ++ + +++ + +KK+ +KK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.5 bits (64), Expect = 2.1
Identities = 9/38 (23%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
++ K ++ E+ + K++ + + E+KE KK+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEE-ESAEGEKKESKKKK 196
Score = 28.5 bits (64), Expect = 2.3
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKE-EKEEKKEKK 63
E K + +E ++ + K++E E EK+E K+KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.1 bits (63), Expect = 2.5
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
E K K EE ++ E +E++ + E+K+ KK++
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.1 bits (63), Expect = 2.6
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E K K EE ++ E K++E + +K E
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192
Score = 27.7 bits (62), Expect = 3.4
Identities = 10/37 (27%), Positives = 19/37 (51%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK 56
K K E ++ + K++E + +KKE K++K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 37.2 bits (86), Expect = 0.004
Identities = 21/96 (21%), Positives = 37/96 (38%), Gaps = 2/96 (2%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQK--KKEKKEEKEEKKEKKKEEKKQKKE 72
I KK K ++ +K + + KKE E+ E ++ K KK
Sbjct: 462 IESAKKYVKPSRHRFVPFEKAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKL 521
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+ KE + ++ E Q K E +++ E K+
Sbjct: 522 RAIEALKERMRSGEELEVIQVNKIETEEEVLSELKE 557
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 37.3 bits (87), Expect = 0.004
Identities = 27/113 (23%), Positives = 63/113 (55%), Gaps = 6/113 (5%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+S ++ + K +++EE + ++K+EE K +EE + + K+ E++ + E +K+
Sbjct: 174 QLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKE 233
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEE------KKQKKKKEEKKNKKAEEQN 116
+ ++ ++K ++ +++ E +QK K E + Q++ +E K K EE+N
Sbjct: 234 ELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERN 286
Score = 36.2 bits (84), Expect = 0.011
Identities = 24/97 (24%), Positives = 52/97 (53%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K++ KEE Q K+ + K +E++E + KEK++E + +EE + +
Sbjct: 155 DNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLE 214
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
++ +K+ + + ++E+++ +KK EEK ++ E Q
Sbjct: 215 SKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQA 251
Score = 34.6 bits (80), Expect = 0.031
Identities = 19/93 (20%), Positives = 52/93 (55%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
+KK + K E++++ + K+K++E + EE+ + E K+ E++ + E +K++
Sbjct: 176 SKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEEL 235
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+++ ++K ++E +++ + E+K K +
Sbjct: 236 RKKYEEKLRQELERQAEAHEQKLKNELALQAIE 268
Score = 33.1 bits (76), Expect = 0.084
Identities = 24/83 (28%), Positives = 51/83 (61%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
EE + KK E K +++EE ++ KEK+EE K E++ + + KE +K+ + +
Sbjct: 170 EELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFE 229
Query: 85 KKKEEKKQKKKEEKKQKKKKEEK 107
++KEE ++K +E+ +Q+ +++ +
Sbjct: 230 REKEELRKKYEEKLRQELERQAE 252
Score = 33.1 bits (76), Expect = 0.10
Identities = 24/98 (24%), Positives = 53/98 (54%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
KK E K +++++ E+ K+K E+ K E++ + ++ EK+ + E E++K++
Sbjct: 177 KKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELR 236
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
KK ++K ++ +++ E ++K K E + E Q
Sbjct: 237 KKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFN 274
Score = 32.7 bits (75), Expect = 0.11
Identities = 26/106 (24%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKK------KEKKEEKEEKKEKKKEEKKQK 70
+R K+K EE K +++ + + KE +K+ +EK+E +++ +EK ++E +++
Sbjct: 191 ERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
E + K K E + E +++ KE K++ +++ + K E N
Sbjct: 251 AEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELN 296
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 37.1 bits (86), Expect = 0.004
Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 3/100 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK K+ + K + +E + +E +Q +K K +K ++ + + KK K
Sbjct: 15 TKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKK---SSKP 71
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E +E+ + +K + + + K E + +
Sbjct: 72 TESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEE 111
Score = 37.1 bits (86), Expect = 0.005
Identities = 17/77 (22%), Positives = 31/77 (40%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E+ K K KK ++ + + KK K E ++K K +K E + K
Sbjct: 42 EQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSK 101
Query: 84 QKKKEEKKQKKKEEKKQ 100
E +++++ EE
Sbjct: 102 APSTESEEEEEPEETPD 118
Score = 34.8 bits (80), Expect = 0.030
Identities = 15/78 (19%), Positives = 32/78 (41%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ + +K K ++ K+ + + KK + + EEK K K E + +
Sbjct: 41 LEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKS 100
Query: 75 KDKKKEEKKQKKKEEKKQ 92
K E +++++ EE
Sbjct: 101 KAPSTESEEEEEPEETPD 118
Score = 34.4 bits (79), Expect = 0.034
Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 9/91 (9%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKE---------EKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+EK KK +K K+ E+ E E+ EK K KK K+ + KK
Sbjct: 10 DEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSS 69
Query: 86 KKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K E EEK K +K + + + ++
Sbjct: 70 KPTESSAASSEEKPAKPRKSAESTRSSHPKS 100
Score = 28.2 bits (63), Expect = 3.5
Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 2/66 (3%)
Query: 7 PKGAISPRIGKRKKK--KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
PK A P + KK K E + ++K K +K E + + K E +E+++
Sbjct: 53 PKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEE 112
Query: 65 EEKKQK 70
E+
Sbjct: 113 PEETPD 118
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 37.2 bits (86), Expect = 0.005
Identities = 20/105 (19%), Positives = 57/105 (54%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+EEE +K+Q+++ + + + + E ++ +EE+ ++EEE+++++E
Sbjct: 397 DTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEE 456
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQ 125
E++Q+ +EE+ + ++EE++ + ++ + + G P +
Sbjct: 457 EEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEE 501
Score = 35.3 bits (81), Expect = 0.023
Identities = 15/87 (17%), Positives = 54/87 (62%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
KQ EE++++K++E++++ + K E +E ++++ ++++++EE
Sbjct: 394 KQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE 453
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+++++++E ++++ ++E++ ++ E N
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADN 480
Score = 31.8 bits (72), Expect = 0.28
Identities = 14/88 (15%), Positives = 49/88 (55%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K K+ +EE+++K++E++ + + + E +E ++++ E++
Sbjct: 389 SKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEE 448
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++++EE+++++++ E++ + EE+ +
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEV 476
Score = 30.7 bits (69), Expect = 0.54
Identities = 13/100 (13%), Positives = 60/100 (60%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K+++ +++++K++E++++ + + E +E ++++ E+++++EE
Sbjct: 394 KQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEE 453
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+++++++E ++++ E+++++++ E + E + +G
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Score = 28.7 bits (64), Expect = 2.8
Identities = 15/96 (15%), Positives = 55/96 (57%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
Q+ +EE+ +++EE++++++E+++E EE++ + +EE+++ + + +++ E +
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
+ ++ +++ +++ + +Q R + P
Sbjct: 494 GDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQP 529
Score = 28.0 bits (62), Expect = 4.8
Identities = 17/91 (18%), Positives = 47/91 (51%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EEE +++Q+ +EE+ + ++EE++ + EE+ E + + ++ +E+ + + E
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ E +Q + + + +EE ++ +
Sbjct: 513 ISRMSEGQQPRGSSVQPESPQEEPLQPESMD 543
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 37.0 bits (85), Expect = 0.005
Identities = 22/90 (24%), Positives = 47/90 (52%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
++ + K K+ ++ + E EK E K K++ +++++K++ E Q+ K
Sbjct: 983 REIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAK 1042
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
EE+ KK++E++Q+ +K N K + R
Sbjct: 1043 EEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072
Score = 35.5 bits (81), Expect = 0.020
Identities = 23/86 (26%), Positives = 49/86 (56%), Gaps = 2/86 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ +++ + + ++K E K + KE+ +++EEK ++ E ++ KEEE KK
Sbjct: 992 DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEK--ERMESLQRAKEEEIGKK 1049
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKK 104
++E++Q+ ++ KE K++ KK
Sbjct: 1050 EKEREQRIRKTIHDNYKEMAKKRLKK 1075
Score = 33.6 bits (76), Expect = 0.084
Identities = 20/67 (29%), Positives = 41/67 (61%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+ +E+ K + KE K ++++E+ + + KEE+ KKEK++E++ +K + K+
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069
Query: 81 EKKQKKK 87
+K+ KKK
Sbjct: 1070 KKRLKKK 1076
Score = 32.4 bits (73), Expect = 0.17
Identities = 14/55 (25%), Positives = 35/55 (63%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+ K ++++E + Q+ KEE+ KK++E++Q+ ++ + ++ KK+ +KK+
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 30.1 bits (67), Expect = 1.1
Identities = 15/71 (21%), Positives = 43/71 (60%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ ++K E +++ +++ +++++K++ E Q+ KE++ K+EK+ +++ K +
Sbjct: 1007 VPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYK 1066
Query: 75 KDKKKEEKKQK 85
+ KK KK++
Sbjct: 1067 EMAKKRLKKKR 1077
Score = 28.9 bits (64), Expect = 2.4
Identities = 21/93 (22%), Positives = 48/93 (51%), Gaps = 3/93 (3%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE--KKEKKKEEKKQKKEEEKDKK 78
++ E + ++ ++ + + +EK E K+ K+ K +E+K++ E + +
Sbjct: 983 REIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQ-RA 1041
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
KEE+ KK++E++Q+ ++ K+ KK K
Sbjct: 1042 KEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074
Score = 27.0 bits (59), Expect = 9.9
Identities = 17/68 (25%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKE--EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+ ++K + + + K+ K Q++KE+ E ++ +++E K+EK++++ K K+
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069
Query: 86 KKEEKKQK 93
KK KK++
Sbjct: 1070 KKRLKKKR 1077
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 36.3 bits (84), Expect = 0.005
Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK--KEEKKQK 85
K+ + E +K K ++K + K ++K +K + + + E+ KK +EKK++
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Query: 86 KKEEKKQK 93
KK+ KK+K
Sbjct: 70 KKKPKKKK 77
Score = 35.5 bits (82), Expect = 0.009
Identities = 18/62 (29%), Positives = 33/62 (53%), Gaps = 1/62 (1%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQK-KEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
+ EK + K+K + K +++ +K K K + E K+ +EKK++KK+ KK
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75
Query: 100 QK 101
+K
Sbjct: 76 KK 77
Score = 33.6 bits (77), Expect = 0.041
Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 3/70 (4%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE-KKQKKEEEK 75
KR + E E K K K + K + K EK K K + E KK +E+K
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEK--FKLLIKAQMAERVKKLHSQEKK 67
Query: 76 DKKKEEKKQK 85
++KK+ KK+K
Sbjct: 68 EEKKKPKKKK 77
Score = 33.2 bits (76), Expect = 0.063
Identities = 18/62 (29%), Positives = 32/62 (51%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ + EK + + K + K +DK E+ K K + ++ K+ Q+KK+E+KK
Sbjct: 14 RALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKP 73
Query: 110 KK 111
KK
Sbjct: 74 KK 75
Score = 30.5 bits (69), Expect = 0.45
Identities = 14/66 (21%), Positives = 30/66 (45%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+ S + K K E K + + ++ K K Q + K+ ++K+++K+
Sbjct: 12 VVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKK 71
Query: 66 EKKQKK 71
+ K+KK
Sbjct: 72 KPKKKK 77
Score = 29.0 bits (65), Expect = 1.4
Identities = 18/77 (23%), Positives = 30/77 (38%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
S K + K ++ ++ K K Q + KK +EKKEEK++ K
Sbjct: 15 ALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Query: 61 EKKKEEKKQKKEEEKDK 77
+KK + + D
Sbjct: 75 KKKVPLQVNPAQLFVDD 91
Score = 27.5 bits (61), Expect = 4.4
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 1/65 (1%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ +K + K K E K + K E+ K K + ++ KK + KK E++ +
Sbjct: 16 LESEKYKANKDKGNPEI-YNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Query: 119 QGLYP 123
+ P
Sbjct: 75 KKKVP 79
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 37.0 bits (86), Expect = 0.006
Identities = 17/65 (26%), Positives = 26/65 (40%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
K+ +E+K K KEE++ + E E DK+K EK +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 87 KEEKK 91
KE+ K
Sbjct: 272 KEKAK 276
Score = 32.8 bits (75), Expect = 0.14
Identities = 15/66 (22%), Positives = 29/66 (43%), Gaps = 4/66 (6%)
Query: 45 KKQKKKEKKEEKEEKKEKK----KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K ++ KEEK K K+++K + + E+ E K+K ++ K
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 101 KKKKEE 106
K+K ++
Sbjct: 272 KEKAKK 277
Score = 32.4 bits (74), Expect = 0.19
Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 2/68 (2%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K +E K+EK K K+++K + + E+ + K+K EK +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGP--LEYDKEKLEKIK 269
Query: 101 KKKKEEKK 108
K++ KK
Sbjct: 270 DLKEKAKK 277
Score = 31.6 bits (72), Expect = 0.34
Identities = 12/66 (18%), Positives = 28/66 (42%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
K ++ +E+K + K+++K + E+ E K+K +K +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 109 NKKAEE 114
+KA++
Sbjct: 272 KEKAKK 277
Score = 28.5 bits (64), Expect = 2.6
Identities = 14/77 (18%), Positives = 30/77 (38%), Gaps = 2/77 (2%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G + NK++QK + + + + +K++ ++ K K+K +K
Sbjct: 226 GSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGISIILF 285
Query: 76 D--KKKEEKKQKKKEEK 90
D K + K + E
Sbjct: 286 DDMTKNKTTNYKIQNED 302
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 36.6 bits (85), Expect = 0.008
Identities = 21/90 (23%), Positives = 42/90 (46%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
I + K K + + K+ KK + + + + + KE E K ++ + +K
Sbjct: 351 IKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKK 410
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
EE ++ E+ K ++KK+KKKE ++
Sbjct: 411 AIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 32.7 bits (75), Expect = 0.12
Identities = 23/95 (24%), Positives = 46/95 (48%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EE + K K + ++ KK KK + + +++ + +E E K ++ +
Sbjct: 349 EEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEG 408
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+K EE +++ EE K KK+++K K+ E+ R
Sbjct: 409 KKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRW 443
Score = 32.0 bits (73), Expect = 0.25
Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ K + + + + ++ KK KK + K + E +E E K E+
Sbjct: 348 NEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY-ESAKTALEKA 406
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+ KK E+ +++ E+ K ++KK+KKK+ +K
Sbjct: 407 EGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 30.8 bits (70), Expect = 0.58
Identities = 25/121 (20%), Positives = 49/121 (40%), Gaps = 8/121 (6%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE--------KK 68
+R K K+ +K+ +KE KK + K EK++ + E+ E+ E+ +K E ++
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEE 335
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
K ++ K + +K + E ++ KK +K +RQ
Sbjct: 336 GLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAY 395
Query: 129 L 129
Sbjct: 396 Y 396
Score = 27.7 bits (62), Expect = 5.4
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 3/74 (4%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKK---QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ K+ K K +++ + KE K +K + K+ EE E+ ++ K +K
Sbjct: 370 KKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKK 429
Query: 71 KEEEKDKKKEEKKQ 84
K+ +K + E+ +
Sbjct: 430 KKRKKKEWFEKFRW 443
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 35.3 bits (82), Expect = 0.008
Identities = 13/59 (22%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKK-EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
Q ++ ++ +K +K+ +K+ ++ + DK +K +K++EE + +KE K
Sbjct: 34 QWLLIDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMMKP 92
Score = 29.5 bits (67), Expect = 0.70
Identities = 10/44 (22%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 69 QKKEEEKDKKKE-EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
+K E+ + ++KE +K+ ++ + K +K +K+++E +++K
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQK 83
Score = 28.8 bits (65), Expect = 1.6
Identities = 12/45 (26%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 32 KKKEEKKQKKKE-EKKQK--KKEKKEEKEEKKEKKKEEKKQKKEE 73
+K E+ ++++KE +K+ + +K + +K EK++EE + ++E
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKE 84
Score = 28.4 bits (64), Expect = 1.8
Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
+K EK QK+ +K+ +K +E ++ + ++ ++++EE + +KE
Sbjct: 34 QWLLIDRKMEKYQKR---EKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMM 90
Query: 87 K 87
K
Sbjct: 91 K 91
Score = 28.4 bits (64), Expect = 2.3
Identities = 12/39 (30%), Positives = 24/39 (61%)
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
D+K E+ ++++KE +K+ ++ K K +K K+ EE
Sbjct: 39 DRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEE 77
Score = 26.5 bits (59), Expect = 9.2
Identities = 14/54 (25%), Positives = 32/54 (59%)
Query: 56 KEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
++ +K +K+E++ QK+ E K ++ KK E+++++ E++K+ K N
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMMKPN 93
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 36.1 bits (83), Expect = 0.008
Identities = 26/96 (27%), Positives = 60/96 (62%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+KKK+E+ ++ + K+ ++++ K+ +K++ K ++++K+ ++ +K+ + +QK++EE+ +K
Sbjct: 81 RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARK 140
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+++KK E K K E + K E K K E
Sbjct: 141 AAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEA 176
Score = 36.1 bits (83), Expect = 0.009
Identities = 34/115 (29%), Positives = 62/115 (53%), Gaps = 4/115 (3%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
PK A K+ +K++ + +++++ +E +KQ + E+K+Q+++ +K E+KK+ + +
Sbjct: 94 PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAK 153
Query: 67 KKQKKEEEKDKKKEEKKQKKKEEKK----QKKKEEKKQKKKKEEKKNKKAEEQNR 117
K E K K E K+K +E K K K E KKK E + K A E+ +
Sbjct: 154 AKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAK 208
Score = 33.4 bits (76), Expect = 0.072
Identities = 26/97 (26%), Positives = 52/97 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+++ ++ E++ + +QK++EE+ +K E+K+K + K + + K K + KK+ E+
Sbjct: 117 QKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEA 176
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K E+ + K E KKK E + K E+ K +
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEA 213
Score = 31.8 bits (72), Expect = 0.25
Identities = 24/90 (26%), Positives = 54/90 (60%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
K ++ ++++ K+ +K++ K +++QK+ E+ E++ + ++K++EE+ +K E+ KK E
Sbjct: 93 KPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAA 152
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K K E + K + +KK +E K +
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEE 182
Score = 31.1 bits (70), Expect = 0.44
Identities = 24/112 (21%), Positives = 49/112 (43%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
A + + + + + + KK+ ++ K E+ + K E K++ + + K
Sbjct: 143 AEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA 202
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
++ K E + K K EKK + E+K +++K K K +K A+ R
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAER 254
Score = 30.7 bits (69), Expect = 0.48
Identities = 28/111 (25%), Positives = 59/111 (53%), Gaps = 4/111 (3%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEE----KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+ K K+ E+ + KQ +KE ++QK+ EE +++ + +++++EE+ K E+K
Sbjct: 87 QVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+K E K K E + K + +KK EE + ++ + K + A + + +
Sbjct: 147 KKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197
Score = 30.7 bits (69), Expect = 0.51
Identities = 22/97 (22%), Positives = 49/97 (50%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K+++++ + +++KK E K K E + K + +K+ ++ K E+ + K E
Sbjct: 132 KQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAA 191
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
KK+ + + K +K K + E K K +K+ + + +
Sbjct: 192 AKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228
Score = 30.7 bits (69), Expect = 0.54
Identities = 27/104 (25%), Positives = 62/104 (59%), Gaps = 5/104 (4%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
R + ++ K +Q++K++++Q +E K ++ E++ K+ +KE+ K +++QK+ EE +
Sbjct: 65 GRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE 124
Query: 77 KKKEEKKQKKKEEKK-----QKKKEEKKQKKKKEEKKNKKAEEQ 115
K+ + ++++++E+ + QKKK E + K E KA +
Sbjct: 125 KQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAE 168
Score = 30.3 bits (68), Expect = 0.68
Identities = 22/90 (24%), Positives = 41/90 (45%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+KK ++ + ++ K K E KK+ + + K ++ K E + K K EKK + E+
Sbjct: 170 KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKA 229
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
E+KK K + + K ++K
Sbjct: 230 AAEKKKAAAKAKADKAAAAAKAAERKAAAA 259
Score = 30.3 bits (68), Expect = 0.80
Identities = 23/98 (23%), Positives = 44/98 (44%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K + ++ ++ K E+ + K E KKK + E K ++ K E + + K E+K
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ E+K +++K K + K + + K A
Sbjct: 222 EAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAA 259
Score = 29.1 bits (65), Expect = 1.8
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 2/98 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKK--EKKEEKEEKKEKKKEEKKQKKEEEKD 76
K+ + E++ +++Q +K +QKKK E + K E + K + KKK E+ K EE
Sbjct: 125 KQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAK 184
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K E KKK E + K EK + + + + K +K E
Sbjct: 185 AKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 36.2 bits (84), Expect = 0.009
Identities = 19/70 (27%), Positives = 33/70 (47%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+++ K +E K+ KE EE ++K E+ +Q + +++ KK+ KK KK E
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
Query: 107 KKNKKAEEQN 116
EE
Sbjct: 504 DTYLLLEELG 513
Score = 35.1 bits (81), Expect = 0.027
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE-EKKQKKE 72
+ + + K E K+ KE +E +++ ++ ++ E ++E KKE KK KK
Sbjct: 441 KFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKI 500
Query: 73 EEKD 76
E D
Sbjct: 501 PEVD 504
Score = 29.7 bits (67), Expect = 1.4
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 34 KEEKKQKKKEEKKQ---KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+EE K EE KQ K+ +E +++ ++ ++ + +E+ KK+ KK KK E
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
Score = 28.9 bits (65), Expect = 2.5
Identities = 16/73 (21%), Positives = 32/73 (43%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
I K EE ++K ++ E+ + +EE K++ +K ++ E E+ EE
Sbjct: 459 IVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEELGINEET 518
Query: 75 KDKKKEEKKQKKK 87
+K + +K
Sbjct: 519 YEKLEALLAKKNN 531
Score = 27.7 bits (62), Expect = 4.8
Identities = 11/64 (17%), Positives = 33/64 (51%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
+K EE+ K ++ ++ KE ++ +E+ ++ ++ + +++ +K+ KK +K
Sbjct: 440 KKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKK 499
Query: 99 KQKK 102
+
Sbjct: 500 IPEV 503
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 36.4 bits (85), Expect = 0.009
Identities = 27/102 (26%), Positives = 46/102 (45%), Gaps = 10/102 (9%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK----------EKKKE 65
G K++K E ++ K E+ + +K++E+ E K EK +
Sbjct: 486 GTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLK 545
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
E+ K E KK EE + KEE + + KEE + K ++ +K
Sbjct: 546 EEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKTEELQK 587
Score = 31.4 bits (72), Expect = 0.30
Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 10/86 (11%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEK----------KQKKEEEKDKKKEEKKQKKKEE 89
+ E+ K E+ +++K++++ E K ++ +EE DK E K+K +E
Sbjct: 503 DDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEA 562
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ K+E + + K++ E K ++ ++
Sbjct: 563 IEWLKEELEGEDKEEIEAKTEELQKV 588
Score = 31.4 bits (72), Expect = 0.31
Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 22 KKEEENKKKQKKKEEKKQKKKE-------EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
K EE + KK++E+ + K E +K K+E + E K+K +E + KEE
Sbjct: 511 KDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEEL 570
Query: 75 KDKKKEEKKQKKKEEKKQ 92
+ + KEE + K +E +K
Sbjct: 571 EGEDKEEIEAKTEELQKV 588
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 36.1 bits (84), Expect = 0.011
Identities = 19/87 (21%), Positives = 43/87 (49%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
++ + E + ++ +++ +K +EK E+ E + EKK+EE ++ K + DK ++ +K
Sbjct: 88 EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ K +K KE+ +
Sbjct: 148 LSEALKGLNYKKNFKEKLLKELKSVIL 174
Score = 35.4 bits (82), Expect = 0.020
Identities = 22/104 (21%), Positives = 45/104 (43%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+I + KK+ K+ E K +Q + E +K++++ EK + K K K+ K+ + K
Sbjct: 96 QIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL 155
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
K +EK K+ + ++ K + K + +
Sbjct: 156 NYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKPE 199
Score = 34.2 bits (79), Expect = 0.045
Identities = 16/78 (20%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 57 EEKKEKKKE-EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
EE E + + E+ +K+ ++ ++K E+ + + ++++++ +K + K K +K KK +
Sbjct: 88 EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
Query: 116 NRQQGLYPNQKRTLIGYL 133
+ N K+ L
Sbjct: 148 LSEALKGLNYKKNFKEKL 165
Score = 33.8 bits (78), Expect = 0.060
Identities = 20/87 (22%), Positives = 46/87 (52%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+G+ + + + + K++ K+ +++ ++ E + +KKE++ EK + K K KK K+ +
Sbjct: 86 LGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYD 145
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+ + K K+ K+K +E K
Sbjct: 146 SNLSEALKGLNYKKNFKEKLLKELKSV 172
Score = 32.7 bits (75), Expect = 0.12
Identities = 20/74 (27%), Positives = 40/74 (54%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E N+K K+EK + KK+ E KE+ + +++KK +K EK+ K+ E +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 84 QKKKEEKKQKKKEE 97
K E++ ++ +++
Sbjct: 438 IKALEKEIKELEKQ 451
Score = 28.4 bits (64), Expect = 3.6
Identities = 18/74 (24%), Positives = 37/74 (50%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
++ +K +KE+ K KKK E ++D +K++K E+ ++E KQ + +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 105 EEKKNKKAEEQNRQ 118
+ K+ +E +Q
Sbjct: 438 IKALEKEIKELEKQ 451
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 35.3 bits (82), Expect = 0.011
Identities = 27/87 (31%), Positives = 51/87 (58%), Gaps = 1/87 (1%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEE-KEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
K K + EK +K+ EE KQ+ E + + ++ KK +++ E++ + EE + ++E K+ K
Sbjct: 64 LNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLK 123
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNKKAE 113
E +K +K + ++ +K KEE K K
Sbjct: 124 AELEKYEKNDPERIEKLKEETKVAKEA 150
Score = 33.8 bits (78), Expect = 0.039
Identities = 24/88 (27%), Positives = 49/88 (55%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K + +K +K+ EE KQ+ E + Q +K KK +E ++ + E+ ++ E+E K
Sbjct: 63 ALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKL 122
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
K E ++ +K + ++ +K +++ K KE
Sbjct: 123 KAELEKYEKNDPERIEKLKEETKVAKEA 150
Score = 31.1 bits (71), Expect = 0.31
Identities = 19/69 (27%), Positives = 40/69 (57%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
+ K + + EK +K+ EE KQ+ E + + ++ KK +++ E++ + EE KQ +K+
Sbjct: 60 PSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKEL 119
Query: 106 EKKNKKAEE 114
+K + E+
Sbjct: 120 KKLKAELEK 128
Score = 30.3 bits (69), Expect = 0.50
Identities = 20/71 (28%), Positives = 42/71 (59%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
+ + K + ++ K+E EE K++ E + Q ++ +K +++ E++ + EE KQ +KE
Sbjct: 60 PSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKEL 119
Query: 98 KKQKKKKEEKK 108
KK K + E+ +
Sbjct: 120 KKLKAELEKYE 130
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 36.2 bits (84), Expect = 0.011
Identities = 16/110 (14%), Positives = 57/110 (51%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
A++ I + +++K+ + +++ ++ + + E+ + K ++ E+ + E+K EE K+
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ E + + +E + + ++ E + ++ EE+ + + + + + +
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401
Score = 35.4 bits (82), Expect = 0.018
Identities = 19/105 (18%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 17 KRKKKKKEEENKKKQKKKE---EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
K+ +++ +++ ++ E E+ + K +E ++ E +E+ EE KE+ + + + +E
Sbjct: 304 KQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEEL 363
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E + ++ E + ++ EE+ + + + Q + + N + E +
Sbjct: 364 EAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEAR 408
Score = 35.0 bits (81), Expect = 0.027
Identities = 16/88 (18%), Positives = 44/88 (50%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+++ E+ QK+ + +++K+ +E+ ++Q +E E ++ E K + E+
Sbjct: 280 EEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEEL 339
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ +E+ ++ K++ E + EE +
Sbjct: 340 AELEEKLEELKEELESLEAELEELEAEL 367
Score = 31.2 bits (71), Expect = 0.46
Identities = 17/111 (15%), Positives = 57/111 (51%), Gaps = 5/111 (4%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ ++ ++E + +++ ++ + ++ EEK ++ + + E EE+ E+ ++E E
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298
Query: 76 DKKKEEKKQKKKEEKKQKKKEEK-----KQKKKKEEKKNKKAEEQNRQQGL 121
+++++ +++ +++ EE + + K +E + AE + + + L
Sbjct: 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL 349
Score = 30.0 bits (68), Expect = 1.1
Identities = 20/122 (16%), Positives = 59/122 (48%), Gaps = 3/122 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K + EE + ++K EE K++ + + + ++ + E +E E + EE +++ E +
Sbjct: 329 ESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEE--LESRLEELEEQLETLRS 386
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
K + + Q + ++ E + + + ++ ++ + E + + L + + L L +
Sbjct: 387 KVAQLELQIASLNNEIERLEARLE-RLEDRRERLQQEIEELLKKLEEAELKELQAELEEL 445
Query: 137 DQ 138
++
Sbjct: 446 EE 447
Score = 29.6 bits (67), Expect = 1.4
Identities = 18/108 (16%), Positives = 55/108 (50%), Gaps = 5/108 (4%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKK--EEKEEKKEKKKEEKKQKK 71
R+ + +++ EE +K+ + + +++K+ ++ E + E+ E + EE + K
Sbjct: 273 RLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKL 332
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+E +E + ++K E+ +++ E + + ++ E + ++ E + +
Sbjct: 333 DE---LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEEL 377
Score = 28.9 bits (65), Expect = 2.6
Identities = 22/115 (19%), Positives = 53/115 (46%), Gaps = 16/115 (13%)
Query: 29 KKQKKKEEKKQKKKEEKKQKK--------KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
++Q +K E+ ++ K E ++ + +E +EE EE +E+ KE +++ +E + ++
Sbjct: 206 ERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQEL 265
Query: 81 E--------KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
E + + +EE ++ +KE + + +K + R L +
Sbjct: 266 EEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
Score = 27.7 bits (62), Expect = 5.3
Identities = 19/101 (18%), Positives = 50/101 (49%), Gaps = 2/101 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
++ E+ ++Q ++ + + + ++ +E EE E + E E+ +E + E
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
++ ++ + + K+ E ++ + EE + K A+ + R +GL
Sbjct: 896 LEELSEELRELESKRSELRR--ELEELREKLAQLELRLEGL 934
Score = 27.3 bits (61), Expect = 8.5
Identities = 19/116 (16%), Positives = 51/116 (43%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+ EE +++ + E + ++ + + +Q K+E K +E E + E +E +++ E
Sbjct: 769 RLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLES 828
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQ 138
+++ +++ ++ ++Q ++ E A E + L + L L
Sbjct: 829 LERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 35.1 bits (81), Expect = 0.012
Identities = 31/80 (38%), Positives = 65/80 (81%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KK KK+ K EEK+ ++++++ E+EE++E+KK E+K++ E +++++ EE+++KKKEE++
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 92 QKKKEEKKQKKKKEEKKNKK 111
+K++EE+ +K+++E +K K
Sbjct: 61 RKEREEQARKEQEEYEKLKS 80
Score = 31.6 bits (72), Expect = 0.17
Identities = 27/96 (28%), Positives = 66/96 (68%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK + ++KQ ++++++ +++E +++KK E+K E E K+E++ EE+++KK+EE+++K+
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER 64
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
E++ +K++E+ +K K +++ +K + E
Sbjct: 65 EEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNE 100
Score = 30.4 bits (69), Expect = 0.42
Identities = 23/105 (21%), Positives = 67/105 (63%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
K++ K E+KQ ++++++ +++E++E K+ +++++ E K++++ EE+ +KK+E++++K+
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER 64
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYL 133
E++ +K++E+ +K K ++ ++ I Y+
Sbjct: 65 EEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNELLEDFINYI 109
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 35.4 bits (82), Expect = 0.013
Identities = 19/76 (25%), Positives = 34/76 (44%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EEE + + + EE++ KKK++KK+KK +K E + + E K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 84 QKKKEEKKQKKKEEKK 99
+ KK+ E+ +
Sbjct: 226 KLKKKRSIAPDNEKSE 241
Score = 34.2 bits (79), Expect = 0.030
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE----KKQKKKEE 97
+EE + K +EE+ +KK+KKK++K +K + E Q + +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 98 KKQKKKKEEKKNKKAE 113
K +KK+ N+K+E
Sbjct: 226 KLKKKRSIAPDNEKSE 241
Score = 31.5 bits (72), Expect = 0.25
Identities = 15/86 (17%), Positives = 36/86 (41%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
E + ++++ +KK+KKK++K +K E + + + K+ K
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLK 228
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKK 108
K++ +K + K +++K
Sbjct: 229 KKRSIAPDNEKSEVYKSLFTSHKKEK 254
Score = 31.5 bits (72), Expect = 0.27
Identities = 21/83 (25%), Positives = 34/83 (40%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K + EEE KK+KKK++KK KK + + + + ++ K K+
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKR 231
Query: 79 KEEKKQKKKEEKKQKKKEEKKQK 101
+K E K KK+K
Sbjct: 232 SIAPDNEKSEVYKSLFTSHKKEK 254
Score = 31.2 bits (71), Expect = 0.36
Identities = 12/70 (17%), Positives = 31/70 (44%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+E+ E + + E+++ +KK+KK+++K KK + E + +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 110 KKAEEQNRQQ 119
K ++++
Sbjct: 226 KLKKKRSIAP 235
Score = 29.2 bits (66), Expect = 1.5
Identities = 18/80 (22%), Positives = 34/80 (42%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R+ + + KKK+++ KKK KK + + + + K+ KK++
Sbjct: 175 RLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIA 234
Query: 74 EKDKKKEEKKQKKKEEKKQK 93
++K E K KK+K
Sbjct: 235 PDNEKSEVYKSLFTSHKKEK 254
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 36.1 bits (83), Expect = 0.013
Identities = 19/94 (20%), Positives = 37/94 (39%), Gaps = 1/94 (1%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE-KKQKKEEEKDKKKEEKKQKKKEE 89
++ K EK KK+ + EK + K++ K +E+ K E KKQ+ +E+
Sbjct: 712 GHPYKKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEK 771
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K + +N+++ + P
Sbjct: 772 SSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLP 805
Score = 31.0 bits (70), Expect = 0.46
Identities = 19/87 (21%), Positives = 36/87 (41%), Gaps = 3/87 (3%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKK---KEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK K +K +K+ + +K K K++ K +++ K E KKQ+ EE+
Sbjct: 716 KKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSC 775
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKE 105
K+ + + K++E
Sbjct: 776 SPSSSSSHHHSSSNKESRKSSRNKEEE 802
Score = 30.7 bits (69), Expect = 0.60
Identities = 21/81 (25%), Positives = 37/81 (45%), Gaps = 2/81 (2%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE--EKKQKKEEEKDKK 78
K E+++ KK+ K +K K K+K K +E+ +K E KK+ E+K
Sbjct: 722 KPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSS 781
Query: 79 KEEKKQKKKEEKKQKKKEEKK 99
KE +K + +E++
Sbjct: 782 SHHHSSSNKESRKSSRNKEEE 802
Score = 30.7 bits (69), Expect = 0.73
Identities = 18/81 (22%), Positives = 34/81 (41%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K +K KK+ K +K K ++K K E+ ++ E KK++ +E+
Sbjct: 722 KPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSS 781
Query: 77 KKKEEKKQKKKEEKKQKKKEE 97
K+ + + K+EE
Sbjct: 782 SHHHSSSNKESRKSSRNKEEE 802
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.8 bits (83), Expect = 0.013
Identities = 30/102 (29%), Positives = 53/102 (51%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK + + +KE + +K K+E K + KEKKE EE++ + +EE K +K K K
Sbjct: 417 KKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ EK QK++EE++ ++ + K KK + + +
Sbjct: 477 RSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSS 518
Score = 33.9 bits (78), Expect = 0.061
Identities = 27/101 (26%), Positives = 54/101 (53%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ K+ K KE E+ ++ K ++ E ++K++ E +E ++E + EE+ ++E
Sbjct: 353 MLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEEN 412
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E+ KK ++K E +K+ E KK KK+ + + +K E
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKES 453
Score = 33.1 bits (76), Expect = 0.10
Identities = 22/97 (22%), Positives = 57/97 (58%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
++ E KK++ E ++ +++ E +++ E++ E+ KK + + + E++ + K+
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKL 439
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
KK+ K E K++K+ +E+++ + +EE K +K + +
Sbjct: 440 KKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476
Score = 32.7 bits (75), Expect = 0.14
Identities = 30/97 (30%), Positives = 53/97 (54%), Gaps = 2/97 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ + ++ K+ + + +EKK+ +EE+ + ++E K EK K K+ EK QK+EEE++
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491
Query: 79 KEE--KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+E K K KK++ KK+ K +K K
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKIS 528
Score = 32.3 bits (74), Expect = 0.17
Identities = 23/90 (25%), Positives = 57/90 (63%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R++ + EEE+ +++ ++ KK + + + EK+ E ++ K++ K E K+KKE ++++
Sbjct: 398 RRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEE 457
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+ E++++ K E+ K + ++ +K+EE+
Sbjct: 458 ELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Score = 32.0 bits (73), Expect = 0.24
Identities = 22/98 (22%), Positives = 52/98 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++KK+ +EE + + +++ + ++ + K+ +K +KEE+EE+ +++ K K
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKS 506
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK++ K+K + + + K K ++KK K+
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544
Score = 30.8 bits (70), Expect = 0.57
Identities = 22/101 (21%), Positives = 57/101 (56%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
++K EN +K+ + ++ K++ K E K+KK+ +EE+ E +E+ K EK K ++
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+K ++++++++ +++ + K +K++ K + +
Sbjct: 478 SEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSS 518
Score = 28.9 bits (65), Expect = 2.5
Identities = 23/103 (22%), Positives = 52/103 (50%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+ K +++E E++++ K ++ + K +K +K+E++EE +E+ K K
Sbjct: 446 EFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGK 505
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+K K++ K + + K K KKKK+++K+ ++
Sbjct: 506 SAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDD 548
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 35.3 bits (81), Expect = 0.015
Identities = 21/98 (21%), Positives = 50/98 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
RK K+K EE K+Q + +K ++++ ++ + E+K ++ +++ E
Sbjct: 94 VRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAG 153
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K EE + E+ ++ ++ K+K E++N+ ++E
Sbjct: 154 KVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKE 191
Score = 29.5 bits (66), Expect = 1.0
Identities = 19/111 (17%), Positives = 52/111 (46%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
P++ + K E+ + +++ + K ++ E+ ++ E+ ++ ++K E++ +
Sbjct: 130 PQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDS 189
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
+E +K E K + + ++ EE + + K + A+E +Q P
Sbjct: 190 KEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQEVP 240
Score = 28.7 bits (64), Expect = 2.2
Identities = 16/95 (16%), Positives = 46/95 (48%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K+ + ++ K E+ +++ + K E+ E+ +E E+ KQ +E+ +++ E
Sbjct: 128 KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENE 187
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
++ E+ ++ K E ++ ++++ E+
Sbjct: 188 DSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSK 222
Score = 28.4 bits (63), Expect = 2.8
Identities = 17/98 (17%), Positives = 50/98 (51%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K++++ K+ Q +++ + + +Q+ E + EE ++ E+ ++ ++D K
Sbjct: 119 SKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
++ +Q+ ++ K+ +K E+ + + + + EE N
Sbjct: 179 EKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDN 216
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.3 bits (79), Expect = 0.015
Identities = 18/62 (29%), Positives = 36/62 (58%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+E +E++E ++E + ++E+ + K E K+KKK E + K E++K K + ++ K
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Query: 110 KK 111
K
Sbjct: 149 SK 150
Score = 34.0 bits (78), Expect = 0.026
Identities = 17/58 (29%), Positives = 35/58 (60%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+E+EE +E+ +E +Q+ E + +EKK+++ + K +K++ K + KK + K K
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 33.6 bits (77), Expect = 0.027
Identities = 19/62 (30%), Positives = 35/62 (56%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+E +E++ E+E E ++E+ + K E K+KKK E + K E++K K + +K +
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Query: 102 KK 103
K
Sbjct: 149 SK 150
Score = 33.6 bits (77), Expect = 0.028
Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
EE++ +++ + ++++ E K E KEKKK E + K E++ K K E K+ K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE-KPKTEPKKPKP 148
Score = 33.2 bits (76), Expect = 0.038
Identities = 16/48 (33%), Positives = 29/48 (60%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+E ++ +Q+ E K + KE+KK++ + K EKE+ K + K+ K K
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 32.8 bits (75), Expect = 0.050
Identities = 20/59 (33%), Positives = 32/59 (54%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E EE++ +++ E ++E + K E K+KK+ E K K EK++ K E KK K
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 32.4 bits (74), Expect = 0.070
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
EEE +++ E +++ E K + K++KK E + K +K++ K + K+ + K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
Query: 79 K 79
K
Sbjct: 152 K 152
Score = 31.6 bits (72), Expect = 0.13
Identities = 15/54 (27%), Positives = 32/54 (59%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
E+ EE+ + ++++ + K E ++ KK+E K K ++EK + + ++ K K K
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 31.3 bits (71), Expect = 0.17
Identities = 16/53 (30%), Positives = 31/53 (58%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
E +++ E +++ E K E +EKK+++ + K +KE+ K + K+ K K K
Sbjct: 100 ETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 30.5 bits (69), Expect = 0.36
Identities = 8/44 (18%), Positives = 26/44 (59%)
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
EEE +++ ++ +++ + + + +EKK+++ + K K+ +
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKT 141
Score = 30.1 bits (68), Expect = 0.55
Identities = 19/60 (31%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE-KKQKKKKEEKKNKKAEEQNRQQGLY 122
+E+++ +EE D+ ++E + K E K+KKK E K K +KE+ K + + + + LY
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKLY 154
Score = 28.2 bits (63), Expect = 1.9
Identities = 16/54 (29%), Positives = 34/54 (62%), Gaps = 1/54 (1%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK-KQKKKEEKKQKKK 95
EE +++ ++ E+++ + K + ++K+K+E K K ++EK K + K+ K K K
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 27.8 bits (62), Expect = 2.9
Identities = 11/46 (23%), Positives = 20/46 (43%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEK 52
K + +E+ K++ K + +K+K K E K+ K K
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 26.6 bits (59), Expect = 7.6
Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E +EE++ ++E + ++E+ + K E K +KKK E K E++ +
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESK--EKKKREVPKPKTEKEKPKTE 142
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 35.6 bits (82), Expect = 0.016
Identities = 19/59 (32%), Positives = 41/59 (69%), Gaps = 1/59 (1%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
E+K+++ EK ++E ++E++ EE++ ++EEEK + ++ Q K E +K+++K + KK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQR-RREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309
Score = 35.2 bits (81), Expect = 0.021
Identities = 19/56 (33%), Positives = 38/56 (67%), Gaps = 3/56 (5%)
Query: 60 KEKKKEEKKQKKEE-EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+EK++E +K KEE E++++ EE Q+++EE+K + ++ Q K + EK+ +K +
Sbjct: 252 EEKRRELEKLAKEEAERERQAEE--QRRREEEKAAMEADRAQAKAEVEKRREKLQN 305
Score = 32.9 bits (75), Expect = 0.11
Identities = 18/60 (30%), Positives = 38/60 (63%), Gaps = 1/60 (1%)
Query: 54 EEKEEKKEK-KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
EEK + EK KEE +++++ E+ +++EE+K + ++ Q K E +K+++K + K +
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKKAS 311
Score = 31.8 bits (72), Expect = 0.30
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
EE + +K +++++ EE + K EK + K E K+K +
Sbjct: 431 EEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
Score = 31.4 bits (71), Expect = 0.37
Identities = 12/52 (23%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
E+K+++ E+ +++ E++ + EE ++++EE+K E ++ + K E +++++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEE--QRRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 30.6 bits (69), Expect = 0.74
Identities = 17/58 (29%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE-EKDKKKEEKKQKK 86
++K++E +K K+E +++++ E++ +EE+K + ++ Q K E EK ++K + KK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309
Score = 29.5 bits (66), Expect = 1.4
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
EE + ++ +++++ E+ + K EK + K E K+K K
Sbjct: 431 EEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
Score = 29.5 bits (66), Expect = 1.6
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
E+E + +K +E+++ +EE K E+ + K E K++ K
Sbjct: 430 EEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
Score = 28.7 bits (64), Expect = 2.5
Identities = 13/53 (24%), Positives = 37/53 (69%), Gaps = 3/53 (5%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
++K++E E K K+E +++++ EE++++++EK + ++ + K E +K+++
Sbjct: 252 EEKRRELE---KLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 28.3 bits (63), Expect = 3.7
Identities = 11/44 (25%), Positives = 29/44 (65%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
+K KEE +++Q +++ +++++K + + + K E E+++EK
Sbjct: 259 EKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302
Score = 27.5 bits (61), Expect = 6.1
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
E + +K +E+++ ++E K EK + K E K+K
Sbjct: 432 EHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEK 467
Score = 27.5 bits (61), Expect = 6.5
Identities = 11/40 (27%), Positives = 22/40 (55%)
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
E + +K +E+ + +EE + K E+ + K E K++ K
Sbjct: 432 EHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLKM 471
Score = 27.5 bits (61), Expect = 6.7
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
K +E++ +EE K E+ + K E K+K +
Sbjct: 438 KLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.4 bits (82), Expect = 0.016
Identities = 19/87 (21%), Positives = 47/87 (54%), Gaps = 2/87 (2%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+K + E ++ E + +EE++ KKK +K +++E++ ++ E+ ++++E
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K E+ QKK ++ +K + + N+
Sbjct: 349 SKL--EEIQKKLEDLEKRLEKLKSNKS 373
Score = 33.5 bits (77), Expect = 0.066
Identities = 20/86 (23%), Positives = 45/86 (52%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ +K + +E + + E++++ K+K EK +KEK+ EE +Q EE+ +
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKEL 347
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKK 104
+ ++ +K+ + +K+ EK + K
Sbjct: 348 NSKLEEIQKKLEDLEKRLEKLKSNKS 373
Score = 33.1 bits (76), Expect = 0.084
Identities = 16/83 (19%), Positives = 45/83 (54%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
++ K + + + + E++++ K++ +K +EK++ EE ++ EE+K+ +
Sbjct: 291 KLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSK 350
Query: 74 EKDKKKEEKKQKKKEEKKQKKKE 96
++ +K+ + +K+ EK + K
Sbjct: 351 LEEIQKKLEDLEKRLEKLKSNKS 373
Score = 31.5 bits (72), Expect = 0.28
Identities = 20/86 (23%), Positives = 48/86 (55%), Gaps = 2/86 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKK--QKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
R +K +N + KE + + EE++ +KK +K ++EK+ ++ E+ ++ +E
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKEL 347
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQK 101
+ K EE ++K ++ +K+ +K + +
Sbjct: 348 NSKLEEIQKKLEDLEKRLEKLKSNKS 373
Score = 29.6 bits (67), Expect = 1.2
Identities = 18/76 (23%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
E + +K + E K+ E + +EE + KKK +K ++KE++ ++ ++ +++
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEER 344
Query: 103 KKEEKKNKKAEEQNRQ 118
K+ N K EE ++
Sbjct: 345 KE---LNSKLEEIQKK 357
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 35.0 bits (81), Expect = 0.017
Identities = 17/66 (25%), Positives = 45/66 (68%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EEE+++K+++K + K K +K K K +++EK ++++++K ++ +++ +D+ E+ +
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98
Query: 84 QKKKEE 89
+K +E
Sbjct: 99 LRKLQE 104
Score = 34.6 bits (80), Expect = 0.019
Identities = 21/65 (32%), Positives = 39/65 (60%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E+E K+++K K K K K+ K K +EK++ K EK+EK E ++ E++ +K +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Query: 84 QKKKE 88
+ ++E
Sbjct: 101 KLQEE 105
Score = 32.3 bits (74), Expect = 0.14
Identities = 19/67 (28%), Positives = 42/67 (62%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K+ +E+++++KEE+K K + KK K + +++EK +++KEEK ++ +E+ +
Sbjct: 34 KDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93
Query: 113 EEQNRQQ 119
E+ R +
Sbjct: 94 AEKLRLR 100
Score = 31.9 bits (73), Expect = 0.16
Identities = 19/65 (29%), Positives = 38/65 (58%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+ +E++++K K K K K+ + K E+KEK K EK++K E ++ E + +K +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Query: 92 QKKKE 96
+ ++E
Sbjct: 101 KLQEE 105
Score = 31.6 bits (72), Expect = 0.24
Identities = 15/75 (20%), Positives = 47/75 (62%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+++ K ++E++++++++ K K + K+ + K E+K++ K++K+E+ + +E+
Sbjct: 30 DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89
Query: 84 QKKKEEKKQKKKEEK 98
+ + EK + +K ++
Sbjct: 90 EDELAEKLRLRKLQE 104
Score = 31.2 bits (71), Expect = 0.32
Identities = 18/65 (27%), Positives = 40/65 (61%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
EE ++K++EK + + K KK + K +++E+ ++KEEK ++ EE + + +K +
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99
Query: 103 KKEEK 107
+K ++
Sbjct: 100 RKLQE 104
Score = 30.8 bits (70), Expect = 0.40
Identities = 20/63 (31%), Positives = 38/63 (60%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++K+EE+ K K K +K K K E+K+K K +KEEK ++ ++ + + E+ + +K
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
Query: 79 KEE 81
+E
Sbjct: 102 LQE 104
Score = 30.0 bits (68), Expect = 0.71
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E + ++ K +EE+D++KEE+K K + K KK + K ++K++ K+ K+ + +
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Query: 121 LYPNQKRT 128
P +
Sbjct: 87 DTPEDELA 94
Score = 29.2 bits (66), Expect = 1.1
Identities = 21/60 (35%), Positives = 38/60 (63%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E E+ K ++++ +E+E++K K K K K+ K K +E++K K++KEEK ++ EE
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 29.2 bits (66), Expect = 1.2
Identities = 21/66 (31%), Positives = 38/66 (57%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E+ EEKEE+K K + K KK + +++EK +++KEEK ++ EE + + EK
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99
Query: 111 KAEEQN 116
+ ++
Sbjct: 100 RKLQEE 105
Score = 28.9 bits (65), Expect = 1.8
Identities = 18/74 (24%), Positives = 41/74 (55%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+ + K + E++ + K+ ++++EEK+++K + K K +K K K E+K+K
Sbjct: 13 APPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKA 72
Query: 103 KKEEKKNKKAEEQN 116
K+E+++ E +
Sbjct: 73 KREKEEKGLRELEE 86
>gnl|CDD|212473 cd01726, LSm6, Like-Sm protein 6. The eukaryotic LSm proteins
(LSm2-8 or LSm1-7) assemble into a hetero-heptameric
ring around the 3'-terminus uridylation tag of the
gamma-methyl triphosphate (gamma-m-P3) capped U6 snRNA.
LSm2-8 form the core of the snRNP particle that, in
turn, assembles with other components onto the pre-mRNA
to form the spliceosome which is responsible for the
excision of introns and the ligation of exons. LSm1-7 is
involved in recognition of the 3' uridylation tag and
recruitment of the decapping machinery. LSm657 is
believed to be an assembly intermediate for both the
LSm1-7 and LSm2-8 rings. Members of this family share a
highly conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 68
Score = 32.5 bits (75), Expect = 0.017
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 7/45 (15%)
Query: 130 IGYLRSVDQFANLVLHKTIERI--HVGNQYGDIPRGIFIIRGENV 172
G L +D + NLVL T E + + +YGD IRG NV
Sbjct: 25 RGVLACLDGYMNLVLEDTEEYVDGQLVAKYGDA-----FIRGNNV 64
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 34.5 bits (80), Expect = 0.022
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 10/55 (18%)
Query: 59 KKEKKKEEKKQKKEEEK----------DKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
+K KKE K+ EK + +KE+KK KEEKK K+E+ K ++
Sbjct: 69 RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 34.1 bits (79), Expect = 0.033
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEE 58
+ +KE+KK KEEKK K+EK + +E
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 32.6 bits (75), Expect = 0.11
Identities = 24/56 (42%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 29 KKQKKKEEKKQKKKEEK---KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+K KKE K EK + + EKE+KK KEEKK K EEKDK +E
Sbjct: 69 RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIK-EEKDKLEEP 123
Score = 29.5 bits (67), Expect = 1.1
Identities = 18/55 (32%), Positives = 29/55 (52%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+K KKE K +K + E + EK++KK K++KK K++K K E+
Sbjct: 69 RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 27.9 bits (63), Expect = 3.2
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKK 50
+ E+ KKK KEEKK K+E+ K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 26.8 bits (60), Expect = 8.9
Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 3/67 (4%)
Query: 18 RKKKKKEEENKKKQKKK---EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
RK KKE K +K + + EK++KK KEEK+ KE+K + ++
Sbjct: 69 RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCL 128
Query: 75 KDKKKEE 81
D +KE+
Sbjct: 129 VDGRKEK 135
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 34.9 bits (80), Expect = 0.023
Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 18/118 (15%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK---EEKKQKKEEE-- 74
KK+ + + K KQK K+ + +K K E + + +K EEKK+KKEEE
Sbjct: 453 LKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEET 512
Query: 75 -----------KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK--NKKAEEQNRQQ 119
+++ E KQ K++ + + KK E K K+EE + K+A+E + Q
Sbjct: 513 AARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQ 570
Score = 31.4 bits (71), Expect = 0.30
Identities = 28/95 (29%), Positives = 51/95 (53%), Gaps = 4/95 (4%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE--EE 74
KR K + + +++ EEKK+KK+EE+ + + +E+ E KQ K+ E
Sbjct: 483 KRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAA--QAAASREECAESLKQAKQDLEM 540
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ KK E + K+EE + +KE ++ +K +E +K
Sbjct: 541 EIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKE 575
Score = 26.8 bits (59), Expect = 9.7
Identities = 16/68 (23%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ---KKKEEKKQKKKKEEK 107
+ K+E + + K K+++++ + EK+ K + + + K+ E+K++KK+EE+
Sbjct: 452 QLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEE 511
Query: 108 KNKKAEEQ 115
+A Q
Sbjct: 512 TAARAAAQ 519
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 34.9 bits (81), Expect = 0.023
Identities = 15/66 (22%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
++K ++ +E +K++ K+ KE+E ++ + K K+ K + K+ EK + + EE+
Sbjct: 240 EQKAERLRQEAAAYEKQQ-KELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEER 298
Query: 108 KNKKAE 113
++ +
Sbjct: 299 PVEEGK 304
Score = 30.3 bits (69), Expect = 0.68
Identities = 16/80 (20%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK-KEEKKQKKEEEKDKKKEEKKQKKKE 88
+QK + +++ EK+QK+ K++E + + + KK K K+ EK + +
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAK----SRIKRLEKLEARLA 295
Query: 89 EKKQKKKEEKKQKKKKEEKK 108
E++ ++ + + K
Sbjct: 296 EERPVEEGKPLAFRFPPPGK 315
Score = 27.2 bits (61), Expect = 6.7
Identities = 12/75 (16%), Positives = 36/75 (48%), Gaps = 3/75 (4%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEK---KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++K E ++ ++++K+ KE++ + K K +K + + K+ E+ + + EE+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERP 299
Query: 77 KKKEEKKQKKKEEKK 91
++ + +
Sbjct: 300 VEEGKPLAFRFPPPG 314
Score = 27.2 bits (61), Expect = 7.8
Identities = 15/72 (20%), Positives = 37/72 (51%), Gaps = 4/72 (5%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
++K E+ ++ E ++++KE KE++ ++ + K KK K + ++ +K +
Sbjct: 240 EQKAERLRQ--EAAAYEKQQKELAKEQEWIRRGKAAASKA--KKAKSRIKRLEKLEARLA 295
Query: 100 QKKKKEEKKNKK 111
+++ EE K
Sbjct: 296 EERPVEEGKPLA 307
Score = 27.2 bits (61), Expect = 7.8
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 8/76 (10%)
Query: 17 KRKKKKKEEENKKKQKKKEEK-----KQKKKEEKKQKKKEKKEEKEEKK---EKKKEEKK 68
+ +++ E ++K+ KE++ K + KK K + K+ EK E + E+ EE K
Sbjct: 245 RLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERPVEEGK 304
Query: 69 QKKEEEKDKKKEEKKQ 84
K K
Sbjct: 305 PLAFRFPPPGKRLGKL 320
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 34.9 bits (81), Expect = 0.024
Identities = 15/57 (26%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 55 EKEEKKEKKKEEKKQKKEEEK-DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E+E ++E ++E K+ + E+E + +K+ ++K K+ K+ K EE ++ E + +
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 33.8 bits (78), Expect = 0.051
Identities = 15/56 (26%), Positives = 34/56 (60%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
++ ++E +++ K+ + E++ + EKK EKK KK +K K +E ++ E + ++
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 33.0 bits (76), Expect = 0.093
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
EE +++ +EE K+ + EK+ + EKK +K+ KK K+ + ++ E+ + + E+
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 32.6 bits (75), Expect = 0.13
Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
Query: 21 KKKEEENKKKQKK-KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
++ EE +++ K+ + EK+ + E+K +KK KK K+ K E+ E+ + E E+ +
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126
Score = 31.9 bits (73), Expect = 0.19
Identities = 18/60 (30%), Positives = 34/60 (56%), Gaps = 5/60 (8%)
Query: 44 EKKQKKKEKKEEKEEKKEKK--KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
E++ +++ ++E KE + EK+ + EKK EK KK KK K +E ++ E + ++
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGL---EKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 31.9 bits (73), Expect = 0.21
Identities = 12/48 (25%), Positives = 27/48 (56%)
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+E ++ ++E K+ + ++E + +K+ ++K KK KK K E +
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115
Score = 31.5 bits (72), Expect = 0.32
Identities = 16/60 (26%), Positives = 36/60 (60%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+E E + +++ +EE K+ + E++ + ++K EK+ KK KK + ++ EE + + E+
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 30.7 bits (70), Expect = 0.52
Identities = 15/51 (29%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 66 EKKQKKEEEKDKKK-EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
E++ ++E E++ K+ E +K+ + EKK +K+ KK KK ++++ E
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELL 117
Score = 30.3 bits (69), Expect = 0.68
Identities = 17/63 (26%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
++ ++E +EE KE + EK+ + E+K +K+ KK KK + ++ EE + + +E +
Sbjct: 68 EELREEYEEELKEYEA-EKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126
Query: 109 NKK 111
+
Sbjct: 127 PLR 129
Score = 30.3 bits (69), Expect = 0.77
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 60 KEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+E ++E +++ KE E +K+ E ++K E K+ KK KK K ++ + E
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLE--KKAKKAIKKGKDEEALAEELLELEAE 122
Score = 29.5 bits (67), Expect = 1.3
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+ K+ + E+ + +KK +K+ KK KK K +E E+ + E ++ E
Sbjct: 76 EELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEP 126
Score = 29.2 bits (66), Expect = 1.8
Identities = 13/48 (27%), Positives = 29/48 (60%)
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
EE +++ +EE K+ + E++ + +++ +KK K+ K K EE ++
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 35.1 bits (81), Expect = 0.025
Identities = 18/61 (29%), Positives = 26/61 (42%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
QK E + EE E+ EE+ + + KK K+K KE K K+EE
Sbjct: 834 QKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Query: 107 K 107
+
Sbjct: 894 R 894
Score = 32.4 bits (74), Expect = 0.19
Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 2/61 (3%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK--KKEE 97
+K E K +E EE E+++ + K + KK+ +KK +E K K+EE
Sbjct: 834 QKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Query: 98 K 98
+
Sbjct: 894 R 894
Score = 30.5 bits (69), Expect = 0.78
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 6/54 (11%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEE--KKQKKEEEKDKKKEEK----KQKKKE 88
K EE ++ E+++++ K +K K K KE K KQ+++E
Sbjct: 842 KMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895
Score = 28.2 bits (63), Expect = 4.4
Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
K EE ++ EE+D + K KK K+K K+ + K EE R+ G+
Sbjct: 842 KMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE--RETGV 898
Score = 28.2 bits (63), Expect = 4.7
Identities = 12/58 (20%), Positives = 22/58 (37%), Gaps = 5/58 (8%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
K EE ++ EE+ + K K+ +K K+ + K+EE+
Sbjct: 842 KMEEYVEENGEEEDDQTSSK-----PVANGNANNLKKDSSSKKKSKEGKSVLIKQEER 894
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 34.2 bits (78), Expect = 0.026
Identities = 14/77 (18%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK--QKKEEEKDKKKEEKKQKKKEEK 90
+ ++ E+ + + K K ++++ K +E + K E+KQ ++E++
Sbjct: 166 QPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQE 225
Query: 91 KQKKKEEKKQKKKKEEK 107
+++ +EE KQ++ +
Sbjct: 226 EEEVEEEAKQEEGQGTD 242
Score = 33.1 bits (75), Expect = 0.077
Identities = 18/56 (32%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ + K KQ++ K Q+ +E K ++K+ +EE+EE E+ +EE KQ++ + D
Sbjct: 189 SVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEE--EEVEEEAKQEEGQGTD 242
Score = 32.7 bits (74), Expect = 0.10
Identities = 18/84 (21%), Positives = 42/84 (50%), Gaps = 1/84 (1%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
P I + + + E + +E + + K K ++E+ K +E ++ K
Sbjct: 157 PTITENQSFQPWPE-RLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKT 215
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKE 96
EEK ++E+++++ +EE KQ++ +
Sbjct: 216 EEKQPQEEQEEEEVEEEAKQEEGQ 239
Score = 31.9 bits (72), Expect = 0.19
Identities = 18/74 (24%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKK-EKKEE--KEEKKEKKKEEKKQKKEEE 74
+ EE + Q K + KQ++ K +E +E K E+K+ +++Q++EE
Sbjct: 170 ERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEV 229
Query: 75 KDKKKEEKKQKKKE 88
+++ K+E+ Q +
Sbjct: 230 EEEAKQEEGQGTDD 243
Score = 30.8 bits (69), Expect = 0.38
Identities = 15/55 (27%), Positives = 33/55 (60%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
G + K + + ++ K +E Q+ K E+KQ ++E++EE+ E++ K++E +
Sbjct: 188 GSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTD 242
Score = 30.4 bits (68), Expect = 0.52
Identities = 20/112 (17%), Positives = 46/112 (41%), Gaps = 4/112 (3%)
Query: 7 PKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK-- 64
P +SP K + E + E + + Q E+ E+ +
Sbjct: 126 PVSILSPNTLKEAEPSAEVQPTTMTLPIAEHPTITENQSFQPWPERLHNNVEELLQSSLS 185
Query: 65 --EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ K + K ++ K Q+ +E K ++K+ +++++++E ++ K EE
Sbjct: 186 LGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
Score = 29.6 bits (66), Expect = 1.1
Identities = 11/65 (16%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 55 EKEEKKEKKKEEKKQK---KEEEKDKKKEEK-KQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
+ ++ E+ + + K K KQ++ K Q+ +E K ++K+ +++ +
Sbjct: 166 QPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQE 225
Query: 111 KAEEQ 115
+ E +
Sbjct: 226 EEEVE 230
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 34.5 bits (79), Expect = 0.027
Identities = 19/78 (24%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K + ++ K+K+K++ E QK KE++++K++EK+ E+K+ +
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLS 212
Query: 76 --DKKKEEKKQKKKEEKK 91
D+ +EK+QK + ++
Sbjct: 213 TKDEPPKEKRQKHHDPER 230
Score = 32.1 bits (73), Expect = 0.15
Identities = 18/86 (20%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KK +++ +K+ ++K+K++ + +K ++ +KK+E+K++ +E+K +
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLS 212
Query: 92 QKKK--EEKKQKKKKEEKKNKKAEEQ 115
K + +EK+QK E++ + +
Sbjct: 213 TKDEPPKEKRQKHHDPERRLEPQSHE 238
Score = 31.4 bits (71), Expect = 0.23
Identities = 16/91 (17%), Positives = 50/91 (54%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K +KE ++K +E +++ ++K++++ E ++ +E + KK+EEK++ E+++
Sbjct: 141 EKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGG 200
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ + + ++K++K + ++
Sbjct: 201 GGGSSGGQSGLSTKDEPPKEKRQKHHDPERR 231
Score = 29.1 bits (65), Expect = 1.6
Identities = 20/90 (22%), Positives = 46/90 (51%), Gaps = 3/90 (3%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
K +K EK+ + KK + +K ++K EK++ E QK +E D++K+++++++ +E
Sbjct: 138 KNAEKFEKECRALSRKKSDDEHRKRSGKQK-EKRRVEDSQKHKE--DRRKKQEEKRRNDE 194
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K+ + + ++ RQ+
Sbjct: 195 DKRPGGGGGSSGGQSGLSTKDEPPKEKRQK 224
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain
is found associated with pfam06512, pfam00520. This
domain has a conserved ADD sequence motif.
Length = 222
Score = 34.0 bits (78), Expect = 0.034
Identities = 15/54 (27%), Positives = 24/54 (44%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ KE + + K KKEE E+ EK+ K E +K+ +K + Q
Sbjct: 9 ESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKSESADGRKRCRFLLRKTRKVSQ 62
Score = 27.1 bits (60), Expect = 7.1
Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ KE N+ + KKEE E+ +KE K E + +++
Sbjct: 9 ESAKERRNRNDKNKKEEHSIG--SEEGDSEKEPKSESADGRKRC 50
Score = 26.7 bits (59), Expect = 8.8
Identities = 13/59 (22%), Positives = 24/59 (40%), Gaps = 2/59 (3%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKE--EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
K + K+ K KK+E ++ + EK+ K E +K+ +K + Q
Sbjct: 4 SKLSSESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKSESADGRKRCRFLLRKTRKVSQ 62
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 33.6 bits (77), Expect = 0.034
Identities = 14/83 (16%), Positives = 37/83 (44%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R+++ KEE +++ + + + K+ E++ +K+K + ++++ KD
Sbjct: 67 RERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQ 100
+ + K + EE+ Q
Sbjct: 127 RGGTQDVVDKSQASLDYGEEETQ 149
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 34.1 bits (79), Expect = 0.035
Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 5/84 (5%)
Query: 17 KRKKKKKEEENKKK---QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE--EKKQKK 71
K + + K + K + +K KK K EE+KE E K++
Sbjct: 12 KGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKKEV 71
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKK 95
E+ + E + E+ +K
Sbjct: 72 EDAITELTPELEAAGLWERLAFEK 95
Score = 29.5 bits (67), Expect = 1.3
Identities = 10/57 (17%), Positives = 23/57 (40%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
KG ++ + K K EE K+ E K++ ++ + E + ++ +
Sbjct: 38 KGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITELTPELEAAGLWERLAFE 94
Score = 28.7 bits (65), Expect = 2.4
Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 5/91 (5%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
KK E K ++ K K ++ KK K EE K+ + KK+
Sbjct: 11 KKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKKE 70
Query: 88 -----EEKKQKKKEEKKQKKKKEEKKNKKAE 113
E + + ++ EK +
Sbjct: 71 VEDAITELTPELEAAGLWERLAFEKIDVTLP 101
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.4 bits (79), Expect = 0.036
Identities = 23/111 (20%), Positives = 56/111 (50%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ +G S + K+ ++ N K++ K+KK++++ + K+E +EE ++
Sbjct: 193 TEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRD 252
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ + E + E++ + K ++ +EE ++K+K++ K+ KK E
Sbjct: 253 VILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMME 303
Score = 34.0 bits (78), Expect = 0.048
Identities = 22/114 (19%), Positives = 55/114 (48%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K S+ K + K K+KK++++ + +++ +++ K + + E
Sbjct: 199 KTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDE 258
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ + E+E + K ++ +EE ++K+KE++K+ KK E +++ E +
Sbjct: 259 SAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEME 312
Score = 29.8 bits (67), Expect = 1.1
Identities = 23/104 (22%), Positives = 42/104 (40%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
K K + + + K +K Q +E +K E K + K+ KK
Sbjct: 159 NGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIM 218
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
KKK ++K++KKE + KEE +++ K + + +
Sbjct: 219 SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEP 262
Score = 29.8 bits (67), Expect = 1.1
Identities = 24/115 (20%), Positives = 43/115 (37%), Gaps = 5/115 (4%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ R G + + K+ K K E K + +K + +E +
Sbjct: 135 VKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTE 194
Query: 71 KEEEKDKKKEEKKQKKKEEKKQ-----KKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
K E K K ++ +K KK+ K++K+KKE ++ EE + G
Sbjct: 195 KTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESG 249
Score = 28.6 bits (64), Expect = 2.4
Identities = 19/104 (18%), Positives = 57/104 (54%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K K+KK+++E + K+E +++ K + + + + ++ E + E K + +
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDS 283
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+++ EEK+++K++ K+ ++E + ++ + ++ EE++ +
Sbjct: 284 EEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEP 327
Score = 28.3 bits (63), Expect = 3.0
Identities = 19/97 (19%), Positives = 40/97 (41%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ K + K + KE +K + + K K +K K+K +E+K+KK+
Sbjct: 175 EVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKE 234
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ K+E +++ K + + + E +E
Sbjct: 235 ASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDE 271
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 34.3 bits (78), Expect = 0.042
Identities = 33/90 (36%), Positives = 50/90 (55%), Gaps = 5/90 (5%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE----EKKQKKEEEKDK 77
+KE E + + K E + K+ K K K K++ EE++EKK + KQKK +K K
Sbjct: 503 QKELELEAQGIKYSETSEADKDVNKSKNK-KRKVDEEEEEKKLKMIMMSNKQKKLYKKMK 561
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
KK+++ E K+KKK+ KQKK +K
Sbjct: 562 YSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591
Score = 29.7 bits (66), Expect = 1.1
Identities = 22/109 (20%), Positives = 52/109 (47%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ ++ EE+ + + + EE +E + +KE + E + + + E +KD
Sbjct: 467 ETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVN 526
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
K + K++K +E++++KK + K++K KK + N ++ +
Sbjct: 527 KSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLK 575
Score = 29.7 bits (66), Expect = 1.4
Identities = 23/108 (21%), Positives = 52/108 (48%), Gaps = 2/108 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ + E ++ +E+ Q +KE + + + K E E + + K +K++ D++
Sbjct: 481 EDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKV--DEE 538
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
+EEKK K ++KK KK K +K+ + + +++ + +K
Sbjct: 539 EEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKK 586
Score = 27.7 bits (61), Expect = 5.3
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
K + KK+K EE+++KK + K+KK K+ K K+E ++ E K KKK+
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKE--EQAENLKKKKKQ 580
Query: 81 EKKQKKKEEKK 91
KQKK + KK
Sbjct: 581 IAKQKKLDSKK 591
Score = 27.7 bits (61), Expect = 5.6
Identities = 21/62 (33%), Positives = 36/62 (58%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+K+K EEE +KK K ++KK KK K K+E++ + KKK+++ K+++
Sbjct: 530 NKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDS 589
Query: 77 KK 78
KK
Sbjct: 590 KK 591
>gnl|CDD|212462 cd00600, Sm_like, Sm and related proteins. The eukaryotic Sm and
Sm-like (LSm) proteins associate with RNA to form the
core domain of the ribonucleoprotein particles involved
in a variety of RNA processing events including pre-mRNA
splicing, telomere replication, and mRNA degradation.
Members of this family share a highly conserved Sm fold
containing an N-terminal helix followed by a strongly
bent five-stranded antiparallel beta-sheet. Sm-like
proteins exist in archaea as well as prokaryotes that
form heptameric and hexameric ring structures similar to
those found in eukaryotes.
Length = 63
Score = 31.4 bits (72), Expect = 0.046
Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 124 NQKRTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLM 175
R L G L + D++ NLVL +E G G+ +IRG N+V +
Sbjct: 14 KDGRVLTGTLVAFDKYMNLVLDDVVETGRDGKV---RVLGLVLIRGSNIVSI 62
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 33.3 bits (77), Expect = 0.055
Identities = 18/76 (23%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK-DKKKEEKKQKKKEEKK 91
+K + ++ K E +Q+ K+ +EEKEE +++ E + + + EK ++++ + ++K+ ++
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172
Query: 92 QKKKEEKKQKKKKEEK 107
K++ +Q K + E+
Sbjct: 173 AFLKKQNQQLKSQLEQ 188
Score = 31.0 bits (71), Expect = 0.25
Identities = 15/78 (19%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKK--KEKKEEKEEKKEKKKEEKKQKKEE 73
G RK + E+ + +++ ++ +++K+E +K+ + K E E+++E++++ ++++ +
Sbjct: 111 GMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170
Query: 74 EKDKKKEEKKQKKKEEKK 91
E K++ +Q K + ++
Sbjct: 171 EIAFLKKQNQQLKSQLEQ 188
Score = 30.6 bits (70), Expect = 0.37
Identities = 17/77 (22%), Positives = 46/77 (59%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+K + E+ K + ++E K+ ++EK+E ++ E + + + +K EE++++ EEK+ +
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172
Query: 89 EKKQKKKEEKKQKKKKE 105
+K+ ++ K + ++
Sbjct: 173 AFLKKQNQQLKSQLEQI 189
Score = 28.7 bits (65), Expect = 1.7
Identities = 15/71 (21%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+K + E+ + + E++ ++ +++KEE + + E + K E +++++EE++ ++K+
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAE--LEAKLEAIEKREEEERQIEEKRHAD 170
Query: 108 KNKKAEEQNRQ 118
+ ++QN+Q
Sbjct: 171 EIAFLKKQNQQ 181
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 33.2 bits (76), Expect = 0.060
Identities = 22/75 (29%), Positives = 46/75 (61%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
+KE K K + +E+ ++ E K+E+K ++++ E ++ ++E+ + EE + K EE K
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Query: 101 KKKKEEKKNKKAEEQ 115
+ K +++NKK E +
Sbjct: 62 ENNKLKEENKKLENE 76
Score = 28.6 bits (64), Expect = 1.9
Identities = 18/82 (21%), Positives = 47/82 (57%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+K+ ++ + ++ K+ + KEE K K+++ + E+ EK+E ++ ++ + + ++ K
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Query: 80 EEKKQKKKEEKKQKKKEEKKQK 101
E K K++ +K + + E K +
Sbjct: 62 ENNKLKEENKKLENELEALKDR 83
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 33.5 bits (77), Expect = 0.062
Identities = 20/75 (26%), Positives = 39/75 (52%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
RKK K E K+ +K++E KK+ K+KE + + KK ++++ + E
Sbjct: 343 ARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFI 402
Query: 77 KKKEEKKQKKKEEKK 91
K +E K+ + ++K+
Sbjct: 403 KLEELKRLENGKQKR 417
Score = 30.4 bits (69), Expect = 0.63
Identities = 16/74 (21%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 28 KKKQKKKEEKKQ--KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+KK K E + +K++E KK+ K ++ K + KK ++++ + + K
Sbjct: 344 RKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIK 403
Query: 86 KKEEKKQKKKEEKK 99
+E K+ + ++K+
Sbjct: 404 LEELKRLENGKQKR 417
Score = 27.0 bits (60), Expect = 8.0
Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 2/65 (3%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ--KKKKEE 106
++ +++E KK+ K++E K + +K +++Q K E K K+ E
Sbjct: 353 AIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLEN 412
Query: 107 KKNKK 111
K K+
Sbjct: 413 GKQKR 417
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 33.8 bits (78), Expect = 0.062
Identities = 16/70 (22%), Positives = 38/70 (54%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+R++K + N++ + ++ ++ + KEK++ ++KK+ + KKE +K
Sbjct: 565 ERRRKIQMAYNEEHGITPQTIRKPIRDILDIELKEKEDAAKKKKKGEDLSDLSKKELKKL 624
Query: 77 KKKEEKKQKK 86
K+ EK+ K+
Sbjct: 625 IKQLEKEMKQ 634
Score = 29.2 bits (66), Expect = 1.9
Identities = 20/71 (28%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 44 EKKQKKKEKKEEKEEKKEK--KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
E+++K + E+ + +K + E K+K+ KK+KK E+ K+E K+
Sbjct: 565 ERRRKIQMAYNEEHGITPQTIRKPIRDILDIELKEKEDAAKKKKKGEDLSDLSKKELKKL 624
Query: 102 KKKEEKKNKKA 112
K+ EK+ K+A
Sbjct: 625 IKQLEKEMKQA 635
Score = 28.0 bits (63), Expect = 4.4
Identities = 15/72 (20%), Positives = 30/72 (41%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
K Q ++ + +K E ++ + +KK+K ++ KKE KK K+
Sbjct: 569 KIQMAYNEEHGITPQTIRKPIRDILDIELKEKEDAAKKKKKGEDLSDLSKKELKKLIKQL 628
Query: 105 EEKKNKKAEEQN 116
E++ + A
Sbjct: 629 EKEMKQAARNLE 640
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 33.6 bits (77), Expect = 0.065
Identities = 29/117 (24%), Positives = 58/117 (49%), Gaps = 4/117 (3%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
I + K+ ++E K++ +KKEE+K++ +K + K E K K E K+++ +
Sbjct: 359 PISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGK 418
Query: 74 EKDKKKEEKKQKKKEEKKQKKK----EEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
K ++ ++ E K KK E ++ KK++EK+ ++ E + L+ K
Sbjct: 419 ITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSK 475
Score = 30.9 bits (70), Expect = 0.44
Identities = 27/91 (29%), Positives = 42/91 (46%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ + E K + + K++E K K EK EE E K +K E ++ KK
Sbjct: 393 LQIIKNEGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKK 452
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E++K+K++ E +K E K K KK K
Sbjct: 453 EKEKEKERPEFVEKVLELFKGSKIITYKKKK 483
Score = 29.8 bits (67), Expect = 0.99
Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
Query: 5 IAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
I +G I I K + K+EE + +K E+ E K KK E EE K++K
Sbjct: 396 IKNEGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEK- 454
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EK++++ E +K E K K K+KK
Sbjct: 455 -EKEKERPEFVEKVLELFKGSKIITYKKKK 483
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 33.5 bits (77), Expect = 0.067
Identities = 21/123 (17%), Positives = 48/123 (39%), Gaps = 7/123 (5%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKE--EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
A+ + K + + +KKE + + + + K++ ++K+E EE K + +
Sbjct: 46 ALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRR 105
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK--KQKKKKEEKKNKKAEEQNRQQGLYPNQ 125
+K Q +K + + K+ K EK++ E + L+P +
Sbjct: 106 SDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAEKRSFLCRELAK---LFPLR 162
Query: 126 KRT 128
+
Sbjct: 163 RVI 165
Score = 32.3 bits (74), Expect = 0.14
Identities = 16/100 (16%), Positives = 44/100 (44%), Gaps = 1/100 (1%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
R + EE +K ++ E + + K + K+E+ + + Q KEE +
Sbjct: 30 ARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEI-E 88
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+K+E ++ K+ +++ + ++ + ++ + Q+
Sbjct: 89 QKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQD 128
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.8 bits (75), Expect = 0.070
Identities = 18/58 (31%), Positives = 30/58 (51%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E + E K E K++KE + + KK++ + KK KK + EKK + + E +K
Sbjct: 96 EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAA 153
Score = 32.1 bits (73), Expect = 0.11
Identities = 24/90 (26%), Positives = 41/90 (45%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+ K + KEEK+ K +K + K KK K+ E +K+ + + E +KK E
Sbjct: 97 AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAA 156
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
EE ++EE ++ +E + AE
Sbjct: 157 AVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186
Score = 31.7 bits (72), Expect = 0.15
Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 1/73 (1%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE-EKKQKKKEEKKQKKKEEKKQK 101
+ + K + KEEKE K KK++ + K+ K E EKK + + +K+ +
Sbjct: 97 AQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAA 156
Query: 102 KKKEEKKNKKAEE 114
E+ EE
Sbjct: 157 AVAAEEAAAAEEE 169
Score = 30.5 bits (69), Expect = 0.38
Identities = 16/60 (26%), Positives = 27/60 (45%)
Query: 60 KEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
KE+K+ + KK++ KK KK + EKK + + +KK + A E+
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAA 166
Score = 27.8 bits (62), Expect = 3.2
Identities = 19/84 (22%), Positives = 40/84 (47%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+ + E K + KEEK+ K KK++ + K+ K+ + +++ + + E +KK E
Sbjct: 96 EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEA 155
Query: 91 KQKKKEEKKQKKKKEEKKNKKAEE 114
EE +++E ++ E
Sbjct: 156 AAVAAEEAAAAEEEEAEEAPAEEA 179
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 33.1 bits (76), Expect = 0.075
Identities = 10/46 (21%), Positives = 30/46 (65%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
+K+K++EE++ ++ E + E+ + K++ K+++K E ++ + +
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHR 164
Score = 30.1 bits (68), Expect = 0.65
Identities = 14/46 (30%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK-KEEKKNKKAEEQNR 117
+K K++EE++ +++ E + E+ + K+K KEE+K + E ++R
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHR 164
Score = 28.5 bits (64), Expect = 2.3
Identities = 11/58 (18%), Positives = 33/58 (56%)
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E + + ++Q + + +K+EE+++ ++ + + E+ + K+K +++ K E+ R
Sbjct: 106 EIRSDVRRQLRFLAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRH 163
Score = 28.5 bits (64), Expect = 2.4
Identities = 12/47 (25%), Positives = 29/47 (61%)
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
+KQK+EEE+ ++ + + E+ + K++ K+++K + E+ +A
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHRA 165
Score = 28.1 bits (63), Expect = 3.5
Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 6/51 (11%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KQK++EE++ +++ E E+ E+ + K+K +++QK E E+ + +
Sbjct: 121 KQKEEEERRVERRREL------GLEDPEQLRLKQKAKEEQKAESEETRHRA 165
Score = 27.4 bits (61), Expect = 4.9
Identities = 11/45 (24%), Positives = 31/45 (68%), Gaps = 1/45 (2%)
Query: 51 EKKEEKEEKKEKKKEEK-KQKKEEEKDKKKEEKKQKKKEEKKQKK 94
+K++E+EE++ +++ E + E+ + K+K +++QK + E+ + +
Sbjct: 120 QKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHR 164
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 33.6 bits (77), Expect = 0.076
Identities = 20/81 (24%), Positives = 32/81 (39%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
K + K KK++ + K + +EE KK +K QKK +K
Sbjct: 402 KSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSS 461
Query: 93 KKKEEKKQKKKKEEKKNKKAE 113
K + K KKE K+++
Sbjct: 462 KVPSDSKAGGKKESVKSQEDN 482
Score = 28.6 bits (64), Expect = 3.1
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 6/81 (7%)
Query: 22 KKEEENKKK----QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K N+ K KK+K K + + +E+ KK +K QKK +K
Sbjct: 402 KSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSS 461
Query: 78 KK--EEKKQKKKEEKKQKKKE 96
K + K KKE K ++
Sbjct: 462 KVPSDSKAGGKKESVKSQEDN 482
Score = 27.1 bits (60), Expect = 8.3
Identities = 23/86 (26%), Positives = 34/86 (39%), Gaps = 7/86 (8%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKK-----KEEKKQKKKEEKKQKKKEKKEEK--EEKK 60
+ K KKKK + K +E+ KK +K QKK K K + K
Sbjct: 409 SSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSK 468
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKK 86
K+E + +E+ + EE KK
Sbjct: 469 AGGKKESVKSQEDNNNIPPEEWVMKK 494
Score = 27.1 bits (60), Expect = 9.8
Identities = 21/76 (27%), Positives = 33/76 (43%), Gaps = 2/76 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KKKK + + K + ++ KK KK +KK +K K + K ++E K
Sbjct: 421 KKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKK--GRDKSSKVPSDSKAGGKKESVKS 478
Query: 79 KEEKKQKKKEEKKQKK 94
+E+ EE KK
Sbjct: 479 QEDNNNIPPEEWVMKK 494
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 33.4 bits (77), Expect = 0.083
Identities = 13/62 (20%), Positives = 31/62 (50%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
++ I + + E+ +K +K+ ++ +KK + K +E E++KEK E + +
Sbjct: 806 LAGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKL 865
Query: 71 KE 72
+
Sbjct: 866 AK 867
Score = 30.3 bits (69), Expect = 0.79
Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 4/74 (5%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ + + + EK + +K EK+ + EKK E K EE E+
Sbjct: 798 GGAELFLPLAGLIDLAAELARLEK-ELEKLEKEIDRIEKKLSNEGFVAKAPEEVV---EK 853
Query: 74 EKDKKKEEKKQKKK 87
EK+K E + + K
Sbjct: 854 EKEKLAEYQVKLAK 867
Score = 28.4 bits (64), Expect = 3.2
Identities = 13/68 (19%), Positives = 33/68 (48%), Gaps = 2/68 (2%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
E + +K+ E K +K+ ++ +KK + + E+ E++K+K E + K + +
Sbjct: 812 LAAELARLEKELE--KLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLE 869
Query: 83 KQKKKEEK 90
++ +
Sbjct: 870 ERLAVLKA 877
Score = 27.6 bits (62), Expect = 6.6
Identities = 17/56 (30%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
+ + +KE ++ +KE + EKK E K EE +K+KE K + + K K
Sbjct: 814 AELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKE--KLAEYQVKLAK 867
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 33.3 bits (76), Expect = 0.089
Identities = 18/81 (22%), Positives = 32/81 (39%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K ++ + KK ++ KK K+ K+ E + ++ + + E
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVV 1238
Query: 77 KKKEEKKQKKKEEKKQKKKEE 97
K K KKK K+KEE
Sbjct: 1239 KPKGRAGAKKKAPAAAKEKEE 1259
Score = 32.9 bits (75), Expect = 0.13
Identities = 19/83 (22%), Positives = 34/83 (40%), Gaps = 1/83 (1%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++ E+ +K + +K +K KK + E EE E + E K
Sbjct: 1181 KLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKP 1240
Query: 77 KKKEEKKQKKKEEKKQKKKEEKK 99
K + +KK ++K+EE +
Sbjct: 1241 KGR-AGAKKKAPAAAKEKEEEDE 1262
Score = 32.9 bits (75), Expect = 0.13
Identities = 23/95 (24%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 19 KKKKKEEENKKKQKKKEEKKQ-----KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
K+ K EE ++K ++ + + K + +K KK K+ + + EE
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAM 1229
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
E + E K K + KKK K+K+EE +
Sbjct: 1230 ETENVAEVVKPKGRA--GAKKKAPAAAKEKEEEDE 1262
Score = 29.1 bits (65), Expect = 2.4
Identities = 26/97 (26%), Positives = 42/97 (43%), Gaps = 5/97 (5%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQ---KKKEKKEEKEEKKEKK--KEEKKQ 69
IG +K +E + K E + KK K K + E++ +K +K+ K E+ +
Sbjct: 1120 IGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAR 1179
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+K + + E KK + KK KK KK E
Sbjct: 1180 EKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASE 1216
Score = 29.1 bits (65), Expect = 2.5
Identities = 22/102 (21%), Positives = 44/102 (43%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
R K E E+ KK K + +K+ K KE+ + ++ ++K ++ + E
Sbjct: 1132 CADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGES 1191
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
KK ++ KK K+ K+ + + +E + E +N
Sbjct: 1192 GAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETEN 1233
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 32.6 bits (75), Expect = 0.090
Identities = 13/28 (46%), Positives = 21/28 (75%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEE 58
+++KE+KK KEEKK K+EK++ +E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 32.2 bits (74), Expect = 0.13
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 10/58 (17%)
Query: 56 KEEKKEKKKEEKKQKKEEEK----------DKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
K+ +K KEEK K+ K +++KE+KK KEEKK K+E++K ++
Sbjct: 65 KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 30.7 bits (70), Expect = 0.35
Identities = 21/58 (36%), Positives = 34/58 (58%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K+ +K KEEK KK K + + +EEK++KK K++KK K++K+K E+
Sbjct: 65 KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 27.6 bits (62), Expect = 3.9
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKK 50
+EE+ KKK KEEKK K+E++K ++
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 33.1 bits (76), Expect = 0.091
Identities = 11/52 (21%), Positives = 22/52 (42%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+S I K K + E++ + K +E K + K + + K + K+
Sbjct: 329 VVSSLILKTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
Score = 29.2 bits (66), Expect = 1.6
Identities = 7/42 (16%), Positives = 14/42 (33%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
K + + E K +E K + K + + K+
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
Score = 28.4 bits (64), Expect = 2.7
Identities = 9/42 (21%), Positives = 16/42 (38%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K + E+ E K +E K + + + + K KK
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
Score = 28.4 bits (64), Expect = 2.8
Identities = 6/39 (15%), Positives = 14/39 (35%)
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K + + + E K +E K + K + + +
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKND 374
Score = 27.7 bits (62), Expect = 5.4
Identities = 8/44 (18%), Positives = 16/44 (36%), Gaps = 2/44 (4%)
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K + + + E K +E K + K + + KN +
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESK--GQSSALAADVKNDLSN 377
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 32.8 bits (75), Expect = 0.091
Identities = 20/95 (21%), Positives = 59/95 (62%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R+K ++ + + + KKE + K E+ +EE K++ +E + +K ++++ K
Sbjct: 139 RQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQK 198
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
++EE+++++K++++++++++K ++ ++ KN K
Sbjct: 199 REEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQ 233
Score = 32.8 bits (75), Expect = 0.10
Identities = 20/98 (20%), Positives = 61/98 (62%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
R++ KE+ + ++K ++ +++ + K++ E K + Q++ +++
Sbjct: 125 LRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRL 184
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
++ E KK K++++K+++++ ++++K+++EE++ +KAEE
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 32.0 bits (73), Expect = 0.16
Identities = 21/101 (20%), Positives = 55/101 (54%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R K+ + E + ++ + K+ + +K+ +K +E++++++EK++EE + ++ K+K
Sbjct: 74 RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+E +QK ++ KQ+ + KK+ + + + R
Sbjct: 134 YEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERN 174
Score = 31.2 bits (71), Expect = 0.34
Identities = 28/121 (23%), Positives = 71/121 (58%), Gaps = 18/121 (14%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK-------- 68
+++KK ++ ++KQK++ EK++++ E +++ KEK EE +K ++ +++
Sbjct: 98 AQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEA 157
Query: 69 ----------QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K E ++E KK+ ++ E K+ K++++K+++++ +++ K+ EE+ R+
Sbjct: 158 AESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
Query: 119 Q 119
Q
Sbjct: 218 Q 218
Score = 29.7 bits (67), Expect = 0.89
Identities = 21/92 (22%), Positives = 51/92 (55%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
KQ ++++K QK EEK+++++EK+ E+ E +++ +EK ++ +K ++ +++ K
Sbjct: 94 SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKH 153
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+++ + K E+ + E + R Q
Sbjct: 154 KKEAAESASSSLSGSAKPERNVSQEEAKKRLQ 185
Score = 28.2 bits (63), Expect = 3.1
Identities = 22/68 (32%), Positives = 47/68 (69%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K ++ +++ KK+ ++ + KK K+++++ ++E++K+ KKQ++EEE+ +K EE QK
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWM 228
Query: 88 EEKKQKKK 95
+ KQ+ K
Sbjct: 229 KNVKQRPK 236
Score = 27.4 bits (61), Expect = 5.4
Identities = 25/100 (25%), Positives = 55/100 (55%), Gaps = 3/100 (3%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+ KE K +++ +E + K+ + +KK +K E+KQK+E EK++++ E +Q+ +EK
Sbjct: 74 RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Query: 91 KQ---KKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
+ ++K ++ K++ + K + AE + +R
Sbjct: 134 YEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPER 173
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.8 bits (75), Expect = 0.096
Identities = 18/73 (24%), Positives = 40/73 (54%), Gaps = 8/73 (10%)
Query: 17 KRKKKKKEEENKKKQKKK--------EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
KK ++ K K+K E+ +K +K+ K+ + E+ +E+ EK +EE++
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180
Query: 69 QKKEEEKDKKKEE 81
+++EE++D ++
Sbjct: 181 EEEEEDEDFDDDD 193
Score = 29.7 bits (67), Expect = 0.96
Identities = 13/62 (20%), Positives = 35/62 (56%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
+K ++ K + K E++E+ +EK +++ K+ + E+ D++ E+ +++++
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEE 180
Query: 88 EE 89
EE
Sbjct: 181 EE 182
Score = 29.0 bits (65), Expect = 1.6
Identities = 22/63 (34%), Positives = 39/63 (61%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
NKK KK K K+K +++E +EK EKK +E + + +E+D+K EE+++++
Sbjct: 123 NKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182
Query: 87 KEE 89
+EE
Sbjct: 183 EEE 185
Score = 27.4 bits (61), Expect = 5.4
Identities = 17/75 (22%), Positives = 39/75 (52%), Gaps = 4/75 (5%)
Query: 6 APKGAISPRIGKRKKKK----KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
K A + K+K EEE +K +K+ K+ E + +E ++++EE++E
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEE 181
Query: 62 KKKEEKKQKKEEEKD 76
+++E++ +++ D
Sbjct: 182 EEEEDEDFDDDDDDD 196
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 32.6 bits (74), Expect = 0.099
Identities = 24/96 (25%), Positives = 42/96 (43%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K KK+ +KK K+ K+ + K+ K + +K K K+ K K +K
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K EK + K+E KK +K+ + + + K
Sbjct: 65 VKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLF 100
Score = 31.5 bits (71), Expect = 0.25
Identities = 22/94 (23%), Positives = 38/94 (40%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
K+ KK ++K+ +K K+ K +K + K KK K +K + K K+
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ +K E +K+ KK KK+
Sbjct: 65 VKFEKTESVKKESVAKKTVKKEAVSAEVFEASNK 98
Score = 29.5 bits (66), Expect = 1.1
Identities = 22/86 (25%), Positives = 38/86 (44%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+ KK KK+ +E+K KK ++ K +K + K ++ K K+ K K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEE 114
+K K E+ + KK+ K +E
Sbjct: 61 KKVTVKFEKTESVKKESVAKKTVKKE 86
Score = 28.4 bits (63), Expect = 3.0
Identities = 24/94 (25%), Positives = 40/94 (42%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
E + K+K EEKK K+ K+ K ++ + K +K K ++ K K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
KK K EK + K+E KK +++ +
Sbjct: 61 KKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFE 94
Score = 27.2 bits (60), Expect = 5.9
Identities = 27/97 (27%), Positives = 39/97 (40%), Gaps = 3/97 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEE--KEEKKEKKKEEKKQKKEEE 74
+ KK ++ K+ K +K K K KK K ++ + KK K +K E
Sbjct: 14 EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESV 73
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
K K+ KK KKE + E + K K KK
Sbjct: 74 K-KESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKK 109
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 32.7 bits (75), Expect = 0.10
Identities = 25/98 (25%), Positives = 52/98 (53%), Gaps = 1/98 (1%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE-KKEKKKEEKKQKKEE 73
+ + KK EE +K+++ K+ KQ + E+ +++ KEKK+ E+ KK KKK + + +
Sbjct: 125 LAEAAAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGD 184
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
D ++ + ++K +K + + K + K+ K
Sbjct: 185 LFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAK 222
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.0 bits (75), Expect = 0.11
Identities = 17/85 (20%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK--EEKKQK 93
E + KEE +K E ++++ +E ++ ++++ D ++++K + EE Q+
Sbjct: 4014 EAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQE 4073
Query: 94 KKEEKKQKKKKEEKKNKKAEEQNRQ 118
+E + K +E+ ++ E Q
Sbjct: 4074 NEESTEDGVKSDEEL-EQGEVPEDQ 4097
Score = 30.7 bits (69), Expect = 0.74
Identities = 12/67 (17%), Positives = 31/67 (46%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+E N++ + E+K ++ + KE+ + E K ++K+ +EE D + +
Sbjct: 3909 DEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDE 3968
Query: 84 QKKKEEK 90
+ ++
Sbjct: 3969 IQPDIQE 3975
Score = 28.8 bits (64), Expect = 2.9
Identities = 21/102 (20%), Positives = 44/102 (43%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K ++ N + +E K E+K +++KE +EE + E + +E
Sbjct: 3921 QKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP 3980
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
E + E+ K +KE K E + +A ++N+++
Sbjct: 3981 PPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEE 4022
Score = 28.4 bits (63), Expect = 3.7
Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 6/88 (6%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEK-----KEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
E + EE+ Q E+ + E EK +E ++ E ++K E+ E
Sbjct: 3874 SENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNES 3933
Query: 90 KKQKKKEEKKQ-KKKKEEKKNKKAEEQN 116
K+++ K + K ++K + E +
Sbjct: 3934 DLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Score = 27.7 bits (61), Expect = 6.3
Identities = 22/107 (20%), Positives = 47/107 (43%), Gaps = 3/107 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE--E 74
K + +++E + + EE ++ ++ + E+ E+ E EE Q+ EE E
Sbjct: 4020 KEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTE 4079
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
K +E+ ++ + + Q K K +A+E+N +G+
Sbjct: 4080 DGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFAS-AEADEENTDKGI 4125
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 32.8 bits (75), Expect = 0.12
Identities = 21/76 (27%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 5 IAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKE--KKEEKEEKKEK 62
+A K AI+ RI + +++ +K+ EK ++K + K K K+E+ + +
Sbjct: 320 LAAKLAIAARIDAFSGEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRR 379
Query: 63 KKEEKKQKKEEEKDKK 78
KK+EKK K E +
Sbjct: 380 KKKEKKAKSERRGLQN 395
Score = 31.6 bits (72), Expect = 0.26
Identities = 17/56 (30%), Positives = 33/56 (58%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
E+ EK+ E+ K+K + K K E+ +K++ + ++KK+EKK K ++ +N
Sbjct: 340 ISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 30.4 bits (69), Expect = 0.57
Identities = 14/53 (26%), Positives = 33/53 (62%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+E+ E++ EK ++K + K + E+DKK+ + ++K+++K+ K E + +
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 27.7 bits (62), Expect = 5.1
Identities = 14/61 (22%), Positives = 29/61 (47%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
E E+ +K+ E+ +K + K K E+ +K++ + ++KKKE+K +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Query: 117 R 117
Sbjct: 395 N 395
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 33.0 bits (76), Expect = 0.12
Identities = 15/63 (23%), Positives = 33/63 (52%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+ +++++K K E+ K E++ EK EK+ K+ EE+ + E E + ++E
Sbjct: 63 RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQE 122
Query: 82 KKQ 84
++
Sbjct: 123 IER 125
Score = 32.6 bits (75), Expect = 0.16
Identities = 16/58 (27%), Positives = 33/58 (56%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
K +E+KK K ++ K+ E++ +K EK+ K+ EE+ + + E K+ +++ E
Sbjct: 67 PKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE 124
Score = 31.4 bits (72), Expect = 0.33
Identities = 18/87 (20%), Positives = 43/87 (49%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
K+ + E++ KK ++ + +E+ E +E K+E ++ +KE E ++ ++ KK
Sbjct: 185 VLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKK 244
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNKKAE 113
E+ E + + ++ E +K +
Sbjct: 245 YLEELLALYEYLEIELERAEALSKFLK 271
Score = 29.1 bits (66), Expect = 1.7
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K EE KK K E+ K EE+ +K +++ +E EE+ + + E K+ ++E +
Sbjct: 68 KLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE 124
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 32.8 bits (75), Expect = 0.12
Identities = 15/58 (25%), Positives = 31/58 (53%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
K+E+ +++K+E KK K++K ++K +K K ++ +KK + K E +
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKLEAQ 248
Score = 30.1 bits (68), Expect = 0.93
Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKE-EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
+++ KK + K+E+ E K+E KK K++K DK K+ K +
Sbjct: 174 SDQEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAA 233
Query: 94 KKEEKKQKKKKEEKKNK 110
+++ + K E ++K
Sbjct: 234 SQKKSSDLESKLEAQSK 250
Score = 29.8 bits (67), Expect = 1.3
Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
KK E KEEK E++K+E K+ + +KD ++K +K K ++ +KK
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDV--STSTAASQKKSSDL 241
Query: 109 NKKAEEQNR 117
K E Q++
Sbjct: 242 ESKLEAQSK 250
Score = 27.8 bits (62), Expect = 4.7
Identities = 15/65 (23%), Positives = 35/65 (53%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
K+ + K+E+ +++K+E K+ + KK+ ++K +K + ++D +KK +
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLES 243
Query: 94 KKEEK 98
K E +
Sbjct: 244 KLEAQ 248
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 32.7 bits (75), Expect = 0.13
Identities = 16/89 (17%), Positives = 40/89 (44%), Gaps = 5/89 (5%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K K E +++++ E + ++ + KQ ++++ E K+ +E + ++ + + +
Sbjct: 669 KDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEG 728
Query: 80 EEKKQ-----KKKEEKKQKKKEEKKQKKK 103
E Q E + + K K+ KK
Sbjct: 729 ELDNQLAQLSAAIEAARTQAKARLKELKK 757
Score = 30.4 bits (69), Expect = 0.77
Identities = 16/102 (15%), Positives = 51/102 (50%), Gaps = 1/102 (0%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K+K+ EE + + E++++ + E + K+ + + + + +++ K + + ++
Sbjct: 620 VAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAER 679
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K++ + Q ++ + Q K+ ++Q+ E K+ E + +
Sbjct: 680 KQQAETQLRQLDA-QLKQLLEQQQAFLEALKDDFRELRTERL 720
Score = 28.5 bits (64), Expect = 2.9
Identities = 13/89 (14%), Positives = 36/89 (40%), Gaps = 2/89 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R + +++ K + E+KQ+ + + +Q + K+ E+++ + K +E ++
Sbjct: 660 RLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTER 719
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
K Q + E + + +
Sbjct: 720 L--AKWQVVEGELDNQLAQLSAAIEAART 746
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 32.7 bits (75), Expect = 0.13
Identities = 29/124 (23%), Positives = 49/124 (39%), Gaps = 6/124 (4%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+G K E K E +K + +E K K E + K+ ++K++E K KK E
Sbjct: 40 LGLLFSKTAVEVTKLAAVDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIER 99
Query: 75 KDKKKEEKK----QKKKEEKKQKKKEEKKQKKKKEEKKNKKAE--EQNRQQGLYPNQKRT 128
E+K + EE + ++EK ++ EE + +Q Q N
Sbjct: 100 STPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINE 159
Query: 129 LIGY 132
L+
Sbjct: 160 LLRA 163
Score = 27.7 bits (62), Expect = 5.1
Identities = 16/112 (14%), Positives = 44/112 (39%), Gaps = 2/112 (1%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ R+ KK K E + + + +K + +E + K + + K +++
Sbjct: 29 LSNRRIKKGGFLGLLFSKTAVEVTKLAAVDSELRKFQTREAAKVKDAEAQLLKDPAEKKR 88
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
+++K K+ ++ +++ ++ K EE +E+ + L
Sbjct: 89 EERKAA--KKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPP 138
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 32.4 bits (74), Expect = 0.13
Identities = 12/54 (22%), Positives = 32/54 (59%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+ E++KE + ++ ++K++ + +KK++ +E K +KK ++ K+K+
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSS 126
Score = 30.9 bits (70), Expect = 0.36
Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK---KQKKKEEKKQKKK 103
QK +++KE E ++ +KK++ E +K K+ EE K +K+ K+K+ Q K
Sbjct: 72 QKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKL 131
Query: 104 KEE 106
EE
Sbjct: 132 TEE 134
Score = 29.7 bits (67), Expect = 0.87
Identities = 16/62 (25%), Positives = 34/62 (54%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
K + +KE + E+ +KK+E + +K+++ E+ K +K+ + K+K+ Q K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLT 132
Query: 89 EK 90
E+
Sbjct: 133 EE 134
Score = 29.7 bits (67), Expect = 0.89
Identities = 15/64 (23%), Positives = 37/64 (57%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ +++K+ + E+ KK+E + +KKKE ++ K ++K + KE++ + + +
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLT 132
Query: 118 QQGL 121
++GL
Sbjct: 133 EEGL 136
Score = 28.9 bits (65), Expect = 1.6
Identities = 16/53 (30%), Positives = 28/53 (52%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
E E KK+++ E KK+K+ EE K +K+ + +EK+ + K +E
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTEE 134
Score = 28.9 bits (65), Expect = 1.8
Identities = 12/50 (24%), Positives = 32/50 (64%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
K + ++E + + ++ KK+++ E KK+K+ E+ + ++K ++ +EK+
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
Score = 28.2 bits (63), Expect = 2.7
Identities = 13/49 (26%), Positives = 27/49 (55%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+ E+ KKK++ + +KK++ +E K +KK + KE++ + K
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTK 130
Score = 27.8 bits (62), Expect = 3.8
Identities = 16/58 (27%), Positives = 31/58 (53%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K + E++ + E+ K+K+E E KKKE ++ K ++K + +EK+ + K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTK 130
Score = 27.8 bits (62), Expect = 3.9
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
QK ++EK+ + E+ +KKE+ + +KK++ EE K +K+
Sbjct: 72 QKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 27.0 bits (60), Expect = 6.8
Identities = 14/43 (32%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
QK E+EK+ + + + KK+++ E KK+K+ +E K +K
Sbjct: 72 QKPEDEKELSASSLEAE--QAKKKEEAEAKKKKEMEELKAVQK 112
Score = 26.6 bits (59), Expect = 9.5
Identities = 18/58 (31%), Positives = 32/58 (55%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K E+E + E ++ KKKEE + KKK++ EE + ++K + K+K+ + K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTK 130
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 32.7 bits (74), Expect = 0.13
Identities = 24/77 (31%), Positives = 40/77 (51%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KK++ + + KE++K+ KK +K E+E K + K E K+K + E +
Sbjct: 226 KKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAV 285
Query: 92 QKKKEEKKQKKKKEEKK 108
+KK +E +K K EKK
Sbjct: 286 KKKAKEVMKKALKMEKK 302
Score = 30.0 bits (67), Expect = 0.81
Identities = 20/78 (25%), Positives = 37/78 (47%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K +E++++ K+ +K +++ K K K E K K E E + KKK ++
Sbjct: 233 KSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEV 292
Query: 92 QKKKEEKKQKKKKEEKKN 109
KK + ++K K K+
Sbjct: 293 MKKALKMEKKAIKNAAKD 310
Score = 26.9 bits (59), Expect = 8.4
Identities = 22/97 (22%), Positives = 44/97 (45%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
RK+ + + +K+ K K + + K + KK K K+++K+ +K +K E E
Sbjct: 195 RKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGA 254
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ + K + + + K E + + KKA+E
Sbjct: 255 RLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKE 291
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 32.7 bits (76), Expect = 0.13
Identities = 15/64 (23%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+EE ++ Q + EE + ++ K+ + K++ ++ E E K+ KEE K + E +
Sbjct: 34 DEERRELQTELEELQAERNALSKEIGQAKRKGEDA--EALIAEVKELKEEIKALEAELDE 91
Query: 84 QKKK 87
+ +
Sbjct: 92 LEAE 95
Score = 31.2 bits (72), Expect = 0.40
Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 4/60 (6%)
Query: 17 KRKKKKKEEENKKKQKKKEEKK----QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
+R++ + E E + ++ K+ ++K E+ + E KE KEE K + E + + E
Sbjct: 36 ERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAE 95
Score = 28.1 bits (64), Expect = 4.1
Identities = 16/64 (25%), Positives = 34/64 (53%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
EE+++ + E +E + E+ KE + K++ E + + ++ KEE K + E + +
Sbjct: 35 EERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEA 94
Query: 103 KKEE 106
+ EE
Sbjct: 95 ELEE 98
Score = 27.0 bits (61), Expect = 8.0
Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKK 53
A+S IG+ K+K ++ E + KE K++ K E + + E +
Sbjct: 53 ALSKEIGQAKRKGEDAE-ALIAEVKELKEEIKALEAELDELEAE 95
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 32.7 bits (75), Expect = 0.13
Identities = 21/85 (24%), Positives = 45/85 (52%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+E K +K+ E KKEE +++ +E + + + + + KK++ E E ++ E+K
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI 905
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKN 109
++ E + +KK++ + K K E
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALE 930
Score = 30.8 bits (70), Expect = 0.66
Identities = 21/101 (20%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQ------KKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
E+E ++ Q+++ + K++ K +K+ KK+E +EE EE + ++ + + + +K++
Sbjct: 832 EKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKER 891
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ E + ++ E K ++ + + ++K+K+ + K E +
Sbjct: 892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE 932
Score = 30.4 bits (69), Expect = 0.70
Identities = 23/118 (19%), Positives = 57/118 (48%), Gaps = 4/118 (3%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ ++ K+++ + E++ K E + K + EE E + E++++ + + EE
Sbjct: 301 AEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYA 360
Query: 76 DKKKEEKKQKKKEEKKQKK----KEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
+ K+E + + + E+ K+ ++E K ++K EK ++ E R+ + + L
Sbjct: 361 ELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418
Score = 30.0 bits (68), Expect = 1.1
Identities = 21/104 (20%), Positives = 56/104 (53%), Gaps = 2/104 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKE--EKKEKKKEEKKQKKEEEK 75
+K + E + +K+ +E ++++ + K+Q K +KE + KKE+ +EE ++ + +
Sbjct: 819 QKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR 878
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
D + KK+ ++ + + E ++K ++ E + +K ++ +
Sbjct: 879 DLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922
Score = 28.5 bits (64), Expect = 2.9
Identities = 14/84 (16%), Positives = 46/84 (54%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
E+K + +K+ +++ +E +E++ KE+ K ++E ++ ++++ +++ E+ +
Sbjct: 818 EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAL 877
Query: 96 EEKKQKKKKEEKKNKKAEEQNRQQ 119
+ + + +K+ + E Q R+
Sbjct: 878 RDLESRLGDLKKERDELEAQLREL 901
Score = 28.1 bits (63), Expect = 3.7
Identities = 18/96 (18%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
KR+ + +EE ++ ++ + + K E +EEKE+K + K+++ + ++ D
Sbjct: 405 KRELDRLQEELQRLSEELADLNAAIAG-IEAKINELEEEKEDKALEIKKQEWKLEQLAAD 463
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K E++ +E+ + ++E + +++ + +A
Sbjct: 464 LSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQA 499
Score = 28.1 bits (63), Expect = 4.6
Identities = 21/98 (21%), Positives = 54/98 (55%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
++ E +K ++ ++ + E+++ ++ + KE+++ + E K+K+ E+ K+
Sbjct: 183 EENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAI 242
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E++ EE+ +K EE + +K+ E+ + EE N++
Sbjct: 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKK 280
Score = 28.1 bits (63), Expect = 4.8
Identities = 17/92 (18%), Positives = 46/92 (50%), Gaps = 2/92 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K +++ E K++ + +++ + +E+ Q+ E+ + + + + +EE++D
Sbjct: 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
Query: 77 KKKE--EKKQKKKEEKKQKKKEEKKQKKKKEE 106
K E +++ K ++ K E++ KEE
Sbjct: 446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEE 477
Score = 27.7 bits (62), Expect = 6.4
Identities = 18/88 (20%), Positives = 46/88 (52%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ + + EN+ + +E +K + +K+ E+ E++EEK +++ EE ++ + +
Sbjct: 694 QSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+ K + K+ E + ++ EE K ++
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEAL 781
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 32.2 bits (73), Expect = 0.15
Identities = 16/64 (25%), Positives = 24/64 (37%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
M P + +P K KK K+ ++ K K + KK KKK + + K
Sbjct: 129 MHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSK 188
Query: 61 EKKK 64
K
Sbjct: 189 PDWK 192
Score = 30.2 bits (68), Expect = 0.69
Identities = 14/65 (21%), Positives = 26/65 (40%)
Query: 2 KISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+ + P P + KK KK + K + K +K +KK + K + K +
Sbjct: 131 HLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190
Query: 62 KKKEE 66
K ++
Sbjct: 191 WKSQD 195
Score = 29.5 bits (66), Expect = 1.1
Identities = 13/51 (25%), Positives = 22/51 (43%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
E K+ +K K+ + K + K KK +KK + K + K + K
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKS 193
Score = 29.1 bits (65), Expect = 1.5
Identities = 14/75 (18%), Positives = 26/75 (34%), Gaps = 2/75 (2%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+N ++ + + KE K+ K+ + K + K KK K K +
Sbjct: 120 GDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKK--PKKKGSVSNRS 177
Query: 85 KKKEEKKQKKKEEKK 99
K + K + K
Sbjct: 178 VKMPGIDPRSKPDWK 192
Score = 28.7 bits (64), Expect = 2.1
Identities = 10/48 (20%), Positives = 17/48 (35%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
+E K+ K+ + K + K KK ++K K K
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191
Score = 28.7 bits (64), Expect = 2.1
Identities = 10/50 (20%), Positives = 20/50 (40%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
+ K+ K+ + K K + KK KK+ + + K K ++
Sbjct: 146 EVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195
Score = 28.7 bits (64), Expect = 2.3
Identities = 13/48 (27%), Positives = 18/48 (37%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+E KK +K + K K + KK KKK + K K
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191
Score = 27.5 bits (61), Expect = 5.4
Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 1/79 (1%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
+ + +E + E KE KK +K Q + K K +K +KK
Sbjct: 116 HPQLGDNPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPK-APKPKKPKKKGSVS 174
Query: 91 KQKKKEEKKQKKKKEEKKN 109
+ K + K + K+
Sbjct: 175 NRSVKMPGIDPRSKPDWKS 193
Score = 27.2 bits (60), Expect = 6.1
Identities = 13/48 (27%), Positives = 19/48 (39%)
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ KE KK KK + K K + K+ KKK N+ +
Sbjct: 138 STAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDP 185
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 31.7 bits (72), Expect = 0.15
Identities = 21/65 (32%), Positives = 40/65 (61%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KKK + K KK + +E + E K +K+ KK+K E++K++KK++K++KKK+++
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Query: 92 QKKKE 96
+
Sbjct: 170 SPEHP 174
Score = 30.6 bits (69), Expect = 0.39
Identities = 20/58 (34%), Positives = 37/58 (63%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K+K K + K+ + ++ ++ E + EKK ++KK + ++E+ KKK+EKK+KKK
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 30.2 bits (68), Expect = 0.49
Identities = 23/69 (33%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
E+ + + ++K + K KK + EE + K +KK KK+K +++K++KK
Sbjct: 99 EQYRLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKH-KKKKHEDDKERKK 157
Query: 103 KKEEKKNKK 111
KK+EKK KK
Sbjct: 158 KKKEKKKKK 166
Score = 30.2 bits (68), Expect = 0.53
Identities = 21/70 (30%), Positives = 41/70 (58%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
Q K++ K K K+ + Q ++ + + K E+K +KK+ E DK++++KK++KK+
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164
Query: 89 EKKQKKKEEK 98
+KK+ E
Sbjct: 165 KKKRHSPEHP 174
Score = 29.0 bits (65), Expect = 1.1
Identities = 20/59 (33%), Positives = 36/59 (61%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KKK + KK + ++ ++ + + K +K+ K++K E KE KK+KKE++K KK+
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 28.3 bits (63), Expect = 2.3
Identities = 20/61 (32%), Positives = 37/61 (60%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K KK ++ ++ + + K EKK KKK+ +++KE KK+KK+++KK+K+ +
Sbjct: 114 KHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEH 173
Query: 77 K 77
Sbjct: 174 P 174
Score = 26.7 bits (59), Expect = 8.1
Identities = 20/73 (27%), Positives = 38/73 (52%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R + + K K K K+ + Q E+ E + E+K +KKK E ++++++K +
Sbjct: 102 RLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKE 161
Query: 78 KKEEKKQKKKEEK 90
KK++KK+ E
Sbjct: 162 KKKKKKRHSPEHP 174
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 32.6 bits (74), Expect = 0.16
Identities = 19/105 (18%), Positives = 41/105 (39%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+GK K K+ + + +KQ + +K K + E K E + + KE+
Sbjct: 1 MGKASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKL 60
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ +EE+ + K K+ N+ + ++ R++
Sbjct: 61 SKLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQVSTQKWRKE 105
Score = 28.7 bits (64), Expect = 2.5
Identities = 15/89 (16%), Positives = 31/89 (34%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K I R + KK + + + + + K K+K K + +EE+
Sbjct: 17 KNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDSSSSEEEE 76
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
+ K +K + ++ +KE
Sbjct: 77 DGSESISKLNVNSKKISLNQVSTQKWRKE 105
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 32.4 bits (75), Expect = 0.16
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 7/62 (11%)
Query: 18 RKKKKKEEENKKKQKKKEEKK-------QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ +KE +K+ ++ EKK K EE +K++EK E EEK K KE +
Sbjct: 813 LARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKERLARL 872
Query: 71 KE 72
K
Sbjct: 873 KA 874
Score = 27.4 bits (62), Expect = 7.0
Identities = 20/67 (29%), Positives = 31/67 (46%), Gaps = 8/67 (11%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKK-------QKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ E +K+ K EK + ++ EKK K E+ EKE +K + EEK K +E
Sbjct: 809 VEAELARLEKELAKLEK-EIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKE 867
Query: 74 EKDKKKE 80
+ K
Sbjct: 868 RLARLKA 874
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease
M16C (pfam05193).
Length = 248
Score = 31.8 bits (73), Expect = 0.16
Identities = 17/47 (36%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
E EEK ++E++K EKK+ E+DK+K ++ + EE Q E
Sbjct: 4 DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEE-LQSTPE 49
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 31.4 bits (72), Expect = 0.18
Identities = 18/61 (29%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE-EKDKKKEEKKQKKKEEKKQ 92
K++ K+ KKE++KQ+K+ +K + E K+ ++ K E+ E+D++ ++Q + E+K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAI 73
Query: 93 K 93
Sbjct: 74 V 74
Score = 29.5 bits (67), Expect = 0.82
Identities = 16/57 (28%), Positives = 33/57 (57%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
K+K KK ++E +K++K+ + +E KQ +E K EK E+ + +++ + E+
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQ 70
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 32.3 bits (74), Expect = 0.18
Identities = 25/82 (30%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE--EKDKKKEEKKQKKKEEKK 91
K+E + K + K KK E+K EK +++ +E+ K K+E + K+E ++ K++ ++
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEE 258
Query: 92 QKKKEEKKQKKKKEEKKNKKAE 113
K EE K+ +KE K E
Sbjct: 259 LKLPEEVKKVIEKELTKLSLLE 280
Score = 30.7 bits (70), Expect = 0.54
Identities = 27/75 (36%), Positives = 41/75 (54%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K + + KK ++K EK Q++ ++Q K KKE EK +K + EK ++K EE +E
Sbjct: 206 KLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEV 265
Query: 82 KKQKKKEEKKQKKKE 96
KK +KE K E
Sbjct: 266 KKVIEKELTKLSLLE 280
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 32.1 bits (73), Expect = 0.18
Identities = 29/103 (28%), Positives = 57/103 (55%), Gaps = 10/103 (9%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK-EKKKEEKKQKKEEEK 75
K+K + ++ENK +K +EK +++EE ++ + +KEE+E+++ +KEE++Q+ + K
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178
Query: 76 DKKK---------EEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+K+ + + KK K E + +K K EK N
Sbjct: 179 NKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPN 221
Score = 30.9 bits (70), Expect = 0.44
Identities = 23/65 (35%), Positives = 41/65 (63%), Gaps = 4/65 (6%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEK----KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
KKK E QK+ ++ QK KEK +EE EE E +KEE++Q++ + +++E++ K
Sbjct: 117 NTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNK 176
Query: 86 KKEEK 90
+K ++
Sbjct: 177 RKNKQ 181
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 32.2 bits (74), Expect = 0.18
Identities = 25/118 (21%), Positives = 53/118 (44%), Gaps = 3/118 (2%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+E++ +K +++K+K K + EK + ++ Q + + KKE +K
Sbjct: 52 DEDETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKP 111
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSVDQFANL 142
+ +E++ K E K +K+ ++ + +E + L +Q L R +FA L
Sbjct: 112 SEPKEEEPKAAAESKVVQKELDELRDELKEL---KNLLEDQLSGLRQVERIPPEFAEL 166
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.5 bits (72), Expect = 0.18
Identities = 25/60 (41%), Positives = 29/60 (48%), Gaps = 6/60 (10%)
Query: 29 KKQKKKEEKKQKKKEEK---KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
KK KEEK K K Q K +KEEKK KEEKK K E K+K E++
Sbjct: 68 KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAE---KEKLEEEYG 124
Score = 31.2 bits (71), Expect = 0.27
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 10/58 (17%)
Query: 56 KEEKKEKKKEEKKQKKEEEK----------DKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
K+ KK KEEK K+ K +KEEKK KEEKK K E++K +++
Sbjct: 65 KDFKKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 32.0 bits (73), Expect = 0.20
Identities = 23/85 (27%), Positives = 40/85 (47%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
KEE K K++ + K + KK + K KKK++KK+K +K +
Sbjct: 213 KEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKT 272
Query: 94 KKEEKKQKKKKEEKKNKKAEEQNRQ 118
KK KQ+K + + K + E+ ++
Sbjct: 273 KKLTGKQRKARVKAKKAQRRERLQK 297
Score = 29.3 bits (66), Expect = 1.4
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK-KKEEKKQKKEEEKDKK 78
K ++ + KK + K +KKK+KK++ KK K KK KQ+K K KK
Sbjct: 229 KAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288
Query: 79 KEEKKQKKKEEK 90
+ +++ +K+ K
Sbjct: 289 AQRRERLQKKIK 300
Score = 28.9 bits (65), Expect = 1.7
Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+EE+ K KK+ K K + K+ + + KK+KK+++K K+ K
Sbjct: 213 KEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKT 272
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
KK KQ+K K +K ++ E+ KK +
Sbjct: 273 --KKLTGKQRKARVKAKKAQRRERLQKKIK 300
Score = 28.1 bits (63), Expect = 3.8
Identities = 16/47 (34%), Positives = 30/47 (63%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
K KKK+++ KK KK + K+ +++K + K KK ++ E+ +KK +
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 32.1 bits (73), Expect = 0.20
Identities = 18/68 (26%), Positives = 31/68 (45%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
K+ K E+ E EK + ++ + E + KK ++ +KK KKK+ KK +
Sbjct: 570 KELKDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGK 629
Query: 106 EKKNKKAE 113
K + E
Sbjct: 630 SKAAAERE 637
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 32.3 bits (73), Expect = 0.20
Identities = 21/59 (35%), Positives = 34/59 (57%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
+K +KE +E EE +K E+ ++ K EE + E KK+K K+ K+ K+ E + K K
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266
Score = 32.0 bits (72), Expect = 0.21
Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 6/94 (6%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEK------EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+E + + E ++ K+ KK + E EK E++ ++EE KK +E ++ K
Sbjct: 176 KEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEEDTKKADEDGEEPKV 235
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLY 122
E+ ++ E KK+K KK ++ K+ E QN+ + L+
Sbjct: 236 EEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLW 269
Score = 27.3 bits (60), Expect = 7.4
Identities = 21/69 (30%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK------E 72
+K+ +E+ + +K E+ ++ K EE K+ + KK++ ++ KE KE + Q K
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTR 271
Query: 73 EEKDKKKEE 81
+ KD KEE
Sbjct: 272 DPKDVTKEE 280
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 31.9 bits (73), Expect = 0.21
Identities = 12/66 (18%), Positives = 29/66 (43%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ KR K K+ EE K +++ + + + +K +++ + + + +K E
Sbjct: 13 LKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVEAL 72
Query: 75 KDKKKE 80
+ K E
Sbjct: 73 RAKGVE 78
Score = 31.9 bits (73), Expect = 0.24
Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 2/72 (2%)
Query: 52 KKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
K K+ K K+ EE+K KEE K K + K +E + + KK
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAK-AAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKK 68
Query: 112 AEEQNRQQGLYP 123
E R +G+ P
Sbjct: 69 VEAL-RAKGVEP 79
Score = 31.1 bits (71), Expect = 0.41
Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 5/78 (6%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE--- 80
E N + K KK+ K ++ +++K K+E K + + +K D+ +
Sbjct: 1 AESNGEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQ 60
Query: 81 --EKKQKKKEEKKQKKKE 96
+ KK E + K E
Sbjct: 61 YRANRLKKVEALRAKGVE 78
Score = 29.2 bits (66), Expect = 1.5
Identities = 11/61 (18%), Positives = 20/61 (32%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
++K +EE K K + +K + + + KK E + K
Sbjct: 18 KAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVEALRAKGV 77
Query: 73 E 73
E
Sbjct: 78 E 78
Score = 29.2 bits (66), Expect = 1.7
Identities = 14/55 (25%), Positives = 22/55 (40%)
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
K+ K+ K K+ EE+K K+E K K + + + P Q R
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRAN 64
Score = 27.6 bits (62), Expect = 5.9
Identities = 15/63 (23%), Positives = 23/63 (36%), Gaps = 3/63 (4%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E K KK K ++ EEEK +++ K K +K +E +
Sbjct: 6 EPLSKNALKKRLKAKQAEEEK---AAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYR 62
Query: 115 QNR 117
NR
Sbjct: 63 ANR 65
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 31.3 bits (71), Expect = 0.22
Identities = 22/84 (26%), Positives = 45/84 (53%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
K+E EK+ +E+ +E +K+ +E K+++++KQ E+ +++ + K + K E+K
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDKY 62
Query: 109 NKKAEEQNRQQGLYPNQKRTLIGY 132
+ E Q Y ++ LI Y
Sbjct: 63 LRSQAEIQNMQNRYAKERAQLIKY 86
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 31.8 bits (72), Expect = 0.22
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 2/77 (2%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK--EEKKQKKEEEKDKKKEEK 82
EE +K + EE KQK +E +KQ K ++ K +K+ E KKQ+ E+ K + ++K
Sbjct: 11 EEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK 70
Query: 83 KQKKKEEKKQKKKEEKK 99
K+ KKE+ + K KK
Sbjct: 71 KKFKKEKVDVRVKVVKK 87
Score = 27.5 bits (61), Expect = 5.6
Identities = 22/64 (34%), Positives = 37/64 (57%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K+K EE K+ K K+ K EK++ + +K++ ++ K E K++KK KKE+ + K
Sbjct: 24 KQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVK 83
Query: 80 EEKK 83
KK
Sbjct: 84 VVKK 87
Score = 27.1 bits (60), Expect = 7.4
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+K + + KQK +E +KQ + K K+ K EK E + KK++ ++ K E K KKK
Sbjct: 15 RKAQSLEELKQKYEEAQKQ-IADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKF 73
Query: 81 EKKQKKKEEKKQKK 94
+K++ K KK
Sbjct: 74 KKEKVDVRVKVVKK 87
Score = 26.8 bits (59), Expect = 9.9
Identities = 21/67 (31%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK------KEEKKQKKKEEKKQKKKKEEKK 108
E+E +K + EE KQK EE + + + K K+ K E + KK++ ++ K + +KK
Sbjct: 11 EEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK 70
Query: 109 NKKAEEQ 115
K +E+
Sbjct: 71 KKFKKEK 77
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 0.23
Identities = 27/94 (28%), Positives = 49/94 (52%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
I P++ RK +EE + KQ + E + E + K+K KK +E + KK E+ ++
Sbjct: 173 IKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEE 232
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
+ +E + K E+ KK E + + EKK ++ +
Sbjct: 233 ELQELESKIEDLTNKKSELNTEIAEAEKKLEQCR 266
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 31.9 bits (72), Expect = 0.23
Identities = 21/60 (35%), Positives = 30/60 (50%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
++ ++K+K E KEE E K KK K+E+E K+ E K E KK + E K
Sbjct: 458 HPQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEFK 517
Score = 30.0 bits (67), Expect = 0.97
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQ-KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+E E K+++ E K++ E K KK +E+++ +E + K E K E
Sbjct: 457 GHPQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEF 516
Score = 28.4 bits (63), Expect = 3.7
Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 4/72 (5%)
Query: 1 MKISIA---PKGAISPRIGKRKKKKKEEENKKKQK-KKEEKKQKKKEEKKQKKKEKKEEK 56
M IS+A G +RK+K+ E ++ + K KK K+++E K+ E K
Sbjct: 446 MPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHK 505
Query: 57 EEKKEKKKEEKK 68
E K+ + E K
Sbjct: 506 SEIKKIAESEFK 517
Score = 27.3 bits (60), Expect = 6.7
Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK----KEEKKQKKEEE 74
+E +++K+K+ E K++ E K KK K+E++ KE + K E K+ E E
Sbjct: 460 QEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESE 515
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 31.9 bits (73), Expect = 0.25
Identities = 14/82 (17%), Positives = 27/82 (32%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
P + + E K +KQKK++E K + + K + EK K+ +
Sbjct: 741 LPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETV 800
Query: 72 EEEKDKKKEEKKQKKKEEKKQK 93
+ + K +
Sbjct: 801 TDLTAADPDAVAAKVDGVSADR 822
Score = 28.4 bits (64), Expect = 3.5
Identities = 9/60 (15%), Positives = 22/60 (36%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+++KKEK+ + E + K K +K ++ + + K + +
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
Score = 27.3 bits (61), Expect = 7.2
Identities = 12/79 (15%), Positives = 23/79 (29%), Gaps = 3/79 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K +E K +++KKE+ + K + E + K E+ K E
Sbjct: 745 RVKGGFGDELGCCNNPKCNYTEKQKKEK---ESKSELEALKGVGAKTAEKLKDAGVETVT 801
Query: 77 KKKEEKKQKKKEEKKQKKK 95
+
Sbjct: 802 DLTAADPDAVAAKVDGVSA 820
Score = 26.9 bits (60), Expect = 9.5
Identities = 9/60 (15%), Positives = 21/60 (35%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E+++++KE K E + K K +K + + + K ++
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
Length = 312
Score = 31.6 bits (72), Expect = 0.25
Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE----KKQKKKEEKKQ 100
K KK KE + ++ + K ++K++ + + EK QK K++ +K+KKK EK
Sbjct: 102 KFGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTN 161
Query: 101 KKKK 104
++K
Sbjct: 162 EEKS 165
Score = 28.5 bits (64), Expect = 2.4
Identities = 19/63 (30%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK--KEEKKQKKEEEKD 76
K KK + + K+ + K K+KE+ + EK +K ++K EKK+KK E+ +
Sbjct: 102 KFGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTN 161
Query: 77 KKK 79
++K
Sbjct: 162 EEK 164
Score = 28.1 bits (63), Expect = 3.9
Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
E + K+ + K K++++ E ++ +K K+K +KKK +K +K EEK
Sbjct: 111 ESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKK-KKLEKTNEEKS 165
Score = 27.7 bits (62), Expect = 5.1
Identities = 16/56 (28%), Positives = 31/56 (55%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
GK + K+ + KQK++ + + +K +K ++K EKK+KK E+ ++K
Sbjct: 109 GKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEK 164
Score = 27.3 bits (61), Expect = 5.7
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 6/64 (9%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQ------KKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+ GK+ K+ + K ++K+Q EK QK K+K E+KK+K ++
Sbjct: 102 KFGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTN 161
Query: 68 KQKK 71
++K
Sbjct: 162 EEKS 165
Score = 27.3 bits (61), Expect = 5.7
Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK-----KQKKKEEKKQ 92
K KK K+ + ++ + K K++++ + + EK +K ++K K+KKK EK
Sbjct: 102 KFGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTN 161
Query: 93 KKK 95
++K
Sbjct: 162 EEK 164
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 0.25
Identities = 19/79 (24%), Positives = 44/79 (55%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
+K + + + E K +EE+ +KEEE+D+ EE ++ ++ E +++ ++ + +K
Sbjct: 80 DKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEK 139
Query: 104 KEEKKNKKAEEQNRQQGLY 122
+EK++ ++N G Y
Sbjct: 140 DDEKESDAEGDENELAGEY 158
Score = 30.4 bits (68), Expect = 0.74
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E E K EE++ +KEE++ + E+ EE EE E +EE ++ EKD +KE
Sbjct: 89 ELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEE-VEVVEEEYDDDEDSEKDDEKESDA 147
Query: 84 QKKKEE 89
+ + E
Sbjct: 148 EGDENE 153
Score = 28.5 bits (63), Expect = 3.0
Identities = 13/51 (25%), Positives = 33/51 (64%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
K +EE+ +K+E+++E E+ E+ +E + ++E + D+ E+ +K+ + +
Sbjct: 98 KNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAE 148
Score = 26.9 bits (59), Expect = 8.9
Identities = 14/66 (21%), Positives = 38/66 (57%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G K EEE +++++E++ ++ EE ++ + ++E +++ +K +EK+ E ++
Sbjct: 92 GLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDE 151
Query: 76 DKKKEE 81
++ E
Sbjct: 152 NELAGE 157
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 31.2 bits (71), Expect = 0.26
Identities = 27/64 (42%), Positives = 47/64 (73%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+ E ++K+ K EK+ + +EE+K++KK+KKE K+EKKEKK +++K + + KKK++
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 82 KKQK 85
KK+K
Sbjct: 190 KKKK 193
Score = 31.2 bits (71), Expect = 0.28
Identities = 27/64 (42%), Positives = 46/64 (71%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E + +KE + +KE + E+EEKKEKKK+++ +K+++EK KKE+ + K +KK+KK
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 95 KEEK 98
K++K
Sbjct: 190 KKKK 193
Score = 29.7 bits (67), Expect = 0.75
Identities = 23/62 (37%), Positives = 44/62 (70%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
E + ++E K ++E + ++EE+K+KKK+++ +K+K+EKK KK++ + K K++KK
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 110 KK 111
KK
Sbjct: 190 KK 191
Score = 29.3 bits (66), Expect = 1.00
Identities = 24/55 (43%), Positives = 42/55 (76%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
K +KE E ++++KK+++KK++ K+EKK+KK +K++ E K KKK++KK+KK
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 28.1 bits (63), Expect = 2.4
Identities = 26/71 (36%), Positives = 46/71 (64%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
+ + E E++ K +K+ E ++++++EK KKKE KK+KK+++ K++K E K
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Query: 101 KKKKEEKKNKK 111
KKK++KK KK
Sbjct: 183 SKKKKKKKKKK 193
Score = 28.1 bits (63), Expect = 2.5
Identities = 22/68 (32%), Positives = 49/68 (72%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
E ++ + +KE + +KE + +++++K+++K+++ +K+K+EKK KKE+ + K +K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Query: 83 KQKKKEEK 90
K+KKK++K
Sbjct: 186 KKKKKKKK 193
Score = 27.4 bits (61), Expect = 4.7
Identities = 21/69 (30%), Positives = 45/69 (65%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+ E + +K+ K +KE E +++++++K+KKKE KK+KK+++ K++K E K
Sbjct: 123 GPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Query: 109 NKKAEEQNR 117
+KK +++ +
Sbjct: 183 SKKKKKKKK 191
Score = 27.4 bits (61), Expect = 4.7
Identities = 28/74 (37%), Positives = 51/74 (68%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
AP G S + + +KE K +++ + E+++KK+++KK++ K++K+EK++KKEK E
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Query: 66 EKKQKKEEEKDKKK 79
K KK+++K KKK
Sbjct: 180 PKGSKKKKKKKKKK 193
>gnl|CDD|232988 TIGR00468, pheS, phenylalanyl-tRNA synthetase, alpha subunit.
Most phenylalanyl-tRNA synthetases are heterodimeric,
with 2 alpha (pheS) and 2 beta (pheT) subunits. This
model describes the alpha subunit, which shows some
similarity to class II aminoacyl-tRNA ligases.
Mitochondrial phenylalanyl-tRNA synthetase is a single
polypeptide chain, active as a monomer, and similar to
this chain rather than to the beta chain, but excluded
from this model. An interesting feature of the
alignment of all sequences captured by this model is a
deep split between non-spirochete bacterial examples
and all other examples; supporting this split is a
relative deletion of about 50 residues in the former
set between two motifs well conserved throughout the
alignment [Protein synthesis, tRNA aminoacylation].
Length = 293
Score = 31.5 bits (72), Expect = 0.26
Identities = 15/56 (26%), Positives = 23/56 (41%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
K K+ KQ K KEE + E K + ++E K K E + K + +
Sbjct: 1 KLKDLLKQLGKLTKEETKPALGALINEVKIELQDELTKLKPELESAGLWSKLKFET 56
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 31.8 bits (73), Expect = 0.26
Identities = 20/73 (27%), Positives = 48/73 (65%), Gaps = 2/73 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE--KKKEEKKQKKEEEK 75
R+K+KK +E K K KK ++ +E+++ ++E+K+E+ + K K ++++++EE++
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 76 DKKKEEKKQKKKE 88
+K KE ++ + +
Sbjct: 550 EKDKETEEDEPEG 562
Score = 29.8 bits (68), Expect = 1.1
Identities = 19/73 (26%), Positives = 47/73 (64%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
++KE+K +++ + + KK ++ +E+++ +EEEKD++ K + K+++++EEK+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 100 QKKKKEEKKNKKA 112
+K K+ E+ +
Sbjct: 550 EKDKETEEDEPEG 562
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 31.9 bits (72), Expect = 0.28
Identities = 20/104 (19%), Positives = 40/104 (38%), Gaps = 2/104 (1%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE--KKEKKKEEKKQKK 71
K + K+ +++E K K + Q K K +E+ ++E + K
Sbjct: 421 ERRDILKNLLADAEKRTARQEELNKALAKAKILQADKAAKAYQEDILQREAQSRGKTAAA 480
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
E +++ K +++ K+K ++ KAEEQ
Sbjct: 481 ERSQEQMTAALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQ 524
Score = 30.8 bits (69), Expect = 0.64
Identities = 24/140 (17%), Positives = 53/140 (37%), Gaps = 18/140 (12%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEK--KQKKKEEKKQKKKEKKEEKEEKKEKK 63
+ A + K K K + K K +E +++ + K E+ +E+ K
Sbjct: 434 EKRTARQEELNKALAKAKILQADKAAKAYQEDILQREAQSRGKTAAAERSQEQMTAALKA 493
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEK----------KQKKKEEKKQ------KKKKEEK 107
+Q+ + K++ QK K + K+++ ++Q KK+ E
Sbjct: 494 LLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEY 553
Query: 108 KNKKAEEQNRQQGLYPNQKR 127
++ AE ++Q + +
Sbjct: 554 TSQLAELLDQQADRFELSAQ 573
Score = 27.7 bits (61), Expect = 5.6
Identities = 15/82 (18%), Positives = 39/82 (47%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+K ++ K +E+ K+E K+++ +E+K E KKE + + + ++
Sbjct: 507 AKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEYTSQLAELLDQQAD 566
Query: 79 KEEKKQKKKEEKKQKKKEEKKQ 100
+ E + +K++ + ++
Sbjct: 567 RFELSAQAAGSQKERGSDLYRE 588
Score = 27.7 bits (61), Expect = 6.3
Identities = 12/92 (13%), Positives = 40/92 (43%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E++ + +E+++ + +++ +EE + + K + ++ +E+
Sbjct: 408 AREDEAEVLSTQEERRDILKNLLADAEKRTARQEELNKALAKAKILQADKAAKAYQEDIL 467
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
Q++ + + + E+ Q++ K A +Q
Sbjct: 468 QREAQSRGKTAAAERSQEQMTAALKALLAFQQ 499
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 31.4 bits (71), Expect = 0.29
Identities = 23/89 (25%), Positives = 35/89 (39%), Gaps = 4/89 (4%)
Query: 17 KRKKKKKEEENKKKQKKKEEK---KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
K K KE N KQKKK +K K + K K K K ++ K + + EE
Sbjct: 153 KEKDDNKENINLFKQKKKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKYEHNYEE 212
Query: 74 E-KDKKKEEKKQKKKEEKKQKKKEEKKQK 101
E + KE Q + ++++
Sbjct: 213 NVPMYHMTEPIDQYKEPTTQGQTRMERRR 241
Score = 28.7 bits (64), Expect = 2.3
Identities = 19/86 (22%), Positives = 37/86 (43%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+++K++ +E K++KK+KK K + K K K K +K K +K E
Sbjct: 151 HRKEKDDNKENINLFKQKKKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKYEHNY 210
Query: 109 NKKAEEQNRQQGLYPNQKRTLIGYLR 134
+ + + + ++ T G R
Sbjct: 211 EENVPMYHMTEPIDQYKEPTTQGQTR 236
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 31.6 bits (72), Expect = 0.29
Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 6/128 (4%)
Query: 4 SIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
K I + ++K K QK K+ K+ E K ++ EK + K +K
Sbjct: 454 KDKNKPFIKLILLSNNSQEKAVL-KLLQKIKKNNDFIKEFETDALKLQEILEKVDSKSEK 512
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEE-----KKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
E+ K + K+ E K +E + + K ++K++ + +
Sbjct: 513 LEKISAKIDNIKELFDESKMSILQEILQPALHHIELNIARIYKLNIKDKEDFLNKLYIKA 572
Query: 119 QGLYPNQK 126
+
Sbjct: 573 HLEWLEII 580
Score = 31.6 bits (72), Expect = 0.31
Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 2/103 (1%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
RI + +E +K K + K K +EK K +K KK + ++ E
Sbjct: 435 RIKGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFET 494
Query: 74 EKDKKKE--EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ K +E EK K E+ ++ + K+ +E K +E
Sbjct: 495 DALKLQEILEKVDSKSEKLEKISAKIDNIKELFDESKMSILQE 537
Score = 28.5 bits (64), Expect = 2.6
Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 1/79 (1%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEE-KKEKKKEE 66
KG I + +K +++ K K +E+ K +K ++ + KE + +
Sbjct: 437 KGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDA 496
Query: 67 KKQKKEEEKDKKKEEKKQK 85
K ++ EK K EK +K
Sbjct: 497 LKLQEILEKVDSKSEKLEK 515
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 31.6 bits (72), Expect = 0.31
Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 9/107 (8%)
Query: 15 IGKRKK---------KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+R K + + KK K KK K ++K+ E E K++ K
Sbjct: 254 YKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEYVPDKKRPKH 313
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K+ + KK + + K + E +K E K
Sbjct: 314 RIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKSLYENTPTKSG 360
Score = 29.7 bits (67), Expect = 1.1
Identities = 21/70 (30%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK---KQKKKEEKKQKKKEEKKQ 100
++ K+++K +K EK + K KK K KK K +KK+ E + +KK+
Sbjct: 251 QELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEYVPDKKR 310
Query: 101 KKKKEEKKNK 110
K + +K NK
Sbjct: 311 PKHRIKKLNK 320
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 31.5 bits (72), Expect = 0.32
Identities = 21/90 (23%), Positives = 45/90 (50%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+E+E +K ++ E+ +K +E + +K+ K E + ++ K QK + K ++ +
Sbjct: 171 SEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTEEIK 230
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
K EE+ + K E K + +E K+ +
Sbjct: 231 PLIKINEEETRVKVEGYIFKIEIKELKSGR 260
Score = 28.1 bits (63), Expect = 4.4
Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K K + ++K Q + + E + EE++ +K EE ++ E+ ++ E ++K K
Sbjct: 143 KNKSPELQKKLQSFGFPQLLIEFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLK 202
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ K ++ K ++ +K K EE
Sbjct: 203 -AESPKVEKPKPLFDGQKGRKIKSTEE 228
Score = 27.7 bits (62), Expect = 5.5
Identities = 17/78 (21%), Positives = 38/78 (48%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ E +++E ++ ++ EE E+ ++ E +K+ K E +K + ++ +K
Sbjct: 164 EFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKI 223
Query: 93 KKKEEKKQKKKKEEKKNK 110
K EE K K E++ +
Sbjct: 224 KSTEEIKPLIKINEEETR 241
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 31.2 bits (71), Expect = 0.32
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K++ E ENKK KKK E +++ K+ E + ++K K+++ K K
Sbjct: 172 RKQQLEWENKKL-KKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKLAK 230
Query: 79 KEEKKQKKK 87
+EK +KK
Sbjct: 231 LDEKDIRKK 239
Score = 28.9 bits (65), Expect = 1.8
Identities = 15/75 (20%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK-EKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
++K +KQ+ + E K+ KK+ + ++ +E ++ +++ +K++
Sbjct: 165 EEKLNSARKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRR 224
Query: 89 EKKQKKKEEKKQKKK 103
+K K +EK +KK
Sbjct: 225 ARKLAKLDEKDIRKK 239
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 31.2 bits (71), Expect = 0.33
Identities = 22/68 (32%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
++E KK + EE + K+ E KQ K K +K++K E+ K+E K+ KEE + K
Sbjct: 36 DDERKKLLSEIEELQAKRNELSKQIGK-AKGQKKDKIEEIKKELKELKEELTELSAALKA 94
Query: 84 QKKKEEKK 91
+ + + K
Sbjct: 95 LEAELQDK 102
Score = 30.8 bits (70), Expect = 0.44
Identities = 18/62 (29%), Positives = 32/62 (51%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K + EE + K+ + ++ K K +KK K +E K+E +E KE+ E K E +
Sbjct: 40 KKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAEL 99
Query: 78 KK 79
+
Sbjct: 100 QD 101
Score = 28.9 bits (65), Expect = 2.1
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+RKK E E + K + K+ + K +KK+K EE +++ ++ KEE + K
Sbjct: 38 ERKKLLSEIEE---LQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKA 94
Query: 77 KKKEEKKQ 84
+ E + +
Sbjct: 95 LEAELQDK 102
Score = 28.1 bits (63), Expect = 3.8
Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
KE K + +K + E KK + E+ Q K E K+ + K +KK++
Sbjct: 12 DLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKI 71
Query: 91 KQKKKEEKKQKKKKEEKKN--KKAEEQNRQQGL 121
++ KKE K+ K++ E K E + + + L
Sbjct: 72 EEIKKELKELKEELTELSAALKALEAELQDKLL 104
Score = 26.9 bits (60), Expect = 9.9
Identities = 21/92 (22%), Positives = 42/92 (45%), Gaps = 2/92 (2%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+ K+ K + E+ E+K+ E +E + + + K + K K +KK
Sbjct: 12 DLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSK--QIGKAKGQKKD 69
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K +E KK+ K+ +++ + K +AE Q+
Sbjct: 70 KIEEIKKELKELKEELTELSAALKALEAELQD 101
>gnl|CDD|212469 cd01722, Sm_F, Sm protein F. The eukaryotic Sm proteins (B/B', D1,
D2, D3, E, F and G) assemble into a hetero-heptameric
ring around the Sm site of the 2,2,7-trimethyl guanosine
(m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
forming the core of the snRNP particle. The snRNP
particle, in turn, assembles with other components onto
the pre-mRNA to form the spliceosome which is
responsible for the excision of introns and the ligation
of exons. Members of this family share a highly
conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet. Sm subunit F is capable of forming both
homo- and hetero-heptamer ring structures. To form the
hetero-heptamer, Sm subunit F initially binds subunits E
and G to form a trimer which then assembles onto snRNA
along with the D3/B and D1/D2 heterodimers.
Length = 69
Score = 29.1 bits (66), Expect = 0.36
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
Query: 131 GYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLMGE 177
G L SVD + NL L T E I G G++ G +IR NV+ + E
Sbjct: 26 GTLVSVDSYMNLQLANTEEYID-GKFTGNL--GEVLIRCNNVLYIRE 69
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 31.4 bits (71), Expect = 0.38
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
K+KKKKK +E KKK+ ++ K+ E ++ + EK + + K EKK
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 31.4 bits (71), Expect = 0.40
Identities = 18/53 (33%), Positives = 32/53 (60%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
+KKK++KK K+K+KK+ +E K+ E ++ E + +K + + K EKK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 30.6 bits (69), Expect = 0.70
Identities = 20/52 (38%), Positives = 29/52 (55%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KKK+KKK ++K+KKK + K+ E EE E EK + + K +KK
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 30.2 bits (68), Expect = 0.78
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKE-EEKDKKKEEKKQKKKEEKKQKKKEEK 98
+KKK+KK+ KE+KK+K +E KQ E ++ E +K E Q K E+K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 29.5 bits (66), Expect = 1.7
Identities = 19/53 (35%), Positives = 31/53 (58%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+KKK++KK K+K++KK ++ K+ E E+ E +K E + K EKK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 28.3 bits (63), Expect = 3.2
Identities = 11/23 (47%), Positives = 21/23 (91%)
Query: 93 KKKEEKKQKKKKEEKKNKKAEEQ 115
+KK++KK+ K+K++KKN++A +Q
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.5 bits (61), Expect = 5.8
Identities = 12/23 (52%), Positives = 19/23 (82%)
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQ 100
+K++KK+K KE+KK+K +E KQ
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.5 bits (61), Expect = 5.9
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
++KKK++K ++KK+KK + K+ E ++ E ++K E Q K E+K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 27.1 bits (60), Expect = 9.6
Identities = 15/35 (42%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+++KK+KK +EKK+KK +E K A E N ++G+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALE-NIEEGI 717
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 31.2 bits (71), Expect = 0.39
Identities = 22/100 (22%), Positives = 59/100 (59%), Gaps = 1/100 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+++++ E+ +++++ E +K KK ++ ++ E EE E K+E ++ K+E EK +
Sbjct: 398 KEEERPREKEGTEEEERREITVYEKRIKKLEETVERLE-EENSELKRELEELKREIEKLE 456
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ E+ +++ +K +K +E + + ++ E + + E++ R
Sbjct: 457 SELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKR 496
Score = 29.3 bits (66), Expect = 1.9
Identities = 17/84 (20%), Positives = 53/84 (63%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ EEEN + +++ EE K++ ++ + + ++ ++E +++ ++ ++ + ++ E +K
Sbjct: 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEK 488
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQK 101
+ EEKK++ +E +++ + K +K
Sbjct: 489 ELEEKKKRVEELERKLAELRKMRK 512
Score = 27.4 bits (61), Expect = 7.4
Identities = 17/101 (16%), Positives = 58/101 (57%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R +K+ ++ ++ ++ EE+ + K E ++ K+E ++ + E + ++E + + +++
Sbjct: 414 RREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKD 473
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ + ++ + ++ ++E ++KKK ++ ++K E + + E
Sbjct: 474 REIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLE 514
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 30.1 bits (68), Expect = 0.40
Identities = 7/42 (16%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+K + + +Q E + + +K+EE +K +
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRA-KKREEHRKKWYQN 144
Score = 28.6 bits (64), Expect = 1.4
Identities = 7/34 (20%), Positives = 12/34 (35%)
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+Q E + + KK E+ KK +
Sbjct: 110 THASILLSSNEQNSTEALQLRAKKREEHRKKWYQ 143
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.0 bits (70), Expect = 0.42
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 6/77 (7%)
Query: 44 EKKQKKKEK---KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK---KEE 97
EK ++ EK KE ++ ++E +KE +K KEEE K+ ++ KE ++ ++ KE
Sbjct: 367 EKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKEL 426
Query: 98 KKQKKKKEEKKNKKAEE 114
K++K+ EK +A E
Sbjct: 427 SKEEKELLEKLKMEASE 443
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex
which functions in messenger ribonucleoprotein
metabolism and plays a role in preventing the
transcription-associated genetic instability. Tho2,
along with four other subunits forms THO.
Length = 296
Score = 30.8 bits (70), Expect = 0.43
Identities = 16/55 (29%), Positives = 30/55 (54%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
++ ++ +K+ ++ +KKKEKK K K+ ++E KK + EK KK+
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKR 84
Score = 30.4 bits (69), Expect = 0.65
Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 45 KKQKKKEKKE-EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
K+ KE + EK+ K+ KK++EK + K K+ ++E KK + EK +K+
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRL 85
Query: 104 KEEKKN 109
EEK +
Sbjct: 86 SEEKSS 91
Score = 29.6 bits (67), Expect = 0.94
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 37 KKQKKKEEKKQKKK-EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
K+ KE ++ +K+ ++ + +KKK+EKK+ K K ++E KK + EK +K+
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRL 85
Query: 96 EEKKQK 101
E+K
Sbjct: 86 SEEKSS 91
Score = 28.8 bits (65), Expect = 2.1
Identities = 14/68 (20%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
K+ KE ++ +K+ ++ ++K+++K++ K K+ +EE KK + +K +
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEE---LKKHIEHNEKTK 82
Query: 89 EKKQKKKE 96
++ ++K
Sbjct: 83 KRLSEEKS 90
Score = 28.4 bits (64), Expect = 2.3
Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 2/61 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK--EEEKD 76
K+ ++ E+ K+ KKK+EKK+ K K+ +EE K+ + +K KK EEK
Sbjct: 31 KEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKS 90
Query: 77 K 77
Sbjct: 91 S 91
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 31.0 bits (71), Expect = 0.45
Identities = 10/60 (16%), Positives = 29/60 (48%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+ K +EE+ +++++++ + Q + + +K E+E + K+ E+ Q+
Sbjct: 166 VDRYKRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225
Score = 27.2 bits (61), Expect = 7.8
Identities = 10/56 (17%), Positives = 32/56 (57%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
K+ K EE++ ++E+++E + + + ++++ ++E ++K+ E+ Q+
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 31.0 bits (70), Expect = 0.45
Identities = 17/65 (26%), Positives = 41/65 (63%), Gaps = 7/65 (10%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE-KKQKKKEEKKQKKKK------EEKKN 109
E ++ ++K E++Q++ +++ K K E+++K KEE KQ++ E Q+ ++ + K +
Sbjct: 4 ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKAD 63
Query: 110 KKAEE 114
++ +E
Sbjct: 64 QQMQE 68
Score = 28.7 bits (64), Expect = 2.9
Identities = 13/65 (20%), Positives = 37/65 (56%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E + ++K + E++++K+ K + ++E+K ++E K+++ E Q+ + + K
Sbjct: 4 ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKAD 63
Query: 85 KKKEE 89
++ +E
Sbjct: 64 QQMQE 68
Score = 27.9 bits (62), Expect = 4.4
Identities = 16/65 (24%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 35 EEKKQKKKEEKKQK-KKEKKEEKEEKKEKKKEE-KKQKKEEEKDKKKEEKKQKKKEEKKQ 92
E ++ ++K E++Q+ +K++ + K E++ K KEE KQ++ E ++ + + K
Sbjct: 4 ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKAD 63
Query: 93 KKKEE 97
++ +E
Sbjct: 64 QQMQE 68
Score = 27.1 bits (60), Expect = 8.1
Identities = 16/68 (23%), Positives = 36/68 (52%), Gaps = 5/68 (7%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R + K ++++ E K++ K K E+++K KEE K+ E E ++ + + +
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAA-----KQREAIEAAQRSRRLDAIEAQI 60
Query: 74 EKDKKKEE 81
+ D++ +E
Sbjct: 61 KADQQMQE 68
>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
protein; Provisional.
Length = 311
Score = 30.8 bits (69), Expect = 0.45
Identities = 22/79 (27%), Positives = 42/79 (53%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+S + +R+K+ + KK +K + + K E KK K K E K +K++ + + K
Sbjct: 108 LSDDVNEREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPELLK 167
Query: 71 KEEEKDKKKEEKKQKKKEE 89
+E++ K E+ +K+EE
Sbjct: 168 REQQVLKVVEKTASQKEEE 186
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 30.9 bits (70), Expect = 0.50
Identities = 17/94 (18%), Positives = 45/94 (47%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
A + + KKK ++ ++ K + + + E + +E + ++ E
Sbjct: 275 AANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGY 334
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
++D + E+ ++ K+E +K++K ++K +K+K
Sbjct: 335 MGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368
Score = 26.7 bits (59), Expect = 9.9
Identities = 13/82 (15%), Positives = 38/82 (46%), Gaps = 1/82 (1%)
Query: 32 KKKEEKKQKKKEE-KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
KKK +K+ ++ + + + K E + + +E K ++ + + +E
Sbjct: 285 KKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDED 344
Query: 91 KQKKKEEKKQKKKKEEKKNKKA 112
+ ++ +++ +KK++ KK +
Sbjct: 345 EDDEESKEEVEKKQKVKKKPRK 366
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 30.9 bits (71), Expect = 0.52
Identities = 10/45 (22%), Positives = 19/45 (42%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
EK +K D ++ KK K +K +++ +K+ K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 30.1 bits (69), Expect = 0.80
Identities = 11/43 (25%), Positives = 16/43 (37%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
K +K K KK K +K K++ EK+ K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 29.8 bits (68), Expect = 1.3
Identities = 13/60 (21%), Positives = 26/60 (43%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K +E + +E E EK EK + ++ KK K +K +++ +++ K
Sbjct: 798 KGRETRAPTVEEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 29.4 bits (67), Expect = 1.7
Identities = 8/41 (19%), Positives = 16/41 (39%)
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
EK +K ++ KK K +K K++ + +
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 29.4 bits (67), Expect = 1.7
Identities = 9/44 (20%), Positives = 18/44 (40%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
E +K + + K KK K ++ K++ +K+ K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 28.2 bits (64), Expect = 3.5
Identities = 9/44 (20%), Positives = 17/44 (38%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+K +K + K+ + ++ K+Q EKD K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 31.1 bits (71), Expect = 0.53
Identities = 16/80 (20%), Positives = 43/80 (53%), Gaps = 4/80 (5%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK----QKKKEEKKQKKKEEK 98
E + ++K + + E ++++ + E E+ +++ E + Q+K E Q++K+++
Sbjct: 159 ELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKR 218
Query: 99 KQKKKKEEKKNKKAEEQNRQ 118
K+ + K+ + +EE+ R
Sbjct: 219 KEITDQAAKRLELSEEETRI 238
Score = 30.3 bits (69), Expect = 0.83
Identities = 12/85 (14%), Positives = 43/85 (50%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
E + +Q+ K++ E + +K + + + ++++ E + EEK+Q+ +
Sbjct: 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELE 197
Query: 96 EEKKQKKKKEEKKNKKAEEQNRQQG 120
+ +Q ++K + +++ +++ ++
Sbjct: 198 AQLEQLQEKAAETSQERKQKRKEIT 222
Score = 29.5 bits (67), Expect = 1.4
Identities = 14/87 (16%), Positives = 44/87 (50%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+ + Q++ KQ+ + + EK + + + ++++ E + E+KQ++ E
Sbjct: 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELE 197
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ ++ +E+ + ++ ++K K+ +Q
Sbjct: 198 AQLEQLQEKAAETSQERKQKRKEITDQ 224
Score = 28.0 bits (63), Expect = 5.1
Identities = 18/81 (22%), Positives = 46/81 (56%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
++E +++ E + ++K + + + +++E + + E+KQ++ E + ++ +EK
Sbjct: 148 QQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKA 207
Query: 84 QKKKEEKKQKKKEEKKQKKKK 104
+ +E+KQK+KE Q K+
Sbjct: 208 AETSQERKQKRKEITDQAAKR 228
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 30.6 bits (70), Expect = 0.57
Identities = 18/67 (26%), Positives = 36/67 (53%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K+ K + ++KE+KK+KKK +KK+ K + + + KE +++ + K + +
Sbjct: 386 KQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLELLKNVWGEILE 445
Query: 80 EEKKQKK 86
K Q+K
Sbjct: 446 SLKAQRK 452
Score = 30.2 bits (69), Expect = 0.80
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
K + ++K+KKKE+KK KKK+ K + K K KE ++ + K
Sbjct: 389 KAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLELLKN 438
Score = 28.7 bits (65), Expect = 2.3
Identities = 17/60 (28%), Positives = 33/60 (55%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
++ E+ E++ ++ K + ++KE+KK+KKK +KK+ K K K +E + E
Sbjct: 376 QRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLEL 435
Score = 27.9 bits (63), Expect = 4.3
Identities = 20/74 (27%), Positives = 38/74 (51%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+++ ++ + E+K+KKKE+KK KKK+ K + + + KE +Q E K+
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLELLKNVWG 441
Query: 80 EEKKQKKKEEKKQK 93
E + K + K +
Sbjct: 442 EILESLKAQRKSLR 455
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 30.3 bits (69), Expect = 0.59
Identities = 18/97 (18%), Positives = 58/97 (59%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
KE+ + ++ K E + +++ ++++K+E++ + +++ ++ K+ ++ E +++K
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+Q++ E K +++EE ++ K E ++ + E Q+ +
Sbjct: 261 AEQERMLEHKLQEQEELLKEGFKTEAESLQKEIQDLK 297
Score = 29.9 bits (68), Expect = 0.79
Identities = 18/85 (21%), Positives = 49/85 (57%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ ++ + E + +++ +KQK++E+ + ++ +E ++ +K E +++K E+++
Sbjct: 207 EAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERM 266
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKK 103
E K Q+++E K+ K E + +K
Sbjct: 267 LEHKLQEQEELLKEGFKTEAESLQK 291
Score = 28.0 bits (63), Expect = 4.0
Identities = 16/75 (21%), Positives = 39/75 (52%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
K+K E + K E E ++E ++K++EE+ + +++ ++ K+ +K E +++K
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKL 259
Query: 104 KEEKKNKKAEEQNRQ 118
E++ + Q
Sbjct: 260 LAEQERMLEHKLQEQ 274
Score = 27.6 bits (62), Expect = 4.5
Identities = 15/77 (19%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
++K + E+ + + + E++ +KQK+EE+ E +++ +E KQ ++ + +++
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMM---EAQERSYQEHVKQLIEKMEAERE 257
Query: 103 KKEEKKNKKAEEQNRQQ 119
K ++ + E + ++Q
Sbjct: 258 KLLAEQERMLEHKLQEQ 274
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 30.7 bits (69), Expect = 0.60
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+ +KKQ+ +E+ Q EK+ KK+KKKEE++Q EE+ K + +
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPN--KKDKKKEEEEQLSEEDAMLKGDLE 50
Score = 29.9 bits (67), Expect = 1.2
Identities = 17/50 (34%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+E+ KKQ+ +E+ Q E++ KK+KK+E+EE + +E+ K + E
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEE--QLSEEDAMLKGDLE 50
Score = 28.4 bits (63), Expect = 3.3
Identities = 15/47 (31%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
++ +KK++ +E+ Q E++ KK KKKEE++Q +E+ K
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPN---KKDKKKEEEEQLSEEDAMLK 46
Score = 27.2 bits (60), Expect = 8.8
Identities = 14/43 (32%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
+KK++ +++ + EK+ + KDKKKEE++Q +E+ K
Sbjct: 6 DKKQQTIDEQSQISPEKQTPNK--KDKKKEEEEQLSEEDAMLK 46
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.6 bits (70), Expect = 0.60
Identities = 14/60 (23%), Positives = 30/60 (50%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+RK ++ EE + ++ + ++ + + + +E E +E KEK KE + E E +
Sbjct: 38 RRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEAE 97
Score = 30.3 bits (69), Expect = 0.66
Identities = 11/81 (13%), Positives = 39/81 (48%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
KR + + + ++ K ++ EE + ++ E +E ++ +++ ++ E K+
Sbjct: 20 KRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKE 79
Query: 77 KKKEEKKQKKKEEKKQKKKEE 97
K++ K+ + ++ + + +
Sbjct: 80 LKEKLKELEAALDELEAELDT 100
Score = 28.7 bits (65), Expect = 2.4
Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 2/83 (2%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD--KKK 79
+++ + + +K + EE+++ +E +E + E+ E KE + K E D +
Sbjct: 15 REKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELI 74
Query: 80 EEKKQKKKEEKKQKKKEEKKQKK 102
E K+ K++ K+ + ++ + +
Sbjct: 75 AEVKELKEKLKELEAALDELEAE 97
Score = 28.7 bits (65), Expect = 2.5
Identities = 10/44 (22%), Positives = 23/44 (52%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKK 53
+S IG+ K+ +++ + + KE K++ K+ E + E +
Sbjct: 54 ELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDELEAE 97
>gnl|CDD|212464 cd01717, Sm_B, Sm protein B. The eukaryotic Sm proteins (B/B', D1,
D2, D3, E, F and G) assemble into a hetero-heptameric
ring around the Sm site of the 2,2,7-trimethyl guanosine
(m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
forming the core of the snRNP particle. The snRNP
particle, in turn, assembles with other components onto
the pre-mRNA to form the spliceosome which is
responsible for the excision of introns and the ligation
of exons. Members of this family share a highly
conserved Sm fold, containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 80
Score = 28.7 bits (65), Expect = 0.62
Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 8/57 (14%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDI--------PRGIFIIRGENVVLM 175
R +G + D+ NLVL E + + G+ ++RGENVV M
Sbjct: 21 RQFVGTFLAFDKHMNLVLSDCEEFRKIKPKKKKKGEEREEKRVLGLVLLRGENVVSM 77
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 30.5 bits (69), Expect = 0.62
Identities = 14/99 (14%), Positives = 58/99 (58%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ + + EE +K +++ +++Q++ + + + +++ +++ E++ ++ EE + +EE ++
Sbjct: 86 RARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQ 145
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
+ ++ ++E + +++ +++ + + E K E + R
Sbjct: 146 EAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEAR 184
Score = 29.0 bits (65), Expect = 1.9
Identities = 19/118 (16%), Positives = 61/118 (51%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
+ A R+ +++K +E ++++++ + + + ++ QK+ E++ + E+
Sbjct: 75 YEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEEL 134
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K +EE ++E + +EE + ++E +++ + E++ + K E + + ++ R+
Sbjct: 135 LKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGRAKEERE 192
Score = 27.4 bits (61), Expect = 4.8
Identities = 17/88 (19%), Positives = 56/88 (63%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K +E+ +Q + E K ++ + ++ + + ++ + + E+++K +E+ ++++++ Q + E +
Sbjct: 58 KMQEKTRQAELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELAR 117
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++ ++E +Q++++ E+ K EE +Q
Sbjct: 118 KRYQKELEQQRRQNEELLKMQEESVLRQ 145
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 29.7 bits (67), Expect = 0.62
Identities = 12/65 (18%), Positives = 32/65 (49%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
KK+ + ++E + +K+ EE+K+ ++ +K E+++ K + + + +
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVK 92
Query: 101 KKKKE 105
KK+
Sbjct: 93 ATKKQ 97
Score = 28.9 bits (65), Expect = 1.1
Identities = 16/90 (17%), Positives = 43/90 (47%), Gaps = 2/90 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K + + +++ + +++ +E+KK ++ KE+ E+++ K + + E +
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ--LFSENATENNT 90
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K KKQ E +Q + +++ +K
Sbjct: 91 VKATKKQLFSSEYEQTSSSSESTSEEETKK 120
Score = 27.3 bits (61), Expect = 3.9
Identities = 13/95 (13%), Positives = 40/95 (42%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
KK + +++ E ++ +E+K+ +K KE+ +Q++ + + + + +
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVK 92
Query: 89 EKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
K++ + ++ + + E + L P
Sbjct: 93 ATKKQLFSSEYEQTSSSSESTSEEETKKTSSILLP 127
Score = 26.6 bits (59), Expect = 7.6
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
EKK+ + ++E E DK+ +++KK QK KE+ +Q++ K + ++ A E N
Sbjct: 31 YEKKDIEINTDYLQEETELDKELFTPEEQKKI-TFQKHKEKPEQEELKNQLFSENATENN 89
Query: 117 R 117
Sbjct: 90 T 90
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 30.7 bits (70), Expect = 0.62
Identities = 17/51 (33%), Positives = 31/51 (60%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
KK E ++ KQK+ + +K+ + E +K +KK +K EKE ++ + E + E
Sbjct: 283 KKAERDRVKQKRSDLEKRVENELEKLEKKLEKLEKELEEAENAENYRLYGE 333
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 29.6 bits (67), Expect = 0.65
Identities = 11/46 (23%), Positives = 21/46 (45%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+K+ E K+ +KK + +K K + K++K + K E
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASE 46
Score = 26.9 bits (60), Expect = 4.9
Identities = 11/45 (24%), Positives = 20/45 (44%)
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
K+ E ++ +KK K +K K K K++K + K+
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 26.9 bits (60), Expect = 5.6
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEK 52
KR +KK + +K K + KQKK + K+ +
Sbjct: 13 KRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48
Score = 26.9 bits (60), Expect = 6.0
Identities = 9/37 (24%), Positives = 17/37 (45%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK 56
K+ ++ K +K K K K++K + K+ E
Sbjct: 12 PKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48
Score = 26.5 bits (59), Expect = 8.3
Identities = 11/49 (22%), Positives = 23/49 (46%), Gaps = 3/49 (6%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+++ E K+ + KK K +K + K+K+K+ K ++ E
Sbjct: 1 MASSRKRLPFEPKRSR---KKGVKALRKAAVAKSKDKQKKPKSKRAASE 46
Score = 26.1 bits (58), Expect = 9.8
Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+K+ E K+ +KK K +K + K+ K+KK + K+ E
Sbjct: 4 SRKRLPFEPKRSRKKGVKALRKAAVAKSKD--KQKKPKSKRAASES 47
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 30.0 bits (68), Expect = 0.66
Identities = 15/52 (28%), Positives = 31/52 (59%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
EK ++ ++ K+ +KE +E +E KK ++ ++K+ E D ++E K + K
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRELMKMQFK 123
Score = 28.9 bits (65), Expect = 1.7
Identities = 12/38 (31%), Positives = 25/38 (65%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK 70
+K ++ QK +E +++ +E +E + KK KK +EK+ +
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQME 109
Score = 26.5 bits (59), Expect = 9.6
Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 12/65 (18%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
Q+K +E ++ KE ++E E + E D KK +K Q+K+ E ++E
Sbjct: 71 QEKMKELQKMMKEFQKEFREAQ------------ESGDMKKLKKLQEKQMEMMDDQRELM 118
Query: 99 KQKKK 103
K + K
Sbjct: 119 KMQFK 123
>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
of Ustilago maydis Rim23 (PalC), and related domains.
This family contains the Bro1-like domain of Ustilago
maydis Rim23 (also known as PalC), and related proteins.
It belongs to the BRO1_Alix_like superfamily which
includes the Bro1-like domains of mammalian Alix
(apoptosis-linked gene-2 interacting protein X),
His-Domain type N23 protein tyrosine phosphatase
(HD-PTP, also known as PTPN23), RhoA-binding proteins
Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, and related
domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
interact with the ESCRT (Endosomal Sorting Complexes
Required for Transport) system. Rim20 and Rim23
participate in the response to the external pH via the
Rim101 pathway. Through its Bro1-like domain, Rim23
allows the interaction between the endosomal and plasma
membrane complexes. Bro1-like domains are
boomerang-shape, and part of the domain is a
tetratricopeptide repeat (TPR)-like structure.
Intermediates in the Rim101 pathway may play roles in
the pathogenesis of fungal corneal infection during
Candida albicans keratitis. This family lacks the
V-shaped (V) domain found in many members of the
BRO1_Alix_like superfamily.
Length = 413
Score = 30.5 bits (69), Expect = 0.68
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
K K + K+ +EK+E +K EK EEE
Sbjct: 330 KSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEEL 364
Score = 28.9 bits (65), Expect = 2.2
Identities = 16/54 (29%), Positives = 28/54 (51%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
KKE ++ K K + K+ +E+++ +K EK +EE + + KK KK
Sbjct: 323 KKELEDLKSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEELRTLEMLLKKYKK 376
Score = 28.1 bits (63), Expect = 3.3
Identities = 18/54 (33%), Positives = 28/54 (51%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
KKE + K K K KK KE+++++K E+ +EE + + KK KK
Sbjct: 323 KKELEDLKSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEELRTLEMLLKKYKK 376
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 29.8 bits (67), Expect = 0.69
Identities = 21/69 (30%), Positives = 38/69 (55%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
KK KK++ +KK+ K+ KK+KKE++E + +E ++ EE DKK + + K +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107
Query: 89 EKKQKKKEE 97
+ E+
Sbjct: 108 TAAEVNHED 116
Score = 29.8 bits (67), Expect = 0.81
Identities = 20/73 (27%), Positives = 37/73 (50%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E +KK KKK+ +KKK KK KKK+K++E+ + ++ ++ EE K E
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103
Query: 84 QKKKEEKKQKKKE 96
+ + + ++
Sbjct: 104 KVQNTAAEVNHED 116
Score = 27.1 bits (60), Expect = 5.3
Identities = 17/74 (22%), Positives = 39/74 (52%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
E+ + KK KK+K KK+ K+ K++K+EK+E + ++ + E++++ ++K
Sbjct: 41 EDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDY 100
Query: 86 KKEEKKQKKKEEKK 99
+ + + E
Sbjct: 101 ELPKVQNTAAEVNH 114
Score = 26.3 bits (58), Expect = 9.3
Identities = 17/73 (23%), Positives = 39/73 (53%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ KK KK++ +KK+ ++ ++KK++K+E + EE D ++ E+ KK + +
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103
Query: 93 KKKEEKKQKKKKE 105
K + + ++
Sbjct: 104 KVQNTAAEVNHED 116
>gnl|CDD|232951 TIGR00389, glyS_dimeric, glycyl-tRNA synthetase, dimeric type.
This model describes a glycyl-tRNA synthetase distinct
from the two alpha and two beta chains of the tetrameric
E. coli glycyl-tRNA synthetase. This enzyme is a
homodimeric class II tRNA synthetase and is recognized
by pfam model tRNA-synt_2b, which recognizes His, Ser,
Pro, and this set of glycyl-tRNA synthetases [Protein
synthesis, tRNA aminoacylation].
Length = 551
Score = 30.6 bits (69), Expect = 0.71
Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 2/80 (2%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK--EEKKEKKKEEKKQKKEE 73
GK + + ++ K E + KKK K +K KK E E +++EE+ K E
Sbjct: 333 GKSLSVFDKLDEPREVTKWEIEPNKKKFGPKFRKDAKKIESNLSEDDLEEREEELDKNEV 392
Query: 74 EKDKKKEEKKQKKKEEKKQK 93
E DK E + + +K
Sbjct: 393 ELDKDLVEIEMVTEVVHGEK 412
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.5 bits (67), Expect = 0.71
Identities = 21/86 (24%), Positives = 56/86 (65%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
K +K+ +K+ +K Q + +KKE++ +K+E+K +++ EE K K+++ Q+K++E
Sbjct: 31 PAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQE 90
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+QK++ +++ ++K+++ + ++
Sbjct: 91 LQQKQQAAQQELQQKQQELLQPIYDK 116
Score = 28.8 bits (65), Expect = 1.3
Identities = 17/76 (22%), Positives = 52/76 (68%), Gaps = 2/76 (2%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE--KKQK 85
+K+ +K+ +K Q + ++K+++ ++++++ +++ EE ++ K++E +K++E +KQ+
Sbjct: 37 QKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQ 96
Query: 86 KKEEKKQKKKEEKKQK 101
+++ Q+K++E Q
Sbjct: 97 AAQQELQQKQQELLQP 112
Score = 28.0 bits (63), Expect = 2.6
Identities = 20/85 (23%), Positives = 55/85 (64%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E K +K+ +K+ KK + E +K++K+ +K+++K +++ E+ +K K+++ Q+K
Sbjct: 28 SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87
Query: 95 KEEKKQKKKKEEKKNKKAEEQNRQQ 119
++E +QK++ +++ ++ +++ Q
Sbjct: 88 QQELQQKQQAAQQELQQKQQELLQP 112
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 30.1 bits (68), Expect = 0.74
Identities = 21/91 (23%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EE K KQK + Q + ++ + K + K E + E++ E++D + E++
Sbjct: 323 EEVEKSKQKFEPLLAQYGESFQEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQR 382
Query: 84 QKKKEEKKQKKKEEKKQKKKK--EEKKNKKA 112
Q+ + +K + KK+ +++ EE + + A
Sbjct: 383 QQLWFFENRKLWQRKKKLREQADEEYQQRHA 413
Score = 27.4 bits (61), Expect = 6.1
Identities = 14/70 (20%), Positives = 34/70 (48%), Gaps = 4/70 (5%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
+ K E + E++ EK++ + E++Q+ +++K ++K+K +E + E
Sbjct: 351 QALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLRE----QADE 406
Query: 97 EKKQKKKKEE 106
E +Q+
Sbjct: 407 EYQQRHATRA 416
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 30.5 bits (68), Expect = 0.75
Identities = 19/103 (18%), Positives = 57/103 (55%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
A++ G R K E + +++ K + K +K++ + + E+ E+ E+++ E+ +
Sbjct: 421 AVNGHYGGRVDKDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIE 480
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
++ E+++ + E+ ++ + E+ + + E+++ + E + +KA
Sbjct: 481 RERLERERLERERLERDRLERDRLDRLERERVDRLERDRLEKA 523
Score = 29.7 bits (66), Expect = 1.3
Identities = 16/88 (18%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
RI K +K E K +KK+ E+ ++++ E+ ++++ ++ E+ E+ E+++ E+++ + +
Sbjct: 439 RIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIER-ERLERERLERERLERD 497
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
++ + ++ ++++ ++ ++ + EK ++
Sbjct: 498 RLERDRLDRLERERVDRLERDRLEKARR 525
Score = 27.0 bits (59), Expect = 9.1
Identities = 19/118 (16%), Positives = 65/118 (55%), Gaps = 3/118 (2%)
Query: 6 APKGAISPRIGKRKKKKKEEENKKKQKKKEEKK--QKKKEEKKQKKKEKKEEKEEKKEKK 63
A G R+ K ++ E + +K E K +KK+ E+ ++++ ++ E+ E+ E+
Sbjct: 421 AVNGHYGGRVDKDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLER-ERMERI 479
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+ E+ +++ E+++ + ++ ++ + ++ ++++ ++ ++ + E+ + + + GL
Sbjct: 480 ERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLEKARRNSYFLKGMENGL 537
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 30.4 bits (69), Expect = 0.77
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 7/75 (9%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQ-------KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
N+++ +K E+ +K K +K Q +K +K K K K++KK+ K +
Sbjct: 319 NEERVEKYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSE 378
Query: 80 EEKKQKKKEEKKQKK 94
++ K +KK KK
Sbjct: 379 TSQEAKSSGKKKVKK 393
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.4 bits (69), Expect = 0.79
Identities = 16/91 (17%), Positives = 32/91 (35%), Gaps = 2/91 (2%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
KK ++ K E K KKE + K+ ++ K+ K+ + + +
Sbjct: 978 KKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKIISS 1037
Query: 81 E--KKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
E + K +K K + + + K
Sbjct: 1038 ESTELSILKPLQKLKGLLLLENNQLQARYKA 1068
Score = 30.4 bits (69), Expect = 0.89
Identities = 22/96 (22%), Positives = 34/96 (35%), Gaps = 5/96 (5%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K ++ E+ KK E K E K KKE E ++ +E KQ KE +
Sbjct: 967 KETSEEYEDLLKKSTILVREGN-KANSELKNFKKELAELSKQY-GALQESTKQLKELPVE 1024
Query: 77 ---KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ K + + K +K K + N
Sbjct: 1025 VAELQSASKIISSESTELSILKPLQKLKGLLLLENN 1060
Score = 27.7 bits (62), Expect = 5.1
Identities = 22/113 (19%), Positives = 40/113 (35%), Gaps = 3/113 (2%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE-EEKD 76
K E K++ E KQ ++ K+ KE E E + K E E
Sbjct: 986 EGNKANSELKNFKKELAELSKQYGALQE--STKQLKELPVEVAELQSASKIISSESTELS 1043
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTL 129
K +K K + + + + + K + + ++Q Q N +T+
Sbjct: 1044 ILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKTI 1096
Score = 27.7 bits (62), Expect = 5.3
Identities = 22/123 (17%), Positives = 44/123 (35%), Gaps = 14/123 (11%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQ--------KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+ K E NK + + + K+ KK ++ + E + K++ E KQ
Sbjct: 948 IEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQ 1007
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQ------KKKEEKKQKKKKEEKKNKKAEEQNRQQGLYP 123
++ K+ ++ + E + + E K ++ K E N+ Q Y
Sbjct: 1008 YGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYK 1067
Query: 124 NQK 126
K
Sbjct: 1068 ALK 1070
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.1 bits (68), Expect = 0.80
Identities = 13/50 (26%), Positives = 24/50 (48%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+KE K K+ K + KE E + ++ K+E +K+ E ++ K
Sbjct: 51 SLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAA 100
Score = 29.3 bits (66), Expect = 1.4
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
K K+ K + KE+ + E KE K+E EK+ E ++ K
Sbjct: 58 KLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAA 100
Score = 27.7 bits (62), Expect = 4.7
Identities = 18/68 (26%), Positives = 33/68 (48%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
+E +Q++ E +K+ K KE K K +E + E + K+E +K+ E ++ K
Sbjct: 40 KEWRQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKA 99
Query: 95 KEEKKQKK 102
+ K K
Sbjct: 100 ALDAKLKT 107
Score = 27.4 bits (61), Expect = 5.7
Identities = 10/47 (21%), Positives = 23/47 (48%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK 56
++ + K K K++ + K+ +K+ +KE + Q+ K + K
Sbjct: 58 KLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDAK 104
>gnl|CDD|212478 cd01731, archaeal_Sm1, archaeal Sm protein 1. The archaeal Sm1
proteins: The Sm proteins are conserved in all three
domains of life and are always associated with U-rich
RNA sequences. They function to mediate RNA-RNA
interactions and RNA biogenesis. All Sm proteins contain
a common sequence motif in two segments, Sm1 and Sm2,
separated by a short variable linker. Eukaryotic Sm
proteins form part of specific small nuclear
ribonucleoproteins (snRNPs) that are involved in the
processing of pre-mRNAs to mature mRNAs, and are a major
component of the eukaryotic spliceosome. Most snRNPs
consist of seven Sm proteins (B/B', D1, D2, D3, E, F and
G) arranged in a ring on a uridine-rich sequence (Sm
site), plus a small nuclear RNA (snRNA) (either U1, U2,
U5 or U4/6). Since archaebacteria do not have any
splicing apparatus, their Sm proteins may play a more
general role. Archaeal LSm proteins are likely to
represent the ancestral Sm domain.
Length = 69
Score = 27.9 bits (63), Expect = 0.82
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 3/48 (6%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVL 174
+ + G L+ DQ NLVL E + G ++RG+NVV
Sbjct: 22 KEVRGVLKGFDQHLNLVLENAEE---IIEGESVRKLGTVLVRGDNVVF 66
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 29.8 bits (67), Expect = 0.83
Identities = 14/91 (15%), Positives = 45/91 (49%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
K++++ + + ++ ++E +E ++ +KQ+ E++ + Q+ +E++Q E
Sbjct: 35 KKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNE 94
Query: 98 KKQKKKKEEKKNKKAEEQNRQQGLYPNQKRT 128
++ + + E+ +Q+ Q++T
Sbjct: 95 AREDVATARDEWLEQLEREKQEFFKALQQQT 125
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 29.9 bits (68), Expect = 0.84
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
K+ + ++K +K EK K E+K +K K+K K E ++Q EK+ + +
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKN-----KLAELEQQLASLEKRIDEAK 295
Query: 97 EKKQKKKKE 105
E K E
Sbjct: 296 ELIAKYGNE 304
Score = 29.9 bits (68), Expect = 0.98
Identities = 18/58 (31%), Positives = 31/58 (53%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ ++K +K E+ K EEK +K KK+ K E +Q+ EK+ +A+E + G
Sbjct: 245 DELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAKYG 302
Score = 28.4 bits (64), Expect = 3.2
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++K E+ +K K EEK +K KK+ K + E++ EK+ +E K+ + +
Sbjct: 248 QEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAKYGN 303
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 30.0 bits (68), Expect = 0.85
Identities = 13/59 (22%), Positives = 40/59 (67%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
EEE + ++++ + +Q++ E + ++ ++ E+ ++EK++ +K+ K+ ++++K+ EK
Sbjct: 151 EEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
Score = 29.6 bits (67), Expect = 0.92
Identities = 16/60 (26%), Positives = 46/60 (76%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
E++E E ++++++ E+++ E + ++ EE +++++EEK+++KK++K++K++++E K
Sbjct: 150 EEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
Score = 29.3 bits (66), Expect = 1.3
Identities = 14/55 (25%), Positives = 40/55 (72%), Gaps = 2/55 (3%)
Query: 50 KEKKEEKEEKKEKKKEEKK--QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
E ++++ + ++++ E Q+ EE + +++EEK+++KK++K++K++E++ +K
Sbjct: 155 AELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
Score = 29.3 bits (66), Expect = 1.4
Identities = 14/51 (27%), Positives = 37/51 (72%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
++Q++ E++ E + ++ EE ++++ EEK+++K++ K++K+ EK+ +K
Sbjct: 159 QQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
Score = 28.5 bits (64), Expect = 2.5
Identities = 14/51 (27%), Positives = 34/51 (66%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
R +R+ +++ + ++ EE +++++EEK+++KK+ KE K+ +KE +
Sbjct: 158 RQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAE 208
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 30.1 bits (68), Expect = 0.87
Identities = 22/118 (18%), Positives = 44/118 (37%), Gaps = 9/118 (7%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEK--KQKKKEKKEEKEEK-------KE 61
I I ++KKK E + + K+ EK + KKEE+ ++ E
Sbjct: 843 IKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNE 902
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++ + + +K K++ +E+ K K +E+ K + R+
Sbjct: 903 EELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAKEEVEAKLKAFFRRF 960
Score = 28.9 bits (65), Expect = 2.2
Identities = 15/90 (16%), Positives = 38/90 (42%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+ + +E ++ +E + ++KK+ E E + + +K + + ++
Sbjct: 826 NDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILA 885
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
KK+EE KQ +EE +++ +
Sbjct: 886 KKEEEFKQFAEEEGLNEEELAFYDDLALNG 915
Score = 27.8 bits (62), Expect = 6.0
Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQK-KEEEKDKKKEEKKQKKKEEK 90
+ +K + K+ ++ KE E+ ++KKK ++ + KK + +K +E
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882
Query: 91 KQKKKEEKKQKKKKEEKKNKKA 112
KKEE+ ++ +EE N++
Sbjct: 883 ILAKKEEEFKQFAEEEGLNEEE 904
Score = 27.0 bits (60), Expect = 8.5
Identities = 22/101 (21%), Positives = 41/101 (40%), Gaps = 2/101 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKK--EKKEEKEEKKEKKKEEKKQKKEEEK 75
+ K + ++ E ++ E+ +QKKK E+ E + KK + +K EE
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K+E++ K+ E++ +EE K E
Sbjct: 883 ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGT 923
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 0.90
Identities = 20/94 (21%), Positives = 55/94 (58%), Gaps = 1/94 (1%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE-EEK 75
+ +++ + E+ + + + E+ ++++E+ ++ E++E EEK+E+ +E +++ E E +
Sbjct: 493 EVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAE 552
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
++K E + +EE ++ ++E + K E K
Sbjct: 553 AEEKREAAAEAEEEAEEAREEVAELNSKLAELKE 586
Score = 28.1 bits (63), Expect = 4.1
Identities = 20/96 (20%), Positives = 57/96 (59%), Gaps = 1/96 (1%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+++ EE ++ ++ ++ + + + E+ ++++E EE ++ + EEK+++ EE +++
Sbjct: 488 EEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAA 547
Query: 80 E-EKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E E + ++K E + +EE ++ +++ + N K E
Sbjct: 548 ELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAE 583
Score = 27.7 bits (62), Expect = 5.2
Identities = 18/70 (25%), Positives = 43/70 (61%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
EK++ E +E+ E+ +K+E K++ + E + + EE ++ K+ ++ ++ E+K +
Sbjct: 613 EKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDEL 672
Query: 104 KEEKKNKKAE 113
+EE+ + +AE
Sbjct: 673 REERDDLQAE 682
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 29.6 bits (67), Expect = 0.93
Identities = 20/55 (36%), Positives = 32/55 (58%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
KK ++ +KKE ++ E +++KK EE+ KKK++K KKK+ KK K
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 27.7 bits (62), Expect = 4.0
Identities = 17/56 (30%), Positives = 32/56 (57%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
KK ++ +K+E K+ E + KK+ +++++KK+KK +KK+ KK K
Sbjct: 150 LKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 27.3 bits (61), Expect = 4.8
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 56 KEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
K E+ +KK+ + K+ + KK +++KK++KK+ K++K +K K
Sbjct: 152 KVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 27.3 bits (61), Expect = 6.0
Identities = 18/47 (38%), Positives = 30/47 (63%)
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
KK E K+K+ + K+ + K+K +++KKK++KK KKK+ KK
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
Score = 26.9 bits (60), Expect = 7.2
Identities = 16/52 (30%), Positives = 30/52 (57%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+R KKK+ + + + + ++K +EEKK+KKK+ ++K+ KK K
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 30.1 bits (68), Expect = 0.93
Identities = 16/53 (30%), Positives = 30/53 (56%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
++ E+K + K+K++++ K KEEK++ + EK+ K+ EE K K
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389
Score = 30.1 bits (68), Expect = 0.97
Identities = 13/50 (26%), Positives = 28/50 (56%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
+ +E E+K ++++K++E+ K KEEK++ + EK+ K+
Sbjct: 332 TRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
Score = 29.3 bits (66), Expect = 1.5
Identities = 19/58 (32%), Positives = 33/58 (56%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ ++ EEK ++K+KKKE+ + KEEK+E + EK+ K+ E+ K+K
Sbjct: 332 TRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389
Score = 28.2 bits (63), Expect = 3.4
Identities = 13/53 (24%), Positives = 29/53 (54%)
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ Q+ + E+ +QK+K++++ K KEEK++ + E++ + EE
Sbjct: 333 RFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEE 385
Score = 28.2 bits (63), Expect = 3.7
Identities = 11/50 (22%), Positives = 28/50 (56%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
+ ++ E+K + ++K+K++++ K +EEK + E +K+ K+
Sbjct: 332 TRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
Score = 27.4 bits (61), Expect = 6.2
Identities = 14/52 (26%), Positives = 32/52 (61%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
+E + +++ +QK+K++++ K KE+K+E E +++ KE ++ KE
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKE 388
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 30.2 bits (69), Expect = 0.94
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQ 47
PRI ++ +E + KE ++++KK EK +
Sbjct: 508 PRIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 28.6 bits (65), Expect = 2.4
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ + ++E K++ E KE ++++KK EK +
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 28.2 bits (64), Expect = 4.0
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E+E K++ E KE E+ +KK EK +
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 28.2 bits (64), Expect = 4.0
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 52 KKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+ + +EE K++ + +E ++K+K+ +K
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEK 540
Score = 27.5 bits (62), Expect = 6.1
Identities = 7/30 (23%), Positives = 18/30 (60%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+++ +++ E KE ++++KK EK +
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.5 bits (62), Expect = 6.3
Identities = 9/34 (26%), Positives = 19/34 (55%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+ E++ +E+ + K+ E++EKK +K E
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 8.1
Identities = 7/32 (21%), Positives = 18/32 (56%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
+ +E+ K+Q + + +E++K+ +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 8.4
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE 88
+ ++E +++ + K+ +++EKK +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 9.4
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
+EE K++ + KE ++++KK EK +
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 29.2 bits (66), Expect = 0.96
Identities = 12/47 (25%), Positives = 21/47 (44%)
Query: 59 KKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
K ++ + K+ K K K + ++KK + EK +KK E
Sbjct: 86 KSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAE 132
Score = 28.8 bits (65), Expect = 1.3
Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 1/49 (2%)
Query: 67 KKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
K ++ + K KQK K + ++KK + EK KK EQ
Sbjct: 86 KSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRA-IEKLRKKLAEQ 133
Score = 27.3 bits (61), Expect = 4.2
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
EE + K+ K++ K + ++KK + EK +KK E
Sbjct: 89 EELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAE 132
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 29.7 bits (67), Expect = 0.98
Identities = 18/86 (20%), Positives = 38/86 (44%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
E +++KKEE+++++++E K E EE + + +E + EE+ D K
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233
Query: 86 KKEEKKQKKKEEKKQKKKKEEKKNKK 111
K ++ K+ + +K
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKA 259
Score = 27.0 bits (60), Expect = 7.7
Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 6/105 (5%)
Query: 14 RIGKRKKKKKEEENKKKQKK------KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
R K++++++EEE++ K EE +++ +E + +E + K
Sbjct: 178 RREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPA 237
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
K +K K + + KK + K + KKA
Sbjct: 238 KPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKA 282
>gnl|CDD|212486 cd06168, LSMD1, LSM domain containing 1. The eukaryotic Sm and
Sm-like (LSm) proteins associate with RNA to form the
core domain of the ribonucleoprotein particles involved
in a variety of RNA processing events including pre-mRNA
splicing, telomere replication, and mRNA degradation.
Members of this family share a highly conserved Sm fold
containing an N-terminal helix followed by a strongly
bent five-stranded antiparallel beta-sheet. LSMD1
proteins have a single Sm-like domain structure. Sm-like
proteins exist in archaea as well as prokaryotes,
forming heptameric and hexameric ring structures similar
to those found in eukaryotes.
Length = 73
Score = 27.9 bits (63), Expect = 0.99
Identities = 7/25 (28%), Positives = 11/25 (44%)
Query: 126 KRTLIGYLRSVDQFANLVLHKTIER 150
R L+G D+ N++L E
Sbjct: 19 GRVLVGTFVCTDKDGNIILSNAEEY 43
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 29.8 bits (68), Expect = 1.0
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 6/74 (8%)
Query: 24 EEENKKKQKKKEEKKQKK---KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
E+ +KK K++ EK Q++ +E+ K +KE E+ ++K E ++ +K +K KE
Sbjct: 211 EKRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEK---LKLPKE 267
Query: 81 EKKQKKKEEKKQKK 94
K++ +KE KK +
Sbjct: 268 AKEKAEKELKKLET 281
Score = 27.5 bits (62), Expect = 5.9
Identities = 23/83 (27%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE---EKDKKKEEKKQKKKEEK 90
++E + E++ +KK +++ EK +++ +E+ K ++E + D K E ++ ++K EK
Sbjct: 202 EKEIDLLQLEKRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEK 261
Query: 91 KQKKKEEKKQKKKKEEKKNKKAE 113
+ KE K+ K +KE KK +
Sbjct: 262 LKLPKEAKE-KAEKELKKLETMS 283
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 29.9 bits (67), Expect = 1.0
Identities = 24/108 (22%), Positives = 54/108 (50%), Gaps = 1/108 (0%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
P++ +++ +E+ + + + + ++ ++E Q K K + + KE++K E
Sbjct: 375 PKLPEQQGGDEEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLE 434
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E + K +K++ +EE K + K KK K+ E + +EE + + G
Sbjct: 435 EYDYRVKLLQKKQWREELK-RMKMMKKFGKEIGELPDGYSEEVDEENG 481
>gnl|CDD|180892 PRK07220, PRK07220, DNA topoisomerase I; Validated.
Length = 740
Score = 30.1 bits (68), Expect = 1.0
Identities = 19/76 (25%), Positives = 35/76 (46%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
Q E ++K+KEEK Q+ K++K + + + ++ +K EE E + E
Sbjct: 662 QCNFIEWQKKQKEEKAQQPKKEKPKSIKDIKGIGKKTAEKLEEAGITTVEALAEADPVEL 721
Query: 91 KQKKKEEKKQKKKKEE 106
+ K K+ K +E
Sbjct: 722 SKTTKISVKKIKSWQE 737
Score = 28.6 bits (64), Expect = 2.8
Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+ E +KKQK+++ ++ KK++ K K + +K +K ++ + E D E
Sbjct: 662 QCNFIEWQKKQKEEKAQQPKKEKPKSIKDIKGIGKKTAEKLEEAGITTVEALAEAD-PVE 720
Query: 81 EKKQKKKEEKKQKKKEE 97
K K KK K +E
Sbjct: 721 LSKTTKISVKKIKSWQE 737
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 29.7 bits (67), Expect = 1.0
Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 51 EKKEEKEEKKEKKKE-EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
E E E + E E K+ K + K + K K K + +K +E+ K++ K E +
Sbjct: 69 EPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRP 128
Query: 110 KKAEEQNRQQ 119
E
Sbjct: 129 ASPFENTAPA 138
Score = 27.7 bits (62), Expect = 4.3
Identities = 9/42 (21%), Positives = 20/42 (47%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
K + K + K K K K +K E++ K++ + + + +
Sbjct: 92 IPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFE 133
Score = 27.3 bits (61), Expect = 4.9
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
K E K K K + K K +K EE+ +++ K E + E
Sbjct: 93 PKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFEN 134
Score = 26.6 bits (59), Expect = 9.5
Identities = 14/48 (29%), Positives = 22/48 (45%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
KE K E K K K K + K KK +++ +++ K E + E
Sbjct: 86 KEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFE 133
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 29.9 bits (67), Expect = 1.0
Identities = 25/92 (27%), Positives = 46/92 (50%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+GK+ E+ KK+ ++ ++K K E + K ++EEKEEK + K+ E
Sbjct: 109 LGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNE 168
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+ K+ + K+K+ K++ KE + K E
Sbjct: 169 ERFKEYLENFKRKKFKRKILKEFENAKINASE 200
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 29.8 bits (67), Expect = 1.1
Identities = 18/88 (20%), Positives = 29/88 (32%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+ Q + E K K E K+ + K +KK + K E+ K
Sbjct: 344 KASVNQVFEFEIYLYKIINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNK 403
Query: 86 KKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K + KKK + NK+ +
Sbjct: 404 NSVHFDTLYKTKIFYHKKKINQNNKEQD 431
Score = 29.8 bits (67), Expect = 1.3
Identities = 18/94 (19%), Positives = 31/94 (32%), Gaps = 3/94 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + E K K + +K + + +++ + +K EE
Sbjct: 344 KASVNQVFEFEIYLYKIINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNK 403
Query: 77 KK---KEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K K KK+ + K+Q KKEE
Sbjct: 404 NSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEEL 437
Score = 29.4 bits (66), Expect = 1.6
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 6/101 (5%)
Query: 12 SPRIGKRKKKKKEEENKK----KQKKKEEKKQKKKEEKK--QKKKEKKEEKEEKKEKKKE 65
S I +K EE NK K + KK+ + +++ KKEE EK+ KK
Sbjct: 387 SQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKS 446
Query: 66 EKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
EK K +E D + K++ ++ + K +K K K E
Sbjct: 447 EKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFNKFKNE 487
Score = 27.9 bits (62), Expect = 4.8
Identities = 23/103 (22%), Positives = 39/103 (37%), Gaps = 4/103 (3%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+I K E + + K E+K + KK EE K +
Sbjct: 359 KIINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEE----SNKNSVHFDTLYKT 414
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ K++ Q KE+ +K++ +K+ KK EK K E +
Sbjct: 415 KIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLD 457
Score = 26.7 bits (59), Expect = 9.8
Identities = 22/83 (26%), Positives = 40/83 (48%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K +KKK + +++ KKEE +K+ KK EK K ++ + + K++ +
Sbjct: 409 DTLYKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFN 468
Query: 77 KKKEEKKQKKKEEKKQKKKEEKK 99
K E K ++ K K +E +
Sbjct: 469 KDIELSKNMLQKFNKFKNEESAE 491
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 29.7 bits (67), Expect = 1.1
Identities = 25/80 (31%), Positives = 47/80 (58%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
K + E+ +K + +QK K++K++ K++ ++ +K+ + ++KD K + QK KEE
Sbjct: 161 KTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEE 220
Query: 90 KKQKKKEEKKQKKKKEEKKN 109
+ KKE K K+K + KN
Sbjct: 221 FDKLKKEGKADKQKIKSAKN 240
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 29.5 bits (66), Expect = 1.1
Identities = 14/75 (18%), Positives = 41/75 (54%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+ EN++ + E + ++ + + K++ + E +E K ++ + E +D+ ++E +
Sbjct: 212 DGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYE 271
Query: 84 QKKKEEKKQKKKEEK 98
++ EE++ + K E+
Sbjct: 272 EEAGEEQENEDKGEE 286
Score = 27.6 bits (61), Expect = 5.2
Identities = 16/74 (21%), Positives = 38/74 (51%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ E+ + E + + E E KEK + E ++ K ++ + + E + + ++E ++
Sbjct: 213 GENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEE 272
Query: 93 KKKEEKKQKKKKEE 106
+ EE++ + K EE
Sbjct: 273 EAGEEQENEDKGEE 286
Score = 26.8 bits (59), Expect = 9.1
Identities = 15/70 (21%), Positives = 35/70 (50%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ + E K E + KE+ +++ +E K +K E + E++ + + EE+ +
Sbjct: 217 ENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGE 276
Query: 79 KEEKKQKKKE 88
++E + K +E
Sbjct: 277 EQENEDKGEE 286
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 29.8 bits (67), Expect = 1.1
Identities = 15/72 (20%), Positives = 32/72 (44%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+ ++E++ E K KK EE + K+KK E + + + + +E + K K +
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414
Query: 108 KNKKAEEQNRQQ 119
+ + +N
Sbjct: 415 SSFINKTENILT 426
Score = 29.0 bits (65), Expect = 1.9
Identities = 23/109 (21%), Positives = 40/109 (36%), Gaps = 2/109 (1%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
+KIS P + I K + EE + + K KK ++ KEKK E E
Sbjct: 335 LKISDNPNDTLEILIIKLLALSELEEEDENEIKF--KKIEENSIDNLDIKEKKIENENDI 392
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
E K + K ++ E K + + + K + ++ +
Sbjct: 393 EGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNSPLKDELLEKTTEI 441
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 29.2 bits (66), Expect = 1.1
Identities = 20/38 (52%), Positives = 25/38 (65%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEK 52
+GK+KK KKEEENKK +KK E K + KK K +K
Sbjct: 121 LGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDK 158
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 29.3 bits (66), Expect = 1.1
Identities = 20/93 (21%), Positives = 47/93 (50%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E + KK + + +K++++ +K +K+ + K+ K +++ + + K + E K++K
Sbjct: 38 ESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEK 97
Query: 95 KEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR 127
KK++E +K+ E +Q L +R
Sbjct: 98 LVNAFNKKQQEYEKDLNRREAEEEQKLLEKIQR 130
Score = 27.4 bits (61), Expect = 4.5
Identities = 17/82 (20%), Positives = 43/82 (52%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
KK E + KK++++ QK +++ + KE K + + + + + + +++K
Sbjct: 41 QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVN 100
Query: 87 KEEKKQKKKEEKKQKKKKEEKK 108
KKQ++ E+ +++ EE++
Sbjct: 101 AFNKKQQEYEKDLNRREAEEEQ 122
Score = 27.0 bits (60), Expect = 5.3
Identities = 24/102 (23%), Positives = 51/102 (50%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ K+ + + K+ +K+ +K +K+ K KE K + + K E + K + E +K+K
Sbjct: 40 PQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLV 99
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+K++E +K + E ++++K EK + E + G
Sbjct: 100 NAFNKKQQEYEKDLNRREAEEEQKLLEKIQRAIESVAEKGGY 141
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.9 bits (67), Expect = 1.1
Identities = 16/90 (17%), Positives = 43/90 (47%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ K E K + ++ E+ ++ ++E+ +++ K+ K+ KK K K+ K + +
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
++ ++ +++K+ KK+E
Sbjct: 359 GDDSDDSDIDGEDSVSLVTAKKQKEPKKEE 388
Score = 29.2 bits (65), Expect = 1.7
Identities = 30/115 (26%), Positives = 53/115 (46%), Gaps = 17/115 (14%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEK-----------------KEEKEEK 59
+KKKKK +NKKK ++ K+ ++ + + + EE+
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEER 300
Query: 60 KEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
++K E K E E+D+ EE +++K EE+ K+ KK KK K +K ++
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
Score = 27.2 bits (60), Expect = 8.0
Identities = 16/84 (19%), Positives = 37/84 (44%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EE++ K E K + +++E E+ E++K E++ ++ K K+ K +K +K
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
Query: 95 KEEKKQKKKKEEKKNKKAEEQNRQ 118
+ + + + +Q
Sbjct: 358 SGDDSDDSDIDGEDSVSLVTAKKQ 381
Score = 27.2 bits (60), Expect = 8.3
Identities = 21/98 (21%), Positives = 51/98 (52%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+EE KK+KKK K +KK ++ K+ K+ ++ +E +++ ++++ D
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGND 296
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
+++E+K + K + ++ E+ + + E+ + GL
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGL 334
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex
together with SecB and SecD on the surface of
Salmonella. All these proteins are secreted by the type
III secretion system. Many mucosal pathogens use type
III secretion systems for the injection of effector
proteins into target cells. SecB, SseC and SecD are
inserted into the target cell membrane. where they form
a small pore or translocon. In addition to SseC, this
family includes the bacterial secreted proteins PopB,
PepB, YopB and EspD which are thought to be directly
involved in pore formation, and type III secretion
system translocon.
Length = 303
Score = 29.7 bits (67), Expect = 1.1
Identities = 13/38 (34%), Positives = 25/38 (65%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
++K +Q ++ KQ+KK E+ Q++ +K EK E+ +K
Sbjct: 18 KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55
Score = 26.7 bits (59), Expect = 10.0
Identities = 13/43 (30%), Positives = 28/43 (65%)
Query: 71 KEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
K EK K + ++ ++ +K++KK EE +++ KK +K ++A+
Sbjct: 11 KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAK 53
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.8 bits (65), Expect = 1.1
Identities = 15/61 (24%), Positives = 40/61 (65%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K ++K + +K+KEE+++++ E ++ +EE+ + E++ K K EK+++ ++++K
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Query: 108 K 108
+
Sbjct: 145 E 145
Score = 28.4 bits (64), Expect = 2.0
Identities = 20/64 (31%), Positives = 44/64 (68%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K ++ KKEK+EE+EE+ E ++ +++++ +E +K+ + K++K+ E ++K+KE K
Sbjct: 89 KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Query: 100 QKKK 103
++ K
Sbjct: 149 EQMK 152
Score = 27.6 bits (62), Expect = 3.3
Identities = 18/67 (26%), Positives = 44/67 (65%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
RKK +K KK+K++EE+++ + EE ++++ + ++E + K+E++++ + ++K+
Sbjct: 87 RKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146
Query: 78 KKEEKKQ 84
KE+ K
Sbjct: 147 LKEQMKM 153
Score = 26.5 bits (59), Expect = 9.0
Identities = 20/74 (27%), Positives = 47/74 (63%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
KK KK K +KK K ++K+++++EE++ + EE ++++ ++ +K+ K +++K
Sbjct: 77 KKDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKR 136
Query: 97 EKKQKKKKEEKKNK 110
+ ++K+KE K +
Sbjct: 137 RENERKQKEILKEQ 150
Score = 26.1 bits (58), Expect = 9.7
Identities = 17/68 (25%), Positives = 45/68 (66%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K ++K ++ KKE+++++++E + E+ +++E+ E +++ + K +K+ E ++K+K
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Query: 88 EEKKQKKK 95
E K++ K
Sbjct: 145 EILKEQMK 152
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 30.0 bits (68), Expect = 1.2
Identities = 14/76 (18%), Positives = 37/76 (48%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EEK +K++ E QK+ + +++ +K K +K ++++++ D +++ +
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPD 521
Query: 95 KEEKKQKKKKEEKKNK 110
EK + + + K
Sbjct: 522 PIEKTSLETEVSNEAK 537
Score = 28.1 bits (63), Expect = 4.3
Identities = 8/49 (16%), Positives = 30/49 (61%)
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
E +++ ++EE++ ++ + E+ ++ K+ KK K+++++ +++
Sbjct: 458 SPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506
Score = 27.7 bits (62), Expect = 5.7
Identities = 9/44 (20%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKE-EKKQKKKEKKEEKEEKKE 61
+K +KEE +++ E + ++ K KK ++++++ + +E
Sbjct: 463 EKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506
Score = 27.3 bits (61), Expect = 7.7
Identities = 9/44 (20%), Positives = 25/44 (56%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
+K +K++ E QK+ E + EK + +K ++++++ ++
Sbjct: 463 EKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 29.3 bits (66), Expect = 1.2
Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK-------KEEKKQKK 71
K + EE+KK +K + ++K +E K+K++K+ +E + +
Sbjct: 109 KAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTP 168
Query: 72 EEEKDKKKEEKKQKK--KEEKKQKKKEEKKQKKKKEEKKNKK 111
K K+ ++ KK K +EK K K KK K
Sbjct: 169 HLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGK 210
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 29.2 bits (66), Expect = 1.2
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
E K ++ EK++E+++Q E++ ++ + KK +K+ EK+Q K E K K K
Sbjct: 151 ERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEAKLKCTK 201
Score = 27.7 bits (62), Expect = 3.9
Identities = 14/44 (31%), Positives = 30/44 (68%)
Query: 64 KEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K +++ KE EK ++K+E+ +KK E+ + K+ +K+ +K++ K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 29.0 bits (65), Expect = 1.3
Identities = 21/84 (25%), Positives = 55/84 (65%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+K +KEEE + + + K++++K E ++ +KEK+ + E+++ ++++K++ +E++
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQ 100
K++E + K++E K+K+ K+
Sbjct: 93 KEEEAEDVKQQEVFSFKRKKPFKE 116
Score = 28.6 bits (64), Expect = 1.8
Identities = 27/81 (33%), Positives = 54/81 (66%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
I K +K++E+EN + K +E++QK + E+ +K+KE E + E +++E+++ +EE
Sbjct: 32 IIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91
Query: 74 EKDKKKEEKKQKKKEEKKQKK 94
EK+++ E+ KQ++ K+KK
Sbjct: 92 EKEEEAEDVKQQEVFSFKRKK 112
Score = 28.3 bits (63), Expect = 2.3
Identities = 18/82 (21%), Positives = 52/82 (63%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
++++++E + + K++++K + EE E++KE + E++ E+++ ++ ++++K+
Sbjct: 35 KADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKE 94
Query: 87 KEEKKQKKKEEKKQKKKKEEKK 108
+E + K++E K+KK K+
Sbjct: 95 EEAEDVKQQEVFSFKRKKPFKE 116
Score = 26.7 bits (59), Expect = 6.8
Identities = 22/80 (27%), Positives = 52/80 (65%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
KK +KEE+K+ E K K++E+K E EE +++K+ + ++++ + ++KEE Q+++
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 88 EEKKQKKKEEKKQKKKKEEK 107
+E++ + ++++ K +K
Sbjct: 93 KEEEAEDVKQQEVFSFKRKK 112
Score = 26.7 bits (59), Expect = 8.0
Identities = 20/84 (23%), Positives = 53/84 (63%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
KK +E+ ++K+ E K K++E+K + ++ +KE++ + E++ ++Q+KEE +++
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKK 103
+E++ + ++++ + KK K+
Sbjct: 93 KEEEAEDVKQQEVFSFKRKKPFKE 116
>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
lipoprotein. This model represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 412
Score = 29.6 bits (66), Expect = 1.4
Identities = 16/62 (25%), Positives = 28/62 (45%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
KKE+K+ +E + EK K E+K+ E K E K E+ ++ + + +
Sbjct: 22 KKEDKESGLREMRVEKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESD 81
Query: 101 KK 102
K
Sbjct: 82 KF 83
Score = 28.0 bits (62), Expect = 3.5
Identities = 16/71 (22%), Positives = 25/71 (35%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
KKE KE + +K K ++KE E K E K E+ ++ + K E
Sbjct: 22 KKEDKESGLREMRVEKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESD 81
Query: 109 NKKAEEQNRQQ 119
+
Sbjct: 82 KFVGTPLVKDD 92
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.1 bits (63), Expect = 1.4
Identities = 9/24 (37%), Positives = 19/24 (79%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEE 74
E+EE++E+++EE++++ EEE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 26.9 bits (60), Expect = 4.2
Identities = 7/24 (29%), Positives = 19/24 (79%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKD 76
+EE++E+++EE+++++ EE+
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEA 96
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 29.3 bits (66), Expect = 1.4
Identities = 12/83 (14%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
KE K + K+ + ++ +++E +E+++ + ++EE +D ++E + +E+ +
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGS--QSREDGSSE 407
Query: 94 KKEEKKQKKKKEEKKNKKAEEQN 116
+ + + K ++ +
Sbjct: 408 SSSDVGSDSESKADKESASDSDS 430
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 1.4
Identities = 22/107 (20%), Positives = 64/107 (59%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
R+ + +++ +E E + + + ++ ++ EK + +E+ E+ EEK EK + E ++ EE
Sbjct: 292 RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
+ + K +++ K+ E++ ++ E++ +K + K+ ++A ++ +++
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEEL 398
Score = 29.3 bits (66), Expect = 1.5
Identities = 21/104 (20%), Positives = 53/104 (50%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ K EE ++ ++ E+ ++K E ++ ++E +E +EE + + ++ ++ EK K
Sbjct: 267 EALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKS 326
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLY 122
EE+ +K +E+ ++ + E ++ ++K E E +
Sbjct: 327 LEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERL 370
Score = 29.0 bits (65), Expect = 2.0
Identities = 26/115 (22%), Positives = 62/115 (53%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ ++ E K + K E++ K+ EE+ ++ +++ E+ E+ ++ +E ++ KEE +
Sbjct: 342 ESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAEL 401
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGY 132
++ +++ E+ +K+ EE +++ ++ E++ KK EEQ Q L G
Sbjct: 402 SAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGA 456
Score = 29.0 bits (65), Expect = 2.0
Identities = 22/103 (21%), Positives = 57/103 (55%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K++ EE+ +K + EE ++ K++ + Q+ KE+ + E++ ++ KE ++ +
Sbjct: 525 EALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTR 584
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K++ E+ +++ +E K+K KE +++ + EE + +
Sbjct: 585 KEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627
Score = 27.8 bits (62), Expect = 4.7
Identities = 17/105 (16%), Positives = 55/105 (52%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+S + + K+ K + + + + E + + + +++ K+ +K E+ ++++EE++
Sbjct: 171 KLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEEL 230
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
++E E +++ + +++KE ++ K + + + E + EE
Sbjct: 231 EQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEE 275
Score = 27.8 bits (62), Expect = 5.5
Identities = 23/102 (22%), Positives = 53/102 (51%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+++ ++ + + + E + K +EE+ ++ + EE EEK E+ +E +++ +E E++
Sbjct: 248 KERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEEL 307
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ ++ EE +K K +++ +K EEK K E
Sbjct: 308 EGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELA 349
Score = 27.0 bits (60), Expect = 8.8
Identities = 17/111 (15%), Positives = 58/111 (52%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+ ++ + + ++K ++ + ++ E+ ++K + ++ KEE ++ + ++ K+ E+ +
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLL 581
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLI 130
+K++ +E +++ K+ +KK K+ +E + Q+ + N+
Sbjct: 582 RTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAE 632
Score = 27.0 bits (60), Expect = 9.5
Identities = 23/96 (23%), Positives = 56/96 (58%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ EE +K ++ + E +++ + + +++ E+K E+ E+ E++ E+ +++ EE K
Sbjct: 652 LQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKL 711
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
E ++ ++ E ++ + EE K++ +K EK + EE
Sbjct: 712 GEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 28.5 bits (64), Expect = 1.4
Identities = 13/59 (22%), Positives = 27/59 (45%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
+ + K K + EEEK + ++E + ++E+ +K ++ K+ KK E
Sbjct: 69 EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127
Score = 28.1 bits (63), Expect = 1.7
Identities = 14/60 (23%), Positives = 28/60 (46%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
+ + K K + E +EEK + ++E ++E+ +K ++ K+ KK E Q
Sbjct: 69 EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128
Score = 27.7 bits (62), Expect = 2.3
Identities = 17/60 (28%), Positives = 28/60 (46%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+ + K K + + +EEK E +KE K ++EE K ++ K+ KK E Q
Sbjct: 69 EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.0 bits (65), Expect = 1.5
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 55 EKEEKKEKKKEEKKQKKEEEKDKKKEE 81
EK++K + KK + K KK+E+K K +E
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 26.7 bits (59), Expect = 8.6
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEE 73
KKK+ K +K + K KK E+KK+ +E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 26.7 bits (59), Expect = 9.3
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEKKKEE 66
KK++ K KK + K +K+E K+K +E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 28.4 bits (64), Expect = 1.5
Identities = 22/81 (27%), Positives = 52/81 (64%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K + K K++KK K+ +++ +E ++ K ++ K E E++K++++K+ K+K
Sbjct: 50 KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEK 109
Query: 88 EEKKQKKKEEKKQKKKKEEKK 108
+E+++++K + KQ+KKK + +
Sbjct: 110 KEEEKERKRQLKQQKKKAKHR 130
Score = 27.2 bits (61), Expect = 4.0
Identities = 21/55 (38%), Positives = 37/55 (67%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
++ KK K +Q K E ++ K+E+KK+ K++K+EEKE K++ K+++KK K
Sbjct: 75 AKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 29.0 bits (65), Expect = 1.5
Identities = 29/112 (25%), Positives = 57/112 (50%), Gaps = 21/112 (18%)
Query: 25 EENKKKQKKKEEKKQKKKEE--KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
EEN+++QK+KE ++Q ++ E K K + +++K+E+K + +E K++ E
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEA 165
Query: 83 -------------------KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+QK+KEEKK+ K+ ++++K++K A Q
Sbjct: 166 REHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAALVAAAAQ 217
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.9 bits (65), Expect = 1.5
Identities = 19/64 (29%), Positives = 40/64 (62%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+ ++++ + EE++E EKKKEE +K ++E D E +KK++ K Q +KE ++
Sbjct: 115 RKWRERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQFLA 174
Query: 103 KKEE 106
++++
Sbjct: 175 ERDD 178
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 29.4 bits (66), Expect = 1.6
Identities = 18/92 (19%), Positives = 37/92 (40%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
K+ E+K + K E K +K + KE + + + K+EE +K+
Sbjct: 25 STTSSNSKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKK 84
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+ K++ ++ K + KK ++K
Sbjct: 85 PDPSKNKEEIEKPKDEPKKPDKKPQADQPNNV 116
Score = 27.1 bits (60), Expect = 8.3
Identities = 18/91 (19%), Positives = 35/91 (38%), Gaps = 1/91 (1%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK-KQKKEEEKDKKKEEKKQK 85
KK + K E K ++ K+ + + E K EE E+K + K++
Sbjct: 35 EKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEI 94
Query: 86 KKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+K + + KK ++K Q + +
Sbjct: 95 EKPKDEPKKPDKKPQADQPNNVHADQPNNNK 125
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 29.3 bits (66), Expect = 1.6
Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 1/89 (1%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K + K + E + E KEE E ++ EEE ++++E+ +K
Sbjct: 204 KYPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSLD-AEEEFEEEEEKPAEKST 262
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
E Q KE K+ KE ++ E N
Sbjct: 263 ESTFQLSKETSIAKESKELPDGEEIEFGN 291
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 29.2 bits (66), Expect = 1.6
Identities = 8/76 (10%), Positives = 36/76 (47%), Gaps = 1/76 (1%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
EE + + + ++ + +E+ ++ +E++E K + E+ +++ +E
Sbjct: 105 DIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERS 164
Query: 83 KQKKKEEKKQKKKEEK 98
+++ +++ +++
Sbjct: 165 -LERRRRRREWEEKRA 179
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 29.0 bits (65), Expect = 1.6
Identities = 26/80 (32%), Positives = 42/80 (52%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EE K+K +E Q KK E EE E+ ++ K+ E+ D +EE +K+K ++ K
Sbjct: 21 EELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLK 80
Query: 95 KEEKKQKKKKEEKKNKKAEE 114
+ + +EE+K K EE
Sbjct: 81 EPAQMSSTSEEEEKKAKLEE 100
Score = 28.7 bits (64), Expect = 1.9
Identities = 23/99 (23%), Positives = 48/99 (48%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K + +I K +K + K+K +E Q KK + + ++ + ++ K+ E+
Sbjct: 2 KPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKV 61
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
+EE +K+K ++ K+ + EE+++K K EE
Sbjct: 62 DSIEEEISEKEKVMSEKLKEPAQMSSTSEEEEKKAKLEE 100
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 29.3 bits (66), Expect = 1.6
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
E+ + K EK EEK E+ EK + +++ +Q +EK+ E
Sbjct: 315 EQPQGKDEKDLEEKP-----EEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQE 369
Query: 114 EQNRQ 118
E + Q
Sbjct: 370 EADGQ 374
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
70kDa MW N terminal. This domain is found in
eukaryotes. This domain is about 90 amino acids in
length. This domain is found associated with pfam00076.
This domain is part of U1 snRNP, which is the pre-mRNA
binding protein of the penta-snRNP spliceosome complex.
It extends over a distance of 180 A from its RNA
binding domain, wraps around the core domain of U1
snRNP consisting of the seven Sm proteins and finally
contacts U1-C, which is crucial for 5'-splice-site
recognition.
Length = 94
Score = 27.6 bits (62), Expect = 1.6
Identities = 14/40 (35%), Positives = 26/40 (65%)
Query: 42 KEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K+ K + E E EK+E++K EKK+K E++ +++ +E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 27.6 bits (62), Expect = 1.7
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
K+ K+E + E +K+E EK +KKE+ ++K +EE K+
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 26.5 bits (59), Expect = 4.7
Identities = 15/47 (31%), Positives = 28/47 (59%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ K+E + E +K++ +K EK+EK EKK EE+ ++ + +D
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKEWDPNEDP 94
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 29.5 bits (66), Expect = 1.6
Identities = 21/94 (22%), Positives = 35/94 (37%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K E EN KKQ +EK ++ ++ + + K + E
Sbjct: 191 KLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDM 250
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
K + E + K E + E+ K+ EE+ K
Sbjct: 251 KNRYESEIKTAESDLSMELEKNNYYKELEERHMK 284
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.0 bits (65), Expect = 1.6
Identities = 14/61 (22%), Positives = 33/61 (54%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
+ K+K+ ++ +K +KK ++K +EEKK + ++K ++ E + D +E +
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQ 107
Query: 86 K 86
+
Sbjct: 108 E 108
Score = 27.4 bits (61), Expect = 4.6
Identities = 17/71 (23%), Positives = 33/71 (46%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KKK+ K KK +KK K+K +++KK E ++K E+ E + ++ + ++
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESAS 111
Query: 79 KEEKKQKKKEE 89
E+
Sbjct: 112 ANSLSDIDNED 122
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 29.2 bits (65), Expect = 1.6
Identities = 21/81 (25%), Positives = 46/81 (56%), Gaps = 4/81 (4%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EE++ + +E+++ +KE + +EK E+++EE+ + ++ D + + KK K+ K
Sbjct: 142 EEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFW 201
Query: 95 KE----EKKQKKKKEEKKNKK 111
KK +K ++++K KK
Sbjct: 202 SAASVFSKKLQKWRQKQKLKK 222
Score = 28.5 bits (63), Expect = 3.3
Identities = 27/112 (24%), Positives = 46/112 (41%), Gaps = 7/112 (6%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
+ SIAP EE + + + EE ++ ++E + EK EE+EE++
Sbjct: 117 LPSSIAPPEIDPEPRKPIVPDLVLEEEE--EVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
Query: 61 EKKKEE-----KKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K ++ + KK KD K KK +K +K++ KK
Sbjct: 175 LKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFSKKLQKWRQKQKLKKPRSG 226
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.3 bits (64), Expect = 1.7
Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 6/72 (8%)
Query: 47 QKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEE 106
Q K K+ E + + E + E + K + +K+K + + KE + + K+ E
Sbjct: 45 QAKSIKQHLAELAADLLELEAAAPRAE--LQAKIARYKKEKARYRSEAKELEAKAKEAEA 102
Query: 107 KKNKKAEEQNRQ 118
+ + Q
Sbjct: 103 ES----DHALHQ 110
Score = 27.9 bits (63), Expect = 2.3
Identities = 12/45 (26%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK-EEKKQKKEEEKDK 77
+ E + E + + + KKE+ + E K+ E K ++ E E D
Sbjct: 62 ELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDH 106
Score = 27.2 bits (61), Expect = 5.2
Identities = 11/49 (22%), Positives = 21/49 (42%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E + + E + K K+EK + + KE + + KE E + +
Sbjct: 62 ELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
Score = 26.8 bits (60), Expect = 7.2
Identities = 12/42 (28%), Positives = 22/42 (52%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
E E + + + K + K+EK + + E KE + + KE + E
Sbjct: 62 ELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAE 103
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 28.9 bits (65), Expect = 1.8
Identities = 13/30 (43%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 57 EEKKEKKKEEKKQKKEEEK-DKKKEEKKQK 85
EK+E K+E ++ K E EK D + EK+ K
Sbjct: 86 AEKRELKEEGEEPKSEPEKADLEDGEKRTK 115
Score = 27.0 bits (60), Expect = 7.2
Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKE-EKKQKKKEEKKQK 93
E+ E EK++ KEE ++ K E EK + EK+ K
Sbjct: 75 PEEPPTPEGAPAEKRELKEEGEEPKSEPEKADLEDGEKRTK 115
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 29.2 bits (66), Expect = 1.8
Identities = 10/36 (27%), Positives = 22/36 (61%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKK 71
EKK+ KK+ +K+ + + +++KE + +KQ+
Sbjct: 85 EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 28.8 bits (65), Expect = 2.8
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 56 KEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQ 92
+E+KK KK EK+ + KK ++ Q E+++
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 28.0 bits (63), Expect = 4.9
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
+KK+ KK+ +K+ + KK K+ + +K++
Sbjct: 85 EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 27.6 bits (62), Expect = 5.6
Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 27 NKKKQKKKEEKKQKK-KEEKKQKKKEKKEEKEEKKE 61
KKK KK EK+ + + +K+KE + E+++
Sbjct: 85 EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 27.6 bits (62), Expect = 6.5
Identities = 7/37 (18%), Positives = 22/37 (59%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
+EKK+ K++ +++ + + ++ K+ + +KQ+
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 27.6 bits (62), Expect = 6.8
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+E+KK KK EK+ + KK+KE + + +
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 28.8 bits (65), Expect = 1.8
Identities = 23/95 (24%), Positives = 41/95 (43%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K K K+E+ + +K Q K+ +K K++E++E + + K
Sbjct: 14 KEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDK 73
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
K E + K+ K+ KK+KKKK+ + K
Sbjct: 74 VSKTESVSFRTPYYKRTTKKMKKKKKKKKVVMSDK 108
Score = 28.5 bits (64), Expect = 2.5
Identities = 20/89 (22%), Positives = 39/89 (43%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ K+ + + + + KK EK ++++ E++Q + K +K K E
Sbjct: 20 KQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTES 79
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+ K+ KK +KK+KKK+ K
Sbjct: 80 VSFRTPYYKRTTKKMKKKKKKKKVVMSDK 108
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 28.8 bits (64), Expect = 1.9
Identities = 14/54 (25%), Positives = 35/54 (64%)
Query: 46 KQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K K K +K+E +EK+EK +E + Q +++E+ ++ + + + +++ +Q ++K
Sbjct: 147 KTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRK 200
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 29.2 bits (66), Expect = 1.9
Identities = 22/116 (18%), Positives = 46/116 (39%), Gaps = 9/116 (7%)
Query: 13 PRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKK--------KEKKEEKEEKKEKKK 64
G+ K+ +E K ++ +KE +++ ++ EK + K E K +
Sbjct: 497 TEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIV 556
Query: 65 EEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ-KKKKEEKKNKKAEEQNRQQ 119
++ K E + K KK+K + + ++KKNK E +++
Sbjct: 557 VLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRK 612
Score = 28.8 bits (65), Expect = 2.2
Identities = 20/104 (19%), Positives = 34/104 (32%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ +K +E + K E K Q E+ K E + + K
Sbjct: 524 LEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFLKK 583
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
KK + K KK + + KK++ K + E + G
Sbjct: 584 KKGILDAGAFESTLKDKKNKVLPEAKKRKLKLGDEVEVITGEPG 627
Score = 28.1 bits (63), Expect = 4.1
Identities = 21/106 (19%), Positives = 37/106 (34%), Gaps = 6/106 (5%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEK-----KQKKKEEKKQKKKEKKEEKEEKKEK 62
+ I RK+ ++E E +K + E K + K +
Sbjct: 507 EELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVP 566
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKE-EKKQKKKEEKKQKKKKEEK 107
+E K + +E K KK++ E K K+ K + K+ K
Sbjct: 567 AEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRK 612
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 28.9 bits (65), Expect = 2.0
Identities = 9/54 (16%), Positives = 22/54 (40%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
K + EE K +++ + + ++ K K++ E + + + EE
Sbjct: 261 KTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEE 314
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 29.2 bits (65), Expect = 2.0
Identities = 32/129 (24%), Positives = 66/129 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++ + +++ + +++ +K++E Q KE KEE++EKK +++E K KEEE+
Sbjct: 240 IDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEEL 299
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
K + K +++K + ++K KE +K+ KK E++ K+ EE + +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEE 359
Query: 137 DQFANLVLH 145
+Q L
Sbjct: 360 EQLEKLQEK 368
Score = 27.6 bits (61), Expect = 5.4
Identities = 29/123 (23%), Positives = 74/123 (60%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ + +ENK+++K+K+ ++++ K K++++ K E + ++ K +E+K K+ E++
Sbjct: 265 EEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEL 324
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
KK E++ +K+KEE ++ +KE K+ + K+E ++ ++ + + Q+ L ++ L
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLES 384
Query: 137 DQF 139
++
Sbjct: 385 ERL 387
Score = 27.6 bits (61), Expect = 6.6
Identities = 28/106 (26%), Positives = 69/106 (65%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ K ++++ ++++K K+ E++ +K ++E K++K+E +E ++E KE + + + +++EE
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEE 359
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
E+ +K +EK ++ +EE KKK E ++ + K ++ E +N ++
Sbjct: 360 EQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405
Score = 27.2 bits (60), Expect = 7.2
Identities = 24/103 (23%), Positives = 55/103 (53%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++ K KE+ K + + ++K + +EE K +E ++ + +++E E
Sbjct: 198 LQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESS 257
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K++ EK+++ + ++ KEE+K+KK +EE+ A+E+ +
Sbjct: 258 KQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELK 300
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.8 bits (65), Expect = 2.1
Identities = 21/94 (22%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEK---KQKKKEKKEEKEEKKEKKKEEKKQKK 71
+G + KEE + +++ KE +++ +K +K K KK + + EK+E+ ++ + K+
Sbjct: 325 VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKE 384
Query: 72 EEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
+ ++ ++ E++ K+ +E+ + E K K
Sbjct: 385 KLSEELEELEEELKELKEELESLYSEGKISVNKT 418
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.5 bits (64), Expect = 2.1
Identities = 19/106 (17%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK---EEKKEKKKEEKKQKKEE 73
K K E+ +K + + +E+ + K+++ EE + + ++
Sbjct: 74 KENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDD 133
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ D E + + +K K+E+ ++K++EE++ EE+ R++
Sbjct: 134 DSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREE 179
>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
and repair / Cell division and chromosome partitioning /
Translation].
Length = 953
Score = 29.1 bits (65), Expect = 2.1
Identities = 15/90 (16%), Positives = 37/90 (41%), Gaps = 3/90 (3%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKK---KEKKEEKEEKKEKKKEEKKQKKEEEK 75
KK +EE K + ++ ++++KE K E+K+ K ++ ++KK +
Sbjct: 378 KKDHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIGSIKPTLMDQLQEKKSPDL 437
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
++ ++ K E ++ +
Sbjct: 438 PDEEFIDTLALPKDLDMKNHELFLKRFAND 467
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 29.1 bits (65), Expect = 2.1
Identities = 22/81 (27%), Positives = 44/81 (54%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
KK K+E K KK+ E KEE ++ ++E +++ K E ++ K+ E + ++ +K E
Sbjct: 59 KKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDE 118
Query: 97 EKKQKKKKEEKKNKKAEEQNR 117
K+K E K + ++++
Sbjct: 119 NLSSKEKTLESKEQSLTDKSK 139
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 28.7 bits (63), Expect = 2.2
Identities = 23/99 (23%), Positives = 55/99 (55%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK + + K+ + ++ + E +++++K ++E+++ +KE + Q K E++ +K
Sbjct: 126 KKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQK 185
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
E++KQK ++EK++ K + E++K K E+
Sbjct: 186 TEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQD 224
Score = 28.7 bits (63), Expect = 2.2
Identities = 24/103 (23%), Positives = 61/103 (59%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
++++KK + + +K+ + ++ K + E++E+K E++K++ +++ E + + + ++
Sbjct: 122 DDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQ 181
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
+K+K E++++K E++KQK KN EQ +Q+ Q
Sbjct: 182 EKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQD 224
Score = 27.6 bits (60), Expect = 6.2
Identities = 22/103 (21%), Positives = 59/103 (57%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ KK + + KK+ + ++ K E +Q++++ ++EK++ +++ E + + E+
Sbjct: 122 DDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQ 181
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+K+K E++++K E++KQK + + E++ +K E + +
Sbjct: 182 EKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQD 224
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 28.7 bits (64), Expect = 2.2
Identities = 20/97 (20%), Positives = 45/97 (46%), Gaps = 2/97 (2%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+KE + K+ K E + + + E E K+E++ E ++ +KE++ +
Sbjct: 42 EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELV 101
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
+ Q++KE+ + +E + ++ +N E N Q
Sbjct: 102 ELQEEKEQLEN--EELQYLREYNLFDRNNLQLEDNLQ 136
Score = 26.8 bits (59), Expect = 9.4
Identities = 14/92 (15%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE-EEKDKKKEEKKQKKKEE 89
Q + + E + KKE++ +E +E +KE+ E E ++KE+ + ++ +
Sbjct: 58 QNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQY 117
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
++ ++ + ++ ++ + + + L
Sbjct: 118 LREYNLFDRNNLQLEDNLQSLELQYEYSLNQL 149
Score = 26.8 bits (59), Expect = 9.5
Identities = 15/87 (17%), Positives = 38/87 (43%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
E + KK++++ + ++ E++ + E +E+KE+ + E+ Q E +
Sbjct: 70 SELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNL 129
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ + + + + E + K K N
Sbjct: 130 QLEDNLQSLELQYEYSLNQLDKLRKTN 156
>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
unknown].
Length = 272
Score = 28.5 bits (63), Expect = 2.2
Identities = 21/99 (21%), Positives = 47/99 (47%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
G R+K + ++ N+ K + +K + + + +K+ ++K EK + +
Sbjct: 105 GGRRKIEPQDINEDPAKAENVEKVPESDAIEALEKQLTQQKSEKHNSSAINFIDELNKRL 164
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
Q+ +++ +++KK EKKQ+ K KN+ A +
Sbjct: 165 WSDPFVSSQRLRKQFRERKKIEKKQEAKDLSLKNRAALD 203
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.0 bits (65), Expect = 2.2
Identities = 21/100 (21%), Positives = 56/100 (56%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + EN KK++++ E K+K++E +++ K +++ + K + + +++ D
Sbjct: 31 KESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDD 90
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
K KK KK+++KK+ ++++ +KE+++ + ++
Sbjct: 91 TKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSED 130
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 28.8 bits (65), Expect = 2.2
Identities = 20/95 (21%), Positives = 45/95 (47%), Gaps = 2/95 (2%)
Query: 11 ISPRIGKRKKK-KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
I P++ K+ + K+E + ++ + + EK +++ K +K K+ EE +Q
Sbjct: 176 IKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQ 235
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
+ +E E KK E ++ + EK +++ +
Sbjct: 236 ELQE-LTIAIEALTNKKSELLEEIAEAEKIREECR 269
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.4 bits (64), Expect = 2.3
Identities = 15/58 (25%), Positives = 39/58 (67%), Gaps = 1/58 (1%)
Query: 37 KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
+KQ + E+ ++++E+K ++EE +E+ K + E+ + K+K ++ QK+++E+ + +
Sbjct: 113 RKQLRFLEQLEREEEEKRDEEE-RERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
Score = 28.1 bits (63), Expect = 2.8
Identities = 12/45 (26%), Positives = 30/45 (66%)
Query: 29 KKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
+ ++++EEK+ +++ E+ + + + E+ K+K KE +K++ EE
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 27.7 bits (62), Expect = 3.6
Identities = 11/50 (22%), Positives = 35/50 (70%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
E+ E ++E+K++E+++++ K + E+ + K++ K+ +K+E+++ + +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
Score = 26.9 bits (60), Expect = 6.5
Identities = 13/49 (26%), Positives = 31/49 (63%)
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNR 117
Q + EE++K+ EE++++ K + ++ + ++K KE +K + E ++R
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 28.5 bits (64), Expect = 2.3
Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
EKK + +K K EK KK E KQK+E+ + E KK +KK +K K ++K +
Sbjct: 100 EKKLELDQKVINKFEKDYKK--EYKQKREDLDKARSELKKLQKKSQKSGTGKYQEKLDQA 157
Query: 104 KEEKKNKKAEEQN 116
EE +K+ E +
Sbjct: 158 LEELNDKQKELEA 170
Score = 27.7 bits (62), Expect = 4.5
Identities = 24/71 (33%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
EKK E ++K K E+ +K E +K+E+ + + E KK +KK +K K +EK ++
Sbjct: 100 EKKLELDQKVINKFEKDYKK---EYKQKREDLDKARSELKKLQKKSQKSGTGKYQEKLDQ 156
Query: 111 KAEEQNRQQGL 121
EE N +Q
Sbjct: 157 ALEELNDKQKE 167
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 28.4 bits (64), Expect = 2.3
Identities = 27/106 (25%), Positives = 55/106 (51%)
Query: 10 AISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
+ I + ++ E + +++ +E+ KK +E ++ + + E ++E+ +EE ++
Sbjct: 55 DEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELER 114
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
EE K + EE +K +EE ++ +E +Q K EK KK EE
Sbjct: 115 LIEEAKAEGYEEGYEKGREEGLEEVRELIEQLNKIIEKLIKKREEI 160
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 28.8 bits (64), Expect = 2.3
Identities = 19/79 (24%), Positives = 36/79 (45%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+ K K+E K+KQ +KE + + ++++ K E +E E + EEE+
Sbjct: 64 ESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVANLSEEER 123
Query: 76 DKKKEEKKQKKKEEKKQKK 94
K + K+K + + K
Sbjct: 124 KKYAAKLKEKGNKAYRNKD 142
Score = 27.6 bits (61), Expect = 4.9
Identities = 23/89 (25%), Positives = 46/89 (51%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K +++ E+K K KKE +K+K+ E++ + K E+K+ K + + + +E
Sbjct: 57 RKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVA 116
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
++++KK K K+K K + K K
Sbjct: 117 NLSEEERKKYAAKLKEKGNKAYRNKDFNK 145
>gnl|CDD|227376 COG5043, MRS6, Vacuolar protein sorting-associated protein
[Intracellular trafficking and secretion].
Length = 2552
Score = 29.1 bits (65), Expect = 2.3
Identities = 13/84 (15%), Positives = 31/84 (36%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K+ + K++ ++ + K +K+ E+ ++ +K+E D K
Sbjct: 334 VVLDSIHEKRYHWTWKYFKERCEDRRAYMKIIRKKYLNEQLSADLSKELEKRELTYDLIK 393
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKK 103
+ + K++ K Q K
Sbjct: 394 YRSRLFRSLAKRRNSIYLKPQTHK 417
>gnl|CDD|212489 cd11678, archaeal_LSm, archaeal Like-Sm protein. The archaeal
Sm-like (LSm): The Sm proteins are conserved in all
three domains of life and are always associated with
U-rich RNA sequences. They function to mediate RNA-RNA
interactions and RNA biogenesis. All Sm proteins contain
a common sequence motif in two segments, Sm1 and Sm2,
separated by a short variable linker. Eukaryotic Sm
proteins form part of specific small nuclear
ribonucleoproteins (snRNPs) that are involved in the
processing of pre-mRNAs to mature mRNAs, and are a major
component of the eukaryotic spliceosome. Most snRNPs
consist of seven Sm proteins (B/B', D1, D2, D3, E, F and
G) arranged in a ring on a uridine-rich sequence (Sm
site), plus a small nuclear RNA (snRNA) (either U1, U2,
U5 or U4/6). Since archaebacteria do not have any
splicing apparatus, their Sm proteins may play a more
general role. Archaeal LSm proteins are likely to
represent the ancestral Sm domain. Members of this
family share a highly conserved Sm fold containing an
N-terminal helix followed by a strongly bent
five-stranded antiparallel beta-sheet. Sm-like proteins
exist in archaea as well as prokaryotes that form
heptameric and hexameric ring structures similar to
those found in eukaryotes.
Length = 69
Score = 26.7 bits (59), Expect = 2.5
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLM 175
L G L +VD + NL L T+E VG + G ++RG N++L+
Sbjct: 22 NQLQGRLVAVDDYMNLHLTDTME--CVGEEKVR-SLGTVVLRGNNILLI 67
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 28.8 bits (64), Expect = 2.5
Identities = 23/101 (22%), Positives = 41/101 (40%), Gaps = 2/101 (1%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK-- 60
+S A + A + +K + ++ K+ + +KK KK ++ KK
Sbjct: 952 VSEAAQQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAA 1011
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK 101
KK KK+ ++ KK K KK KK +K ++
Sbjct: 1012 AKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 28.4 bits (63), Expect = 3.6
Identities = 21/84 (25%), Positives = 35/84 (41%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E+ + +K + KK + K+K +K K+ ++ KK K K ++
Sbjct: 968 EKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPA 1027
Query: 85 KKKEEKKQKKKEEKKQKKKKEEKK 108
KK K KK KK +K +K
Sbjct: 1028 KKVARKPAAKKAAKKPARKAAGRK 1051
Score = 28.4 bits (63), Expect = 3.7
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+K E +K ++ +K + K+K +K K+ +K +K K++ K
Sbjct: 968 EKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPA 1027
Query: 81 EKKQKKKEEKKQKKKEEKKQKKKK 104
+K +K KK KK +K +K
Sbjct: 1028 KKVARKPAAKKAAKKPARKAAGRK 1051
>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain. This domain is
found in Plasmodium Duffy binding proteins. Plasmodium
vivax and Plasmodium knowlesi merozoites invade human
erythrocytes that express Duffy blood group surface
determinants. The Duffy receptor family is localised in
micronemes, an organelle found in all organisms of the
phylum Apicomplexa.
Length = 276
Score = 28.5 bits (64), Expect = 2.5
Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
R++KKK E+ KKK K+K+ K K+ K K KK +K KKKE KQKK+ +K K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDN-KKKEWDKQKKKYKKYK 257
Query: 78 KKEEKKQKKKEEKKQKKK 95
K+ KK + K K+
Sbjct: 258 KRNNKKNYSDIKDKYAKE 275
Score = 27.3 bits (61), Expect = 6.6
Identities = 26/77 (33%), Positives = 44/77 (57%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQK 93
+E KK+ +K +KK K+K+ + +K K K K+ K+ +KKKE KQKKK +K +K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258
Query: 94 KKEEKKQKKKKEEKKNK 110
+ +K K++ +
Sbjct: 259 RNNKKNYSDIKDKYAKE 275
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 28.2 bits (63), Expect = 2.6
Identities = 20/95 (21%), Positives = 44/95 (46%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K KKE E+ + KK+ K ++ + K + E K ++ ++ KK ++ +++
Sbjct: 64 LKYKKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEE 123
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+ + +E + +E K K + KK+ K +
Sbjct: 124 VSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 28.3 bits (63), Expect = 2.6
Identities = 29/106 (27%), Positives = 52/106 (49%), Gaps = 8/106 (7%)
Query: 11 ISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKE----KKEEKEEKKEKKKEE 66
+ PR +K+ E+E K ++Q+ E + + +E K + +EK+++ +EE
Sbjct: 54 LRPRWRSLQKRSSEKEITVFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQLEEE 113
Query: 67 KKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ----KKKKEEKK 108
K+++K E D+ +E K K + Q+ EE K K EKK
Sbjct: 114 KRRQKIEMWDRMQEGKSYKVSAKLAQEATEETSSSSVLPKPKTEKK 159
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 27.7 bits (62), Expect = 2.6
Identities = 25/74 (33%), Positives = 52/74 (70%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
++K EE K+K ++K KK+ K+++KKQKKK+KK+ K+ K+++KE K +E ++++
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Query: 81 EKKQKKKEEKKQKK 94
E+ ++++ + +K
Sbjct: 129 EEDKQEEPVEIMEK 142
Score = 27.3 bits (61), Expect = 4.1
Identities = 35/87 (40%), Positives = 63/87 (72%), Gaps = 1/87 (1%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+K KKE E+++ Q+K+EEKK+K EEK KK+ K+++K++KK+KKK+ KK K+EEK+
Sbjct: 57 EKWKKETEDEEFQQKREEKKRKD-EEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKE 105
K ++ +EE+ ++ K+E+ + ++
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEIMEK 142
Score = 26.5 bits (59), Expect = 6.9
Identities = 27/65 (41%), Positives = 47/65 (72%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
++ K+E E+++ ++K E+K++K+EEK KK K+QKKK++KK+KKK +K KK+++E
Sbjct: 57 EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Query: 110 KKAEE 114
E
Sbjct: 117 SSEES 121
Score = 26.2 bits (58), Expect = 8.7
Identities = 27/74 (36%), Positives = 53/74 (71%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
++K+++K+ ++++K KK K+QKKK++KK+KKK KK K+E+KE K ++ EEE+
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Query: 77 KKKEEKKQKKKEEK 90
++ ++++ + EK
Sbjct: 129 EEDKQEEPVEIMEK 142
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.9 bits (65), Expect = 2.6
Identities = 24/92 (26%), Positives = 57/92 (61%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+K+KE E ++ + + + E+ +K +++ +E EE KE+ +E +K+ + E K+
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNK 110
K E+K ++ EE+ ++ K+E ++ ++K ++ +
Sbjct: 256 KLEEKIRELEERIEELKKEIEELEEKVKELKE 287
Score = 28.1 bits (63), Expect = 4.1
Identities = 25/101 (24%), Positives = 58/101 (57%)
Query: 9 GAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+ + + ++ K+E E +K+ + E ++K EEK ++ +E+ EE +++ E+ +E+ K
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283
Query: 69 QKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+ KE ++ ++ K + EE + +E +K+ + EE+ N
Sbjct: 284 ELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEIN 324
Score = 27.7 bits (62), Expect = 4.9
Identities = 29/94 (30%), Positives = 55/94 (58%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
+ E + ++ EK+ + EE+ +E+ +E EEK+E+ +E KK+ KE EK ++ E
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELE 358
Query: 82 KKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
++ + EE K KK+E ++ KK+ +K E++
Sbjct: 359 ERHELYEEAKAKKEELERLKKRLTGLTPEKLEKE 392
Score = 27.7 bits (62), Expect = 5.5
Identities = 24/102 (23%), Positives = 59/102 (57%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
+++ +E +K+++ +E KK+ K+ EK+ ++ E++ E E+ + KKEE ++ K+
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP 386
Query: 80 EEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGL 121
E+ +++ +E +K K++ E++ K K E + ++ +
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAI 428
Score = 27.3 bits (61), Expect = 8.4
Identities = 20/99 (20%), Positives = 60/99 (60%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ ++ +K +++ E++ K+ EEK+++ +E K++ +E +++ +E +++ + E+
Sbjct: 308 ELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEA 367
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
K K+E+ ++ K+ E+ +++ ++ EK ++ EE+
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406
Score = 27.0 bits (60), Expect = 8.6
Identities = 28/97 (28%), Positives = 55/97 (56%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
++ E +K+ + E++ EE+ ++ +EK+E EE K+K KE +K+ +E E+
Sbjct: 302 YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH 361
Query: 78 KKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ E+ + KKEE ++ KK +K EK+ ++ E+
Sbjct: 362 ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 28.2 bits (63), Expect = 2.6
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 3/63 (4%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE---EKDKKKEEKKQKKKEEK 90
+E K+ K +K + + + K + E ++ +EE DK E ++Q +KE
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 91 KQK 93
K+K
Sbjct: 189 KKK 191
Score = 27.5 bits (61), Expect = 4.1
Identities = 14/66 (21%), Positives = 30/66 (45%), Gaps = 3/66 (4%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
++ KK K +K E + + +K + + +K +++ K K E +++ +K
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILL---EGLQKNQEELFKLLDKYNELREQVQK 185
Query: 105 EEKKNK 110
E K K
Sbjct: 186 ESSKKK 191
Score = 27.1 bits (60), Expect = 5.6
Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 25 EENKKKQKKKEEKK--QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
EEN KK K ++ + + + +K + E ++ +E+ + + E + +KE
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 83 KQK 85
K+K
Sbjct: 189 KKK 191
Score = 27.1 bits (60), Expect = 6.4
Identities = 13/63 (20%), Positives = 29/63 (46%)
Query: 57 EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
EE +K + K + + K + K Q E ++ ++E K K E + + +E +
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 117 RQQ 119
+++
Sbjct: 189 KKK 191
Score = 26.7 bits (59), Expect = 7.7
Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
KK K ++ + + + +K + + +K ++E K K E +++ ++E
Sbjct: 131 NAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEE--LFKLLDKYNELREQVQKESS 188
Query: 77 KKKE 80
KKKE
Sbjct: 189 KKKE 192
Score = 26.7 bits (59), Expect = 8.0
Identities = 13/64 (20%), Positives = 33/64 (51%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
EE +K + + + + + K +K + E +K +E+ K ++ + +++ +K+
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 103 KKEE 106
KK+E
Sbjct: 189 KKKE 192
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 27.2 bits (61), Expect = 2.7
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEE 74
+E+K+E+++EE+K++ EEE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 28.5 bits (64), Expect = 2.7
Identities = 23/90 (25%), Positives = 52/90 (57%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
+ E + K K+K K ++ ++K+ K+ E+E K+EK+ ++K + K+
Sbjct: 257 SAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIA 316
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
KE +++K +K+++++E+ +KKK + +K
Sbjct: 317 KEVAQKEKARARKKEQRKERGEKKKLKRRK 346
Score = 28.5 bits (64), Expect = 2.8
Identities = 19/79 (24%), Positives = 51/79 (64%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K + + K Q+ KE+++++ + E K++K+ KK+ + + K+ ++ +KE+ +
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARA 327
Query: 77 KKKEEKKQKKKEEKKQKKK 95
+KKE++K++ +++K +++K
Sbjct: 328 RKKEQRKERGEKKKLKRRK 346
Score = 28.1 bits (63), Expect = 3.7
Identities = 23/101 (22%), Positives = 53/101 (52%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ K E++ ++ ++ +E+K +K E + + + EE + +EE + E +
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFES 264
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ E K K++ K + K+K++KE ++ K E+Q +++
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.3 bits (64), Expect = 2.8
Identities = 27/88 (30%), Positives = 55/88 (62%), Gaps = 1/88 (1%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
KKE + K+E K + + ++E KE + E +++EK+ ++EE +K+E +KKEE
Sbjct: 50 LKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDES-LEKKEESL 108
Query: 92 QKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++K++E ++++ E+K ++ EE +Q
Sbjct: 109 EEKEKELAARQQQLEEKEEELEELIEEQ 136
Score = 26.7 bits (60), Expect = 7.6
Identities = 23/91 (25%), Positives = 57/91 (62%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K + E E + K+++ E ++Q+K+ +K++ ++K+E EKKE+ EEK+++ + +
Sbjct: 63 HKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQL 122
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+E++++ ++ ++Q+++ E+ EE K
Sbjct: 123 EEKEEELEELIEEQQQELERISGLTAEEAKE 153
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 28.6 bits (63), Expect = 2.8
Identities = 24/87 (27%), Positives = 45/87 (51%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ +++ ++E KK K + K K+E KK +K + E+ KEE + K +++D
Sbjct: 442 RERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELIKADDED 501
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKK 103
++ + K K +K K+ +K KKK
Sbjct: 502 DEQLDHFDMKSILKAEKFKKNRKLKKK 528
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 28.7 bits (65), Expect = 2.8
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKK 68
EE++++ +E E E++E+KKEEKK
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKK 89
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 28.3 bits (63), Expect = 2.9
Identities = 13/66 (19%), Positives = 30/66 (45%), Gaps = 2/66 (3%)
Query: 37 KKQKKKEEKKQKKKEKKEEK--EEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
++ ++K KK +EK + + ++ K +K E+ ++K + KK E
Sbjct: 5 LLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASI 64
Query: 95 KEEKKQ 100
+ + K+
Sbjct: 65 RAQNKR 70
Score = 27.9 bits (62), Expect = 4.2
Identities = 24/125 (19%), Positives = 44/125 (35%), Gaps = 19/125 (15%)
Query: 14 RIGKRKKKKKEEENKKK-QKKKEE-----KKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
+I RK KK ++K Q + +E K +K EE ++K E ++ E + +
Sbjct: 9 QIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASIRAQN 68
Query: 68 KQ-------------KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
K+ + + + +KQ++ K K KE+ KA
Sbjct: 69 KRQLDRFGLGDKQRERLDARLQIDRTFRKQQRDLNKAMTAKGLAGSDLYKEQLAAIKAAL 128
Query: 115 QNRQQ 119
Sbjct: 129 NEALA 133
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 28.4 bits (64), Expect = 3.0
Identities = 10/74 (13%), Positives = 33/74 (44%), Gaps = 2/74 (2%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQK--KKKEEKK 108
E+ + +E + + Q+ + + K + +K+K +++ + + K+E +
Sbjct: 1 EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60
Query: 109 NKKAEEQNRQQGLY 122
++ E+ ++ L
Sbjct: 61 LREEIERLKEPPLI 74
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 27.7 bits (62), Expect = 3.0
Identities = 18/74 (24%), Positives = 44/74 (59%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
K + + ++K++EK+ EKEEK+ K+++EK+ K + +++E++ ++ ++ K +
Sbjct: 7 KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARY 66
Query: 98 KKQKKKKEEKKNKK 111
+ EE K ++
Sbjct: 67 ADDEDYDEELKEQE 80
Score = 26.5 bits (59), Expect = 8.2
Identities = 25/98 (25%), Positives = 60/98 (61%), Gaps = 4/98 (4%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
RI ++K++E+E ++K+E++++++KE++ K +KEE+E++ E+ ++ K +
Sbjct: 10 RIIDIEEKREEKE----REKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
D + +++ K++E + ++KK+K +KK K
Sbjct: 66 YADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKP 103
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 28.7 bits (64), Expect = 3.0
Identities = 18/97 (18%), Positives = 41/97 (42%), Gaps = 3/97 (3%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
E + + + + Q + E KQ + + +EK+ + E +++ ++ D +E+
Sbjct: 359 AERVHDLRSLECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLAD---DEE 415
Query: 83 KQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
+ + +E +++K Q KEE N Q
Sbjct: 416 EVRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQ 452
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 28.6 bits (65), Expect = 3.1
Identities = 22/111 (19%), Positives = 47/111 (42%), Gaps = 7/111 (6%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
+ + E+ ++++EE++E + ++ + E+ +K K KQ KK K Q+K
Sbjct: 181 DPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEK 240
Query: 95 KEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKR------TLIGYLRSVDQF 139
K E + + K+ K + + + + L K+ L + V
Sbjct: 241 KVEGRLAQHKKYAK-LREKLKEELKSLRLTSKQIDELVEQLRDINKRVRGQ 290
Score = 27.1 bits (61), Expect = 8.1
Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 5/91 (5%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK-----KEEKKQKKKKE 105
EE + EE ++EE+++ + + E + +K K KQ KK
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLR 235
Query: 106 EKKNKKAEEQNRQQGLYPNQKRTLIGYLRSV 136
+ + KK E + Q Y + L L+S+
Sbjct: 236 KAQEKKVEGRLAQHKKYAKLREKLKEELKSL 266
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.5 bits (63), Expect = 3.1
Identities = 24/90 (26%), Positives = 51/90 (56%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E +++ Q+ + + +EE+K++ +E++EE EE + K E+K + ++ +KEEK+
Sbjct: 94 SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKE 153
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
+ +EE+K K+ ++ + K K E
Sbjct: 154 PEPEEEEKPKRGSLEENNGEFMTHKLKHTE 183
Score = 27.7 bits (61), Expect = 5.3
Identities = 24/72 (33%), Positives = 52/72 (72%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K+ EK ++K+++ + EE K+K+EE + +EE++++K+EE +K +EE+++++ KEE
Sbjct: 200 KEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEI 259
Query: 108 KNKKAEEQNRQQ 119
+ ++AE ++Q
Sbjct: 260 ERRRAEAAEKRQ 271
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 28.4 bits (63), Expect = 3.2
Identities = 20/88 (22%), Positives = 30/88 (34%), Gaps = 2/88 (2%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKE-EKKQ 92
EE K + E + EK + EK + K + + KEE KK E K
Sbjct: 199 LEETKASEVEH-LSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKAL 257
Query: 93 KKKEEKKQKKKKEEKKNKKAEEQNRQQG 120
E + + ++ E R G
Sbjct: 258 AALEAANAADEDPQDRDAAVEAAARLMG 285
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 28.5 bits (64), Expect = 3.2
Identities = 20/79 (25%), Positives = 33/79 (41%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K E KK ++ EE K +KE + K K ++ +E K+K + K E K
Sbjct: 718 KVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVLVEVVDAK 777
Query: 80 EEKKQKKKEEKKQKKKEEK 98
+ K K ++ + K
Sbjct: 778 DMKSLKTMADRLKSKLGSA 796
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 28.5 bits (63), Expect = 3.2
Identities = 13/47 (27%), Positives = 32/47 (68%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
ENK+KQ ++++Q + +K++K+EE+ + K++ ++ +KQ ++
Sbjct: 271 ENKRKQMLPQQQQQSFDQLVSPRKEQKREERRQLKQQLRDMQKQLRQ 317
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 27.6 bits (62), Expect = 3.2
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K K K EK KKK KK+KK + ++ + + K+ ++ + E K
Sbjct: 76 KPKPPTKWEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDD 135
Query: 88 E-----EKKQKKKEEKKQKKKKEEKKNKK 111
KK+++K+E+ K +K E KNKK
Sbjct: 136 PKEDPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 28.1 bits (63), Expect = 3.3
Identities = 6/41 (14%), Positives = 14/41 (34%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
EE+ + E + E + +++ E+K
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKK 155
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.5 bits (63), Expect = 3.4
Identities = 24/83 (28%), Positives = 38/83 (45%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
K EEE + ++ EE ++ KKE ++ E EEK + E+ K + K+
Sbjct: 239 KVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRNIAGGFS 298
Query: 81 EKKQKKKEEKKQKKKEEKKQKKK 103
E K KK +K E+K+ K
Sbjct: 299 EFFSKLNPFKKDEKIEKKEISKI 321
Score = 27.0 bits (59), Expect = 9.3
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+++E E D++ EE ++ KKE ++ E ++K K +E+
Sbjct: 241 AEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIK 282
>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
Length = 374
Score = 28.4 bits (63), Expect = 3.4
Identities = 13/49 (26%), Positives = 29/49 (59%)
Query: 52 KKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQ 100
KK+++ E K E ++Q E ++DK+ ++ KQ + + + K++ + Q
Sbjct: 19 KKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAETKQDNQVQ 67
Score = 28.0 bits (62), Expect = 3.9
Identities = 10/49 (20%), Positives = 29/49 (59%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
KK+K+ E + K + +++ + K++++ K+ ++ + + E KQ + +
Sbjct: 19 KKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAETKQDNQVQ 67
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 27.7 bits (62), Expect = 3.4
Identities = 20/68 (29%), Positives = 44/68 (64%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
+K+QK +E E+ E+ EK +EE+ +++E E++ + EE++Q+ E + Q ++ + K +
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRA 62
Query: 104 KEEKKNKK 111
+ E +N +
Sbjct: 63 QAEFENLR 70
Score = 26.9 bits (60), Expect = 7.2
Identities = 22/85 (25%), Positives = 51/85 (60%), Gaps = 3/85 (3%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
K++K EE ++ EE ++ ++E+ EE+E ++E + EE++Q+ E + + ++ K K
Sbjct: 2 SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAE---LEAQLEELKDK 58
Query: 88 EEKKQKKKEEKKQKKKKEEKKNKKA 112
+ Q + E +++ ++E ++ KK
Sbjct: 59 YLRAQAEFENLRKRTEREREEAKKY 83
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 28.3 bits (64), Expect = 3.5
Identities = 24/95 (25%), Positives = 50/95 (52%), Gaps = 10/95 (10%)
Query: 17 KRKKKKKEEENKK--------KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKK 68
+R+++ + N++ K+K ++ + +E++ + K EKK K KKE +K KK
Sbjct: 569 ERRREIQMAYNEEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKK 628
Query: 69 QKKEEEKDKKKE--EKKQKKKEEKKQKKKEEKKQK 101
+KE ++ K E+ + ++E K+ K+E +
Sbjct: 629 LEKEMKEAAKNLEFEEAARLRDEIKELKEELLGKS 663
Score = 27.6 bits (62), Expect = 6.1
Identities = 17/74 (22%), Positives = 41/74 (55%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKE 80
+K +E +++++ + ++ + KK+ ++ + + E+ + + K +K+ K KKE
Sbjct: 562 QKAIDETERRREIQMAYNEEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKE 621
Query: 81 EKKQKKKEEKKQKK 94
+K KK EK+ K+
Sbjct: 622 LEKLIKKLEKEMKE 635
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 28.0 bits (62), Expect = 3.5
Identities = 21/82 (25%), Positives = 39/82 (47%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
+ K + K+E ++ KEE +++ K + K E++ EK++ + D EE K+
Sbjct: 2 DPEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKR 61
Query: 85 KKKEEKKQKKKEEKKQKKKKEE 106
+ K K KQK++ E
Sbjct: 62 RAAAAAKAKAAALAKQKREGTE 83
Score = 27.6 bits (61), Expect = 5.7
Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 3/90 (3%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
K++ K K KQK++ +E EE+K K K + K K KQK++
Sbjct: 60 KRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAK---AKAAAAAKAKAAALAKQKREGT 116
Query: 90 KKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
++ ++E+ K K KA +Q+
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQK 146
Score = 27.6 bits (61), Expect = 6.0
Identities = 26/90 (28%), Positives = 42/90 (46%), Gaps = 3/90 (3%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
EE+ K K K K K KQK++ +E EE+K K + K K K
Sbjct: 88 EEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAA---KAKAAAAAKAKAAALAK 144
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
QK++ ++ ++EE+ K+K + K A+
Sbjct: 145 QKREGTEEVTEEEEETDKEKAKAKAAAAAK 174
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 28.1 bits (63), Expect = 3.6
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 1/74 (1%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
EK + K +K ++ + E+ +K K K EKK + +K K+ K + K K
Sbjct: 1 YEKYRSKLLQKAKESGLEFIERLKKALKDKIEKK-EFSAKKPPTGPSKQASKFKTLKPPK 59
Query: 95 KEEKKQKKKKEEKK 108
+KK+ K K
Sbjct: 60 PADKKKPFDKPFKP 73
Score = 27.7 bits (62), Expect = 4.5
Identities = 15/66 (22%), Positives = 28/66 (42%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAE 113
E+ K +K +E + E K ++K +KK+ K+ KQ K + K K
Sbjct: 2 EKYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPA 61
Query: 114 EQNRQQ 119
++ +
Sbjct: 62 DKKKPF 67
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 27.9 bits (62), Expect = 3.7
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 76 DKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
KK E +K+K ++K + KK KK + +
Sbjct: 350 YPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 27.3 bits (61), Expect = 3.8
Identities = 21/41 (51%), Positives = 30/41 (73%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKK 63
EE+ KQ KE+ KQKK+++KK+KKK+KK K+ K+KK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 27.9 bits (63), Expect = 3.8
Identities = 25/76 (32%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 19 KKKKKEEENKKKQKKKE---EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK 75
+K+KEEE KKK K E + +K E +K++ EE K KK E+++ +EEE+
Sbjct: 460 SRKRKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVEEEEE 519
Query: 76 DKKKEEKKQKKKEEKK 91
++ + K EK
Sbjct: 520 EEAVVVESAKNYTEKD 535
Score = 27.5 bits (62), Expect = 5.1
Identities = 20/80 (25%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 41 KKEEKKQKKKEKKEEKEEKKEK------KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
KK +++K+E+ ++K + EK +K + +KEE ++ K KK E+++ +
Sbjct: 456 KKYLSRKRKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVE 515
Query: 95 KEEKKQKKKKEEKKNKKAEE 114
+EE+++ E KN ++
Sbjct: 516 EEEEEEAVVVESAKNYTEKD 535
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 27.9 bits (62), Expect = 3.8
Identities = 20/95 (21%), Positives = 54/95 (56%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
I + + + +E + K +++ Q++ E+ + K ++K+++ EE +++ EE
Sbjct: 110 ILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEEL 169
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+D+K +K +++EKK++ +E K++ ++ E+
Sbjct: 170 EDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
Score = 26.8 bits (59), Expect = 7.0
Identities = 21/98 (21%), Positives = 57/98 (58%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K +I + + ++ +E K K+ +E+ ++ K K++EK+E+ E+ ++E
Sbjct: 107 KKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKEL 166
Query: 68 KQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKE 105
++ ++E+ +K ++++KK+E ++ K+E ++ ++ E
Sbjct: 167 EELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 28.2 bits (63), Expect = 3.8
Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 3/83 (3%)
Query: 12 SPRIGKRKKKKKEEENKKKQKKK--EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQ 69
I K+KK+ E+ KK++K K + + + + E+E ++ + K Q
Sbjct: 60 GAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQ 119
Query: 70 K-KEEEKDKKKEEKKQKKKEEKK 91
E + K E K E +
Sbjct: 120 NPHEGARVKNINEIKLGNYEIEP 142
Score = 26.7 bits (59), Expect = 9.4
Identities = 14/74 (18%), Positives = 31/74 (41%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKK 91
K+K++ ++ KKEKK + ++ + E + + ++E ++ + K
Sbjct: 59 LGAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKV 118
Query: 92 QKKKEEKKQKKKKE 105
Q E + K E
Sbjct: 119 QNPHEGARVKNINE 132
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 28.0 bits (63), Expect = 3.9
Identities = 14/47 (29%), Positives = 22/47 (46%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEK 67
K E K ++ EE K+ +KE ++ KKK E + K E+
Sbjct: 725 KVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEE 771
Score = 27.6 bits (62), Expect = 5.8
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 20 KKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEE 66
K E K ++ +E K+ +K+ E+ +KK E + K EE
Sbjct: 725 KVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAEE 771
Score = 26.8 bits (60), Expect = 9.7
Identities = 10/61 (16%), Positives = 29/61 (47%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+E+E+ K+ K + K ++ E+ +E E++ E+ K++ + + + +
Sbjct: 711 QEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAKAE 770
Query: 83 K 83
+
Sbjct: 771 E 771
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 27.7 bits (61), Expect = 4.0
Identities = 22/92 (23%), Positives = 41/92 (44%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
E + +++ +KK ++ + + +K + EK K +E KQK++ + E +
Sbjct: 140 PSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQS 199
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
K E K + K K K +EK + A Q
Sbjct: 200 NSKPIETAPKADKADKTKPKPKEKAERAAALQ 231
Score = 27.0 bits (59), Expect = 7.2
Identities = 20/102 (19%), Positives = 44/102 (43%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEK 62
I+ + + + + + E+ +Q + + + +K ++ E ++ +
Sbjct: 96 INDPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAE 155
Query: 63 KKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
KK +K + E +K + EK K +E KQK+K KQ +
Sbjct: 156 KKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAET 197
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 28.0 bits (63), Expect = 4.1
Identities = 20/64 (31%), Positives = 38/64 (59%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
E++ KKEK E +E+ E ++E + +EE D +++ K +K + Q+ KEE +Q +
Sbjct: 420 EREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQVRL 479
Query: 104 KEEK 107
+ E+
Sbjct: 480 ELEQ 483
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 27.3 bits (61), Expect = 4.1
Identities = 18/76 (23%), Positives = 44/76 (57%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E K++ + + +++ EE +++ E E+ +++ E+ EE K + EEE ++ KE +
Sbjct: 50 ERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEA 109
Query: 85 KKKEEKKQKKKEEKKQ 100
+ + EK++ +E + +
Sbjct: 110 EIEAEKERALEELRAE 125
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 26.6 bits (59), Expect = 4.1
Identities = 20/72 (27%), Positives = 35/72 (48%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
KK K K E K+K + +++++ K K KE K +K + + +K + K+ KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 96 EEKKQKKKKEEK 107
+ KK K +
Sbjct: 62 DCKKGCKWEGNT 73
Score = 25.9 bits (57), Expect = 8.0
Identities = 24/74 (32%), Positives = 34/74 (45%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKK 87
KK K K E K+K + K++ K + KE K +K Q E +K + K+ KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 88 EEKKQKKKEEKKQK 101
+ KK K E K
Sbjct: 62 DCKKGCKWEGNTCK 75
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 28.2 bits (63), Expect = 4.3
Identities = 19/60 (31%), Positives = 31/60 (51%)
Query: 38 KQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
++ K +KQK + K+ E E+ +K K+EKKQ + K E K+ EE + +E
Sbjct: 15 EEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIEEH 74
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 27.4 bits (61), Expect = 4.4
Identities = 25/66 (37%), Positives = 42/66 (63%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
++ ++E+K+ KE +E++ K +K+K+EKK+KK EK KK+ K K KK KK
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 99 KQKKKK 104
K+++ K
Sbjct: 153 KKEEGK 158
Score = 27.0 bits (60), Expect = 4.7
Identities = 24/66 (36%), Positives = 42/66 (63%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEK 90
++ ++E+K+ KE ++++ + K+EK+EKKEKK EK KK+ K K KK KK
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 91 KQKKKE 96
K+++ +
Sbjct: 153 KKEEGK 158
Score = 26.6 bits (59), Expect = 7.2
Identities = 27/66 (40%), Positives = 42/66 (63%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
EE ++++KE +E EEK+ K +++K++K+E+K +K KK+ K K KK KK
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 103 KKEEKK 108
KKEE K
Sbjct: 153 KKEEGK 158
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 27.1 bits (61), Expect = 4.5
Identities = 13/35 (37%), Positives = 26/35 (74%)
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
K+K++KK+KK+E + ++++KK EE+K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 26.4 bits (59), Expect = 8.1
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 32 KKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE 65
+KK++KK+KK+ E ++KK+K EE+KE +K
Sbjct: 83 RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 26.4 bits (59), Expect = 8.6
Identities = 13/39 (33%), Positives = 27/39 (69%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+K++KK+KKKE + +E+KK+ E++++ ++ D
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAA 120
Score = 26.4 bits (59), Expect = 9.4
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 70 KKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
K++++K KKK+E + ++++KK EE+K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 27.9 bits (61), Expect = 4.6
Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 9/104 (8%)
Query: 3 ISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQK---------KKEEKKQKKKEKK 53
+ IAPK A P ++ E + +K+ E KQ +KE ++ K+E K
Sbjct: 466 VKIAPKLAELPATKPQETAIGENKAPFIEKQAETNKQPIDMPSLPVIQKEATERDKEEMK 525
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE 97
+ EEK ++ E +++ EE K K + +K KEE
Sbjct: 526 PQTEEKMVEESEPANDANGKKRSAGIEEGKLIAKSAEDEKAKEE 569
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 27.8 bits (62), Expect = 4.9
Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK-KEEKKQKKKE 88
K +KKEE+ + + EK+ +K EK EK + +E E+ + E+ K + +
Sbjct: 277 KLQKKEEQTLRTRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVDF 336
Query: 89 EKKQKKKEEKKQKKKKEEKK 108
+ K+K++E K+ + +K+E+
Sbjct: 337 KSKKKREEAKRGRPRKDEEL 356
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 27.3 bits (61), Expect = 5.0
Identities = 15/69 (21%), Positives = 32/69 (46%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
KK+K +E + K+ + K+K ++K++ ++ E ++ K + E + K K
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPG 64
Query: 79 KEEKKQKKK 87
K + K
Sbjct: 65 KFYVPAEHK 73
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 28.0 bits (62), Expect = 5.0
Identities = 15/84 (17%), Positives = 38/84 (45%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
N +K +E +++ + EK+ + EK + E + +E E +++ E+ + +
Sbjct: 337 NNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDEN 396
Query: 87 KEEKKQKKKEEKKQKKKKEEKKNK 110
K+ +E + + ++KN
Sbjct: 397 DHSKRICDDDELENHFRAADEKNS 420
>gnl|CDD|153366 cd07682, F-BAR_srGAP2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 2. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP2 is expressed in zones
of neuronal differentiation. It plays a role in the
regeneration of neurons and axons. srGAP2 contains an
N-terminal F-BAR domain, a Rho GAP domain, and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 263
Score = 27.3 bits (60), Expect = 5.3
Identities = 20/66 (30%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEK--DKKKEEKKQKKKEEKKQKKKEEKKQK 101
+ K K+ EK+EEK+ + ++E+++ + + + + EEK ++ KK +K +EK+Q
Sbjct: 153 QSKLKEAEKQEEKQMSRSVRQEDRQTPRSPDSTTNIRIEEKHVRRSSVKKIEKMKEKRQA 212
Query: 102 KKKEEK 107
K E K
Sbjct: 213 KYTENK 218
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 27.5 bits (61), Expect = 5.4
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 30 KQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKE 61
+ K E K +K K KK+KK+E+K+E +++ E
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEKDELDDEVE 53
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 27.6 bits (61), Expect = 5.5
Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 1/91 (1%)
Query: 26 ENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQK 85
EN+ QK+ E K+ + + E+K + E+E + + E + K EE +KK E + +
Sbjct: 74 ENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEV-RLKALEELEKKAENEAAE 132
Query: 86 KKEEKKQKKKEEKKQKKKKEEKKNKKAEEQN 116
+EE K K + + K + +K + +E
Sbjct: 133 AEEEAKLLKDKLDAESLKLQNEKEDQLKEAK 163
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 27.7 bits (62), Expect = 5.5
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 51 EKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
E+ EK KK KK K +K + K + E+
Sbjct: 7 EESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDED 45
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 27.5 bits (61), Expect = 5.5
Identities = 23/120 (19%), Positives = 52/120 (43%), Gaps = 4/120 (3%)
Query: 1 MKISIAPKGAISPRI-GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEK 59
+KI +A A + +++ ++ E +Q +EEK Q ++E + K E +E +
Sbjct: 267 VKIILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGL 326
Query: 60 KEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
E++KE + + +E Q++ +E+ + + + E + A E +
Sbjct: 327 IERQKETELEPQERSYFI---NAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAE 383
Score = 27.1 bits (60), Expect = 8.7
Identities = 22/114 (19%), Positives = 50/114 (43%), Gaps = 5/114 (4%)
Query: 22 KKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEK-----KEKKKEEKKQKKEEEKD 76
+KE E + ++ K + + E++ K ++ +E K E + K + E +
Sbjct: 231 EKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAE 290
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQKRTLI 130
+ + +Q +EEK Q ++E + K + + E+ ++ L P ++ I
Sbjct: 291 QAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFI 344
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 27.6 bits (62), Expect = 5.7
Identities = 18/55 (32%), Positives = 38/55 (69%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEK 98
EK K EK EEK E++E+++ + +++EEE+D++++++K ++ K + +EK
Sbjct: 7 EKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEK 61
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 27.5 bits (62), Expect = 5.7
Identities = 9/32 (28%), Positives = 21/32 (65%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+E + + +EE++E+++EE+++ EEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 26.8 bits (60), Expect = 8.5
Identities = 10/42 (23%), Positives = 25/42 (59%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
K+ ++ KE + + ++EE+++++EEE++ +EE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 26.9 bits (60), Expect = 5.8
Identities = 20/66 (30%), Positives = 42/66 (63%)
Query: 45 KKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKK 104
KK K +KE EK++KKK ++ KK + + E+ +++ E++K+ +K +K+ K++
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRR 124
Query: 105 EEKKNK 110
+++K K
Sbjct: 125 QKEKEK 130
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 27.5 bits (61), Expect = 5.9
Identities = 16/82 (19%), Positives = 40/82 (48%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+ E+ ++++ E+ +K E + + E+ E + ++ E+++ + EEE+++
Sbjct: 25 VTDVPETEQELADEEQEFEQILEKAEAEAAQIIEQAEAQAAAIREQIEQERAQWEEERER 84
Query: 78 KKEEKKQKKKEEKKQKKKEEKK 99
+E KQ+ E Q + E
Sbjct: 85 LIQEAKQEGYEAGFQAGESEAL 106
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 27.7 bits (61), Expect = 6.0
Identities = 6/61 (9%), Positives = 12/61 (19%), Gaps = 1/61 (1%)
Query: 9 GAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKE-KKEEKEEKKEKKKEEK 67
G + IG E ++ E + + + K
Sbjct: 194 GTVLAIIGDANAAPAEPAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAP 253
Query: 68 K 68
Sbjct: 254 A 254
Score = 26.9 bits (59), Expect = 9.0
Identities = 5/44 (11%), Positives = 7/44 (15%)
Query: 53 KEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKE 96
+EE E E + K
Sbjct: 212 EEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAA 255
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 27.7 bits (61), Expect = 6.1
Identities = 24/98 (24%), Positives = 40/98 (40%), Gaps = 6/98 (6%)
Query: 8 KGAISPRIGKRKKKKKEEENKKKQKKKEEKKQK------KKEEKKQKKKEKKEEKEEKKE 61
K + P+ G + + KE E KK +E K KK E + K K+ +E KK
Sbjct: 538 KESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKP 597
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K+ ++ + K E K ++ + K K+
Sbjct: 598 KRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKR 635
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 26.2 bits (58), Expect = 6.3
Identities = 14/55 (25%), Positives = 28/55 (50%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
KK + K E E+ K E+ K++E + + K++ + + K++ KK+ K
Sbjct: 40 KKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKAKP 94
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 27.5 bits (61), Expect = 6.3
Identities = 19/84 (22%), Positives = 41/84 (48%)
Query: 35 EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKK 94
E+ + EE+ +++ E E + ++E+++ +E + K E K++K
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREK 62
Query: 95 KEEKKQKKKKEEKKNKKAEEQNRQ 118
K+ KK+K+++ K E+QN
Sbjct: 63 ARLKELKKQKKQEIQKILEQQNAA 86
>gnl|CDD|153368 cd07684, F-BAR_srGAP3, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 3. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP3, also called MEGAP
(MEntal disorder associated GTPase-Activating Protein),
is a Rho GAP with activity towards Rac1 and Cdc42. It
impacts cell migration by regulating actin and
microtubule cytoskeletal dynamics. The association
between srGAP3 haploinsufficiency and mental retardation
is under debate. srGAP3 contains an N-terminal F-BAR
domain, a Rho GAP domain, and a C-terminal SH3 domain.
F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 253
Score = 27.4 bits (60), Expect = 6.4
Identities = 22/61 (36%), Positives = 31/61 (50%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
E K K+ EK+EEK+ K + EE ++ KK +K +EK+Q K E K K
Sbjct: 153 ESKLKEAEKQEEKQFNKSGDISSNLLRHEERPQRRSSVKKIEKMKEKRQAKYSENKLKCT 212
Query: 104 K 104
K
Sbjct: 213 K 213
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 27.6 bits (61), Expect = 6.4
Identities = 15/104 (14%), Positives = 46/104 (44%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
K K E + Q ++K+ +KE K ++ ++ ++ +
Sbjct: 188 KKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQ 247
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQ 118
K + +EE+ +K++ K+ + + E+ + ++ ++ ++ + R+
Sbjct: 248 KREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARK 291
>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
Length = 141
Score = 26.6 bits (59), Expect = 6.5
Identities = 17/69 (24%), Positives = 37/69 (53%)
Query: 23 KEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEK 82
+EE NK +QK + K++ +++ + KK E + + E ++ +K++ KE+ + E K
Sbjct: 72 REEANKIRQKAIAKAKEEAEKKIEAKKAELESKYEAFAKQLANQKQELKEQLLSQMPEFK 131
Query: 83 KQKKKEEKK 91
+ + K
Sbjct: 132 EALNAKLSK 140
>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 147 to 176 amino acids in length. This domain
is found associated with pfam00226.
Length = 144
Score = 26.5 bits (59), Expect = 6.5
Identities = 17/49 (34%), Positives = 29/49 (59%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
K QKKKE+++Q++K + + KK+++ E E+K +EEEK
Sbjct: 3 NKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEEKG 51
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This
family includes outer membrane proteins such as OmpH
among others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 140
Score = 26.8 bits (60), Expect = 6.5
Identities = 23/76 (30%), Positives = 49/76 (64%)
Query: 18 RKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDK 77
+K+ +KE + ++ + +K EK+ +K +EK QK E +K++K+ +KK ++ + K +
Sbjct: 20 QKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQ 79
Query: 78 KKEEKKQKKKEEKKQK 93
K ++ QK+++E+ QK
Sbjct: 80 KLQQDLQKRQQEELQK 95
>gnl|CDD|234726 PRK00321, rdgC, recombination associated protein; Reviewed.
Length = 303
Score = 27.1 bits (61), Expect = 6.7
Identities = 18/43 (41%), Positives = 22/43 (51%), Gaps = 11/43 (25%)
Query: 41 KKEEK-------KQKKKEKKEEKEEK---KEKKKEEKKQKKEE 73
+KEEK KQ +EK E E + K KKE K + KEE
Sbjct: 68 RKEEKILPASVIKQALEEKVAEIEAEEGRKLGKKE-KDELKEE 109
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 27.2 bits (59), Expect = 6.9
Identities = 25/96 (26%), Positives = 44/96 (45%)
Query: 17 KRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKD 76
+ K+ E +K K +KK+E++ ++ K+ K E+ K K+E +K EK +++ +D
Sbjct: 133 RNKRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREKDRERARRED 192
Query: 77 KKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKA 112
++K KK KK K K A
Sbjct: 193 AAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAA 228
Score = 26.8 bits (58), Expect = 9.6
Identities = 24/102 (23%), Positives = 53/102 (51%)
Query: 14 RIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEE 73
RI +R +K EE K + E K K + K+ E + K +KK+E++ ++
Sbjct: 100 RILRRLLRKYREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQL 159
Query: 74 EKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ K+E+ + K +++ +K+E+ +++ ++E+ A +Q
Sbjct: 160 AAKRLKDEQHRHKARKQELRKREKDRERARREDAAAAAAAKQ 201
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 26.6 bits (59), Expect = 7.1
Identities = 22/76 (28%), Positives = 42/76 (55%)
Query: 25 EENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQ 84
E KK+ ++K Q +E K + +E E ++ + EE K + EE++K K + +
Sbjct: 39 ERAKKEAALAQKKAQVILKEAKDEAQEIIENANKRGSEILEEAKAEAREEREKIKAQARA 98
Query: 85 KKKEEKKQKKKEEKKQ 100
+ + EK+Q ++E +KQ
Sbjct: 99 EIEAEKEQAREELRKQ 114
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 26.6 bits (59), Expect = 7.3
Identities = 18/78 (23%), Positives = 43/78 (55%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
NK+ K + ++++ + E E +++ +K+ EE E ++ K+ + + K
Sbjct: 62 NKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLK 121
Query: 87 KEEKKQKKKEEKKQKKKK 104
KE+K+ +KK ++ +K++K
Sbjct: 122 KEQKELRKKLDELEKEEK 139
>gnl|CDD|222863 PHA02540, 61, DNA primase; Provisional.
Length = 337
Score = 27.3 bits (61), Expect = 7.4
Identities = 14/53 (26%), Positives = 23/53 (43%)
Query: 34 KEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKK 86
+E ++ KE K + + K E K++KK +K E D E+ K
Sbjct: 80 REYIMERFKERGTGKGRPVPKPKFEFKKEKKVIEKLPFCERLDTLPEDHPIIK 132
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 27.3 bits (60), Expect = 7.4
Identities = 15/61 (24%), Positives = 28/61 (45%)
Query: 48 KKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEK 107
K K+ E+ E + + E D +++KQK EE+K + KQ+ E++
Sbjct: 79 KTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKE 138
Query: 108 K 108
+
Sbjct: 139 R 139
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 27.3 bits (60), Expect = 7.6
Identities = 15/103 (14%), Positives = 51/103 (49%), Gaps = 2/103 (1%)
Query: 24 EEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKK 83
+ ++ + Q + + + +K+ + ++E + +K ++E++ ++E ++ K
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 84 QKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQGLYPNQK 126
+++ + K+ ++ + + K ++ ++ E Q Q L +QK
Sbjct: 135 AQQELARLTKQAQDLQTRLKTLAEQRRQLEAQ--AQSLQASQK 175
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 26.8 bits (59), Expect = 7.8
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 49 KKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K +EEKE+KK KEEK+ KK+E ++
Sbjct: 90 KNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 25.9 bits (57), Expect = 7.8
Identities = 11/25 (44%), Positives = 22/25 (88%)
Query: 50 KEKKEEKEEKKEKKKEEKKQKKEEE 74
+++ +E+++EKKKEE+K+++EEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 26.8 bits (60), Expect = 7.8
Identities = 12/66 (18%), Positives = 31/66 (46%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
K KEE+++ + +E E +KK + +E ++++ ++ +++ + K
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKA 74
Query: 96 EEKKQK 101
+K K
Sbjct: 75 LAEKSK 80
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 27.1 bits (60), Expect = 8.1
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 28 KKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKK 64
+KK K K++ K + K KK+ K + ++ +K+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 27.1 bits (60), Expect = 9.2
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 39 QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKK 79
K EKK K K+K + K K ++K + K ++ KK+
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 26.8 bits (59), Expect = 8.1
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 31 QKKKEEKKQKKKEEKKQKKKEKKEEKE 57
++KE+KK + KEEKK K+EK +E E
Sbjct: 92 LREKEKKKSRTKEEKKALKEEKDKEAE 118
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 26.9 bits (60), Expect = 8.2
Identities = 24/85 (28%), Positives = 39/85 (45%), Gaps = 3/85 (3%)
Query: 22 KKEEENKKKQKKK---EEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKK 78
K+ EE KK+ K E Q ++++ KK E E + K+ E ++ E E ++
Sbjct: 119 KELEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEHEDFKQLENEENFLETEYNEI 178
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKK 103
KK + ++ KK EKK K
Sbjct: 179 TSSNNYKKLKINRKLKKAEKKLIKS 203
>gnl|CDD|227411 COG5079, SAC3, Nuclear protein export factor [Intracellular
trafficking and secretion / Cell division and chromosome
partitioning].
Length = 646
Score = 27.2 bits (60), Expect = 8.3
Identities = 7/67 (10%), Positives = 20/67 (29%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKK 99
K+++ K + + +++ + + KK K+++ E K
Sbjct: 4 LGKQKRSHKARMQAPVSKDRFDLGMRVFMVLLFSSTLDTNPFELLKKLRAKERELLEAKS 63
Query: 100 QKKKKEE 106
Sbjct: 64 ANALPSA 70
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 26.8 bits (60), Expect = 8.5
Identities = 14/51 (27%), Positives = 26/51 (50%)
Query: 58 EKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKK 108
+EKK + E+ + E +++KE + KK E K+K ++ EK+
Sbjct: 261 YYYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEELEKR 311
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.3 bits (61), Expect = 8.6
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 21 KKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKE-EKKQKK 71
+++ E NK K + K+E+K +E +K + +E+ +EKKEK E KK K+
Sbjct: 746 RERAELNKLKAQLKKERKGAMRELRKDNRFIAREKLKEKKEKDAEYHKKMKR 797
Score = 26.9 bits (60), Expect = 9.4
Identities = 30/123 (24%), Positives = 57/123 (46%), Gaps = 26/123 (21%)
Query: 18 RKKKKKE--EE--NKKKQKKKEEKKQKKKEE-------------------KKQKKKEKKE 54
RKK KKE +E K K K E +K K+++E K K
Sbjct: 171 RKKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMT 230
Query: 55 EKEEKKE---KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKK 111
+E+ E + +E ++ + D+ K E++ K+E ++ KK E ++ ++ + E+++ +
Sbjct: 231 PEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDE 290
Query: 112 AEE 114
EE
Sbjct: 291 EEE 293
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 26.9 bits (59), Expect = 8.6
Identities = 11/49 (22%), Positives = 28/49 (57%)
Query: 33 KKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEE 81
KKE +K +E +++ E++++ EE + +E + +++ D+ E+
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDED 81
Score = 26.9 bits (59), Expect = 9.7
Identities = 11/50 (22%), Positives = 28/50 (56%)
Query: 40 KKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEE 89
KK+ ++ KE +EE E+++ +E+ +E++D ++ + ++E
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDE 82
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 26.9 bits (60), Expect = 8.7
Identities = 26/119 (21%), Positives = 49/119 (41%), Gaps = 12/119 (10%)
Query: 1 MKISIAPKGAISPRIGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKK 60
I+I +S +R + E K+ ++ + + + E EK
Sbjct: 473 QSITIKASSGLSDEEIERMVEDAEAN-------AALDKKFRELVEARNEAESLIYSLEKA 525
Query: 61 EKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQNRQQ 119
K+ K E++K+K E+ EE + +KEE K K ++ ++ +K E+ QQ
Sbjct: 526 LKE-----IVKVSEEEKEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEKKYQQ 579
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 26.5 bits (59), Expect = 8.7
Identities = 16/77 (20%), Positives = 39/77 (50%)
Query: 43 EEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
+E + + E + E +E+ E+ E + EE + ++ E +++ ++ ++Q+ K + K
Sbjct: 12 DEARARASEIRAEADEEAEEIIAEAEADAEEILEDREAEAEREIEQLREQELSSAKLEAK 71
Query: 103 KKEEKKNKKAEEQNRQQ 119
++ K+ E R Q
Sbjct: 72 RERLNARKEVLEDVRNQ 88
>gnl|CDD|234776 PRK00478, scpA, segregation and condensation protein A/unknown
domain fusion protein; Provisional.
Length = 505
Score = 26.8 bits (59), Expect = 8.9
Identities = 17/74 (22%), Positives = 40/74 (54%)
Query: 36 EKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKK 95
+++Q++++ K+ K+ +EE+ +K+EE +KK E +E+ ++ EEK +
Sbjct: 430 QRQQEQEQALKESIKQIQEERAISNFQKREEYLKKKYGEYYLSREQYQKLTPEEKINIRI 489
Query: 96 EEKKQKKKKEEKKN 109
++K K + +
Sbjct: 490 NQRKIDLIKLDDEQ 503
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 26.9 bits (59), Expect = 9.1
Identities = 22/102 (21%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 17 KRKKKKKEEENKKKQKKKEEKK----QKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKE 72
K +K K K + K + K ++K Q+ K E+ + + E K+EE K +
Sbjct: 292 KISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQS 351
Query: 73 EEKDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEE 114
+ K+ +KQ E+ + +E+++ ++ +K N ++++
Sbjct: 352 NIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDK 393
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 26.9 bits (59), Expect = 9.2
Identities = 15/95 (15%), Positives = 42/95 (44%), Gaps = 4/95 (4%)
Query: 19 KKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEK----KEKKKEEKKQKKEEE 74
+KKK +++++ + +KQ+ +++ + E + + ++K +E+E
Sbjct: 166 EKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEEDE 225
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKN 109
+D + E + E+++ E + E + N
Sbjct: 226 EDAESESSFESSNEDEEGSSSEADEMAAALEAELN 260
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 26.8 bits (59), Expect = 9.2
Identities = 14/41 (34%), Positives = 20/41 (48%)
Query: 62 KKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKK 102
KK EK ++EE + KKK+ +K KE K K+
Sbjct: 360 KKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEGGKGKE 400
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 26.6 bits (59), Expect = 9.2
Identities = 15/85 (17%), Positives = 29/85 (34%), Gaps = 8/85 (9%)
Query: 27 NKKKQKKKEEKKQKKKEEKKQKKKEKKE--------EKEEKKEKKKEEKKQKKEEEKDKK 78
+ + +++K + KK+ K KK + + ++K K
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRIDLVGEWLSTAISPPDSSPVSLDAREKLRAYLPK 206
Query: 79 KEEKKQKKKEEKKQKKKEEKKQKKK 103
E K + KK K+ K + K
Sbjct: 207 WEGKTKNKKSLKEYKDLIKLLDSGK 231
>gnl|CDD|212465 cd01718, Sm_E, Sm protein E. The eukaryotic Sm proteins (B/B', D1,
D2, D3, E, F and G) assemble into a hetero-heptameric
ring around the Sm site of the 2,2,7-trimethyl guanosine
(m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
forming the core of the snRNP particle. The snRNP
particle, in turn, assembles with other components onto
the pre-mRNA to form the spliceosome which is
responsible for the excision of introns and the ligation
of exons. Members of this family share a highly
conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet. Sm subunit E binds subunits F and G to form
a trimer which then assembles onto snRNA along with the
D1/D2 and D3/B heterodimers forming a seven-membered
ring structure.
Length = 79
Score = 25.2 bits (56), Expect = 9.2
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Query: 121 LYPNQKRTLIGYLRSVDQFANLVLHKTIERIHVGNQYGDIPRGIFIIRGENVVLMGE 177
LY + G + D++ NLVL E +H+ P G +++G+N+ L+
Sbjct: 25 LYEQTDMRIEGKIIGFDEYMNLVLDDAEE-VHLKTN-TRKPLGRILLKGDNITLIQN 79
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 26.6 bits (59), Expect = 9.3
Identities = 23/101 (22%), Positives = 49/101 (48%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+ + ++ KK++ K K K+ KK+KKK+ K + + +KK ++ +
Sbjct: 78 LQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137
Query: 75 KDKKKEEKKQKKKEEKKQKKKEEKKQKKKKEEKKNKKAEEQ 115
+ K + KE ++ KE + ++KK + K K+ E++
Sbjct: 138 RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKK 178
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 26.8 bits (60), Expect = 9.3
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 44 EKKQKKKEKKEEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEE-KKQKK 102
E +K+ ++ + EE+K +KKE ++ K +K +K K K +K+ K+K + KK K
Sbjct: 5 ELHRKRHGRRLDHEERK-RKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKKTIK 63
Query: 103 KKEEKKNKKAEEQNRQQGLYPN 124
EE+ K+ + +G P
Sbjct: 64 MHEERNVKQKVDDKVPEGAVPA 85
>gnl|CDD|212476 cd01729, LSm7, Like-Sm protein 7. The eukaryotic LSm proteins
(LSm2-8 or LSm1-7) assemble into a hetero-heptameric
ring around the 3'-terminus uridylation tag of the
gamma-methyl triphosphate (gamma-m-P3) capped U6 snRNA.
LSm2-8 form the core of the snRNP particle that, in
turn, assembles with other components onto the pre-mRNA
to form the spliceosome which is responsible for the
excision of introns and the ligation of exons. LSm1-7 is
involved in recognition of the 3' uridylation tag and
recruitment of the decapping machinery. LSm657 is
believed to be an assembly intermediate for both the
LSm1-7 and LSm2-8 rings. Members of this family share a
highly conserved Sm fold containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 89
Score = 25.6 bits (57), Expect = 9.3
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 9/55 (16%)
Query: 127 RTLIGYLRSVDQFANLVLHKTIERI-------HVGNQYGDIPRGIFIIRGENVVL 174
R + G L+ DQ NLVL T+E + + ++ + G+ + RG +VVL
Sbjct: 23 REVTGILKGYDQLLNLVLDDTVEYLRDPEDPYKLTDETRSL--GLVVCRGTSVVL 75
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
Provisional.
Length = 193
Score = 26.6 bits (58), Expect = 9.7
Identities = 20/77 (25%), Positives = 46/77 (59%)
Query: 15 IGKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEKEEKKEKKKEEKKQKKEEE 74
+G +K K +E K++ + +++K K+E ++ ++K E+ E+ E + EE +++EE
Sbjct: 1 MGNYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEEN 60
Query: 75 KDKKKEEKKQKKKEEKK 91
+++K +E+ +E K
Sbjct: 61 EEQKPKEEIDYPIQENK 77
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 26.6 bits (59), Expect = 9.8
Identities = 15/50 (30%), Positives = 26/50 (52%)
Query: 54 EEKEEKKEKKKEEKKQKKEEEKDKKKEEKKQKKKEEKKQKKKEEKKQKKK 103
E KEE+ E +E ++ EE + EE Q EE ++ +EE + ++
Sbjct: 47 EAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGFEEGQEEGRVLER 96
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 26.3 bits (58), Expect = 10.0
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 16 GKRKKKKKEEENKKKQKKKEEKKQKKKEEKKQKKKEKKEEK 56
+ ++KK + KK+ + Q+ K++KK+K K+KK K
Sbjct: 64 RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.301 0.124 0.316
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,476,143
Number of extensions: 1089038
Number of successful extensions: 50000
Number of sequences better than 10.0: 1
Number of HSP's gapped: 23118
Number of HSP's successfully gapped: 6927
Length of query: 213
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 120
Effective length of database: 6,812,680
Effective search space: 817521600
Effective search space used: 817521600
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 57 (25.6 bits)