RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5528
(182 letters)
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.1 bits (92), Expect = 6e-04
Identities = 18/60 (30%), Positives = 34/60 (56%)
Query: 123 MVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K T +++ + K +K +EE++EKKKK K+ +++K ++EEE+EE
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 34.9 bits (81), Expect = 0.015
Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 138 RRRMRKKKKVEKEEE-EEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ R+++K EK+++ KKK + + + +++K EEEEE EE
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Score = 34.9 bits (81), Expect = 0.016
Identities = 18/47 (38%), Positives = 30/47 (63%)
Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ + KK++ EK+E+++K K+K + K+KK EE+EEEEE
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Score = 32.2 bits (74), Expect = 0.14
Identities = 14/45 (31%), Positives = 30/45 (66%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ +KKK +++EE++++ K K + ++++ EEE+EEEE
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 31.8 bits (73), Expect = 0.18
Identities = 14/45 (31%), Positives = 30/45 (66%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ +KK K++EEE++++ ++K + +++ EE+EEEEE
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 28.7 bits (65), Expect = 2.0
Identities = 12/46 (26%), Positives = 28/46 (60%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
R +K+KK + ++K+++ + + ++ K+++ EE EEE+E
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 28.3 bits (64), Expect = 2.5
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K ++ + +KEEEEE++K+ K + K + ++ +EEEEE++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 35.3 bits (82), Expect = 0.012
Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 138 RRRMRKKKKVEK---EEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
R+++RKK++ + E+EE +K K+K + K + E ++ +
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 34.5 bits (80), Expect = 0.020
Identities = 12/46 (26%), Positives = 22/46 (47%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
LR++ RK +K ++EE EK K+ K E ++ + +
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 35.3 bits (82), Expect = 0.013
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
K+ +KE+E+EK K KR N K++K EE
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 32.6 bits (75), Expect = 0.10
Identities = 15/48 (31%), Positives = 22/48 (45%)
Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
K S + ++ +K EKE+E+ K KK R K +K EE
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 32.2 bits (74), Expect = 0.11
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
K+ EK+E+E++K K K++ + N K++ EE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 31.1 bits (71), Expect = 0.28
Identities = 11/39 (28%), Positives = 20/39 (51%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K +++E+EK+K +K R+ N K+ + EE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 34.3 bits (79), Expect = 0.018
Identities = 13/41 (31%), Positives = 28/41 (68%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++K K +K++ ++KK K+K K ++K +K +E+E E++
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 32.4 bits (74), Expect = 0.072
Identities = 12/41 (29%), Positives = 28/41 (68%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ +KV+KE EE++K K K+K + +K K ++++++++
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 31.6 bits (72), Expect = 0.14
Identities = 13/47 (27%), Positives = 31/47 (65%)
Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++ +K+K + ++++ KKKK+K K K+++ K +++E+E E
Sbjct: 73 KVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 30.0 bits (68), Expect = 0.49
Identities = 12/51 (23%), Positives = 30/51 (58%)
Query: 132 IYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
IY +KKK++ +E E+ KK+ +++ + K KK ++++++++
Sbjct: 52 IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 33.5 bits (77), Expect = 0.022
Identities = 15/43 (34%), Positives = 29/43 (67%)
Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ ++++K K+EE+ KK+ KR+ K+ KKKK ++ ++EE
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111
Score = 32.3 bits (74), Expect = 0.063
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 129 QHKIYTTSLRRRMRK-KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ + + + RK ++K K+ + +KKK K+K K+ KK E+E +
Sbjct: 64 EDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Score = 28.9 bits (65), Expect = 0.83
Identities = 16/48 (33%), Positives = 33/48 (68%), Gaps = 1/48 (2%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
T+ +R R+KKK +K+++++K KK +K ++ + +++ +EEEE E
Sbjct: 83 TAKKRAKRQKKK-QKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGE 129
Score = 26.9 bits (60), Expect = 4.0
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 5/50 (10%)
Query: 138 RRRMRKKKKVE-----KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++R KK+K E K + +KKK+ K+K K+ KK+ +E + E
Sbjct: 70 QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.5 bits (76), Expect = 0.046
Identities = 9/49 (18%), Positives = 23/49 (46%)
Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
S + R ++K + +++K ++K ++ KK + E++ E
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 33.1 bits (76), Expect = 0.051
Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 18/55 (32%)
Query: 126 LFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRK-------GKRNNNNKKKK 173
QH ++ K++KEEE +K+ K +R+ G + +KKK
Sbjct: 260 FGQQHYVF-----------GKIDKEEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303
>gnl|CDD|182885 PRK10991, fucI, L-fucose isomerase; Provisional.
Length = 588
Score = 33.1 bits (76), Expect = 0.063
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 6/51 (11%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEEE 181
T LRRR+ +K+ EEE E KKN ++G+ N + ++N E++
Sbjct: 209 TELRRRI--DQKIYDEEELEMALAWAKKNCKEGEDENAEQYQRNAEQKRAV 257
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 33.0 bits (75), Expect = 0.075
Identities = 9/49 (18%), Positives = 19/49 (38%)
Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
T R ++K K + ++E K ++ K+ EE+ +
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAP 838
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 32.2 bits (74), Expect = 0.089
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+R ++ KK E E+EK + +RK K KK+K ++ E +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 26.4 bits (59), Expect = 8.6
Identities = 9/39 (23%), Positives = 21/39 (53%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
KK++ E+E+ +++ K K K +K + ++ E +
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 31.9 bits (73), Expect = 0.16
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
KKK+ EK+ ++ KKKK ++K + K +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.1 bits (71), Expect = 0.32
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKK 172
+KK++ +K ++ +KKKK K+KGK+ +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.1 bits (71), Expect = 0.32
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
KK +E++ +K+KK K++ K+ KKK
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 30.3 bits (69), Expect = 0.49
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
KKK E+++ +++KKK KRK K KK +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 28.4 bits (64), Expect = 2.1
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
+ + K K ++EE++ +K+K K+K K+ +KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 28.4 bits (64), Expect = 2.3
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKR 165
++R KK + K++++ KKK KRK K
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 26.9 bits (60), Expect = 7.5
Identities = 15/47 (31%), Positives = 27/47 (57%), Gaps = 5/47 (10%)
Query: 137 LRRRMRKKKKV-----EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
L +R+ + K+ +K+ EE+K +K K+K KR KK+K + +
Sbjct: 368 LNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.5 bits (71), Expect = 0.19
Identities = 12/46 (26%), Positives = 26/46 (56%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
L+++ +++KV +EEE+ +K++ + R K++ EE E
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264
Score = 31.2 bits (70), Expect = 0.29
Identities = 12/45 (26%), Positives = 25/45 (55%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++R ++K +E+EE+ K+++ RK + ++ K E E E
Sbjct: 221 KKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAE 265
Score = 28.1 bits (62), Expect = 3.1
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNN 168
K + R + +K EKE E E+++K KR NN
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171
Score = 26.5 bits (58), Expect = 9.6
Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE----EEEE 182
K+++ E EE KKK+ +R+ +++K EE + EEEE
Sbjct: 208 KQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEE 251
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 31.9 bits (73), Expect = 0.19
Identities = 8/42 (19%), Positives = 21/42 (50%), Gaps = 2/42 (4%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNN--NNKKKKNEEEEEEEE 182
+ K E+ + + K +++ +R NN++ +NE + +
Sbjct: 585 ETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDN 626
Score = 28.9 bits (65), Expect = 1.7
Identities = 8/41 (19%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNN---NKKKKNEEEEEEEE 182
+ K E + ++++++RK ++NN N+++ + E
Sbjct: 591 QPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRRE 631
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 30.8 bits (70), Expect = 0.19
Identities = 12/41 (29%), Positives = 27/41 (65%)
Query: 126 LFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRN 166
L + KI L+ + ++KK+ +K+++++KKK +K+ K+
Sbjct: 94 LVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 25.8 bits (57), Expect = 9.0
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
K+V KE+ ++KK+K K+K K+ KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 31.9 bits (73), Expect = 0.19
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ EKEE+E K++ + + + KKKK E+ E+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 27.7 bits (62), Expect = 4.7
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 144 KKKVEKEEEE-EKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
K+++++E+EE E K+ KR K +KKK E E+ E+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 31.8 bits (72), Expect = 0.21
Identities = 15/52 (28%), Positives = 24/52 (46%)
Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
Q K Y R++ +K + EE ++ K K+ GK +EE +EE
Sbjct: 428 QRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEVDEE 479
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 31.2 bits (71), Expect = 0.21
Identities = 15/39 (38%), Positives = 27/39 (69%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
++ +KKK+V+KE++E+K KK K + + KKKK ++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 29.3 bits (66), Expect = 0.90
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ +K+ E EEEE+K+KK K++ K+ KK K E+ E +
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Score = 28.5 bits (64), Expect = 1.5
Identities = 13/39 (33%), Positives = 26/39 (66%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+K+KK +KE ++EKK+K +K K KK ++++++
Sbjct: 154 KKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 27.8 bits (62), Expect = 2.6
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ K ++ E EE++KK K+K K KK+K +++E+
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 26.6 bits (59), Expect = 6.2
Identities = 10/40 (25%), Positives = 26/40 (65%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+K+ K E+E + ++ ++K K+ KK+ +E+++++E
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 30.3 bits (69), Expect = 0.22
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K + E E+EKKKK KR GK+ + + E +E E
Sbjct: 70 KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAE 108
Score = 30.3 bits (69), Expect = 0.29
Identities = 11/38 (28%), Positives = 22/38 (57%), Gaps = 6/38 (15%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNE 175
+RR+ + + E+ +E +K+K RK N++KK +
Sbjct: 91 KRRIALRLRRERTKERAEKEKRTRK------NREKKFK 122
Score = 29.2 bits (66), Expect = 0.66
Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKN-KRKGKRNNNNKKK 172
+ R+R+++ E+ E+E++ +KN ++K KR K+K
Sbjct: 94 IALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 28.0 bits (63), Expect = 1.6
Identities = 10/38 (26%), Positives = 17/38 (44%), Gaps = 7/38 (18%)
Query: 138 RRRMRKKKKVEKEEEE-------EKKKKNKRKGKRNNN 168
R +R +++ KE E ++KK KR+ K
Sbjct: 93 RIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.9 bits (70), Expect = 0.23
Identities = 12/40 (30%), Positives = 24/40 (60%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ +E +EE+K+ + K + + +K++EE EEE E
Sbjct: 57 QETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96
Score = 29.7 bits (67), Expect = 0.73
Identities = 9/40 (22%), Positives = 25/40 (62%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K ++E+EE +++N+ + + +++ +K+ EE+ E
Sbjct: 79 EDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNV 118
Score = 28.9 bits (65), Expect = 1.3
Identities = 8/38 (21%), Positives = 22/38 (57%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+KE++ + +K+++ + N ++ ++E E+E E
Sbjct: 74 NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111
Score = 28.5 bits (64), Expect = 1.7
Identities = 8/40 (20%), Positives = 23/40 (57%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K +K + E++ ++++ + + + +NE+E EE+
Sbjct: 75 SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKT 114
Score = 27.8 bits (62), Expect = 2.7
Identities = 13/46 (28%), Positives = 27/46 (58%), Gaps = 6/46 (13%)
Query: 143 KKKKVEKEEEEEK------KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ ++E+ +EEEK K++K ++ + +++NEEE+EE
Sbjct: 58 ETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESS 103
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 31.6 bits (72), Expect = 0.24
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+ +EE+ G +NN ++K N+ E E+
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELED 252
Score = 28.5 bits (64), Expect = 2.1
Identities = 10/39 (25%), Positives = 16/39 (41%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K K +EEK + N NNK+++ + E
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNEL 250
>gnl|CDD|130161 TIGR01089, fucI, L-fucose isomerase. This enzyme catalyzes the
first step in fucose metabolism, and has been
characterized in Escherichia coli and Bacteroides
thetaiotaomicron [Energy metabolism, Sugars].
Length = 587
Score = 31.5 bits (71), Expect = 0.24
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEE 180
T +RRR+ +K+ EEE E K + G+ NN + ++N E+
Sbjct: 208 TEIRRRI--DQKIYDEEELEMALAWADKYCKYGEDENNKEYQRNAEQSRA 255
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.4 bits (71), Expect = 0.26
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KKKK EK +EE K +K K + K K+K+ E+E++ EE
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Score = 30.2 bits (68), Expect = 0.61
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+KKK+ KEE +++K K + K KR K+K+ E++ EE
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Score = 29.9 bits (67), Expect = 0.71
Identities = 12/48 (25%), Positives = 27/48 (56%)
Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+ + + K++ +K++E+ K++ RK K K+ E+E+E+E
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151
Score = 29.5 bits (66), Expect = 1.0
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KK + ++ EEEKK++ R R KKK +++E
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191
Score = 29.1 bits (65), Expect = 1.6
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K +E + + K + K K +KKK +E+ +EE
Sbjct: 88 PAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128
Score = 28.7 bits (64), Expect = 1.8
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
K+ EKE+E+ K++K K+K K K +K +EE +E+
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Score = 28.7 bits (64), Expect = 1.8
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K K KE + E K+ +++ ++ KKKK E+ +EE +
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129
Score = 27.9 bits (62), Expect = 3.2
Identities = 8/42 (19%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 142 RKKKKVEKEEEEEKKKK-NKRKGKRNNNNKKKKNEEEEEEEE 182
K+ EKE+E+EKK + + + + + + ++ +
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Score = 27.9 bits (62), Expect = 3.2
Identities = 12/52 (23%), Positives = 25/52 (48%)
Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ + K K ++ + E K++ K K + KKKK + +EE ++
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130
Score = 27.9 bits (62), Expect = 3.3
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+VEK + K K + N + K++ +E+E+ +E
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKE 115
Score = 27.9 bits (62), Expect = 3.6
Identities = 8/41 (19%), Positives = 23/41 (56%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
R K+ EKE+E++ ++ R+ ++ + K+ ++ ++
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181
Score = 27.5 bits (61), Expect = 4.5
Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 138 RRRMRKKKKVEKEEEE-EKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K++K +K+E+ E+ K K K + K+ E+E+E++
Sbjct: 109 EKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154
Score = 26.8 bits (59), Expect = 8.1
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+++ EKE+ +E+KKK K K K ++K K E +E+
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 26.4 bits (58), Expect = 9.8
Identities = 9/41 (21%), Positives = 22/41 (53%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K E + E K+++ +++ + KKK+ +EE ++
Sbjct: 91 KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.4 bits (69), Expect = 0.26
Identities = 15/45 (33%), Positives = 25/45 (55%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
R ++K+ EKE E+E+K++ + K K +K E E+ EE
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEE 54
Score = 26.9 bits (60), Expect = 4.0
Identities = 14/54 (25%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+I +R ++++K E++E +E+K+K KG ++K+ EE E+ +
Sbjct: 8 SGRIIDIEEKREEKEREK-EEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 31.0 bits (70), Expect = 0.27
Identities = 11/44 (25%), Positives = 19/44 (43%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
+ + KK + K + K KK K+KG +N + K +
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186
>gnl|CDD|223440 COG0363, NagB, 6-phosphogluconolactonase/Glucosamine-6-phosphate
isomerase/deaminase [Carbohydrate transport and
metabolism].
Length = 238
Score = 30.7 bits (70), Expect = 0.29
Identities = 15/67 (22%), Positives = 20/67 (29%), Gaps = 14/67 (20%)
Query: 5 LHYTPSEDDIAHLLKEFTVDFLLKGY------------GRLVSQLYDLLVTDLHSQIDTS 52
L +++A E D L G LY+ LV Q+D S
Sbjct: 3 LIIFEDAEELAKAAAEIIADKLQAAKAERGRAVLALSGGSTPLALYEALVKLPQGQLDWS 62
Query: 53 --HFFWL 57
F L
Sbjct: 63 KVTIFNL 69
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.8 bits (70), Expect = 0.30
Identities = 12/53 (22%), Positives = 23/53 (43%)
Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
++ K +++ +K+ K E E + K K K + K K NE+ +
Sbjct: 66 SKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
Score = 27.8 bits (62), Expect = 3.9
Identities = 10/39 (25%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 145 KKVEKEEEEEKKKKNKR-KGKRNNNNKKKKNEEEEEEEE 182
+EE + K+ K + NN KKK+E+++++++
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78
Score = 26.6 bits (59), Expect = 7.7
Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 4/53 (7%)
Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
++K+ TTS +K KK +K E +KK + K+K K+ K + E +
Sbjct: 45 EENKVATTS----TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.0 bits (70), Expect = 0.31
Identities = 10/46 (21%), Positives = 19/46 (41%), Gaps = 2/46 (4%)
Query: 139 RRMRKKKKVEKEEEEEKKK--KNKRKGKRNNNNKKKKNEEEEEEEE 182
++K + EK E + +KK KR+ N++ + E
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMAR 375
>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
Length = 601
Score = 31.1 bits (71), Expect = 0.32
Identities = 12/48 (25%), Positives = 19/48 (39%), Gaps = 2/48 (4%)
Query: 13 DIAHLLKEFTVDFLLKGYGRLVSQLYDLLVTDLHSQIDTSHFFWLVTY 60
D L+ TV+ L K YG L+ + V ++ + W Y
Sbjct: 167 DAEEWLERETVEELAKKYGSLLP--VPIRVEGEKGGVNETPPPWTRDY 212
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 30.7 bits (70), Expect = 0.33
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 5/68 (7%)
Query: 120 IHTMVRLFNQ-----HKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKN 174
+ ++ L N K+ L + +K +EEEEEK K + ++ +KK+
Sbjct: 231 LEDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEE 290
Query: 175 EEEEEEEE 182
+++EE E
Sbjct: 291 KKKEEREA 298
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.4 bits (69), Expect = 0.41
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
K ++E+E ++KKKK K+K K+NN
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATV 206
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 30.2 bits (68), Expect = 0.42
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
R+K E+++K K K K K + KKK E
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREG 116
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 30.6 bits (69), Expect = 0.43
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 14/55 (25%)
Query: 142 RKKKKVEK-EEEEEKKKKNKRK-------------GKRNNNNKKKKNEEEEEEEE 182
RKKKK EK +E+E KK K +K G +KK+ + + E+E
Sbjct: 20 RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDE 74
Score = 27.9 bits (62), Expect = 3.2
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+KK+ EEE E+KKK + K K K K ++E + +
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 30.6 bits (69), Expect = 0.44
Identities = 13/53 (24%), Positives = 24/53 (45%)
Query: 130 HKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
H T R+++ + KKV + +NK + N+NNK K+ + +
Sbjct: 15 HLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDV 67
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.4 bits (68), Expect = 0.50
Identities = 10/41 (24%), Positives = 30/41 (73%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
L R ++++EK++ E+KK++ +++G+R +++ +++E+
Sbjct: 84 LEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.5 bits (68), Expect = 0.54
Identities = 8/41 (19%), Positives = 19/41 (46%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+K + K E E + + ++ +KKK E +++ +
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
Score = 30.5 bits (68), Expect = 0.56
Identities = 13/41 (31%), Positives = 24/41 (58%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+K ++++K EEE K K + K + KK + ++ EE+E
Sbjct: 1647 KKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687
Score = 30.1 bits (67), Expect = 0.67
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 129 QHKIYTTSLRRRMRKKKKVE---KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ KI L++ +KKKVE K+E EEKKK + K N K E ++ EE+
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
Score = 30.1 bits (67), Expect = 0.83
Identities = 16/51 (31%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 136 SLRRRMRKKKKVE----KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+L++ + KK E KE EE+KK + +K + N K ++ ++E EE++
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743
Score = 28.6 bits (63), Expect = 2.6
Identities = 17/64 (26%), Positives = 31/64 (48%)
Query: 119 IIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
I M + K+ ++ K K E+ ++ E++KK + K+ +KKK EE +
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
Query: 179 EEEE 182
+ EE
Sbjct: 1654 KAEE 1657
Score = 28.6 bits (63), Expect = 2.6
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+K ++ +K EE+EKK K + K ++ +++E EE
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
Score = 28.2 bits (62), Expect = 3.0
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 137 LRRRMRKKKKVEKEEEEEKKK----KNKRKGKRNNNNKKKKNEEEEEEEE 182
K + EK++EE KKK K K + K+ + KKK EE++++ +
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD 1408
Score = 28.2 bits (62), Expect = 3.0
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KKK E ++ EE +KK K+ KK E +++E E
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Score = 28.2 bits (62), Expect = 3.3
Identities = 12/47 (25%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKG---KRNNNNKKKKNEEEEEEEE 182
++ ++ K++ EE +K +++K+K K+ ++KK E ++E E
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Score = 28.2 bits (62), Expect = 3.6
Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 138 RRRMRKKKKVE--KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ KKK E K+ EE++KK + K KK + +++E EE
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
Score = 27.8 bits (61), Expect = 3.8
Identities = 13/41 (31%), Positives = 26/41 (63%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+K ++++K EEE K K + K + + KK + +++EEE+
Sbjct: 1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756
Score = 27.8 bits (61), Expect = 4.4
Identities = 8/41 (19%), Positives = 15/41 (36%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KK + E E + +KK EE +++ +
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Score = 27.8 bits (61), Expect = 4.7
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K E ++ EE KKK + K + KK ++E EE
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700
Score = 27.4 bits (60), Expect = 5.6
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+K + K+E EE KK + K K KK + ++ EEE
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Score = 27.4 bits (60), Expect = 5.8
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+KK + K+ EEE K K + K+ +KKK E ++ EE+
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Score = 27.4 bits (60), Expect = 6.0
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
K + K+ EE+KKK + K + K + ++E EE
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700
Score = 27.4 bits (60), Expect = 6.3
Identities = 9/42 (21%), Positives = 19/42 (45%)
Query: 141 MRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+KK + K+ E K + + ++K E+++EE
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 27.0 bits (59), Expect = 6.6
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
KKK E ++ +E KKK + K+ + KK +++ +E
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADE 1514
Score = 27.0 bits (59), Expect = 6.6
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+KK E ++E ++ K + K+ +KKK EE ++ EE
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726
Score = 27.0 bits (59), Expect = 7.1
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
KKK EK++ +E KKK + K+ + KK +++ +E
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422
Score = 27.0 bits (59), Expect = 7.2
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
KKK E ++ +E KKK + K+ + KKK E ++ E
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346
Score = 27.0 bits (59), Expect = 7.7
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KKK E ++ +E KKK + + KKK EE ++ +E
Sbjct: 1437 KKKAEEAKKADEAKKKAEEA--KKAEEAKKKAEEAKKADE 1474
Score = 26.6 bits (58), Expect = 9.1
Identities = 15/45 (33%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 142 RKKKKVEKEEEEEKKK----KNKRKGKRNNNNKKKKNEEEEEEEE 182
+K + +K+ EE+KKK K K+ + KKK EE+++ +E
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 30.0 bits (67), Expect = 0.59
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKN 174
S + ++ + +E +E+ KRK + N ++KK N
Sbjct: 23 ASFKNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKKCN 62
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 30.0 bits (68), Expect = 0.60
Identities = 14/54 (25%), Positives = 33/54 (61%), Gaps = 8/54 (14%)
Query: 137 LRRRMRKKKKVEKEEEEEK--------KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+R+++R +++E+EEEE++ + K + +++ +K K ++EE+EE
Sbjct: 112 VRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.3 bits (68), Expect = 0.65
Identities = 10/32 (31%), Positives = 22/32 (68%)
Query: 147 VEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
+K E+ ++K+K+K K+ NKKK +++++
Sbjct: 229 SDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKK 260
Score = 28.8 bits (64), Expect = 1.6
Identities = 6/48 (12%), Positives = 20/48 (41%)
Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ + + + +E+EE + + G+ + K ++E++
Sbjct: 178 FQLMMMKAAKNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDD 225
Score = 28.4 bits (63), Expect = 2.6
Identities = 12/38 (31%), Positives = 23/38 (60%), Gaps = 3/38 (7%)
Query: 148 EKEEEEEKKKKNKRKG---KRNNNNKKKKNEEEEEEEE 182
E +EE+ KKK K+ K+ +++KK K +++ +E
Sbjct: 234 EDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271
Score = 28.0 bits (62), Expect = 3.1
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++K E+E KK K +K K N K + + ++ +
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363
Score = 28.0 bits (62), Expect = 3.2
Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 5/43 (11%)
Query: 145 KKVEKEEEEEKKKKNK-----RKGKRNNNNKKKKNEEEEEEEE 182
++ E EE E++K + +KGK+ K KKN ++++ +
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
>gnl|CDD|150820 pfam10204, DuoxA, Dual oxidase maturation factor. DuoxA (Dual
oxidase maturation factor) is the essential protein
necessary for the final release of DUOX2 (an NADPH:O2
oxidoreductase flavoprotein) from the endoplasmic
reticulum. Dual oxidases (DUOX1 and DUOX2) constitute
the catalytic core of the hydrogen peroxide generator,
which generates H2O2 at the apical membrane of thyroid
follicular cells, essential for iodination of
thyroglobulin by thyroid peroxidases. DuoxA carries five
membrane-integral regions including a reverse
signal-anchor with external N-terminus (type III) and
two N-glycosylation sites. It is conserved from
nematodes to humans.
Length = 281
Score = 29.8 bits (67), Expect = 0.70
Identities = 12/62 (19%), Positives = 21/62 (33%), Gaps = 4/62 (6%)
Query: 52 SHFFWLVTYFLKFVVQLELDIDHVSS----VISFDIISYLSFEGINLCEHLQIVKQHKGI 107
S FWLV L + + + +S Y +F L + + G+
Sbjct: 41 SRLFWLVRVLLSLFIGAVILAVNFTSDWHVARVTTKAPYKAFSRSKLPARVGLHIGLMGV 100
Query: 108 NL 109
N+
Sbjct: 101 NI 102
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 29.8 bits (67), Expect = 0.78
Identities = 8/44 (18%), Positives = 15/44 (34%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ R + E ++ ++ +K KK E EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
Score = 28.3 bits (63), Expect = 3.2
Identities = 9/44 (20%), Positives = 17/44 (38%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
R ++++ + +K + K KK +E E EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
Score = 27.5 bits (61), Expect = 4.4
Identities = 11/44 (25%), Positives = 14/44 (31%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
R ++ E KK + K KK E E EE
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 29.7 bits (67), Expect = 0.84
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+VEKE +E++++ K + K K EE +EEEE
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEE 66
Score = 27.4 bits (61), Expect = 4.4
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
++K +KEEE +++++ + K K+ K+ E E
Sbjct: 51 EEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 27.4 bits (61), Expect = 5.4
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 7/40 (17%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K E+E +EE++K+ K+K K KK +E E E
Sbjct: 53 KTTDKEEEVDEEEEKEEKKK-------KTKKVKETTTEWE 85
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.4 bits (66), Expect = 0.85
Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 4/44 (9%)
Query: 143 KKKKVEKEEEEEKKKK----NKRKGKRNNNNKKKKNEEEEEEEE 182
KK K+ K ++ KK K R +KK E
Sbjct: 117 KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160
>gnl|CDD|129780 TIGR00697, TIGR00697, conserved hypothetical integral membrane
protein. All known members of this family are proteins
or 210-250 amino acids in length. Conserved regions of
hydrophobicity suggest that all members of the family
are integral membrane proteins [Hypothetical proteins,
Conserved].
Length = 202
Score = 29.2 bits (66), Expect = 0.86
Identities = 29/117 (24%), Positives = 41/117 (35%), Gaps = 31/117 (26%)
Query: 1 MKALLHYTPSEDDIA--HLLKEFTVDFL-----LKGYGRLVSQLYDLLVTD--------- 44
+ L + PS D + H F+ L Y +VSQL D+ V
Sbjct: 76 TQLHLFFIPSPGDESQTHFEALFSSSPRIALASLVAY--IVSQLLDVKVFTFLKKRTPGR 133
Query: 45 ----------LHSQ-IDTSHFFWLVTYF-LKFVVQLELDIDHVSSVISFDIISYLSF 89
L SQ IDT F ++ Y F + L+I S +I I L+
Sbjct: 134 FLWLRSNGSTLFSQLIDTFIFSFIAFYGSPPFPAENLLNIMIFSYLIKG-TIGVLAT 189
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.0 bits (63), Expect = 0.89
Identities = 9/13 (69%), Positives = 12/13 (92%)
Query: 170 KKKKNEEEEEEEE 182
+++K EEEEEEEE
Sbjct: 67 EEEKKEEEEEEEE 79
Score = 27.2 bits (61), Expect = 1.6
Identities = 9/13 (69%), Positives = 12/13 (92%)
Query: 170 KKKKNEEEEEEEE 182
++KK EEEEEEE+
Sbjct: 68 EEKKEEEEEEEED 80
Score = 26.8 bits (60), Expect = 2.3
Identities = 8/13 (61%), Positives = 12/13 (92%)
Query: 170 KKKKNEEEEEEEE 182
+KK+ EEEEEE++
Sbjct: 69 EKKEEEEEEEEDD 81
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 29.0 bits (65), Expect = 0.98
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNN 168
+K KK + E+++E+KKK K K K+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 28.3 bits (63), Expect = 1.7
Identities = 13/27 (48%), Positives = 19/27 (70%)
Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKRN 166
+ KKKK E ++E +KKKK K+K K+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.4 bits (67), Expect = 1.1
Identities = 10/44 (22%), Positives = 29/44 (65%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+R ++K++ ++++ E +K++ + + K +K++ E++EEE
Sbjct: 89 KRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEE 132
Score = 27.0 bits (61), Expect = 6.2
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
RK + +EK EEE +KK+ + + K+ KK++ EE EE+
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 28.8 bits (65), Expect = 1.1
Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
KK EKEE E +K+ K + KK EE E +
Sbjct: 79 KKNEKEERELRKRAEKEA-----LEQAKKEEELREAK 110
>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase. The valyl-tRNA synthetase
(ValS) is a class I amino acyl-tRNA ligase and is
particularly closely related to the isoleucyl tRNA
synthetase [Protein synthesis, tRNA aminoacylation].
Length = 861
Score = 29.3 bits (66), Expect = 1.2
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++K+ ++EKK+ + +GK N KK +E E+E
Sbjct: 819 ARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKE 856
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.9 bits (65), Expect = 1.2
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 11/54 (20%)
Query: 140 RMRKKKKVEKEEEEEKKKK--NKRKGK-------RNNNNKKKKNEEE--EEEEE 182
R R+ ++E+ +E +KKK K + N N KK+K + + +E E+
Sbjct: 118 RERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQ 171
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 27.6 bits (62), Expect = 1.3
Identities = 11/31 (35%), Positives = 21/31 (67%), Gaps = 2/31 (6%)
Query: 136 SLRRRMRKKKKVEKEEE--EEKKKKNKRKGK 164
+ R++ +KKK +K +E KK++ +R+GK
Sbjct: 39 NRRKKKKKKKLKKKSKEWILRKKEQMRRRGK 69
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 1.4
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++K+ EK + K K K + +K K E+EE+
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82
Score = 28.5 bits (64), Expect = 1.8
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+K++ + + + K K K K K K+ +EE+ E
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83
Score = 27.7 bits (62), Expect = 3.6
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ ++ E+E+ + K +K + +K+K + E+EE+
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80
Score = 26.9 bits (60), Expect = 5.2
Identities = 9/40 (22%), Positives = 17/40 (42%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+++ K + K KK + K++ EE+ E
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 26.5 bits (59), Expect = 8.1
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
K K EK + KKKK K K K N +++
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 26.2 bits (58), Expect = 9.6
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++K + + + KK K K + K++K E+ E E
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.0 bits (65), Expect = 1.4
Identities = 13/41 (31%), Positives = 22/41 (53%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ K++ +K+ EEE K K + K+ KKK E E + +
Sbjct: 143 KAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 28.3 bits (63), Expect = 2.7
Identities = 11/46 (23%), Positives = 20/46 (43%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ + + K + E E EKK K + K + K K E +++
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAA 169
Score = 26.7 bits (59), Expect = 7.9
Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 142 RKKKKVEKEEEEEKKKKNK-RKGKRNNNNKKKKNEEEEEEEE 182
++ ++ +K+ EE K K+ K K +KK EE +++ E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAE 153
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 28.8 bits (65), Expect = 1.4
Identities = 20/56 (35%), Positives = 38/56 (67%), Gaps = 4/56 (7%)
Query: 124 VRLFNQHKIYTTSLRRRMRKKKKVEKE----EEEEKKKKNKRKGKRNNNNKKKKNE 175
V++ + K+ +L+R+ ++KKK +KE +E+ +KKK +R+ KR N KK+K++
Sbjct: 135 VKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDD 190
Score = 26.1 bits (58), Expect = 9.9
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++ K E+ + +KK KRK + + +K EE + EE
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 28.9 bits (65), Expect = 1.5
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ E K + KRK K N +K++ E E E +E
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKE 298
Score = 28.5 bits (64), Expect = 1.8
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRK 162
K T + R + +++K++E+E +EEK+ K K
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKKLA 307
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 28.0 bits (62), Expect = 1.5
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+K E EE+E K G N N ++ ++E EEEE
Sbjct: 17 KEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEE 55
Score = 26.8 bits (59), Expect = 3.5
Identities = 13/42 (30%), Positives = 22/42 (52%)
Query: 141 MRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+++KK+V +E+E K N ++ +E EEEEE
Sbjct: 16 LKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEMEEEEEV 57
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 27.7 bits (62), Expect = 1.5
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 12/35 (34%)
Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
EEE++++ +++ EEEEE EE
Sbjct: 72 AAAAEEEEEEE------------EEEEEEEEESEE 94
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.2 bits (65), Expect = 1.6
Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 9/35 (25%)
Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
E+EEEEE++++ + ++++ EEEEEEEE
Sbjct: 864 EEEEEEEEEEEEE---------EEEEEEEEEEEEE 889
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.2 bits (63), Expect = 1.6
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ E+E+ E K ++K K KR K + E+ + E
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 28.0 bits (63), Expect = 1.7
Identities = 10/45 (22%), Positives = 20/45 (44%), Gaps = 8/45 (17%)
Query: 143 KKKKVEKEEEE--------EKKKKNKRKGKRNNNNKKKKNEEEEE 179
K + E+EE +++ +RK +R K++ + EE
Sbjct: 52 SDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREE 96
Score = 26.1 bits (58), Expect = 7.8
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 131 KIYTTSLRRRM--RKKKKVEKEEEEEKKKKNKRKGK 164
+IY S+ RM R+KK+ E++E+EE +K + K
Sbjct: 65 RIYE-SIDERMDERRKKRREQKEKEEIEKYREENPK 99
>gnl|CDD|178689 PLN03144, PLN03144, Carbon catabolite repressor protein 4 homolog;
Provisional.
Length = 606
Score = 28.9 bits (65), Expect = 1.8
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 7 YTPSEDDIAHLLK 19
YTP+ DD+ H+LK
Sbjct: 183 YTPTADDVGHVLK 195
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 28.9 bits (65), Expect = 1.9
Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 12/66 (18%)
Query: 127 FNQHKIYTTSLRRRMRKKKKVEKEE-----------EEEKKKKNKRKGKRNNNNKKKKNE 175
F+ H + + +RM K +E EE E +K K K +R + E
Sbjct: 243 FDGHLVDFDEMLKRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPK-ERTAIERVPMPE 301
Query: 176 EEEEEE 181
+ E
Sbjct: 302 LDPEYR 307
>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region. Family of
proteins with a conserved region found in PAPA-1, a
PAP-1 binding protein.
Length = 89
Score = 27.3 bits (61), Expect = 1.9
Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 142 RKKKKVEKEEEEEKKKKNKR-----KGKRNNNNKKKKNEEEEEEEE 182
R+K + EK EEEK + + ++ +K KK +E++ +E+
Sbjct: 12 RRKNQSEKRAEEEKMETINKLLKKQARRKKREDKIKKGDEKKAQEK 57
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 28.6 bits (64), Expect = 2.2
Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 7/39 (17%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ + EEE E+K+K KRK + KK E+E+E+E
Sbjct: 278 GERSDSEEETEEKEKEKRK-------RLKKMMEDEDEDE 309
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 28.6 bits (64), Expect = 2.2
Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 125 RLFNQHKIYT------TSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
RL N + L + + + K +K + ++K KK +K + ++ K EE E
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKK-SKKKEAKESSDKDDEEESE 176
Query: 179 EEEE 182
E+E
Sbjct: 177 SEDE 180
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 28.6 bits (65), Expect = 2.3
Identities = 8/34 (23%), Positives = 15/34 (44%)
Query: 146 KVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
+++ EEE K+ +K+K E+ E
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|239618 cd03556, L-fucose_isomerase, L-fucose isomerase (FucIase); FucIase
converts L-fucose, an aldohexose, to its ketose form,
which prepares it for aldol cleavage (similar to the
isomerization of glucose during glycolysis). L-fucose
(or 6-deoxy-L-galactose) is found in blood group
determinants as well as in various oligo- and
polysaccharides, and glycosides in mammals, bacteria and
plants.
Length = 584
Score = 28.6 bits (64), Expect = 2.3
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 6/51 (11%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKK----KKNKRKGKRNNNNKKKKNEEEEEEE 181
T + RRM +++ EE EK K+N ++G+ N +K E+++++
Sbjct: 205 TEIIRRM--DEEIYDPEEYEKALAWTKENCKEGEDENEKNDQKKREQKDKD 253
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 28.5 bits (63), Expect = 2.4
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ + KK+KV++EEEE+K K K+ KK K ++EE+
Sbjct: 524 DVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQ 570
Score = 26.6 bits (58), Expect = 9.5
Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 6/47 (12%)
Query: 139 RRMRKKKKVEKEEEEEKKKK------NKRKGKRNNNNKKKKNEEEEE 179
+ + KK+ EEEEEKK K ++K + K EE+ E
Sbjct: 526 NKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAE 572
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 2.6
Identities = 10/46 (21%), Positives = 25/46 (54%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
L++ + + + + + K K + K + K+KK E+++EE++
Sbjct: 296 LKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 27.8 bits (62), Expect = 2.6
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKR-------NNNNKKKKNEEEEEEEE 182
+ ++K + EEEE+ +K K+ + + + +K+EEEEEEEE
Sbjct: 134 KFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE 183
>gnl|CDD|220644 pfam10238, Eapp_C, E2F-associated phosphoprotein. This entry
represents the conserved C-terminal portion of an E2F
binding protein. E2F transcription factors play an
essential role in cell proliferation and apoptosis and
their activity is frequently deregulated in human
cancers. E2F activity is regulated by a variety of
mechanisms, frequently mediated by proteins binding to
individual members or a subgroup of the family. EAPP
interacts with a subset of E2F factors and influences
E2F-dependent promoter activity. EAPP is present
throughout the cell cycle but disappears during mitosis.
Length = 128
Score = 27.3 bits (61), Expect = 2.6
Identities = 13/54 (24%), Positives = 23/54 (42%)
Query: 129 QHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+H+IY R KV++E+ K N++K +RN+ + E
Sbjct: 45 RHEIYKNQYRAMFAINCKVDEEKVLYSKPINRKKRRRNSKKNELNPNNTATSEI 98
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 27.7 bits (62), Expect = 2.7
Identities = 15/45 (33%), Positives = 30/45 (66%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ R ++++ E+E EE KK + KR+G+R + ++ E+++EEE
Sbjct: 15 QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59
>gnl|CDD|219543 pfam07733, DNA_pol3_alpha, Bacterial DNA polymerase III alpha
subunit.
Length = 384
Score = 28.2 bits (64), Expect = 2.7
Identities = 11/19 (57%), Positives = 12/19 (63%)
Query: 137 LRRRMRKKKKVEKEEEEEK 155
LRR M KKK E E+ EK
Sbjct: 357 LRRAMGKKKPEEMEKLREK 375
>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245). This is
a family of proteins conserved in fungi. The function is
not known, and there is no S. cerevisiae member.
Length = 145
Score = 27.6 bits (61), Expect = 2.8
Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 137 LRRRMRKK--KKVEKEEEEEKKKKNKRKGKR-----NNNNKKKKNEEEEEEEE 182
LRR++ K KKV E+E+ + KRK ++K +++++++E+
Sbjct: 76 LRRQLLGKNYKKVMAEKEKAEGGPVKRKAAVVAKEAKQSSKGVGDDDDDDDED 128
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 27.7 bits (61), Expect = 2.9
Identities = 15/47 (31%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 138 RRRMRKKKKVEKEEEEEK--KKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+RR +KK K +K E+ K +K + + N +K+ EE + EE
Sbjct: 87 KRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEE 133
>gnl|CDD|236637 PRK09970, PRK09970, xanthine dehydrogenase subunit XdhA;
Provisional.
Length = 759
Score = 28.1 bits (63), Expect = 3.0
Identities = 12/64 (18%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 105 KGINLDPY-VRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKV----EKEEEEEKKKKN 159
+ +DP R + + +IY+ L + K +K+ ++ E + ++ N
Sbjct: 378 TALGIDPVEFRLRNAAREGDANPLSGKRIYSAGLPECLEKGRKIFEWDKRRAECKNQQGN 437
Query: 160 KRKG 163
R+G
Sbjct: 438 LRRG 441
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 27.4 bits (61), Expect = 3.1
Identities = 9/44 (20%), Positives = 22/44 (50%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
++ +KK+ + KK K+ ++ +KK ++++ EE
Sbjct: 49 KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEE 92
Score = 27.4 bits (61), Expect = 3.6
Identities = 12/47 (25%), Positives = 24/47 (51%)
Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
S ++ +K+ ++ KK K K K K KK ++++++ EE
Sbjct: 47 SPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEEN 93
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 27.8 bits (62), Expect = 3.3
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K K+ K E++ KK K K K K K K + ++
Sbjct: 83 KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS 122
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.4 bits (61), Expect = 3.8
Identities = 12/43 (27%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 141 MRKKKKVEKEEEEEKKKKN--KRKGKRNNNNKKKKNEEEEEEE 181
+RKK+K +KE+ + +KK++ ++ G + +K K + E
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSE 203
Score = 26.6 bits (59), Expect = 7.6
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 151 EEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
EE EK ++ ++K K + + E+E+EE
Sbjct: 24 EEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55
>gnl|CDD|233536 TIGR01695, mviN, integral membrane protein MviN. This model
represents MviN, a family of integral membrane proteins
predicted to have ten or more transmembrane regions.
Although frequently listed as a virulence protein, it is
not restricted to pathogens and it is an essential
protein in Sinorhizobium meliloti. In a number of
species its gene is adjacent to that of the
uridylyltransferase GlnD , the signal-transducing enzyme
that performs the key modification to the nitrogen
regulatory protein PII [Unknown function, General].
Length = 502
Score = 27.7 bits (62), Expect = 4.4
Identities = 23/75 (30%), Positives = 33/75 (44%), Gaps = 3/75 (4%)
Query: 18 LKEFTVDFLLKGYGRLVSQLYDLLVTDLHSQIDTSHFFWLVTYFLKFVVQLELDIDHVS- 76
LK F FL G SQ+ L+ T L S ++ L Y+ + QL L I +S
Sbjct: 222 LKRFLKLFLPTTLGSSASQITLLINTALASFLEIGSVSAL--YYANRIYQLPLGIFGISL 279
Query: 77 SVISFDIISYLSFEG 91
S + +S + EG
Sbjct: 280 STVLLPKLSRHASEG 294
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 27.4 bits (61), Expect = 4.5
Identities = 27/128 (21%), Positives = 44/128 (34%), Gaps = 21/128 (16%)
Query: 65 VVQLELDIDHVSSVISFDI----ISYLSFEG-----------INLCEHLQIVKQHKGINL 109
V LDI VS VI++D+ Y+ G I+ + VK+ K I
Sbjct: 332 VAARGLDIPDVSHVINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKRI-- 389
Query: 110 DPYVRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNN 169
+RL + + V L + L + R + E + +E KK K
Sbjct: 390 ---EKRLERKLPSAVLLPLD-EPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGL 445
Query: 170 KKKKNEEE 177
+ +
Sbjct: 446 GVRFTLSK 453
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 26.3 bits (58), Expect = 4.5
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 11/35 (31%)
Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
EE+ E+K++ +KKK EE+EEEEE
Sbjct: 73 AAEEKAEEKEE-----------EKKKEEEKEEEEE 96
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.5 bits (61), Expect = 4.8
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKR 165
+ + KEE E+K+K K+ KR
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKR 367
>gnl|CDD|148072 pfam06244, DUF1014, Protein of unknown function (DUF1014). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 26.6 bits (59), Expect = 4.9
Identities = 12/45 (26%), Positives = 20/45 (44%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
R ++++K+E EE+ KK + N N + EE E
Sbjct: 6 EERLEKEQRKLEAPEEKSKKNAAREVPLEENLNVNRNRLEEGTVE 50
>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein. Members of this family are
part of the Paf1/RNA polymerase II complex. The Paf1
complex probably functions during the elongation phase
of transcription. The Leo1 subunit of the yeast
Paf1-complex binds RNA and contributes to complex
recruitment. The subunit acts by co-ordinating
co-transcriptional chromain modifications and helping
recruitment of mRNA 3prime-end processing factors.
Length = 312
Score = 27.1 bits (60), Expect = 4.9
Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 16/68 (23%)
Query: 128 NQHKIYTTSLRRRMRKKKKV----------------EKEEEEEKKKKNKRKGKRNNNNKK 171
H+ T +L R RK V EK+EE++ + + +R+ + NK
Sbjct: 207 FTHRKLTRNLATRQRKSGGVKITIVGKDPEHEKREREKKEEQKLRARRRRQNREKMKNKP 266
Query: 172 KKNEEEEE 179
Sbjct: 267 PNRPGHGS 274
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 27.3 bits (61), Expect = 5.0
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 142 RKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEE 176
R +KK + +EE +KKK + KG + KK +
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVK---ALKKVVAK 278
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 27.1 bits (60), Expect = 5.1
Identities = 13/35 (37%), Positives = 17/35 (48%)
Query: 145 KKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
K+V + E EK K NK KG NK ++ E
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIE 44
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 27.0 bits (60), Expect = 5.1
Identities = 9/45 (20%), Positives = 23/45 (51%), Gaps = 4/45 (8%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+++ ++++VE+ E + + + K K K+ ++ E EE
Sbjct: 120 QKQKEEEERRVERRRELGLEDPEQLRLK----QKAKEEQKAESEE 160
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 27.5 bits (61), Expect = 5.2
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 131 KIYTTSLRRRMRKKKKVEKEEEEEKKKKNK---RKGKRNNNNKKKKNEEEEEEE 181
K Y L+ +K +V+ E + ++ + K K K+ N KK+K + +E++E
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 26.1 bits (58), Expect = 5.3
Identities = 8/34 (23%), Positives = 17/34 (50%), Gaps = 11/34 (32%)
Query: 148 EKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
EE+K+++ ++++ +EE EEE
Sbjct: 74 AAAAEEKKEEE-----------EEEEEKEESEEE 96
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 27.5 bits (61), Expect = 5.6
Identities = 12/52 (23%), Positives = 22/52 (42%)
Query: 128 NQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEE 179
N K + L + +K K + E+ ++K NK K+ K+E +
Sbjct: 101 NNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 26.6 bits (59), Expect = 5.6
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ R K K +KEE E++K+K +R K +K++ E E+++ E
Sbjct: 70 EKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAE 114
Score = 26.6 bits (59), Expect = 5.7
Identities = 18/50 (36%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 137 LRRRMRKKKKVEKEEEEEKKKKNKRKG----KRNNNNKKKKNEEEEEEEE 182
LRR K KK E E+E+EK+++ + K +KKK E + +EE
Sbjct: 72 LRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEE 121
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 27.0 bits (60), Expect = 5.9
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 149 KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
E K KK K+K K+ +K +E+E E E
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEA 301
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 27.4 bits (61), Expect = 6.0
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEE 177
RR++ KK E + +KK K K + + K KK +
Sbjct: 750 KGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAK 793
>gnl|CDD|233534 TIGR01693, UTase_glnD, [Protein-PII] uridylyltransferase. This
model describes GlnD, the
uridylyltransferase/uridylyl-removing enzyme for the
nitrogen regulatory protein PII. Not all homologs of
PII share the property of uridylyltransferase
modification on the characteristic Tyr residue (see
Prosite pattern PS00496 and document PDOC00439), but
the modification site is preserved in the PII homolog
of all species with a member of this family [Central
intermediary metabolism, Nitrogen metabolism,
Regulatory functions, Protein interactions].
Length = 850
Score = 27.4 bits (61), Expect = 6.2
Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 7/50 (14%)
Query: 29 GYGR----LVSQLYDLLVTDLHSQIDTSHFFWLVTYFLKFVVQLELDIDH 74
GYGR S + DLL LH + FL + L ++ H
Sbjct: 51 GYGRGELAPYSDI-DLLF--LHDGKPAEEVEPKIERFLYPLWDLGFEVGH 97
>gnl|CDD|224652 COG1738, yhhQ, Uncharacterized member of the PurR regulon [General
function prediction only].
Length = 233
Score = 26.9 bits (60), Expect = 6.2
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 13/103 (12%)
Query: 1 MKALLHYTPSEDDIA--HLLKEFTVDFLL---KGYGRLVSQLYDLLVTDLHSQIDTSHFF 55
+ LH+ PS D A L F+ + +VSQL D+ V + +
Sbjct: 99 TQIALHFPPSGSDEAQEALAALFSFVPRIALASLLAYIVSQLLDVWVFNRLKRRTGGKSL 158
Query: 56 WLVTYFLKFVVQLELDIDHVSSVISFDIISYLSFEG--INLCE 96
WL V QL ID +VI F I Y+S +G E
Sbjct: 159 WLRPNASTLVSQL---ID---TVIFFFIAFYVSPDGFMAEWIE 195
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 26.9 bits (60), Expect = 6.2
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 149 KEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEE 180
+ EE+EK+KK K++GK + EE E
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 27.1 bits (60), Expect = 6.3
Identities = 13/61 (21%), Positives = 35/61 (57%)
Query: 122 TMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEE 181
+V +N+ + S +R ++KK E+++ EE ++K + +R +K++ +E+++
Sbjct: 59 AVVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKK 118
Query: 182 E 182
+
Sbjct: 119 Q 119
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 26.9 bits (59), Expect = 6.4
Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 20/64 (31%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNN--------------------NKKKKNEEEE 178
+ + E++E+ + +K + G R +K+K NEE+E
Sbjct: 166 EKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEEDE 225
Query: 179 EEEE 182
E+ E
Sbjct: 226 EDAE 229
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 27.2 bits (60), Expect = 6.5
Identities = 9/47 (19%), Positives = 22/47 (46%)
Query: 136 SLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+ +K K ++EE +++ K + + + +EE+ +EE
Sbjct: 788 KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEE 834
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 27.1 bits (60), Expect = 6.9
Identities = 13/26 (50%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 140 RMRKKKKVEKEEEEEKKKKNKRKGKR 165
RKKKK K+ +E+KKKKN+ K+
Sbjct: 682 GSRKKKKK-KKSKEKKKKKNREASKQ 706
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 25.8 bits (57), Expect = 6.9
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 150 EEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
EEEE+ + + K+ + N + EEE EEEE
Sbjct: 43 EEEEKNEVEEKQAVEEEANKEGVVEEEEVEEEE 75
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 25.7 bits (57), Expect = 7.6
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 137 LRRRMRKKKKVEKEEEEE-KKKKNKRKGKRNNNNKK 171
L + K+ K + +E KKK KRK K+ +
Sbjct: 29 LEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 26.0 bits (58), Expect = 7.6
Identities = 10/41 (24%), Positives = 22/41 (53%), Gaps = 2/41 (4%)
Query: 134 TTSLRRRMRKKKKVEKEEEEEKKKKNKRKGK--RNNNNKKK 172
+L+ +K + ++ + E+K ++NK G+ N KK+
Sbjct: 71 QKALQAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 26.8 bits (60), Expect = 7.8
Identities = 13/45 (28%), Positives = 20/45 (44%)
Query: 138 RRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+ + EKE E +KK ++K K+ K + EE E
Sbjct: 74 YEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118
>gnl|CDD|233221 TIGR00985, 3a0801s04tom, mitochondrial import receptor subunit
translocase of outer membrane 20 kDa subunit.
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 148
Score = 26.0 bits (57), Expect = 7.9
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKK 158
+ LRRR +K+ EK+ K+K
Sbjct: 37 FRKKLRRRRKKQAGAEKQYGGLAKEK 62
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 26.8 bits (60), Expect = 8.3
Identities = 13/50 (26%), Positives = 22/50 (44%)
Query: 133 YTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
Y S + K + E E + K + ++ NN + + EE+EEE
Sbjct: 829 YKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE 878
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 26.6 bits (59), Expect = 8.5
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
KK+K +E + K+ K K K N +K
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIY 35
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 26.6 bits (59), Expect = 8.7
Identities = 14/40 (35%), Positives = 26/40 (65%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
K+K E +EEEE + + + K ++ N K++E+ ++EEE
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 26.6 bits (59), Expect = 8.9
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 147 VEKEEEEEKKKKNKRKGKRNNNNKKKK 173
E+ E +KKK + K K + N+KK +
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLR 172
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 26.0 bits (58), Expect = 9.2
Identities = 10/45 (22%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 139 RRMRKKKKVEKEEEEEKK---KKNKRKGKRNNNNKKKKNEEEEEE 180
+ +++ + +++E K K K+ K +KKK ++EE +
Sbjct: 52 KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 26.0 bits (58), Expect = 9.3
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 144 KKKVEKEEEEEKKKKNKRKGKRNNNNKKKK 173
K EK E++ +K K K + K+ + K
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 26.0 bits (58), Expect = 9.7
Identities = 10/53 (18%), Positives = 23/53 (43%)
Query: 113 VRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKR 165
+ L+ + + L+ + +KK EK + + +KKK K++ +
Sbjct: 44 ITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 26.2 bits (57), Expect = 9.6
Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 9/73 (12%)
Query: 110 DPYVRRLHLIIHTMVRLFNQHKIYTTSLRRRMRKKKKVEKEEEEEKKKKNKRKGKRNNNN 169
P+ RLH N ++ +SL + K K ++E+ K + + +
Sbjct: 166 QPWPERLH---------NNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTE 216
Query: 170 KKKKNEEEEEEEE 182
+K+ EE+EEEE
Sbjct: 217 EKQPQEEQEEEEV 229
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 26.3 bits (58), Expect = 9.7
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 143 KKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
KK +++ E K K + + NN K N E ++EE
Sbjct: 36 KKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEEN 75
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 26.3 bits (59), Expect = 9.8
Identities = 7/40 (17%), Positives = 19/40 (47%)
Query: 139 RRMRKKKKVEKEEEEEKKKKNKRKGKRNNNNKKKKNEEEE 178
+ M+++ K +++E K +K + K+ K + +
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 26.4 bits (58), Expect = 9.9
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 153 EEKKKKNKRKGKRNNNNKKKKNEEEEEEEE 182
+E+ + + K N +KKK+ EE+ EE+
Sbjct: 13 DEQSQISPEKQTPNKKDKKKEEEEQLSEED 42
>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
This family represents homologues of COX16 which has
been shown to be involved in assembly of cytochrome
oxidase. Protein in this family are typically between
106 and 134 amino acids in length.
Length = 79
Score = 24.8 bits (55), Expect = 10.0
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 135 TSLRRRMRKKKKVEKEEEEEKKKKNKRK 162
T LR R +K + EEEE K KR+
Sbjct: 22 TQLRYERRDRKVQQLEEEEALKLLKKRR 49
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 26.2 bits (58), Expect = 10.0
Identities = 9/21 (42%), Positives = 17/21 (80%)
Query: 138 RRRMRKKKKVEKEEEEEKKKK 158
R R+K++ +++EEEE+K+K
Sbjct: 199 REEERRKQRKKQQEEEERKQK 219
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.137 0.386
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,641,424
Number of extensions: 937985
Number of successful extensions: 6644
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5369
Number of HSP's successfully gapped: 759
Length of query: 182
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 91
Effective length of database: 6,901,388
Effective search space: 628026308
Effective search space used: 628026308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.2 bits)