RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14918
(463 letters)
>gnl|CDD|216557 pfam01534, Frizzled, Frizzled/Smoothened family membrane region.
This family contains the membrane spanning region of
frizzled and smoothened receptors. This membrane region
is predicted to contain seven transmembrane alpha
helices. Proteins related to Drosophila frizzled are
receptors for Wnt (mediating the beta-catenin signalling
pathway), but also the planar cell polarity (PCP)
pathway and the Wnt/calcium pathway. The predominantly
alpha-helical Cys-rich ligand-binding region (CRD) of
Frizzled is both necessary and sufficient for Wnt
binding. The smoothened receptor mediates hedgehog
signalling.
Length = 328
Score = 423 bits (1090), Expect = e-147
Identities = 170/331 (51%), Positives = 223/331 (67%), Gaps = 20/331 (6%)
Query: 47 FFSQPEREFASMWIFVMSALCCISTLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGY 106
FS+ E+ FA WI SALC +STL T+ TFLID +RF+YPERPI++LS CY +VS+GY
Sbjct: 2 LFSRDEKRFARSWIGWWSALCFLSTLFTVLTFLIDWKRFRYPERPIIYLSACYLIVSLGY 61
Query: 107 LIRVSVGHEEVAC-----EDQMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSFTWFL 161
LI +G E VAC + V G++ CT++FLLVYFFGM+S +WWVIL+ TWFL
Sbjct: 62 LIGFVLGRESVACRKADGGMRTVTQGSTENLSCTVLFLLVYFFGMASSVWWVILTLTWFL 121
Query: 162 AAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKY 221
AAGLKWG+EAI + S YFHL AW +P V +++V+ ++ VDGD ++GIC+VGN N+ L+
Sbjct: 122 AAGLKWGHEAIEAKSSYFHLVAWGLPAVLTITVLALNKVDGDELSGICFVGNLNLDALRG 181
Query: 222 FVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLS 281
FV+ PL +YL++G FL GF+SLFRIR+VIK G T KLEKLM+RIG+FS+L
Sbjct: 182 FVLAPLCLYLVIGTSFLLAGFVSLFRIRSVIKTDG-----TNTSKLEKLMVRIGVFSILY 236
Query: 282 TLPASIVIGCLIYESYVYNEYVESIV-CPCSEKLDKDYFK---------PLHVVLLLKYF 331
T+PA IVI C YE +E+ S C C + +K P V +LKYF
Sbjct: 237 TVPALIVIACYFYEFANRDEWERSWRDCICCQYSIPCPYKDKSGDPEARPPLAVFMLKYF 296
Query: 332 MALAVGLTSGVWIWSGKTFSSWKKLWNRLFG 362
M+L VG+TSGVW+WS KT SW++ RL G
Sbjct: 297 MSLVVGITSGVWVWSKKTLESWRRFIRRLCG 327
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 50.9 bits (122), Expect = 2e-07
Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 18/80 (22%)
Query: 394 PPPPNIPIPAVPVM-----GGGA-------------HSMTSSLKQNPYSSHVHKNERKKK 435
P P +P P M G GA S + + + V + E+K+K
Sbjct: 98 VPRPPVPQPKGLKMRFFPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEK 157
Query: 436 KKKKKKKKKKKKKKKKKKKK 455
KKKK+ KK+KK+KK KK+K
Sbjct: 158 KKKKEVKKEKKEKKDKKEKM 177
Score = 31.2 bits (71), Expect = 0.74
Identities = 12/19 (63%), Positives = 12/19 (63%)
Query: 428 HKNERKKKKKKKKKKKKKK 446
K K KKKKKKKKKK
Sbjct: 175 EKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 42.4 bits (100), Expect = 2e-05
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K KK KKKKKKKKKK K K++
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEE 35
Score = 41.3 bits (97), Expect = 7e-05
Identities = 16/23 (69%), Positives = 16/23 (69%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K K KK KKKKKKKKKK K K
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSK 33
Score = 40.1 bits (94), Expect = 2e-04
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KK KKKKKKKKKK K K++
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEV 36
Score = 40.1 bits (94), Expect = 2e-04
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KKKKKKKKKK K K++ +K++
Sbjct: 16 KIDVKKKKKKKKKKNKSKEEVVTEKEE 42
Score = 39.7 bits (93), Expect = 2e-04
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ KKKKKKKKKK K K++ +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTE 39
Score = 39.3 bits (92), Expect = 3e-04
Identities = 16/26 (61%), Positives = 16/26 (61%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWS 457
K K K KK KKKKKKKKKK S
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKS 32
Score = 35.5 bits (82), Expect = 0.007
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++KKKKKKKK K K++ +K++++
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEE 44
Score = 33.2 bits (76), Expect = 0.050
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++KKKKKKK K K++ +K++++K
Sbjct: 19 VKKKKKKKKKKNKSKEEVVTEKEEEEK 45
Score = 31.6 bits (72), Expect = 0.16
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KKKKKK K K++ +K++++K
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKS 46
Score = 28.9 bits (65), Expect = 1.5
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KKKKK K K++ +K++++K
Sbjct: 21 KKKKKKKKKNKSKEEVVTEKEEEEKSS 47
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 44.6 bits (106), Expect = 3e-05
Identities = 18/33 (54%), Positives = 21/33 (63%), Gaps = 5/33 (15%)
Query: 428 HKNERKKK-----KKKKKKKKKKKKKKKKKKKK 455
K ER+KK KK+K KK KKKKK KKK +
Sbjct: 172 KKAERQKKREENLKKRKDDKKNKKKKKAKKKGR 204
Score = 41.1 bits (97), Expect = 5e-04
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
+N +K+K KK KKKKK KKK +
Sbjct: 182 ENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 39.6 bits (93), Expect = 0.001
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKK 454
KK+K KK KKKKK KKK +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 38.8 bits (91), Expect = 0.003
Identities = 10/25 (40%), Positives = 22/25 (88%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+R++K++++K +KK+K+K+ KKK+
Sbjct: 10 QRRRKREQRKARKKQKRKEAKKKED 34
Score = 38.4 bits (90), Expect = 0.003
Identities = 10/25 (40%), Positives = 23/25 (92%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E++++K++++K +KK+K+K+ KKK+
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKE 33
Score = 37.7 bits (88), Expect = 0.006
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ R+K++++K +KK+K+K+ KKK+
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDA 35
Score = 37.3 bits (87), Expect = 0.009
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 439 KKKKKKKKKKKKKKKKKWSFRFQ 461
KK K+K+K+KKK KK+W R +
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKE 167
Score = 36.1 bits (84), Expect = 0.018
Identities = 9/27 (33%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++++K +KK+K+K+ KKK+ +K
Sbjct: 11 RRRKREQRKARKKQKRKEAKKKEDAQK 37
Score = 35.7 bits (83), Expect = 0.024
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KK+++ KK+K KK KKKKK
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKKK 198
Score = 35.7 bits (83), Expect = 0.029
Identities = 18/53 (33%), Positives = 21/53 (39%), Gaps = 29/53 (54%)
Query: 432 RKKKKKKKKKKK-----------------------------KKKKKKKKKKKK 455
RK+K+KKK KK+ KK KKKKK KKK
Sbjct: 150 RKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKK 202
Score = 35.4 bits (82), Expect = 0.033
Identities = 18/41 (43%), Positives = 21/41 (51%), Gaps = 15/41 (36%)
Query: 430 NERK----KKKKKKKKK-----------KKKKKKKKKKKKK 455
ERK KKK +++KK KK KKKKK KKK
Sbjct: 163 KERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203
Score = 34.6 bits (80), Expect = 0.066
Identities = 14/32 (43%), Positives = 23/32 (71%), Gaps = 3/32 (9%)
Query: 427 VHKNE---RKKKKKKKKKKKKKKKKKKKKKKK 455
V +E +K K+K+K+KKK KK+ K++K+K
Sbjct: 137 VKDDEKLLKKALKRKEKQKKKSKKEWKERKEK 168
Score = 33.4 bits (77), Expect = 0.15
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+RK +KK+K+K+ KKK+ +K + +
Sbjct: 17 QRKARKKQKRKEAKKKEDAQKSEAE 41
Score = 33.0 bits (76), Expect = 0.22
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKK 452
K E++K +KK+K+K+ KKK+ +K
Sbjct: 13 RKREQRKARKKQKRKEAKKKEDAQK 37
Score = 31.1 bits (71), Expect = 0.74
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
RKK+K+K+ KKK+ +K + ++ K
Sbjct: 20 ARKKQKRKEAKKKEDAQKSEAEEVK 44
Score = 31.1 bits (71), Expect = 0.89
Identities = 8/26 (30%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
+ + +KK+K+K+ KKK+ +K + +
Sbjct: 16 EQRKARKKQKRKEAKKKEDAQKSEAE 41
Score = 30.4 bits (69), Expect = 1.5
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 10/41 (24%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKK----------KKKKK 455
S V + ++ KK K KKKKKKKK +KKK
Sbjct: 67 SKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKK 107
Score = 30.4 bits (69), Expect = 1.5
Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 7/54 (12%)
Query: 408 GGGAHSMTSSLKQNP-----YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
G A K+ Y + K E +KKK ++ + K + ++K+KW
Sbjct: 74 GEQAKKDLKLKKKKKKKKTDYKQLLKKLE--ARKKKLEELDEDKAAEIEEKEKW 125
Score = 29.6 bits (67), Expect = 3.0
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
K +K+K+K+ KKK+ +K + ++ K
Sbjct: 18 RKARKKQKRKEAKKKEDAQKSEAEEVK 44
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 43.1 bits (102), Expect = 4e-05
Identities = 19/27 (70%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KK+KKKKKKKKKKK KK KKK
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 42.4 bits (100), Expect = 6e-05
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++KK+KKKKKKKKKKK KK KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 41.6 bits (98), Expect = 1e-04
Identities = 18/28 (64%), Positives = 21/28 (75%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++K+KKKKKKKKKKK KK KKKK
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 40.8 bits (96), Expect = 2e-04
Identities = 19/29 (65%), Positives = 22/29 (75%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
V K + K+KK+KKKKKKKKKKK KK K
Sbjct: 104 VLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 39.3 bits (92), Expect = 8e-04
Identities = 17/28 (60%), Positives = 22/28 (78%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ K+K K+KK+KKKKKKKKKKK
Sbjct: 98 QKIVKQVLKEKAKQKKQKKKKKKKKKKK 125
Score = 39.3 bits (92), Expect = 8e-04
Identities = 16/24 (66%), Positives = 20/24 (83%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+ K+K K+KK+KKKKKKKKKKK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 34.3 bits (79), Expect = 0.041
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E+K K+ K+K K+KK+KKKKKKK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKK 121
Score = 31.6 bits (72), Expect = 0.38
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKKKKKK 448
LK+ K ++KKKKKK KK KKKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 45.3 bits (108), Expect = 5e-05
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KK +K+KKKKK+KKK KK+KKK
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 45.0 bits (107), Expect = 6e-05
Identities = 16/27 (59%), Positives = 23/27 (85%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K +K+KKKKK+KKK KK+KKK +
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 43.4 bits (103), Expect = 2e-04
Identities = 15/27 (55%), Positives = 23/27 (85%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ +K+KKKKK+KKK KK+KKK +K
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 43.4 bits (103), Expect = 2e-04
Identities = 17/27 (62%), Positives = 24/27 (88%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +R++KK +K+KKKKK+KKK KK+KK
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 43.0 bits (102), Expect = 3e-04
Identities = 17/27 (62%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +KK +K+KKKKK+KKK KK+KKK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 40.7 bits (96), Expect = 0.001
Identities = 14/23 (60%), Positives = 20/23 (86%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KKK+++KK +K+KKKKK+KKK
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKG 405
Score = 39.2 bits (92), Expect = 0.003
Identities = 15/27 (55%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KKK+++KK +K+KKKKK+KKK
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKK 404
Score = 34.9 bits (81), Expect = 0.071
Identities = 13/25 (52%), Positives = 20/25 (80%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ K KKK+++KK +K+KKKKK+
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKR 401
Score = 34.6 bits (80), Expect = 0.093
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKW 456
++K K KKK+++KK +K+KKKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKK 400
Score = 34.2 bits (79), Expect = 0.13
Identities = 13/25 (52%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K+K K KKK+++KK +K+KKK
Sbjct: 374 EIKEKYPKPPKKKREEKKPQKRKKK 398
Score = 29.2 bits (66), Expect = 5.4
Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 4/33 (12%)
Query: 429 KNERKKK----KKKKKKKKKKKKKKKKKKKKWS 457
K E K+ K+K K KKK+++KK +K+
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKK 397
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 42.7 bits (101), Expect = 6e-05
Identities = 16/26 (61%), Positives = 21/26 (80%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ KK+ K++KKK+KKKKKKK KK
Sbjct: 80 EEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 42.3 bits (100), Expect = 7e-05
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K+ K++KKK+KKKKKKK KK K
Sbjct: 82 KTAKKRAKRQKKKQKKKKKKKAKKGNK 108
Score = 42.0 bits (99), Expect = 1e-04
Identities = 16/25 (64%), Positives = 19/25 (76%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
K K++KKK+KKKKKKK KK KK
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 41.6 bits (98), Expect = 1e-04
Identities = 17/27 (62%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K+E K KK+ K++KKK+KKKKKKK K
Sbjct: 78 KDEEKTAKKRAKRQKKKQKKKKKKKAK 104
Score = 40.4 bits (95), Expect = 4e-04
Identities = 15/24 (62%), Positives = 20/24 (83%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
++ K++KKK+KKKKKKK KK KK
Sbjct: 86 KRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 39.6 bits (93), Expect = 6e-04
Identities = 16/26 (61%), Positives = 22/26 (84%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
+ +R+KKK+KKKKKKK KK KK++K
Sbjct: 87 RAKRQKKKQKKKKKKKAKKGNKKEEK 112
Score = 38.5 bits (90), Expect = 0.002
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
K +++KKK+KKKKKKK KK KK++K +
Sbjct: 86 KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSS 118
Score = 37.7 bits (88), Expect = 0.003
Identities = 14/27 (51%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +RK ++K KK+ K++KKK+KKKKK
Sbjct: 74 EKKRKDEEKTAKKRAKRQKKKQKKKKK 100
Score = 36.6 bits (85), Expect = 0.007
Identities = 13/27 (48%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K+K ++K KK+ K++KKK+KKK
Sbjct: 72 REEKKRKDEEKTAKKRAKRQKKKQKKK 98
Score = 34.6 bits (80), Expect = 0.033
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KK+K ++K KK+ K++KKK+KK
Sbjct: 70 QKREEKKRKDEEKTAKKRAKRQKKKQKK 97
Score = 30.4 bits (69), Expect = 1.0
Identities = 10/25 (40%), Positives = 21/25 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
++K+++KK+K ++K KK+ K++KK
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKK 93
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 43.3 bits (102), Expect = 6e-05
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+K KKKK + K++KKKKK+KKKK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKK 165
Score = 42.5 bits (100), Expect = 1e-04
Identities = 16/30 (53%), Positives = 24/30 (80%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
H K+++KK + K++KKKKK+KKKKKK+
Sbjct: 140 HEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 41.7 bits (98), Expect = 2e-04
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 420 QNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ P S K KK KKKK + K++KKKKK+KKK
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164
Score = 36.3 bits (84), Expect = 0.015
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 6/40 (15%)
Query: 422 PYSSHVHKNERKKKKKKKKKKK------KKKKKKKKKKKK 455
+ + + +KK KKKK KKKKK+KKKKKK
Sbjct: 128 EETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 29.0 bits (65), Expect = 3.5
Identities = 18/58 (31%), Positives = 21/58 (36%), Gaps = 28/58 (48%)
Query: 426 HVHKNERKKKKKKK-------------------------KKKKKKKK---KKKKKKKK 455
+H KKK K K KK KKKK K++KKKKK
Sbjct: 103 LMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKK 160
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 44.9 bits (107), Expect = 6e-05
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K KK+K+K+K K KK+ + K K
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 44.5 bits (106), Expect = 7e-05
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 417 SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S K+ S +R +KK+K+K+K K KK+ + K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422
Score = 44.2 bits (105), Expect = 1e-04
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E++K+K K KK+ + K K++K
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRKP 428
Score = 44.2 bits (105), Expect = 1e-04
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K+K+K+K K KK+ + K K+
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 42.6 bits (101), Expect = 3e-04
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+++K+K+K K KK+ + K K++
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRR 426
Score = 41.9 bits (99), Expect = 6e-04
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + K+K+K K KK+ + K K++K
Sbjct: 401 EKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 41.5 bits (98), Expect = 7e-04
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S V +KK+K+K+K K KK+ + K
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIG 423
Score = 41.5 bits (98), Expect = 7e-04
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K+K K KK+ + K K++K
Sbjct: 403 KEKEKEKPKVKKRHRDTKNIGKRRKPS 429
Score = 39.2 bits (92), Expect = 0.004
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +KK K KK K+ +KK+K+K
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKKEKEK 407
Score = 38.8 bits (91), Expect = 0.005
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E++K K KK+ + K K++K
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSG 430
Score = 38.4 bits (90), Expect = 0.007
Identities = 8/27 (29%), Positives = 13/27 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K KK+ + K K++K
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 34.5 bits (80), Expect = 0.10
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + +KK K KK K+ +KK+K
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEK 405
Score = 29.5 bits (67), Expect = 3.7
Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 11/35 (31%)
Query: 432 RKKKKKKKKKKKKK-----------KKKKKKKKKK 455
R K K +KK K KK+K+K+K K
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 42.1 bits (100), Expect = 1e-04
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ + K E K +K +KK +K K K +KKK K
Sbjct: 60 AALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 38.3 bits (90), Expect = 0.002
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
K E+K +K K K +KKK KK++ K +
Sbjct: 72 EKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 38.3 bits (90), Expect = 0.002
Identities = 14/44 (31%), Positives = 21/44 (47%)
Query: 412 HSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ SL+ K K +KK +K K K +KKK KK++
Sbjct: 52 KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 36.4 bits (85), Expect = 0.010
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ +KK +K K K +KKK KK++ K
Sbjct: 72 EKAEKKAEKAKAKAEKKKAKKEEPK 96
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 41.4 bits (97), Expect = 1e-04
Identities = 21/46 (45%), Positives = 24/46 (52%)
Query: 409 GGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
G H TSS + + H + K KKKKKK KK KK KK KK
Sbjct: 75 GECHLTTSSGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.5 bits (74), Expect = 0.13
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
SH + + +K KKKKKK KK KK KK
Sbjct: 87 SHPDDHPPEPTEKPKKKKKKSKKTKKPKKSS 117
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 40.0 bits (94), Expect = 1e-04
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKKKKKKKKK + K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKA 43
Score = 30.0 bits (68), Expect = 0.52
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKK 452
K ++KKKKKK + K KK
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKKAT 47
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 39.1 bits (92), Expect = 2e-04
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
K K + K+ KK+ KK +KKK+K F+R
Sbjct: 7 KRNEKLRAKRAKKRAAKKAARKKKRKLI---FKR 37
Score = 34.9 bits (81), Expect = 0.008
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +R +K + K+ KK+ KK +KKK+
Sbjct: 5 KRKRNEKLRAKRAKKRAAKKAARKKKR 31
Score = 31.0 bits (71), Expect = 0.16
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
KK+ KK +KKK+K K+ +K
Sbjct: 17 AKKRAAKKAARKKKRKLIFKRAEK 40
Score = 29.5 bits (67), Expect = 0.51
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KK+K+ +K + K+ KK+ KK
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKK 24
Score = 28.0 bits (63), Expect = 1.9
Identities = 8/23 (34%), Positives = 15/23 (65%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKK 453
+++ KK +KKK+K K+ +K
Sbjct: 18 KKRAAKKAARKKKRKLIFKRAEK 40
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 42.4 bits (100), Expect = 2e-04
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ER KKKK+ K K K+ KKKKKK
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKD 106
Score = 40.4 bits (95), Expect = 9e-04
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+ E + KKKK+ K K K+ KKKKKK +
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112
Score = 38.5 bits (90), Expect = 0.004
Identities = 17/41 (41%), Positives = 27/41 (65%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
S SS QN ++H ER+ ++KK + K +K+K+KKK+K
Sbjct: 141 SSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181
Score = 38.1 bits (89), Expect = 0.005
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++KK+ K K K+ KKKKKK
Sbjct: 82 ERLKKKKRVKTKAYKEPTKKKKKKDPTA 109
Score = 33.1 bits (76), Expect = 0.19
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KKKKK K K + KKK
Sbjct: 100 KKKKKKDPTAAKSPKAAAPRPKKK 123
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 42.8 bits (101), Expect = 3e-04
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 421 NPYSSHVHKNERKKKKKKKKKKKKKKKKKKKK 452
P S + +E K KK KKKKKKKKK++K
Sbjct: 261 TPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Score = 41.7 bits (98), Expect = 7e-04
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKW 456
+ K KK KKKKKKKKK++K
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLD 293
Score = 41.3 bits (97), Expect = 0.001
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
+S + K KK KKKKKKKKK++K
Sbjct: 263 ASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Score = 36.3 bits (84), Expect = 0.039
Identities = 17/87 (19%), Positives = 36/87 (41%), Gaps = 25/87 (28%)
Query: 374 TNKHRVLKQHDYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTS-----SLKQNPYSSHVH 428
TNK R G+ P +P+P GA ++ + +
Sbjct: 8 TNKLR----------AKLGLKP---LPVP-------GAKPGSTKESRDAAAYENWKKRQE 47
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E K+K+++ ++K K ++K+++ K
Sbjct: 48 EAEAKRKREELREKIAKAREKRERNSK 74
Score = 32.8 bits (75), Expect = 0.40
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K++KKK+ ++KK +K+K++
Sbjct: 96 KKSKKRQKKKEAERKKALLLDEKEKER 122
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 39.8 bits (93), Expect = 4e-04
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
N+R+ KK KKK++KKKKKKK K
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 33.7 bits (77), Expect = 0.049
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
Y + +++ + KK KKK++KKKKKKK K
Sbjct: 94 YPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 33.7 bits (77), Expect = 0.061
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 436 KKKKKKKKKKKKKKKKKKKKWSFRF 460
+++ KK KKK++KKKKKKK +
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNL 123
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 40.6 bits (96), Expect = 0.001
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%)
Query: 412 HSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
M LK + + K E K K +KKK+K++ K K K K
Sbjct: 50 KEMKEELKA----ALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 33.7 bits (78), Expect = 0.19
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 428 HKNERKKKKKKKKKKKKKKKKK 449
HK ++KK+K++ K K K K +
Sbjct: 70 HKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 37.9 bits (88), Expect = 0.002
Identities = 13/28 (46%), Positives = 21/28 (75%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
KNE KKK++KKK K+ +K++K++K
Sbjct: 41 SKNEGVSKKKQRKKKPKRLTRKQRKRQK 68
Score = 31.3 bits (71), Expect = 0.30
Identities = 13/56 (23%), Positives = 22/56 (39%), Gaps = 16/56 (28%)
Query: 416 SSLKQNPYSSHVHKNERKKK----------------KKKKKKKKKKKKKKKKKKKK 455
S + S + R K+ KKK++KKK K+ +K++K
Sbjct: 10 PSPSIDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRK 65
Score = 29.0 bits (65), Expect = 1.9
Identities = 10/22 (45%), Positives = 17/22 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKK 450
K +RKKK K+ +K++K++KK
Sbjct: 49 KKQRKKKPKRLTRKQRKRQKKG 70
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 39.6 bits (93), Expect = 0.002
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K K KK K K ++K+K K++K+
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKE 77
Score = 37.7 bits (88), Expect = 0.007
Identities = 10/27 (37%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+ K KKKK K K K
Sbjct: 196 KAEKAAKGGKKKKGKAKAKLNVGGAND 222
Score = 37.7 bits (88), Expect = 0.007
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K KK K K ++K+K K++K++K
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEK 79
Score = 36.9 bits (86), Expect = 0.014
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKK 452
NE+ K +K K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214
Score = 36.6 bits (85), Expect = 0.017
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K KK K K ++K+K K++
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKRE 75
Score = 35.8 bits (83), Expect = 0.029
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E +K K K K KK K K ++K+
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKE 70
Score = 35.8 bits (83), Expect = 0.031
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + K K K KK K K ++K+K K
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAK 73
Score = 35.4 bits (82), Expect = 0.036
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+K K +K K KKKK K K K
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAK 214
Score = 35.0 bits (81), Expect = 0.053
Identities = 8/26 (30%), Positives = 9/26 (34%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K + KKKK K K K
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 34.6 bits (80), Expect = 0.063
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+K K K ++K+K K++K++K +
Sbjct: 56 AKAKKALKAKIEEKEKAKREKEEKGLR 82
Score = 34.2 bits (79), Expect = 0.095
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E ++K+++K K K K KK K K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAK 65
Score = 33.9 bits (78), Expect = 0.13
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKK 451
+ + K +K K KKKK K K K
Sbjct: 190 LINEKLKAEKAAKGGKKKKGKAKAK 214
Score = 33.5 bits (77), Expect = 0.15
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +K K KKKK K K K
Sbjct: 195 LKAEKAAKGGKKKKGKAKAKLNVGG 219
Score = 33.1 bits (76), Expect = 0.24
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++E K+++K K K K KK K K +
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIE 67
Score = 31.6 bits (72), Expect = 0.64
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K K ++K+K K++K++K ++
Sbjct: 58 AKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 30.4 bits (69), Expect = 1.5
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E + ++K+++K K K K KK K
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALK 63
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 40.0 bits (93), Expect = 0.002
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 411 AHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
H + K +P V E+K KK KKK+KK+K+K++ K KKK F+
Sbjct: 177 KHRNAETSK-SPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 35.4 bits (81), Expect = 0.055
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
P S K E +++ +++ +K K++KKK++K+K+
Sbjct: 76 PMSDQYVKLEEERRHRQRLEKDKREKKKREKEKR 109
Score = 35.0 bits (80), Expect = 0.076
Identities = 9/25 (36%), Positives = 21/25 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
ER+ +++ +K K++KKK++K+K+ +
Sbjct: 87 ERRHRQRLEKDKREKKKREKEKRGR 111
Score = 34.3 bits (78), Expect = 0.14
Identities = 9/29 (31%), Positives = 22/29 (75%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ R++ +K K++KKK++K+K+ +++ S
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 34.3 bits (78), Expect = 0.15
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K KKKK++K+K++KKKKKK
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHHH 301
Score = 33.9 bits (77), Expect = 0.21
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 403 AVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
A V G S K + + K+ + KKKK++K+K++KKKKKK +
Sbjct: 251 ANTVSGTAPDSEPDEPK-DAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 33.9 bits (77), Expect = 0.22
Identities = 8/26 (30%), Positives = 19/26 (73%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKK 453
H+ +K K++KKK++K+K+ +++
Sbjct: 90 HRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 33.1 bits (75), Expect = 0.35
Identities = 8/26 (30%), Positives = 19/26 (73%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKK 451
H + E+ K++KKK++K+K+ +++
Sbjct: 90 HRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 33.1 bits (75), Expect = 0.36
Identities = 21/66 (31%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 400 PIPAVPVMGGGAHSMTSSLKQNPYSSHVHKN------ERKKKKKKKKKKKKKKKKKKKKK 453
P A V S+ +++ S + E KK K KKKK++K+K++KKKK
Sbjct: 236 PASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKK 295
Query: 454 KKWSFR 459
KK
Sbjct: 296 KKHHHH 301
Score = 32.3 bits (73), Expect = 0.51
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 427 VHKNERKKKKK--KKKKKKKKKKKKKKKKKKWSFRF 460
V K +K KKK K+K+K++ K KKK+ + S
Sbjct: 195 VEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLL 230
Score = 30.0 bits (67), Expect = 3.4
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K E+K+K+K++ K KKK+ + K
Sbjct: 204 KKEKKEKEKERDKDKKKEVEGFKSLLLA 231
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 36.9 bits (86), Expect = 0.002
Identities = 7/24 (29%), Positives = 19/24 (79%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
ERK+++K +++++ K+ K K+++
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREE 25
Score = 32.6 bits (75), Expect = 0.084
Identities = 5/25 (20%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +K+++K +++++ K+ K K+++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
Score = 31.8 bits (73), Expect = 0.15
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K E K +++++ K+ K K+++ ++K
Sbjct: 4 KEEEKAQREEELKRLKNLKREEIEEK 29
Score = 30.7 bits (70), Expect = 0.36
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K +R+++ K+ K K+++ ++K +K K
Sbjct: 8 KAQREEELKRLKNLKREEIEEKLEKIKK 35
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 37.2 bits (87), Expect = 0.002
Identities = 16/35 (45%), Positives = 23/35 (65%), Gaps = 3/35 (8%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSF-RFQR 462
K R++ + K K+K+KKKKKKK+ + F RFQ
Sbjct: 64 KTAREEAVEAKAKEKEKKKKKKKELE--DFYRFQI 96
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 37.4 bits (87), Expect = 0.003
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
SS HK +RKK + + KK KK+++ S
Sbjct: 74 SSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSS 107
Score = 30.9 bits (70), Expect = 0.57
Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 17/58 (29%)
Query: 415 TSSLKQNPYSSHVHKNERKKK----------KKKKKKK-----KKKKKKKKKKKKKWS 457
TS L S + + +RK+K KK+KK + KK KK+++ S
Sbjct: 51 TSQLIHL--SQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSS 106
Score = 27.4 bits (61), Expect = 8.2
Identities = 9/34 (26%), Positives = 14/34 (41%)
Query: 411 AHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKK 444
S + K+ S HV KK KK+++
Sbjct: 72 QKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 37.6 bits (88), Expect = 0.003
Identities = 10/27 (37%), Positives = 20/27 (74%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
ER +K+K+ +K ++KK K+++K+K
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 37.3 bits (87), Expect = 0.003
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K+K+ +K ++KK K+++K+K+K
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 36.1 bits (84), Expect = 0.008
Identities = 9/24 (37%), Positives = 19/24 (79%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
ER K++ +K+K+ +K ++KK K++
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRR 124
Score = 36.1 bits (84), Expect = 0.010
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
R+++ K++ +K+K+ +K ++KK K
Sbjct: 96 LRLRRERTKERAEKEKRTRKNREKKFK 122
Score = 34.9 bits (81), Expect = 0.023
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
K++ +K+K+ +K ++KK K+++K
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQK 126
Score = 34.9 bits (81), Expect = 0.025
Identities = 8/27 (29%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + + +K+K+ +K ++KK K+++K+K
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 34.9 bits (81), Expect = 0.025
Identities = 7/27 (25%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++ +K+K+ +K ++KK K+++K+
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKE 127
Score = 34.9 bits (81), Expect = 0.025
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
R++ K++ +K+K+ +K ++KK K+
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKR 123
Score = 32.6 bits (75), Expect = 0.12
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKK 451
+ E++ +K ++KK K+++K+K+K
Sbjct: 107 AEKEKRTRKNREKKFKRRQKEKEK 130
Score = 32.6 bits (75), Expect = 0.15
Identities = 13/45 (28%), Positives = 25/45 (55%), Gaps = 12/45 (26%)
Query: 430 NERKKKKKKKKKKK-----------KKKKKKKKKKKKWSFR-FQR 462
E+KKK+K+ KK+ K++ +K+K+ +K + F+R
Sbjct: 79 KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKR 123
Score = 31.9 bits (73), Expect = 0.22
Identities = 6/26 (23%), Positives = 17/26 (65%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K R + ++++ K++ +K+K+ +K
Sbjct: 90 KKRRIALRLRRERTKERAEKEKRTRK 115
Score = 30.7 bits (70), Expect = 0.58
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
H K E++ ++K+KKKK+K+ KK++
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIAL 96
Score = 30.7 bits (70), Expect = 0.73
Identities = 6/29 (20%), Positives = 19/29 (65%), Gaps = 2/29 (6%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKK--KKK 455
+ + ++++ K++ +K+K+ +K +KK
Sbjct: 92 RRIALRLRRERTKERAEKEKRTRKNREKK 120
Score = 29.5 bits (67), Expect = 1.6
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K+ K +K+ ++K+KKKK+K+ KK+
Sbjct: 63 DLKKHNAKVEKELLREKEKKKKRKRPGKKR 92
Score = 28.4 bits (64), Expect = 4.1
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E ++K+KKKK+K+ KK++ +
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRL 98
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 38.5 bits (90), Expect = 0.003
Identities = 16/25 (64%), Positives = 17/25 (68%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K+K K KKKK KKKK K K KK
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKD 103
Score = 37.4 bits (87), Expect = 0.006
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K KKKK KKKK K K KK KK
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKD 107
Score = 37.4 bits (87), Expect = 0.006
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E++K K KKKK KKKK K K KK
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDD 104
Score = 36.6 bits (85), Expect = 0.011
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +KKK K K KK KK K +KK +
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 36.6 bits (85), Expect = 0.012
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KK KKKK K K KK KK K +
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSE 111
Score = 36.2 bits (84), Expect = 0.015
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
V K +K+K K KKKK KKKK K K KK
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKDKK 102
Score = 36.2 bits (84), Expect = 0.015
Identities = 15/26 (57%), Positives = 17/26 (65%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
++K K KKKK KKKK K K KK K
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDK 105
Score = 36.2 bits (84), Expect = 0.015
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++K KKKK K K KK KK K +K
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Score = 35.8 bits (83), Expect = 0.021
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ KKKK K K KK KK K +KK
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Score = 34.3 bits (79), Expect = 0.073
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K KK KK K +KK +K+ +
Sbjct: 93 KKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 33.1 bits (76), Expect = 0.17
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K ++ K K KK KK K +KK +K+
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKE 117
Score = 32.7 bits (75), Expect = 0.21
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K++ K KK KK K +KK +K+ + K
Sbjct: 95 KDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 32.0 bits (73), Expect = 0.41
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K++ KK KK K +KK +K+ + K +
Sbjct: 97 KDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 30.4 bits (69), Expect = 1.2
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ KK KK K +KK +K+ + K
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 30.0 bits (68), Expect = 1.7
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ KK+ ++K+K K KKKK KKKK
Sbjct: 72 EKVKKEYEEKQKWKWKKKKSKKKKD 96
Score = 29.7 bits (67), Expect = 2.0
Identities = 14/27 (51%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E +K KK+ ++K+K K KKKK KKK
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKK 94
Score = 29.3 bits (66), Expect = 3.1
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
++K KK K +KK +K+ + K +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 28.9 bits (65), Expect = 3.5
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KK K +KK +K+ + K + K
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Score = 28.9 bits (65), Expect = 4.0
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K+++ KK K +KK +K+ + K +
Sbjct: 99 KDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 28.9 bits (65), Expect = 4.1
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
K+++K K +KK +K+ + K + K +S
Sbjct: 102 KDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 28.9 bits (65), Expect = 4.3
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKKW 456
+ +K KK+ ++K+K K KKKK K
Sbjct: 67 LAEEIEKVKKEYEEKQKWKWKKKKSKK 93
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 36.1 bits (84), Expect = 0.004
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S ++R ++KKKKKKK KKK K+ +KK
Sbjct: 29 ASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 35.7 bits (83), Expect = 0.005
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKKW 456
++ ++ ++ ++KKKKKKK KKK K+W
Sbjct: 30 SKVRRISQRNRRKKKKKKKLKKKSKEW 56
Score = 31.9 bits (73), Expect = 0.12
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K ++ ++ ++KKKKKKK KKK
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKS 53
Score = 30.0 bits (68), Expect = 0.47
Identities = 12/31 (38%), Positives = 22/31 (70%), Gaps = 4/31 (12%)
Query: 429 KNERKKKKKKKKKKK----KKKKKKKKKKKK 455
+ ++KKKKK KKK K +KK++ +++ K+
Sbjct: 40 RRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 39.1 bits (91), Expect = 0.004
Identities = 14/27 (51%), Positives = 23/27 (85%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KNE K+++K+K++ K++KKKKK+K K
Sbjct: 99 KNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 37.9 bits (88), Expect = 0.010
Identities = 13/44 (29%), Positives = 18/44 (40%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+ P K + + K + KK KKK KKK+ QR
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQR 197
Score = 37.2 bits (86), Expect = 0.018
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ K + K++KKKKK+K K++ K +K K++
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE 136
Score = 36.8 bits (85), Expect = 0.021
Identities = 13/46 (28%), Positives = 28/46 (60%)
Query: 409 GGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
S + K P +++ ++++K+K++ K++KKKKK+K K+
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126
Score = 36.8 bits (85), Expect = 0.022
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K+++K+K++ K++KKKKK+K
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKP 124
Score = 36.8 bits (85), Expect = 0.023
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E K + K+++K+K++ K++KKKKK
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKK 121
Score = 36.8 bits (85), Expect = 0.024
Identities = 14/43 (32%), Positives = 26/43 (60%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ T K+ S + + K++ K++KKKKK+K K++ K +K
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 34.5 bits (79), Expect = 0.12
Identities = 13/28 (46%), Positives = 21/28 (75%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+KKKK+K K++ K +K K++ K+K
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Score = 33.7 bits (77), Expect = 0.19
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + +KKKKK+K K++ K +K K++ K
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 33.7 bits (77), Expect = 0.19
Identities = 11/25 (44%), Positives = 20/25 (80%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+RK K++ K+K+ K+K+K+K+KK
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKV 154
Score = 32.9 bits (75), Expect = 0.35
Identities = 10/27 (37%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E K+K+ K+K+K+K+KK ++ + +
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDR 160
Score = 32.9 bits (75), Expect = 0.38
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K+ + K++ K+K+ K+K+K+K+KK +
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKVE 155
Score = 32.6 bits (74), Expect = 0.45
Identities = 9/24 (37%), Positives = 19/24 (79%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
K++ K+K+ K+K+K+K+KK ++
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEE 156
Score = 32.2 bits (73), Expect = 0.66
Identities = 9/28 (32%), Positives = 20/28 (71%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + + K+K+ K+K+K+K+KK ++ +
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
Score = 31.4 bits (71), Expect = 1.2
Identities = 9/45 (20%), Positives = 19/45 (42%), Gaps = 2/45 (4%)
Query: 411 AHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ S K KKK KKK+ +++K+++ ++
Sbjct: 160 REEEKKRERVRAKSRP--KKPPKKKPPNKKKEPPEEEKQRQAARE 202
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 36.8 bits (86), Expect = 0.005
Identities = 13/29 (44%), Positives = 22/29 (75%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
+ K ++++K K+KKKKKKK+ + +RFQ
Sbjct: 68 RNKAAEERRKLKEKKKKKKKELENFYRFQ 96
Score = 34.5 bits (80), Expect = 0.030
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 7/44 (15%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKK-------KKKKKKK 455
K+ +S E ++K K+KKKKKKK+ + ++KKK++
Sbjct: 61 KRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEE 104
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 38.1 bits (89), Expect = 0.005
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+++ KK K KKKK K K K + ++ +R
Sbjct: 205 ELQRELMKKGKGKKKKIVKDKDGKVVYKWKKER 237
Score = 31.2 bits (71), Expect = 0.89
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KKKK K K K K KK++K+
Sbjct: 213 KGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 30.8 bits (70), Expect = 1.1
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K + KKKK K K K K KK++K
Sbjct: 209 ELMKKGKGKKKKIVKDKDGKVVYKWKKERK 238
Score = 28.5 bits (64), Expect = 5.9
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S L++ K + KKK+K K+ K++K+++KK KK
Sbjct: 160 SQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKK 199
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 36.1 bits (84), Expect = 0.005
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E K+K + KKKKK+K KKK+
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKR 60
Score = 35.7 bits (83), Expect = 0.006
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++K + KKKKK+K KKK+
Sbjct: 35 KELKQKADEGNNSGKKKKKRKAKKKRD 61
Score = 31.8 bits (73), Expect = 0.15
Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 6/33 (18%)
Query: 429 KNERKKKKKKK------KKKKKKKKKKKKKKKK 455
+ E K+ K+K KKKKK+K KKK+ +
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 30.7 bits (70), Expect = 0.38
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKK 453
K + KKKKK+K KKK+ +
Sbjct: 39 QKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 30.7 bits (70), Expect = 0.41
Identities = 12/36 (33%), Positives = 23/36 (63%), Gaps = 6/36 (16%)
Query: 426 HVHKNERKKKKKKKKKKKK------KKKKKKKKKKK 455
++K+ +++++K+ K+K KKKKK+K KK
Sbjct: 23 ELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKK 58
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 36.2 bits (84), Expect = 0.006
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWS 457
KK++KK K K KKKKWS
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWS 28
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 35.2 bits (82), Expect = 0.009
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 3/30 (10%)
Query: 427 VHKNERK---KKKKKKKKKKKKKKKKKKKK 453
H R +KKK+KK+ K + K+ KK
Sbjct: 51 AHAAIRADPSRKKKEKKEVKAESKRYNAKK 80
Score = 29.8 bits (68), Expect = 0.79
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+K+ K + K+ KK +++K +
Sbjct: 63 KKEKKEVKAESKRYNAKKLTLEQRKAR 89
Score = 28.7 bits (65), Expect = 1.7
Identities = 7/16 (43%), Positives = 13/16 (81%)
Query: 441 KKKKKKKKKKKKKKKW 456
+KKK+KK+ K + K++
Sbjct: 61 RKKKEKKEVKAESKRY 76
Score = 27.5 bits (62), Expect = 3.9
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KK+ K + K+ KK +++K
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQRKA 88
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 37.7 bits (88), Expect = 0.009
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +RK K++ KKKK K+ K K KK
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 34.6 bits (80), Expect = 0.093
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +KKK K+ K K KK K KK
Sbjct: 256 EEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 33.8 bits (78), Expect = 0.16
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKKK K+ K K KK K K
Sbjct: 255 KEEIKKKKPKESKGVKALKKVVAKGMK 281
Score = 32.7 bits (75), Expect = 0.31
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K+K K++ KKKK K+ K K K
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 32.3 bits (74), Expect = 0.42
Identities = 11/28 (39%), Positives = 12/28 (42%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
E KKKK K+ K K KK K
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGM 280
Score = 32.3 bits (74), Expect = 0.47
Identities = 14/43 (32%), Positives = 21/43 (48%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S K+ V E + +KK+K K++ KKKK K+ K
Sbjct: 227 LKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGV 269
Score = 31.9 bits (73), Expect = 0.54
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
+ ++KK K+ K K KK K KK SF
Sbjct: 257 EIKKKKPKESKGVKALKKVVAKGMKKIDSF 286
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 34.9 bits (81), Expect = 0.010
Identities = 12/30 (40%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 428 HKNERKKKKKKKKKKK--KKKKKKKKKKKK 455
+K E+K + +K +K KK K KKKK
Sbjct: 4 YKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33
Score = 31.8 bits (73), Expect = 0.10
Identities = 9/27 (33%), Positives = 14/27 (51%), Gaps = 2/27 (7%)
Query: 432 RKKKKKKKKKKKKKKKKKK--KKKKKW 456
R + K +KK + +K +K KK W
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKNPTW 27
Score = 28.0 bits (63), Expect = 2.2
Identities = 6/26 (23%), Positives = 21/26 (80%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
E+K+K++++K+++K+ +K ++ + +
Sbjct: 52 EEKKRKQEEEKERRKEARKAERAEAR 77
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 37.4 bits (87), Expect = 0.011
Identities = 9/29 (31%), Positives = 12/29 (41%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
+ +K K K KK KK +KW
Sbjct: 187 LSSTASSGDEKSPKSKAAPKKAGKKMRKW 215
Score = 34.0 bits (78), Expect = 0.12
Identities = 7/27 (25%), Positives = 8/27 (29%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+K K K KK KK
Sbjct: 185 AKLSSTASSGDEKSPKSKAAPKKAGKK 211
Score = 31.3 bits (71), Expect = 0.94
Identities = 10/54 (18%), Positives = 15/54 (27%), Gaps = 1/54 (1%)
Query: 402 PAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ P + K+ P S K + +K K K KK
Sbjct: 157 SSTPNSSRPSTPHLLKAKEGP-SRRAKKAAKLSSTASSGDEKSPKSKAAPKKAG 209
Score = 29.3 bits (66), Expect = 3.7
Identities = 13/55 (23%), Positives = 19/55 (34%), Gaps = 11/55 (20%)
Query: 410 GAHSMTSSLKQNPYSSHVHKNE-----RKKKKKK---KKKKKKKKKKKKKKKKKW 456
+ + SS P + H+ K + R KK K +K K K K
Sbjct: 156 PSSTPNSSR---PSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKK 207
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 36.4 bits (85), Expect = 0.013
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKK 454
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 36.4 bits (85), Expect = 0.013
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 434 KKKKKKKKKKKKKKKKKKKKKK 455
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 34.5 bits (80), Expect = 0.061
Identities = 9/23 (39%), Positives = 18/23 (78%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKK 452
+++K KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGA 35
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 35.7 bits (83), Expect = 0.013
Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 10/35 (28%)
Query: 431 ERKKKKKKKKKKKKK----------KKKKKKKKKK 455
ER K++KKK+ K+KK K++KKK K +
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 35.3 bits (82), Expect = 0.016
Identities = 14/31 (45%), Positives = 25/31 (80%), Gaps = 3/31 (9%)
Query: 428 HKNERKKKKKKKKKKKKKKK---KKKKKKKK 455
+K E+KK+ K+KK+++K++K K++KKK K
Sbjct: 98 NKQEKKKRSKEKKEEEKERKRQLKQQKKKAK 128
Score = 34.1 bits (79), Expect = 0.047
Identities = 11/26 (42%), Positives = 22/26 (84%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K E ++ K++KKK+ K+KK+++K++K
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERK 117
Score = 31.5 bits (72), Expect = 0.39
Identities = 8/24 (33%), Positives = 20/24 (83%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKK 452
K +++++K++K++ K++KKK K +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAKHR 130
Score = 30.7 bits (70), Expect = 0.73
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 436 KKKKKKKKKKKKKKKKKKKKWSFR 459
++ K++KKK+ K+KK+++K+ +
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQ 119
Score = 28.4 bits (64), Expect = 4.5
Identities = 9/23 (39%), Positives = 19/23 (82%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K + ++ K++KKK+ K+KK+++K
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEK 114
Score = 28.4 bits (64), Expect = 4.6
Identities = 9/28 (32%), Positives = 20/28 (71%)
Query: 434 KKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
K + ++ K++KKK+ K+KK+++ + Q
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQ 119
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 35.8 bits (83), Expect = 0.014
Identities = 16/31 (51%), Positives = 18/31 (58%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
KKK KK K KK K KK+K + S RF R
Sbjct: 7 AKKKAKKAAKAAKKGVKVKKRKVRTSVRFFR 37
Score = 30.8 bits (70), Expect = 0.73
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFR 459
K ++K KK K KK K KK+K + F
Sbjct: 6 KAKKKAKKAAKAAKKGVKVKKRKVRTSVRFF 36
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 36.1 bits (84), Expect = 0.014
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
E++ K K++K+++ ++K+K+ K
Sbjct: 123 LLEKELAKLKREKRRENERKQKEILK 148
Score = 33.4 bits (77), Expect = 0.10
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K+ K K++K+++ ++K+K+ K
Sbjct: 126 KELAKLKREKRRENERKQKEILK 148
Score = 33.0 bits (76), Expect = 0.12
Identities = 7/25 (28%), Positives = 18/25 (72%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKK 454
E K K++K+++ ++K+K+ K++
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQ 150
Score = 31.9 bits (73), Expect = 0.32
Identities = 6/23 (26%), Positives = 17/23 (73%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
+ +K+ K K++K+++ ++K+K
Sbjct: 122 ELLEKELAKLKREKRRENERKQK 144
Score = 31.1 bits (71), Expect = 0.69
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K K++K+++ ++K+K+ K++ K
Sbjct: 125 EKELAKLKREKRRENERKQKEILKEQMK 152
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 35.3 bits (82), Expect = 0.014
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 8/39 (20%)
Query: 432 RKKKKKK-------KKKKKKKKKKKKKKKKKWSF-RFQR 462
+K ++ K K +K+KKKKKKK+ +F RFQ
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKKKELKNFYRFQI 95
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 37.1 bits (86), Expect = 0.015
Identities = 31/141 (21%), Positives = 58/141 (41%), Gaps = 20/141 (14%)
Query: 184 WLIPTVKSVSVIIMSAVDGDTIAGICYVG-NHNIHNLKYFVVFPL-IIYLILGIIFLFGG 241
W+I +++ ++ + D IC + N+ N K F+ F + II +++ + L
Sbjct: 163 WIISIIETTPILFVYTTKKDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPLTILL-- 220
Query: 242 FISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLSTLPASIVI------------ 289
+I +K GI+ K K KL++ I I V+ LP ++ +
Sbjct: 221 -YCYSKILITLK---GINKSKKNKKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKS 276
Query: 290 GCLIYESYVYNEYVESIVCPC 310
GC + Y +V I+
Sbjct: 277 GCKAVKYIQYALHVTEIISLS 297
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 33.9 bits (78), Expect = 0.020
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
+K+KKK K + KK +K+ K KK +
Sbjct: 29 KKQKKKNKLRFLKKLEKRPKDVKKGPVTVR 58
Score = 32.4 bits (74), Expect = 0.073
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
P S+ + +++KKK K + KK +K+ K KK
Sbjct: 14 DRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 31.2 bits (71), Expect = 0.21
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
++KKK K + KK +K+ K KK + R
Sbjct: 30 KQKKKNKLRFLKKLEKRPKDVKKGPVTVRVLSD 62
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.8 bits (86), Expect = 0.020
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
ERKK +KK++K +KK +K++ +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAA 434
Score = 35.7 bits (83), Expect = 0.051
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K RKK++K +KK +K++ +K KKK
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEKAAAKKK 438
Score = 32.6 bits (75), Expect = 0.44
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K++K +KK +K++ +K KKK
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKA 439
Score = 32.2 bits (74), Expect = 0.61
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++KK +KK++K +KK +K++ +K
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEK 432
Score = 31.1 bits (71), Expect = 1.4
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KK +K++ +K KKK + KK
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAKK 445
Score = 31.1 bits (71), Expect = 1.5
Identities = 9/26 (34%), Positives = 18/26 (69%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
N ++KK +KK++K +KK +K++
Sbjct: 405 NLSPAERKKLRKKQRKAEKKAEKEEA 430
Score = 30.7 bits (70), Expect = 2.1
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ +++K +KK +K++ +K KKK +
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAE 440
Score = 30.3 bits (69), Expect = 2.1
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+K +K++ +K KKK + KK
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 29.5 bits (67), Expect = 4.2
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+++ +K KKK + KK K
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Score = 29.1 bits (66), Expect = 5.8
Identities = 8/28 (28%), Positives = 12/28 (42%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E +K KKK + KK K +
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 36.7 bits (85), Expect = 0.023
Identities = 12/30 (40%), Positives = 25/30 (83%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ ERK+K+ K++++KK+ +++KKKKK+ +
Sbjct: 278 LRTERKEKEAKEQQEKKELEQRKKKKKEMA 307
Score = 30.9 bits (70), Expect = 1.4
Identities = 14/48 (29%), Positives = 28/48 (58%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
S+ K + ++K+ K++++KK+ +++KKKKK+ QRF
Sbjct: 267 STKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRF 314
Score = 30.9 bits (70), Expect = 1.4
Identities = 8/43 (18%), Positives = 25/43 (58%)
Query: 421 NPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
P+ + ++++KK ++ K +++ K +KK++ + ++F
Sbjct: 7 QPFQMTKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKF 49
Score = 28.6 bits (64), Expect = 6.6
Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
Q P+S + + E+KK +K + +++ KK K
Sbjct: 88 SQKPFSFYE-REEQKKAILPRKLRSSTSEREPKKFK 122
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 36.0 bits (83), Expect = 0.023
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 1/68 (1%)
Query: 383 HDYLPPLAHGMPPPPNI-PIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKK 441
+Y L + P P I P + G + +++ S + KKKK
Sbjct: 136 EEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYID 195
Query: 442 KKKKKKKK 449
K KK K++
Sbjct: 196 KYKKLKEE 203
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 34.9 bits (81), Expect = 0.024
Identities = 15/31 (48%), Positives = 23/31 (74%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
E K +++KK+K+KKKKKKK+ + +RFQ
Sbjct: 68 EEAGKAAEEEKKEKEKKKKKKKELEDFYRFQ 98
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 36.6 bits (85), Expect = 0.025
Identities = 11/46 (23%), Positives = 21/46 (45%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
L+ NP + + K ++K K +K K ++ +KK R +
Sbjct: 326 LRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKV 371
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 35.5 bits (82), Expect = 0.027
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
RKKKK + K +K+ K+K +K
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRK 25
Score = 34.3 bits (79), Expect = 0.056
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KKK + K +K+ K+K +KK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKK 26
Score = 33.9 bits (78), Expect = 0.091
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KK + K +K+ K+K +KKK
Sbjct: 4 KKKNRSSNYKVNRKRLKRKDRKKK 27
Score = 30.5 bits (69), Expect = 1.2
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKK 453
KN K +K+ K+K +KKK
Sbjct: 5 KKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 30.5 bits (69), Expect = 1.3
Identities = 7/26 (26%), Positives = 14/26 (53%)
Query: 436 KKKKKKKKKKKKKKKKKKKKWSFRFQ 461
+KKKK + K +K+ K+ + +
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKK 27
Score = 29.7 bits (67), Expect = 2.4
Identities = 11/33 (33%), Positives = 15/33 (45%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
S +K RK+ K+K +KKK K W
Sbjct: 8 RSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNW 40
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 35.0 bits (81), Expect = 0.028
Identities = 8/26 (30%), Positives = 24/26 (92%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWS 457
++K+++K+++K++K++K++K+K+W
Sbjct: 14 IEEKREEKEREKEEKERKEEKEKEWG 39
Score = 30.0 bits (68), Expect = 1.5
Identities = 8/27 (29%), Positives = 23/27 (85%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E++++K+++K++K++K++K+K+ K
Sbjct: 14 IEEKREEKEREKEEKERKEEKEKEWGK 40
Score = 28.0 bits (63), Expect = 5.1
Identities = 6/29 (20%), Positives = 23/29 (79%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
V++++ + ++K+++K+++K++K++K
Sbjct: 3 VYRDKSGRIIDIEEKREEKEREKEEKERK 31
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 34.6 bits (80), Expect = 0.029
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S + K ++K K +K +KK K++ + +K++
Sbjct: 27 SKKLLKAKKKLKSEKLEKKAKRQLRAEKRQAL 58
Score = 33.9 bits (78), Expect = 0.054
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
S + KKK K +K +KK K++ + +K++
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQA 57
Score = 30.0 bits (68), Expect = 1.3
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 434 KKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+ KK K KKK K +K +KK K R ++
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEK 54
Score = 30.0 bits (68), Expect = 1.3
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K+E+ +KK K++ + +K++ +K + K
Sbjct: 38 KSEKLEKKAKRQLRAEKRQALEKGRVK 64
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 36.4 bits (84), Expect = 0.031
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + KK K KKK KK KKK+ K+ K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDK 169
Score = 34.0 bits (78), Expect = 0.13
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
+ K KK K KKK KK KKK+ K
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAK 164
Score = 32.9 bits (75), Expect = 0.32
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+K K KKK KK KKK+ K+ K
Sbjct: 147 KKGKAKKKTKKSKKKEAKESSDKD 170
Score = 32.9 bits (75), Expect = 0.35
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
E K KK K KKK KK KKK+ K+
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESS 167
Score = 32.1 bits (73), Expect = 0.62
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
++ K KKK KK KKK+ K+ K
Sbjct: 147 KKGKAKKKTKKSKKKEAKESSDKDD 171
Score = 31.7 bits (72), Expect = 0.91
Identities = 15/52 (28%), Positives = 20/52 (38%), Gaps = 12/52 (23%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKK------------KKKKKKKKKKWSF 458
P S + + K+ +KKK KK KKK K+KK F
Sbjct: 365 AAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKF 416
Score = 31.3 bits (71), Expect = 1.2
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K + KKK KK KKK+ K+ K +++
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEEES 175
Score = 31.0 bits (70), Expect = 1.4
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ KKK KK KKK+ K+ K ++ S
Sbjct: 149 GKAKKKTKKSKKKEAKESSDKDDEEES 175
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 36.5 bits (84), Expect = 0.032
Identities = 25/87 (28%), Positives = 35/87 (40%), Gaps = 22/87 (25%)
Query: 391 HGMPPPP--------NIPIPAVPVMGGGAHSMTSSL---------KQNPYSS----HVHK 429
H +P P N P A+ + A S + L K S K
Sbjct: 500 HFVPKPAREEVKIVSNTPAIAMEEVAPTAVSDAALLAPEEIYKKMKAIEKSKTELDRTDK 559
Query: 430 N-ERKKKKKKKKKKKKKKKKKKKKKKK 455
N ER+KKK+K++ KK K KK++K
Sbjct: 560 NRERRKKKRKRRAAKKAVTKAKKERKI 586
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 36.2 bits (84), Expect = 0.034
Identities = 13/39 (33%), Positives = 16/39 (41%)
Query: 417 SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ SS + K K K K KKKKKK+K
Sbjct: 510 QDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDD 548
Score = 33.5 bits (77), Expect = 0.24
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+RKKK+K+K++ K + KK+K+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652
Score = 33.5 bits (77), Expect = 0.28
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K +KK+K+K++ K + KK+K+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652
Score = 32.3 bits (74), Expect = 0.59
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +RK+K++ K + KK+K+K KK
Sbjct: 630 KKKRKRKRRFLTKIEGVKKEKRKDKKL 656
Score = 31.6 bits (72), Expect = 0.98
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++K++ K + KK+K+K KK K
Sbjct: 631 KKRKRKRRFLTKIEGVKKEKRKDKKLK 657
Score = 31.2 bits (71), Expect = 1.4
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++K+K+K++ K + KK+K+K K
Sbjct: 628 KRKKKRKRKRRFLTKIEGVKKEKRKDK 654
Score = 30.8 bits (70), Expect = 1.8
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++K+K++ K + KK+K+K KK
Sbjct: 628 KRKKKRKRKRRFLTKIEGVKKEKRKDKK 655
Score = 30.4 bits (69), Expect = 2.3
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K K+ KK K KK + KK+K K
Sbjct: 217 EAKAKRIKKIKSKKYHRVHKKEKLK 241
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 35.2 bits (81), Expect = 0.036
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 417 SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
S ++ P ++ V K ++K K KK K K KK + KKK ++ W
Sbjct: 79 SWRRKPATTAVEK-KKKGKSKKNKLKGKKDEDKKKAREGW 117
Score = 29.4 bits (66), Expect = 3.1
Identities = 13/50 (26%), Positives = 20/50 (40%)
Query: 401 IPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKK 450
V+ G ++ K + K + KK K K KK + KKK +
Sbjct: 65 SHDYEVLVTGNTGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.7 bits (80), Expect = 0.040
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E K+ ++K+ K KK+KK+KK+KK
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 34.0 bits (78), Expect = 0.075
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 6/63 (9%)
Query: 399 IPIPAVPVMGGGAHSMTSSLKQ------NPYSSHVHKNERKKKKKKKKKKKKKKKKKKKK 452
+ P G T+S K+ YS ++ E +K++K+ K+ ++K+ K K
Sbjct: 56 FNVDKKPKRLGRFPGFTNSYKKAYVTLAEGYSINLFPEESEKEQKEVSKETEEKEAIKAK 115
Query: 453 KKK 455
K+K
Sbjct: 116 KEK 118
Score = 33.2 bits (76), Expect = 0.14
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K+ K+ ++K+ K KK+KK+KK
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKK 122
Score = 33.2 bits (76), Expect = 0.14
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + K+ ++K+ K KK+KK+KK+K
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 32.4 bits (74), Expect = 0.26
Identities = 17/40 (42%), Positives = 23/40 (57%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S +Q S + E K KK+KK+KK+KK +K KKK
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKK 134
Score = 29.7 bits (67), Expect = 1.7
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KK+KK+KK+KK +K KKK K
Sbjct: 109 KEAIKAKKEKKEKKEKKVAEKLAKKKSTK 137
Score = 28.2 bits (63), Expect = 6.4
Identities = 10/28 (35%), Positives = 12/28 (42%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKK 454
K K K KK KK KK++ K
Sbjct: 131 AKKKSTKTTKNTTKKATKKTTTKKEEGK 158
Score = 27.4 bits (61), Expect = 9.5
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +++KK+KK+KK +K KKK K
Sbjct: 111 AIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 35.9 bits (84), Expect = 0.041
Identities = 7/29 (24%), Positives = 11/29 (37%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
KK K +K K++ +K K
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 35.5 bits (83), Expect = 0.064
Identities = 7/30 (23%), Positives = 11/30 (36%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
E+ +K + K KK K+S
Sbjct: 813 LLAEKPEKLRYLADAPAKDPAGKKAAVKFS 842
Score = 34.0 bits (79), Expect = 0.20
Identities = 6/24 (25%), Positives = 11/24 (45%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+K K +K K++ +K K
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858
Score = 33.2 bits (77), Expect = 0.29
Identities = 6/24 (25%), Positives = 9/24 (37%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
K KK K +K K++
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQ 848
Score = 33.2 bits (77), Expect = 0.36
Identities = 6/24 (25%), Positives = 8/24 (33%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
K KK K +K K+
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQ 847
Score = 33.2 bits (77), Expect = 0.36
Identities = 7/27 (25%), Positives = 8/27 (29%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K KK K +K K
Sbjct: 820 KLRYLADAPAKDPAGKKAAVKFSRKTK 846
Score = 32.5 bits (75), Expect = 0.55
Identities = 6/28 (21%), Positives = 9/28 (32%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K KK K +K K++
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQY 849
Score = 32.5 bits (75), Expect = 0.61
Identities = 6/24 (25%), Positives = 10/24 (41%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKW 456
KK K +K K++ +K
Sbjct: 832 PAGKKAAVKFSRKTKQQYVASEKD 855
Score = 32.1 bits (74), Expect = 0.78
Identities = 6/25 (24%), Positives = 10/25 (40%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K KK K +K K++
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYV 850
Score = 32.1 bits (74), Expect = 0.83
Identities = 6/27 (22%), Positives = 9/27 (33%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+ + K KK K +
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSR 843
Score = 31.7 bits (73), Expect = 0.95
Identities = 5/25 (20%), Positives = 10/25 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
+ K +K K++ +K K
Sbjct: 834 GKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 31.8 bits (73), Expect = 0.045
Identities = 12/25 (48%), Positives = 13/25 (52%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
ER KK KKKKK K + K K
Sbjct: 12 ERNAKKAAKKKKKGAKSQLKAAAKA 36
Score = 29.9 bits (68), Expect = 0.23
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
++ E+ +++ KK KKKKK K + K
Sbjct: 5 NQREQARERNAKKAAKKKKKGAKSQLK 31
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 33.8 bits (78), Expect = 0.053
Identities = 8/29 (27%), Positives = 23/29 (79%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
H K +R+KK +++ K+K+++++ +++K+
Sbjct: 28 HKRKQQRRKKAQEEAKEKEREERIEERKR 56
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.1 bits (81), Expect = 0.066
Identities = 8/27 (29%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +RK K ++ K+K++K+ +++ K++K
Sbjct: 274 RPKRKTKAQRNKEKRRKELEREAKEEK 300
Score = 33.5 bits (77), Expect = 0.20
Identities = 10/26 (38%), Positives = 21/26 (80%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K +R K+K++K+ +++ K++K+ KKK
Sbjct: 280 KAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 32.8 bits (75), Expect = 0.43
Identities = 7/27 (25%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
N+ + K+K K ++ K+K++K+ +++
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELERE 295
Score = 32.4 bits (74), Expect = 0.46
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K R K+K K ++ K+K++K+ +++ K
Sbjct: 271 KPVRPKRKTKAQRNKEKRRKELEREAKE 298
Score = 31.6 bits (72), Expect = 0.99
Identities = 7/27 (25%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K+K K ++ K+K++K+
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKEL 292
Score = 31.2 bits (71), Expect = 1.2
Identities = 11/40 (27%), Positives = 20/40 (50%), Gaps = 7/40 (17%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKK-------KKKKKKK 455
Y K+K K ++ K+K++K+ +K+ KKK
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 35.3 bits (81), Expect = 0.067
Identities = 15/25 (60%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K KKKKKK K KKK KK K
Sbjct: 238 EEKSKKKKKKLAKNKKKLDDDKKGK 262
Score = 33.4 bits (76), Expect = 0.25
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ KKKKKK K KKK KK K+
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 31.1 bits (70), Expect = 1.3
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KNE + KK KK KK K KK K
Sbjct: 327 KNEEEGGLSKKGKKLKKLKGKKNGLDK 353
Score = 30.7 bits (69), Expect = 2.0
Identities = 12/29 (41%), Positives = 12/29 (41%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
E KK KK KK K KK K S
Sbjct: 328 NEEEGGLSKKGKKLKKLKGKKNGLDKDDS 356
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 34.7 bits (80), Expect = 0.067
Identities = 8/30 (26%), Positives = 25/30 (83%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + E KK K++++K++++++K++KK+++
Sbjct: 183 RLQEWELKKLKQQQQKREEERRKQRKKQQE 212
Score = 33.2 bits (76), Expect = 0.21
Identities = 6/28 (21%), Positives = 26/28 (92%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +++++K++++++K++KK+++++++K
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEERK 217
Score = 31.2 bits (71), Expect = 0.84
Identities = 8/32 (25%), Positives = 26/32 (81%), Gaps = 6/32 (18%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK------KW 456
++K++++++K++KK+++++++K+K KW
Sbjct: 196 QQKREEERRKQRKKQQEEEERKQKAEEAWQKW 227
Score = 31.2 bits (71), Expect = 0.97
Identities = 7/27 (25%), Positives = 26/27 (96%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +++K++++++K++KK+++++++K+K
Sbjct: 193 KQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 30.5 bits (69), Expect = 1.5
Identities = 11/50 (22%), Positives = 30/50 (60%), Gaps = 5/50 (10%)
Query: 411 AHSMTSSLK-QNPYS----SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ S++ S K + S + KK K++++K++++++K++KK++
Sbjct: 162 SSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQ 211
Score = 29.3 bits (66), Expect = 4.2
Identities = 6/25 (24%), Positives = 22/25 (88%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWS 457
++++K++KK+++++++K+K ++ W
Sbjct: 201 EERRKQRKKQQEEEERKQKAEEAWQ 225
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 32.6 bits (75), Expect = 0.070
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWSF 458
R K++++K KKK KK + ++ KK F
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKKKVF 93
Score = 31.4 bits (72), Expect = 0.20
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 417 SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
SLK+ + + K ER+K KKK KK + ++ KKK
Sbjct: 59 SLKK--LAPYRDKEERRKDKKKYGKKARLREWKKKVF 93
Score = 29.9 bits (68), Expect = 0.64
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
++++++K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKK 91
Score = 29.1 bits (66), Expect = 1.2
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++K KKK KK + ++ KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 35.3 bits (82), Expect = 0.071
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K K +KK++++KK+KKKK KKK
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKK 440
Score = 34.5 bits (80), Expect = 0.10
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
V K E+K++++KK+KKKK KKK++++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 33.7 bits (78), Expect = 0.22
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KK KK +K +KK++++KK+KK
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK 431
Score = 33.0 bits (76), Expect = 0.33
Identities = 9/33 (27%), Positives = 21/33 (63%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++++++KK+KKKK KKK+++++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 32.6 bits (75), Expect = 0.46
Identities = 11/27 (40%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K+KKKK KKK+++++++K+K
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 32.6 bits (75), Expect = 0.50
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+K KK KK +K +KK++++KK+
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKK 431
Score = 32.2 bits (74), Expect = 0.56
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K KK KK +K +KK++++KK+KKK
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKK 432
Score = 32.2 bits (74), Expect = 0.56
Identities = 10/25 (40%), Positives = 21/25 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+R+++KK+KKKK KKK++++++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEE 446
Score = 32.2 bits (74), Expect = 0.63
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KK +K +KK++++KK+KKKK
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAF 435
Score = 31.8 bits (73), Expect = 0.75
Identities = 10/30 (33%), Positives = 20/30 (66%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KK+KKKK KKK+++++++
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 31.8 bits (73), Expect = 0.84
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ +K +KK++++KK+KKKK
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAG 437
Score = 31.8 bits (73), Expect = 0.88
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E +KK+KKKK KKK+++++++K
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEK 448
Score = 31.0 bits (71), Expect = 1.2
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E KK+KKKK KKK+++++++K+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 31.0 bits (71), Expect = 1.3
Identities = 10/29 (34%), Positives = 22/29 (75%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++KKKK KKK+++++++K+KK
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 30.3 bits (69), Expect = 2.3
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + KKKK KKK+++++++K+KK+
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 30.3 bits (69), Expect = 2.7
Identities = 11/27 (40%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+KKK KKK+++++++K+KK++
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Score = 28.3 bits (64), Expect = 8.6
Identities = 9/27 (33%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KK KKK+++++++K+KK+++
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEE 454
>gnl|CDD|222043 pfam13314, DUF4083, Domain of unknown function (DUF4083). This is
a family of very short, approximately 60 residue,
proteins from Firmicutes, that are all putatively
annotated as being MutT/Nudix. However, the
characteristic Nudix motif of GX(5)EX(7)REUXEE is
absent.
Length = 58
Score = 31.6 bits (72), Expect = 0.072
Identities = 13/54 (24%), Positives = 22/54 (40%)
Query: 218 NLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADKLEKLM 271
L + ++I LI+ I F FI + K Q K D++ +L+
Sbjct: 2 PLGTSIYLIVVIILIVLIGASFTLFIRRLLHTSAAKKQHSDPIEQKLDRIIQLL 55
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 34.6 bits (80), Expect = 0.074
Identities = 13/25 (52%), Positives = 14/25 (56%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E + KKKKKKKKKK KK
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSA 202
Score = 34.2 bits (79), Expect = 0.093
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K + ++++ KKKKKKKKKK KK
Sbjct: 170 ELLKARLEEERAKKKKKKKKKKTKK 194
Score = 33.9 bits (78), Expect = 0.14
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
+ ER KKKKKKKKKK KK
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSA 202
Score = 33.1 bits (76), Expect = 0.20
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E ++ KKKKKKKKKK KK
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSS 201
Score = 31.5 bits (72), Expect = 0.67
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KKKKKKKKKK KK +
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEA 204
Score = 31.5 bits (72), Expect = 0.81
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
R ++++ KKKKKKKKKK KK
Sbjct: 174 ARLEEERAKKKKKKKKKKTKKNNAT 198
Score = 31.2 bits (71), Expect = 1.0
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+KKKKKKKKK KK +
Sbjct: 181 AKKKKKKKKKKTKKNNATGSSAEAT 205
Score = 30.8 bits (70), Expect = 1.1
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
E + K + ++++ KKKKKKKKKK
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKK 191
Score = 30.4 bits (69), Expect = 1.7
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E + K + ++++ KKKKKKKKKK
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKT 192
Score = 28.1 bits (63), Expect = 9.9
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
++KKKKKKK KK +
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEATVS 207
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 35.2 bits (81), Expect = 0.075
Identities = 17/54 (31%), Positives = 24/54 (44%)
Query: 402 PAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
PA V+G ++ + P + + K + KKK K K K KKK K K
Sbjct: 723 PACSVIGNSIDAVITKYAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKS 776
Score = 32.5 bits (74), Expect = 0.51
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWS 457
E+K K KKK K K K KKK +
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKA 774
Score = 30.2 bits (68), Expect = 3.1
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
+ K KKK K K KK KK+
Sbjct: 760 TKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 29.8 bits (67), Expect = 3.3
Identities = 11/23 (47%), Positives = 12/23 (52%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKK 454
K K KKK K K KK KK+
Sbjct: 762 GKAAKTVKKKSKAKSKKTTKKRA 784
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 35.1 bits (81), Expect = 0.076
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 7/37 (18%)
Query: 428 HKNERKKKKKKKKKKKKKK-------KKKKKKKKKWS 457
K+E+KKKKKK+KK+ K + K KK KK
Sbjct: 68 KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKK 104
Score = 32.8 bits (75), Expect = 0.35
Identities = 15/37 (40%), Positives = 20/37 (54%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
KN KKK +KKKKKKK+KK+ K + +
Sbjct: 52 TSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88
Score = 30.1 bits (68), Expect = 2.6
Identities = 12/55 (21%), Positives = 17/55 (30%), Gaps = 8/55 (14%)
Query: 411 AHSMTSSLKQNPYS------SHVHKNERKK--KKKKKKKKKKKKKKKKKKKKKWS 457
T K P S + K +++ K KK KK+ K S
Sbjct: 12 FSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNES 66
Score = 30.1 bits (68), Expect = 3.0
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
+KK +KKKKKKK+KK+ K + + K F
Sbjct: 64 NESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Score = 28.1 bits (63), Expect = 9.8
Identities = 15/41 (36%), Positives = 18/41 (43%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFR 459
K+ P S K K KK KK KKK K K + +F
Sbjct: 80 KKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFN 120
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 32.2 bits (74), Expect = 0.079
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+N R KK+KK +K K+ K +++ ++
Sbjct: 27 RRNPRVKKRKKYEKAVKRLKSQRRVVRE 54
Score = 30.3 bits (69), Expect = 0.34
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RKK ++ + KK+KK +K K+ K
Sbjct: 23 RKKDRRNPRVKKRKKYEKAVKRLK 46
Score = 28.8 bits (65), Expect = 1.1
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKK 451
K +R+ + KK+KK +K K+ K
Sbjct: 23 RKKDRRNPRVKKRKKYEKAVKRLK 46
Score = 28.8 bits (65), Expect = 1.1
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RK+KK ++ + KK+KK +K K+
Sbjct: 21 RKRKKDRRNPRVKKRKKYEKAVKR 44
Score = 27.2 bits (61), Expect = 4.0
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
++KK ++ + KK+KK +K K+
Sbjct: 22 KRKKDRRNPRVKKRKKYEKAVKRL 45
Score = 26.4 bits (59), Expect = 7.3
Identities = 6/30 (20%), Positives = 16/30 (53%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+RKK +K K+ K +++ +++ +
Sbjct: 31 RVKKRKKYEKAVKRLKSQRRVVREEDGGYG 60
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 31.4 bits (71), Expect = 0.080
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K+ + + KKK KKK K+ K+K +
Sbjct: 17 KDNDLRSRVKKKAKKKFGKEDSKEKDAY 44
Score = 28.3 bits (63), Expect = 1.1
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 8/35 (22%)
Query: 429 KNERKKKKKKKKKKKKK--------KKKKKKKKKK 455
ERKKK ++ ++ K KKK KKK K
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGK 35
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 32.2 bits (74), Expect = 0.080
Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 3/28 (10%)
Query: 431 ERKKKKKKKKKKKKKKKKK---KKKKKK 455
RK++KKK+ KK K +++ K KK
Sbjct: 10 YRKEQKKKELKKNKAERQARREAKLAKK 37
Score = 31.1 bits (71), Expect = 0.16
Identities = 9/26 (34%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K ++KK+ KK K +++ +++ K KK
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKK 37
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 34.1 bits (79), Expect = 0.093
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E +K+K ++++K K K +KK+
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKKR 147
Score = 33.4 bits (77), Expect = 0.18
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ KK++ +K+K ++++K K K
Sbjct: 115 AEQQAKKREAAGEKEKAPRRERKPKPK 141
Score = 33.0 bits (76), Expect = 0.19
Identities = 8/33 (24%), Positives = 20/33 (60%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
+ +K+K ++++K K K +KK+K + + +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 33.0 bits (76), Expect = 0.22
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +K+K ++++K K K +KK+K
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRK 148
Score = 33.0 bits (76), Expect = 0.24
Identities = 7/27 (25%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+R+ +K+K ++++K K K +K
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRK 145
Score = 32.6 bits (75), Expect = 0.28
Identities = 6/27 (22%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +R +++ KK++ +K+K ++++K
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERK 137
Score = 32.2 bits (74), Expect = 0.41
Identities = 7/27 (25%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E++ KK++ +K+K ++++K K
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKP 140
Score = 31.8 bits (73), Expect = 0.44
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + KK++ +K+K ++++K K K
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKA 142
Score = 31.4 bits (72), Expect = 0.71
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K+ + K R+++K K K +KK+K + +K +
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 30.7 bits (70), Expect = 1.1
Identities = 6/28 (21%), Positives = 19/28 (67%), Gaps = 3/28 (10%)
Query: 431 ERKKKKKKKKKK---KKKKKKKKKKKKK 455
++ +++ KK++ +K+K ++++K K
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPK 139
Score = 30.7 bits (70), Expect = 1.2
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
R++K K K +KK+K + +K + +
Sbjct: 131 APRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 28.7 bits (65), Expect = 5.7
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ERK K K +KK+K + +K + +
Sbjct: 133 RRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 28.4 bits (64), Expect = 6.8
Identities = 4/25 (16%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + ++ +++ KK++ +K+K
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAP 132
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 33.4 bits (76), Expect = 0.10
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
K + +K KKKK+K +KK K KKK
Sbjct: 19 KTQSQKSDKKKKEKVSEKKGKSKKK 43
Score = 32.6 bits (74), Expect = 0.18
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
KQ S + KKKK+K +KK K KKK++K
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46
Score = 32.3 bits (73), Expect = 0.28
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K+++KKK+K +KK K KKK++K K
Sbjct: 20 TQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.7 bits (80), Expect = 0.11
Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 5/42 (11%)
Query: 427 VHKNERKK-----KKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
+ K E KK +++ ++ K KKKK+KKK+W +F+ F
Sbjct: 403 LEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRWF 444
Score = 30.8 bits (70), Expect = 1.7
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
E ++ K KKKK+KKK+ +K +W
Sbjct: 418 ELIEEGLLKSKKKKRKKKEWFEKFRWFV 445
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 33.3 bits (76), Expect = 0.12
Identities = 13/48 (27%), Positives = 27/48 (56%), Gaps = 5/48 (10%)
Query: 420 QNPYSSHVHK--NERKKKKKKK---KKKKKKKKKKKKKKKKWSFRFQR 462
+ V +RK +KKKK KK+ K++K+++++K+ + R +
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKE 99
Score = 31.4 bits (71), Expect = 0.47
Identities = 12/52 (23%), Positives = 29/52 (55%)
Query: 403 AVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
AVP + SS + + +E+K+ K++K+++++K+ K++K+
Sbjct: 48 AVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKE 99
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 32.6 bits (75), Expect = 0.13
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K+K K +KK+++KK + +K K
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAK 40
Score = 32.2 bits (74), Expect = 0.16
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ K +KK+++KK + +K K +
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAE 44
Score = 32.2 bits (74), Expect = 0.18
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
++ K+K K +KK+++KK + +K
Sbjct: 14 DKAIAKEKAKWEKKQEEKKSEAEKLA 39
Score = 31.8 bits (73), Expect = 0.23
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KK + +K K ++K + + +K
Sbjct: 27 KQEEKKSEAEKLAKMSAEEKAEYELEK 53
Score = 31.8 bits (73), Expect = 0.25
Identities = 7/26 (26%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K+E +K K ++K + + +K +K+
Sbjct: 32 KSEAEKLAKMSAEEKAEYELEKLEKE 57
Score = 31.5 bits (72), Expect = 0.30
Identities = 7/27 (25%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KK+++KK + +K K ++K +
Sbjct: 22 AKWEKKQEEKKSEAEKLAKMSAEEKAE 48
Score = 30.7 bits (70), Expect = 0.65
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + +KK+++KK + +K K ++K
Sbjct: 20 EKAKWEKKQEEKKSEAEKLAKMSAEEKA 47
Score = 30.3 bits (69), Expect = 0.73
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+K+++KK + +K K ++K +
Sbjct: 23 KWEKKQEEKKSEAEKLAKMSAEEKAEY 49
Score = 29.5 bits (67), Expect = 1.3
Identities = 6/27 (22%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K K ++K + + +K +K+
Sbjct: 31 KKSEAEKLAKMSAEEKAEYELEKLEKE 57
>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus.
Length = 139
Score = 33.0 bits (76), Expect = 0.13
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 11/58 (18%)
Query: 411 AHSMTSSLKQNPYSSHVH---KNERKKKKKKK------KKKKKKKKKKKKKKKKWSFR 459
+ S L Y S++ +KKKK K K +KK KK K K+S +
Sbjct: 67 NKYLESQLDY--YKSYLDNVLSTLQKKKKFSKQKFSVKKLQKKHGKKPKFGSYKYSAK 122
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 32.1 bits (73), Expect = 0.14
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
+ KK+KKKKK+ K + K KK + + +
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52
Score = 31.3 bits (71), Expect = 0.23
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
RKKKKK+ K + K KK +KK
Sbjct: 24 RKKKKKRTAKTARPKATKKGQKKDKKKDEFP 54
Score = 30.9 bits (70), Expect = 0.31
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++KK+ K + K KK +KK KKK
Sbjct: 23 KRKKKKKRTAKTARPKATKKGQKKDKKK 50
Score = 29.7 bits (67), Expect = 0.88
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
N +K+KKKKK+ K + K KK +K
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQK 45
Score = 29.4 bits (66), Expect = 1.2
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +KKKK+ K + K KK +KK K
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDK 48
Score = 29.4 bits (66), Expect = 1.3
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFR 459
K +R K + K KK +KK KKK + F
Sbjct: 27 KKKRTAKTARPKATKKGQKKDKKKDEFPEFP 57
Score = 29.0 bits (65), Expect = 1.5
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++K+ K + K KK +KK KKK
Sbjct: 25 KKKKKRTAKTARPKATKKGQKKDKKKD 51
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 33.6 bits (77), Expect = 0.15
Identities = 17/62 (27%), Positives = 20/62 (32%), Gaps = 13/62 (20%)
Query: 394 PPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
P P P P K K ++ K K K K K K K K + K
Sbjct: 71 TEPETPPEPTPP-------------KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKP 117
Query: 454 KK 455
KK
Sbjct: 118 KK 119
Score = 31.7 bits (72), Expect = 0.62
Identities = 18/70 (25%), Positives = 21/70 (30%), Gaps = 17/70 (24%)
Query: 390 AHGMPPPPNIPI-----PAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKK 444
P P P P P + K + K +KK KK K K
Sbjct: 53 TEEPQPEPEPPEEQPKPPTEPE------------TPPEPTPPKPKEKPKPEKKPKKPKPK 100
Query: 445 KKKKKKKKKK 454
K K K K K
Sbjct: 101 PKPKPKPKPK 110
Score = 29.4 bits (66), Expect = 3.2
Identities = 18/69 (26%), Positives = 22/69 (31%), Gaps = 14/69 (20%)
Query: 387 PPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKK 446
PP PP P P P ++ P + K K K K K K +
Sbjct: 69 PPTEPETPPEPTPPKPKEKP----------KPEKKP----KKPKPKPKPKPKPKPKVKPQ 114
Query: 447 KKKKKKKKK 455
K KK K
Sbjct: 115 PKPKKPPSK 123
Score = 29.4 bits (66), Expect = 3.5
Identities = 11/27 (40%), Positives = 13/27 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K K K K K + K KK K
Sbjct: 98 KPKPKPKPKPKPKVKPQPKPKKPPSKT 124
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 32.7 bits (75), Expect = 0.16
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWS 457
RKK K+K ++ K K KK++ +K+K
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEK 90
Score = 29.3 bits (66), Expect = 2.4
Identities = 9/27 (33%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++K ++ K K KK++ +K+K+K++
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEE 92
Score = 28.9 bits (65), Expect = 2.8
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSFR 459
K+K ++ K K KK++ +K+K+K+ F
Sbjct: 69 KEKLRRDKLKAKKEEAEKEKEKEERFM 95
Score = 28.5 bits (64), Expect = 4.8
Identities = 7/28 (25%), Positives = 18/28 (64%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ K KK++ +K+K+K+++ K
Sbjct: 72 LRRDKLKAKKEEAEKEKEKEERFMKALA 99
Score = 28.1 bits (63), Expect = 5.2
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
K + ++ K K KK++ +K+K+K+++
Sbjct: 69 KEKLRRDKLKAKKEEAEKEKEKEER 93
Score = 28.1 bits (63), Expect = 6.4
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
E+ ++ K K KK++ +K+K+K+++
Sbjct: 70 EKLRRDKLKAKKEEAEKEKEKEER 93
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 30.6 bits (70), Expect = 0.17
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKKW 456
+ + E +K +K+K+KK +K+++KK
Sbjct: 32 LRRREFYEKPSEKRKRKKAAARKRRRKKLA 61
Score = 30.2 bits (69), Expect = 0.30
Identities = 8/24 (33%), Positives = 17/24 (70%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
E+ +K+K+KK +K+++KK +
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAR 62
Score = 29.8 bits (68), Expect = 0.40
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K +K+K+KK +K+++KK +
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAR 62
>gnl|CDD|222716 pfam14374, Ribos_L4_asso_C, 60S ribosomal protein L4 C-terminal
domain. This family is found at the very C-terminal of
60 ribosomal L4 proteins.
Length = 80
Score = 31.0 bits (71), Expect = 0.18
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKKK 445
LK NPY+ + +KK+KK K +K
Sbjct: 53 LKLNPYAKVARRAALLAEKKRKKAKAEK 80
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 33.5 bits (77), Expect = 0.20
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ER KK++ + ++KKK+KK K +++
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKSERR 391
Score = 32.0 bits (73), Expect = 0.58
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K ER + ++KKK+KK K +++ +
Sbjct: 370 KKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 32.0 bits (73), Expect = 0.74
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K K ++ KK++ + ++KKK+KK
Sbjct: 358 PKPPTKAKPERDKKERPGRYRRKKKEKK 385
Score = 31.6 bits (72), Expect = 0.86
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K ++ KK++ + ++KKK
Sbjct: 356 KPPKPPTKAKPERDKKERPGRYRRKKK 382
Score = 30.8 bits (70), Expect = 1.5
Identities = 8/24 (33%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+ K ++ KK++ + ++KKK+KK
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKA 386
Score = 30.4 bits (69), Expect = 2.1
Identities = 6/27 (22%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + + ++KKK+KK K +++ +
Sbjct: 369 DKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 30.4 bits (69), Expect = 2.2
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ KK++ + ++KKK+KK K
Sbjct: 364 AKPERDKKERPGRYRRKKKEKKAK 387
Score = 30.1 bits (68), Expect = 2.4
Identities = 7/29 (24%), Positives = 16/29 (55%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + K K K ++ KK++ + ++KK
Sbjct: 353 LKEKPPKPPTKAKPERDKKERPGRYRRKK 381
Score = 29.3 bits (66), Expect = 4.5
Identities = 6/27 (22%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++++++ + ++KKK+KK K +++ +
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGLQ 394
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 32.6 bits (75), Expect = 0.21
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
K E++ K+ KK KK + KK K F
Sbjct: 112 KKQEKELIKEGKKPYYLKKSEIKKLVLKKKF 142
Score = 31.5 bits (72), Expect = 0.45
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KK K+ K +KK+KK K+KK
Sbjct: 142 FDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 30.7 bits (70), Expect = 0.91
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 7/47 (14%)
Query: 412 HSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
SM S LK KN+ ++++ K+ KK++K+ K+ KK +
Sbjct: 89 QSMKSRLKT-------LKNKDREREILKEHKKQEKELIKEGKKPYYL 128
Score = 30.7 bits (70), Expect = 0.97
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 8/42 (19%)
Query: 420 QNPYSSHVHKNERKKKKKKKKKKKKKKKKK------KKKKKK 455
+ PY ++ K+E KK KKK + KK K+ KK+KK
Sbjct: 123 KKPY--YLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKN 162
Score = 29.2 bits (66), Expect = 3.3
Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 3/31 (9%)
Query: 428 HKNERKKKKKKKKKK---KKKKKKKKKKKKK 455
HK + K+ K+ KK KK + KK KKK
Sbjct: 111 HKKQEKELIKEGKKPYYLKKSEIKKLVLKKK 141
Score = 28.8 bits (65), Expect = 4.3
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KK K+ K +KK+KK K+K
Sbjct: 141 KFDELKKSKQLDKALEKKRKKNAGKEK 167
Score = 28.4 bits (64), Expect = 6.2
Identities = 8/36 (22%), Positives = 19/36 (52%)
Query: 420 QNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ S + + K K ++++ K+ KK++K+ K
Sbjct: 85 KRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIK 120
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 33.8 bits (78), Expect = 0.21
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E K+ K++K+ +K KK+++KKKK+
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKE 582
Score = 33.5 bits (77), Expect = 0.28
Identities = 10/27 (37%), Positives = 22/27 (81%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+++K+ K++K+ +K KK+++KKK
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKK 580
Score = 32.3 bits (74), Expect = 0.63
Identities = 10/38 (26%), Positives = 25/38 (65%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
+ + + +K+ K++K+ +K KK+++KKKK + ++
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 31.9 bits (73), Expect = 0.76
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ +K+ +K KK+++KKKK+ +K +K
Sbjct: 562 ALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 30.8 bits (70), Expect = 1.7
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K E +++K++K+ K++K+ +K KK+
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQ 575
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.22
Identities = 6/31 (19%), Positives = 13/31 (41%), Gaps = 4/31 (12%)
Query: 429 KNERKKKK----KKKKKKKKKKKKKKKKKKK 455
K E KK K + + + + ++K +
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 34.0 bits (77), Expect = 0.24
Identities = 3/27 (11%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E K + + + + ++K + +
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Score = 33.6 bits (76), Expect = 0.32
Identities = 6/32 (18%), Positives = 17/32 (53%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ K E + + + ++K + +KKK++
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 32.0 bits (72), Expect = 0.73
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KKK ++ KK + K + +
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 32.0 bits (72), Expect = 0.94
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + KKK ++ KK + K + +
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Score = 31.6 bits (71), Expect = 1.0
Identities = 6/27 (22%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E + + ++K + +KKK++ K
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Score = 31.6 bits (71), Expect = 1.0
Identities = 3/26 (11%), Positives = 12/26 (46%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ + + + + ++K + +KK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 31.6 bits (71), Expect = 1.2
Identities = 5/24 (20%), Positives = 12/24 (50%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KK ++ KK + K + + +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADE 1358
Score = 31.3 bits (70), Expect = 1.6
Identities = 11/31 (35%), Positives = 19/31 (61%), Gaps = 4/31 (12%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKK----KKK 455
+E + ++K + +KKK++ KKK KKK
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 30.5 bits (68), Expect = 2.3
Identities = 4/27 (14%), Positives = 12/27 (44%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + ++ KK + K + + + +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 30.1 bits (67), Expect = 2.9
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKK KKK ++KKK + KKK
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 30.1 bits (67), Expect = 3.1
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K + KKK ++ KK + KKK ++ KK
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 30.1 bits (67), Expect = 3.7
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KKK ++ KKK KKK ++ KK
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKK 1343
Score = 29.7 bits (66), Expect = 4.0
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKK + KKK ++ KK + KKK
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 29.7 bits (66), Expect = 4.1
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KKK ++ KKK + KK + KKK
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 29.7 bits (66), Expect = 4.1
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ K + KKK ++ KK + KKK ++ KK
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329
Score = 29.7 bits (66), Expect = 4.3
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KKK ++ KK + KKK ++ KKK
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKK 1330
Score = 29.7 bits (66), Expect = 4.3
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KKK ++ KK + KKK ++ KKK
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKK 1498
Score = 29.7 bits (66), Expect = 4.4
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E KKK KKK ++ KK + K
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKA 1350
Score = 29.7 bits (66), Expect = 4.5
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
+E KKK ++ KKK + KK KKK
Sbjct: 1394 DEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 29.7 bits (66), Expect = 4.6
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KK + KKK ++ KK + KKK
Sbjct: 1465 KAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 29.7 bits (66), Expect = 4.8
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKK KKK ++ KK + K
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Score = 29.7 bits (66), Expect = 4.8
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKK + KKK ++ KK + KKK
Sbjct: 1426 KAEEKKKADEAKKKAEEAKKADEAKKK 1452
Score = 29.3 bits (65), Expect = 5.2
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
LK+ + + E KK ++ KKK ++ KK ++ +KK
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 29.3 bits (65), Expect = 5.4
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KKK + KKK ++KKK + KKK
Sbjct: 1413 AAAAKKKADEAKKKAEEKKKADEAKKK 1439
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 33.6 bits (77), Expect = 0.23
Identities = 10/27 (37%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E +KK+K++ + K+K++K ++KK
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.8 bits (67), Expect = 3.2
Identities = 9/27 (33%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+ K+++ +KK+K++ + K+K++K
Sbjct: 403 KEEKLKQEENEKKQKEQADEDKEKRQK 429
Score = 28.6 bits (64), Expect = 7.3
Identities = 7/27 (25%), Positives = 21/27 (77%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++++ +KK+K++ + K+K++K ++
Sbjct: 406 KLKQEENEKKQKEQADEDKEKRQKDER 432
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 33.1 bits (76), Expect = 0.26
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
+KK +K + K K+K K K+ F
Sbjct: 196 GGSKKKGRKGGAARGKPNAKRKAKDAKFGF 225
Score = 29.6 bits (67), Expect = 3.4
Identities = 8/27 (29%), Positives = 12/27 (44%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KKK +K + K K+K K
Sbjct: 193 AAKGGSKKKGRKGGAARGKPNAKRKAK 219
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 32.9 bits (75), Expect = 0.27
Identities = 26/101 (25%), Positives = 38/101 (37%), Gaps = 25/101 (24%)
Query: 375 NKHRVLKQHDY--LPPLAHGMPPPPNIPIPAVPVMGGGAHSM----TSSLKQNPY----- 423
N + + + + L P+ + +PP I +P G H + L NPY
Sbjct: 75 NNALMERDNKFAALQPVENSLPPGCGI-LPGTK----GIHHLQMLHHPQLGDNPYGTREM 129
Query: 424 ---------SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ E KK KK + K K K KK KKK
Sbjct: 130 HHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170
Score = 31.8 bits (72), Expect = 0.75
Identities = 15/79 (18%), Positives = 25/79 (31%), Gaps = 10/79 (12%)
Query: 377 HRVLKQHDYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKK 436
H + H L +G ++ +P + + K+ K + K
Sbjct: 110 HLQMLHHPQLGDNPYGTREMHHLEVPPIS-------TAPPEAKEVK---KPKKGQSPKVP 159
Query: 437 KKKKKKKKKKKKKKKKKKK 455
K K KK KKK +
Sbjct: 160 KAPKPKKPKKKGSVSNRSV 178
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.5 bits (74), Expect = 0.28
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KKKK +KKK KK KKKKK+K
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEK 74
Score = 32.5 bits (74), Expect = 0.30
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K +KKK +KKK KK KKKKK+K
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEK 74
Score = 32.1 bits (73), Expect = 0.35
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSF 458
KKKK +KKK KK KKKKK+
Sbjct: 51 SKKKKTTPRKKKTTKKTKKKKKEKEE 76
Score = 32.1 bits (73), Expect = 0.38
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+ KKKK +KKK KK KKKKK
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKK 72
Score = 32.1 bits (73), Expect = 0.41
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
KK KKKK +KKK KK KKK
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKK 70
Score = 29.4 bits (66), Expect = 2.9
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KK +KKK KK KKKKK+K++
Sbjct: 53 KKKTTPRKKKTTKKTKKKKKEKEE 76
Score = 28.6 bits (64), Expect = 5.7
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++ +KKK KK KKKKK+K++
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEKEEVP 78
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 33.1 bits (76), Expect = 0.28
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKK 453
+ N K KKKKKKKKK KK K KK
Sbjct: 248 IRANPSKVKKKKKKKKKVVHKKYKTKK 274
Score = 30.8 bits (70), Expect = 1.4
Identities = 17/31 (54%), Positives = 17/31 (54%), Gaps = 3/31 (9%)
Query: 427 VHKNERK---KKKKKKKKKKKKKKKKKKKKK 454
H R K KKKKKKKKK KK K KK
Sbjct: 244 AHAAIRANPSKVKKKKKKKKKVVHKKYKTKK 274
Score = 30.0 bits (68), Expect = 2.7
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
VHK + KK K++K + K KK +++++
Sbjct: 266 VHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 28.9 bits (65), Expect = 5.4
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ++KKKKK KK K KK K++K
Sbjct: 256 KKKKKKKKKVVHKKYKTKKLTGKQRKA 282
Score = 28.5 bits (64), Expect = 7.5
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K KK K KK K++K + K KK +
Sbjct: 261 KKKKVVHKKYKTKKLTGKQRKARVKAKKAQ 290
Score = 28.5 bits (64), Expect = 8.2
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKK--KKKKKKKKKKKWSFRFQR 462
++ NP K ++KK KK K KK K++K + K K + R +R
Sbjct: 248 IRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 33.6 bits (77), Expect = 0.28
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
R +KK K K KK +K KK K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAK 793
Score = 33.2 bits (76), Expect = 0.29
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E+K K K KK +K KK K +K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 33.2 bits (76), Expect = 0.32
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
P S+ + + K K KK +K KK K +K + + +R
Sbjct: 765 PDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805
Score = 32.0 bits (73), Expect = 0.81
Identities = 12/45 (26%), Positives = 18/45 (40%)
Query: 409 GGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
G A +Q + + + +KK K K KK +K KK
Sbjct: 745 GDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 32.3 bits (74), Expect = 0.29
Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 7/48 (14%)
Query: 411 AHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
AH + Q K E++K +K +K+ + +K ++ K K+ F
Sbjct: 30 AHDRAVAKYQ-------AKKEKEKAEKMRKRYPELEKIFRELKSKFEF 70
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 29.6 bits (67), Expect = 0.30
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
N +KK +K KKK KK K K
Sbjct: 2 NRKKKINQKLKKKAKKANAKLHPSNK 27
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 32.7 bits (75), Expect = 0.31
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 417 SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+K+ + K+ +++KKKKKKK KKKK KK
Sbjct: 162 DIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAA 200
Score = 32.3 bits (74), Expect = 0.37
Identities = 15/43 (34%), Positives = 18/43 (41%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + + E KKKKKKK KKKK KK K
Sbjct: 162 DIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 31.1 bits (71), Expect = 0.91
Identities = 14/25 (56%), Positives = 15/25 (60%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+KKKKKK KKKK KK K
Sbjct: 181 EKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 29.6 bits (67), Expect = 2.8
Identities = 15/36 (41%), Positives = 20/36 (55%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
S++ + ++KKKKK KKKK KK K SF
Sbjct: 174 KSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKAISSF 209
Score = 29.2 bits (66), Expect = 4.0
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
K +++KKKKKKK KKKK KK
Sbjct: 173 KKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 33.4 bits (77), Expect = 0.32
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
++ KKK++ K + K+ KK S++ QR
Sbjct: 537 KQPAVKKKEEAAAPKAETVKRSSKKLSYKLQR 568
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.4 bits (74), Expect = 0.32
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +KK+ +K KK KKK K+K
Sbjct: 49 KQPKKKRPTTPRKPATTKKSKKKDKEK 75
Score = 29.7 bits (67), Expect = 2.5
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKK 452
S K +R +K KK KKK K+K
Sbjct: 47 SPKQPKKKRPTTPRKPATTKKSKKKDKEK 75
Score = 29.0 bits (65), Expect = 3.7
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
++ KKK+ +K KK KKK
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKK 71
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 32.3 bits (74), Expect = 0.33
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
Y+ V K E K+K++ KKK K +K K+
Sbjct: 148 YADVVAKLEEKRKERAAAYYKKKVKLRKAWKE 179
>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting
protein 1 (NUFIP1). Proteins in this family have been
implicated in the assembly of the large subunit of the
ribosome and in telomere maintenance. Some proteins in
this family contain a CCCH zinc finger. This family
contains a protein called human fragile X mental
retardation-interacting protein 1, which is known to
bind RNA and is phosphorylated upon DNA damage.
Length = 57
Score = 29.6 bits (67), Expect = 0.34
Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 3/30 (10%)
Query: 429 KNERKKK---KKKKKKKKKKKKKKKKKKKK 455
ERKK K +KKKK ++++ K+ +
Sbjct: 28 IEERKKNYPTKANIEKKKKAEEERIKRGEL 57
>gnl|CDD|202819 pfam03939, Ribosomal_L23eN, Ribosomal protein L23, N-terminal
domain. The N-terminal domain appears to be specific to
the eukaryotic ribosomal proteins L25, L23, and L23a.
Length = 52
Score = 29.6 bits (67), Expect = 0.35
Identities = 12/30 (40%), Positives = 15/30 (50%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
K K KK K KKK+K + S F+R
Sbjct: 3 AKALKAKKAVLKGVHGKKKRKIRTSVTFRR 32
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 29.7 bits (67), Expect = 0.35
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKK 453
K KKK ++KKKK KKK+K K
Sbjct: 5 PKKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 28.6 bits (64), Expect = 0.81
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
++ KKK ++KKKK KKK+K K
Sbjct: 6 KKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 28.6 bits (64), Expect = 1.1
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
KK KKK ++KKKK KKK+K
Sbjct: 5 PKKIAKKKPGTEQKKKKSAKKKRK 28
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 32.7 bits (75), Expect = 0.36
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 412 HSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
S + S ++ K + K+KKK+KKK++KKKK+
Sbjct: 302 LFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQ 344
Score = 29.2 bits (66), Expect = 5.1
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K ER +K + K+KKK+KKK++KKK
Sbjct: 316 KFERDNEKLDAEVKEKKKEKKKEEKKK 342
Score = 28.9 bits (65), Expect = 6.5
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +K + K+KKK+KKK++KKKK+
Sbjct: 320 DNEKLDAEVKEKKKEKKKEEKKKKQ 344
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 32.4 bits (74), Expect = 0.37
Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 6/51 (11%)
Query: 411 AHSMTSSLKQNPYSSHVH------KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + K S+ V RK+K KK KK KK+K+K+
Sbjct: 198 LLKLNDNDKTAASSASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 32.9 bits (74), Expect = 0.37
Identities = 17/67 (25%), Positives = 25/67 (37%), Gaps = 7/67 (10%)
Query: 392 GMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKK 451
G PPP P GGGA +S + + + ERKKK+ K + +
Sbjct: 111 GTPPPS-------PPRGGGAVRASSRRPEGAAPTSQPEGERKKKRFCKMGEPEHAGGNLI 163
Query: 452 KKKKWSF 458
+ F
Sbjct: 164 RLPGGRF 170
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 31.8 bits (73), Expect = 0.38
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 8/57 (14%)
Query: 407 MGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKK--------KKKKKKKKKKKKK 455
+ M S K+ ERK+KKK K K + K K + +KK
Sbjct: 6 SPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKK 62
Score = 31.4 bits (72), Expect = 0.60
Identities = 13/36 (36%), Positives = 16/36 (44%), Gaps = 12/36 (33%)
Query: 432 RKKKKKKKKKKKKKKKKKK------------KKKKK 455
+KKK K+ K K KKK K+KKK
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKK 36
Score = 30.3 bits (69), Expect = 1.2
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 10/37 (27%)
Query: 429 KNERKKKKKKKKKKKKKK----------KKKKKKKKK 455
K + K+ K K KKK ++K+KKK K
Sbjct: 2 KKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHK 38
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 32.9 bits (75), Expect = 0.41
Identities = 13/42 (30%), Positives = 18/42 (42%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ + S + K + K +K KKK K KKKKK
Sbjct: 101 NNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
Score = 31.3 bits (71), Expect = 1.1
Identities = 13/41 (31%), Positives = 18/41 (43%)
Query: 415 TSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ KQ + + K K +K KKK K KKKK
Sbjct: 100 KNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKK 140
Score = 30.6 bits (69), Expect = 2.3
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
S P + + + + K +K KKK K KKKKK S
Sbjct: 103 SKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLS 144
Score = 29.4 bits (66), Expect = 4.7
Identities = 12/34 (35%), Positives = 13/34 (38%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+ KKK K KKKKK K
Sbjct: 114 LMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.9 bits (75), Expect = 0.41
Identities = 14/31 (45%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
ERKKKK++K K+K+ KK K +K+ + Q
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEA-KAKLQ 48
Score = 30.3 bits (68), Expect = 2.7
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S + + + E ++KKKK++K K+K+ KK K +K
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQK 41
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 31.1 bits (71), Expect = 0.48
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKK 454
V K +K K+++ KKK++K K K KKK
Sbjct: 110 VRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 30.7 bits (70), Expect = 0.67
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + KK K+++ KKK++K K K KKK
Sbjct: 111 RKKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 31.6 bits (72), Expect = 0.58
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 399 IPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
IP P A +T + + + ++ R+ K K KK+KKK+ K+KKK
Sbjct: 161 IPFQRTPKREK-ARVITKEER----AFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
(DUF1713). This domain is found at the C terminal end
of mitochondrial proteins of unknown function.
Length = 34
Score = 28.5 bits (64), Expect = 0.59
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKK 454
R++K KK K KK +K+ + ++K
Sbjct: 9 RRRKMKKHKYKKLRKRTRALRRK 31
Score = 28.5 bits (64), Expect = 0.64
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RK+++K KK K KK +K+ + ++
Sbjct: 7 RKRRRKMKKHKYKKLRKRTRALRR 30
Score = 28.1 bits (63), Expect = 0.91
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +K+++K KK K KK +K+ + +
Sbjct: 3 DSVLRKRRRKMKKHKYKKLRKRTRALR 29
Score = 27.7 bits (62), Expect = 1.0
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKW 456
+K+++K KK K KK +K+ +
Sbjct: 6 LRKRRRKMKKHKYKKLRKRTRALR 29
Score = 27.7 bits (62), Expect = 1.2
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKK 453
V + R+K KK K KK +K+ + ++K
Sbjct: 5 VLRKRRRKMKKHKYKKLRKRTRALRRK 31
Score = 27.3 bits (61), Expect = 1.6
Identities = 8/24 (33%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
++++K KK K KK +K+ + ++K
Sbjct: 8 KRRRKMKKHKYKKLRKRTRALRRK 31
Score = 26.5 bits (59), Expect = 3.4
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S + K RK KK K KK +K+ + ++K K
Sbjct: 3 DSVLRKRRRKMKKHKYKKLRKRTRALRRKLDK 34
>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI). YihI activates the
GTPase activity of Der, a 50S ribosomal subunit
stability factor. The stimulation is specific to Der as
YihI does not stimulate the GTPase activity of Era or
ObgE. The interaction of YihI with Der requires only the
C-terminal 78 amino acids of YihI. A yihI deletion
mutant is viable and shows a shorter lag period, but the
same post-lag growth rate as a wild-type strain. yihI is
expressed during the lag period. Overexpression of yihI
inhibits cell growth and biogenesis of the 50S ribosomal
subunit. YihI is an unusual, highly hydrophilic protein
with an uneven distribution of charged residues,
resulting in an N-terminal region with high pI and a
C-terminal region with low pI.
Length = 169
Score = 31.2 bits (71), Expect = 0.59
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 10/45 (22%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKK----------KKKKKKKKKKK 453
K+ Y ERK+KKK+K K +K+K +KK
Sbjct: 16 KKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAAQKK 60
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.0 bits (73), Expect = 0.61
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
K E ++++K++KKKK KK K+ +W
Sbjct: 57 KEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Score = 28.6 bits (64), Expect = 8.0
Identities = 12/33 (36%), Positives = 20/33 (60%), Gaps = 6/33 (18%)
Query: 429 KNERKKKKKKK------KKKKKKKKKKKKKKKK 455
K E ++K K +++K++KKKK KK K+
Sbjct: 47 KKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
Score = 28.6 bits (64), Expect = 8.2
Identities = 12/33 (36%), Positives = 22/33 (66%), Gaps = 6/33 (18%)
Query: 429 KNERKKKKKKKKKK------KKKKKKKKKKKKK 455
K E+K++++K K +++K++KKKK KK
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 32.3 bits (74), Expect = 0.63
Identities = 10/27 (37%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ KKK K K K KKK K
Sbjct: 72 SDIPKKKTKTAAKAAAAKAPAKKKLKD 98
Score = 31.9 bits (73), Expect = 0.69
Identities = 9/27 (33%), Positives = 10/27 (37%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K K KKK K + KK
Sbjct: 80 KTAAKAAAAKAPAKKKLKDELDSSKKA 106
Score = 31.1 bits (71), Expect = 1.5
Identities = 10/29 (34%), Positives = 12/29 (41%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K KKK K + KK +KK
Sbjct: 82 AAKAAAAKAPAKKKLKDELDSSKKAEKKN 110
Score = 30.7 bits (70), Expect = 1.6
Identities = 9/27 (33%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+KK K K K KKK K +
Sbjct: 73 DIPKKKTKTAAKAAAAKAPAKKKLKDE 99
Score = 30.3 bits (69), Expect = 2.1
Identities = 8/27 (29%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++K K K K KKK K +
Sbjct: 74 IPKKKTKTAAKAAAAKAPAKKKLKDEL 100
Score = 30.3 bits (69), Expect = 2.3
Identities = 8/27 (29%), Positives = 10/27 (37%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K K KKK K +
Sbjct: 76 KKKTKTAAKAAAAKAPAKKKLKDELDS 102
Score = 30.3 bits (69), Expect = 2.3
Identities = 10/25 (40%), Positives = 10/25 (40%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E KKK K K K KKK
Sbjct: 71 ESDIPKKKTKTAAKAAAAKAPAKKK 95
Score = 30.3 bits (69), Expect = 2.5
Identities = 9/27 (33%), Positives = 9/27 (33%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KKK K K K KKK
Sbjct: 70 TESDIPKKKTKTAAKAAAAKAPAKKKL 96
Score = 30.0 bits (68), Expect = 3.4
Identities = 9/61 (14%), Positives = 13/61 (21%)
Query: 395 PPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ + M S K + K K KKK K +
Sbjct: 43 ESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Query: 455 K 455
Sbjct: 103 S 103
Score = 29.6 bits (67), Expect = 4.4
Identities = 9/27 (33%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K KKK K + KK +KK
Sbjct: 85 AAAAKAPAKKKLKDELDSSKKAEKKNA 111
>gnl|CDD|219607 pfam07854, DUF1646, Protein of unknown function (DUF1646). Some of
the members of this family are hypothetical bacterial
and archaeal proteins, but others are annotated as being
cation transporters expressed by the archaebacterium
Methanosarcina mazei.
Length = 347
Score = 32.0 bits (73), Expect = 0.63
Identities = 30/145 (20%), Positives = 58/145 (40%), Gaps = 26/145 (17%)
Query: 136 LVFLLVYFFGMSSYIWWVILS------FTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTV 189
FLL+ G+SS + I++ L K L A L P
Sbjct: 102 FAFLLIVILGLSSSVISAIVAAIILAEIANALPLSRKSKVRLTVIACFAIGLGAALTPIG 161
Query: 190 KSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLI-IYLILGIIFLFGGFISLFRI 248
+ +S I +S ++ D +F +F L+ IY+I G+ L G + F +
Sbjct: 162 EPLSTIAVSKLNAD-----------------FFYLFDLLGIYIIPGVAAL--GLYAAFSL 202
Query: 249 RNVIKNQGGISNRTKADKLEKLMIR 273
+ + + G+ +++++++IR
Sbjct: 203 KRLSYREPGVEVVEYEEEIKEVVIR 227
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 29.9 bits (68), Expect = 0.64
Identities = 9/26 (34%), Positives = 21/26 (80%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKW 456
K++++K++KK+K +KK +++ K+W
Sbjct: 63 LEKREREKREKKEKLEKKLEEELKEW 88
Score = 28.4 bits (64), Expect = 2.1
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 22/68 (32%)
Query: 384 DYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKK 443
YL PP P + + K ER+K++KK+K +K
Sbjct: 42 QYLSEFKDYKDEPPPEPTE--------------TWLE--------KREREKREKKEKLEK 79
Query: 444 KKKKKKKK 451
K +++ K+
Sbjct: 80 KLEEELKE 87
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 31.9 bits (73), Expect = 0.64
Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 15/72 (20%)
Query: 387 PPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKK 446
PP P P PA +HS ++ ++ S K+K++ KK+
Sbjct: 321 PPDTSDSDPSTETPPPASL-----SHSPPAAFERPLALS----------PKRKREGDKKQ 365
Query: 447 KKKKKKKKKWSF 458
KKKK KK K +F
Sbjct: 366 KKKKSKKLKLTF 377
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 31.6 bits (72), Expect = 0.68
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+ K + +++ + K+ K KKK KKK F+R
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 30.4 bits (69), Expect = 1.4
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
K ++ + K+ K KKK KKK+K + R + F
Sbjct: 8 KARQELAVQVAKQAKAKKKANKKKRKIYFKRAESF 42
Score = 30.0 bits (68), Expect = 1.9
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
+ + K+ K KKK KKK+K K + F
Sbjct: 11 QELAVQVAKQAKAKKKANKKKRKIYFKRAESFV 43
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.5 bits (72), Expect = 0.69
Identities = 10/31 (32%), Positives = 22/31 (70%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++ ++K+KKK K++KK K++K+K
Sbjct: 88 TPIYQYFEEEKEKKKAMSKEEKKAIKEEKEK 118
Score = 30.3 bits (69), Expect = 1.8
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRF 460
+ + E K+KKK K++KK K++K+K + + +
Sbjct: 91 YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125
Score = 29.2 bits (66), Expect = 3.8
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
Y + E+KK K++KK K++K+K ++
Sbjct: 91 YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 29.8 bits (67), Expect = 0.72
Identities = 14/27 (51%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E K KK KKKK K KK K K
Sbjct: 33 KPEPKPKKAAAKKKKPAVKGKKGAKGK 59
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 31.7 bits (72), Expect = 0.73
Identities = 13/43 (30%), Positives = 21/43 (48%), Gaps = 5/43 (11%)
Query: 418 LKQNPYSSHVHK-----NERKKKKKKKKKKKKKKKKKKKKKKK 455
LK NPY+ + + K KK+K KK+K+ K++
Sbjct: 331 LKLNPYAKTARRMALLAEAARVKAKKEKLAKKRKQVSKEEAAA 373
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 31.7 bits (72), Expect = 0.77
Identities = 5/27 (18%), Positives = 8/27 (29%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K + K K + K +K
Sbjct: 218 KAAAEAKAAAAKAAAEAKAAAEKAAAA 244
Score = 31.7 bits (72), Expect = 0.79
Identities = 6/25 (24%), Positives = 8/25 (32%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K K + K +K K
Sbjct: 222 EAKAAAAKAAAEAKAAAEKAAAAKA 246
Score = 30.5 bits (69), Expect = 1.7
Identities = 5/30 (16%), Positives = 9/30 (30%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
K K + K +K K + +
Sbjct: 223 AKAAAAKAAAEAKAAAEKAAAAKAAEKAAA 252
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 31.9 bits (72), Expect = 0.77
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
TS + +S +R KKKK KK K +K+ K+K KK
Sbjct: 51 RFTSFESDDQANSGNVSGKRVPKKKKIKKPKLRKRTKRKNKK 92
Score = 30.3 bits (68), Expect = 2.4
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
N K+ KKKK KK K +K+ K+K
Sbjct: 59 DQANSGNVSGKRVPKKKKIKKPKLRKRTKRKN 90
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 29.2 bits (66), Expect = 0.77
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ + E +K +K+K+KK +K+K K+
Sbjct: 32 MKEREFYEKPSEKRKRKKAAARKRKFKR 59
Score = 28.1 bits (63), Expect = 1.6
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
E++K+KK +K+K K+ +K++++
Sbjct: 43 EKRKRKKAAARKRKFKRLRKEQQR 66
Score = 28.1 bits (63), Expect = 1.8
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E+ +K+K+KK +K+K K+ +K+
Sbjct: 39 EKPSEKRKRKKAAARKRKFKRLRKE 63
Score = 27.7 bits (62), Expect = 2.4
Identities = 8/24 (33%), Positives = 18/24 (75%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
K+K+KK +K+K K+ +K++++
Sbjct: 43 EKRKRKKAAARKRKFKRLRKEQQR 66
Score = 27.7 bits (62), Expect = 2.8
Identities = 11/35 (31%), Positives = 21/35 (60%), Gaps = 3/35 (8%)
Query: 431 ERKKK---KKKKKKKKKKKKKKKKKKKKWSFRFQR 462
E K++ +K +K+K+KK +K+K K + Q+
Sbjct: 31 EMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQQ 65
>gnl|CDD|216554 pfam01529, zf-DHHC, DHHC palmitoyltransferase. This family
includes the well known DHHC zinc binding domain as well
as three of the four conserved transmembrane regions
found in this family of palmitoyltransferase enzymes.
Length = 167
Score = 30.9 bits (70), Expect = 0.84
Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 15/88 (17%)
Query: 207 GICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQGGISNRTKADK 266
C +G N KYF++F ++YL L +I L +S + + +I+N +
Sbjct: 76 NNC-IGRRNH---KYFLLF--LLYLTLYLILLL--VLSFYYLVYLIRNI-------ELFF 120
Query: 267 LEKLMIRIGIFSVLSTLPASIVIGCLIY 294
L + I ++ +L + + L++
Sbjct: 121 FLILSLFSSIILLVLSLFFLLFLSFLLF 148
Score = 27.8 bits (62), Expect = 7.7
Identities = 8/48 (16%), Positives = 17/48 (35%)
Query: 208 ICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKNQ 255
L + +I+ ++ LF F+ F + ++KN
Sbjct: 111 YLIRNIELFFFLILSLFSSIILLVLSLFFLLFLSFLLFFHLYLILKNI 158
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 31.9 bits (72), Expect = 0.86
Identities = 8/43 (18%), Positives = 21/43 (48%), Gaps = 2/43 (4%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
T +L + + + + ++ K K++ KK + + K+K +
Sbjct: 181 EETENLAEL--IIDLEELKLQELKLKEQAKKALEYYQLKEKLE 221
Score = 30.3 bits (68), Expect = 2.5
Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK--KKWSFRFQRF 463
K ER K + KK++ +++KK+ ++ ++ RF+ F
Sbjct: 987 KEERYNKDELKKERLEEEKKELLREIIEETCQRFKEF 1023
>gnl|CDD|218842 pfam05987, DUF898, Bacterial protein of unknown function (DUF898).
This family consists of several bacterial proteins of
unknown function. Some of the family members are
described as putative membrane proteins.
Length = 339
Score = 31.4 bits (72), Expect = 0.90
Identities = 14/87 (16%), Positives = 30/87 (34%), Gaps = 6/87 (6%)
Query: 175 YSLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILG 234
Y Y + + +V ++++ G G V ++ YL+L
Sbjct: 183 YKAYLLALLVALLFL-AVLGLLLALAAGALAGAAPLGGFGL-----AAVGAIILGYLVLF 236
Query: 235 IIFLFGGFISLFRIRNVIKNQGGISNR 261
+ +L R+RN++ N +
Sbjct: 237 VGYLVAAAYYRARLRNLVWNNTTLDGE 263
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 31.1 bits (70), Expect = 0.91
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKK 448
S + ++ +H + K KKKKKK+ K
Sbjct: 13 SQQNTDELNTVIHNDSSSSKSKKKKKKRSKATS 45
Score = 29.5 bits (66), Expect = 2.8
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 412 HSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKK 445
HS ++ + N + + + KKKKKK+ K
Sbjct: 12 HSQQNTDELNTVIHNDSSSSKSKKKKKKRSKATS 45
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 0.92
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 12/44 (27%)
Query: 428 HKNERKKKKKK------KKKKKKK------KKKKKKKKKKWSFR 459
+ + KK K+ K+K +KK KK+ + K+K S R
Sbjct: 257 FGSRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLR 300
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.2 bits (71), Expect = 0.93
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + + +K++KK K++KK K +K+K
Sbjct: 88 TQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.3 bits (71), Expect = 0.96
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ E + K K + KKK + KKK +
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAE 176
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 31.3 bits (71), Expect = 0.97
Identities = 12/42 (28%), Positives = 24/42 (57%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
S++ L ++E ++ K++ +KK+K KKK +K+K
Sbjct: 327 SLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368
Score = 31.3 bits (71), Expect = 1.1
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+++ + K++ +KK+K KKK +K+K S F+R
Sbjct: 344 DEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRR 378
Score = 30.9 bits (70), Expect = 1.3
Identities = 10/40 (25%), Positives = 18/40 (45%)
Query: 414 MTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
+ +E K++ +KK+K KKK +K+K
Sbjct: 331 FLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
Score = 30.5 bits (69), Expect = 1.7
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ K++ +KK+K KKK +K+K
Sbjct: 342 DEDEDDEESKEEVEKKQKVKKKPRKRK 368
Score = 29.4 bits (66), Expect = 3.9
Identities = 9/33 (27%), Positives = 21/33 (63%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
Y + +++ + ++ K++ +KK+K KKK +K
Sbjct: 334 YMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRK 366
Score = 29.4 bits (66), Expect = 4.6
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ K++ +KK+K KKK +K+K
Sbjct: 343 EDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 31.3 bits (71), Expect = 1.0
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 415 TSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+SLK+NP + KKK K+KK+KK+ + K++
Sbjct: 204 AASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEES 244
Score = 29.8 bits (67), Expect = 2.9
Identities = 14/89 (15%), Positives = 21/89 (23%), Gaps = 2/89 (2%)
Query: 370 VLINTNKHRVLKQHDYLPPLAHGMPPPPNIPIPAVPVM-GGGAHSMTSSLKQNPYSSHVH 428
+ N N R A P S +K +
Sbjct: 129 PITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAK-KTQDT 187
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
E +K + K K K+ KK +
Sbjct: 188 SKETTTEKTEGKTSVKAASLKRNPPKKSN 216
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 30.0 bits (68), Expect = 1.0
Identities = 6/26 (23%), Positives = 15/26 (57%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKK 452
+ ++++ +K +K+ KK KK +
Sbjct: 100 KEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 28.9 bits (65), Expect = 3.0
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RK+ K ++++ +K +K+ KK K
Sbjct: 99 RKEALKAYQQEELRKIQKRSKKSK 122
Score = 28.9 bits (65), Expect = 3.1
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E K ++++ +K +K+ KK KK +
Sbjct: 99 RKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 28.5 bits (64), Expect = 4.0
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E +K+ K ++++ +K +K+ KK
Sbjct: 95 KAEARKEALKAYQQEELRKIQKRSKKS 121
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 31.8 bits (72), Expect = 1.1
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKK 454
RKKKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 30.2 bits (68), Expect = 2.7
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
+KKKKKKK K+KKKKK ++ K
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASK 705
Score = 30.2 bits (68), Expect = 2.9
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
+KKKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 29.1 bits (65), Expect = 5.9
Identities = 14/21 (66%), Positives = 18/21 (85%)
Query: 437 KKKKKKKKKKKKKKKKKKKWS 457
+KKKKKKK K+KKKKK ++ S
Sbjct: 684 RKKKKKKKSKEKKKKKNREAS 704
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.5 bits (69), Expect = 1.1
Identities = 9/42 (21%), Positives = 22/42 (52%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
++SS + + +E +++ + K + K+KKK++ K
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131
Score = 29.3 bits (66), Expect = 2.4
Identities = 8/32 (25%), Positives = 18/32 (56%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ + + + K + K+KKK++ K K +K+
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKE 137
Score = 29.3 bits (66), Expect = 2.4
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ + K + K+KKK++ K K +K+K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEK 138
Score = 28.2 bits (63), Expect = 5.7
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +R+ K K +K+K K + KK K K
Sbjct: 124 KKKREVPKPKTEKEKPKTEPKKPKPSK 150
>gnl|CDD|193414 pfam12941, HCV_NS5a_C, HCV NS5a protein C-terminal region. This is
a family of proteins found in the hepatitis C virus.
This family contains the C-terminal region of the NS5A
protein. CC The molecular function of the non-structural
5a protein is uncertain. The NS5a protein is
phosphorylated when expressed in mammalian cells. It is
thought to interact with the ds RNA dependent
(interferon inducible) kinase PKR.
Length = 244
Score = 30.9 bits (70), Expect = 1.1
Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 2/24 (8%)
Query: 381 KQHDYLPPLAHG--MPPPPNIPIP 402
K DY PP HG +PP P+P
Sbjct: 125 KAPDYEPPTVHGCALPPAGLPPVP 148
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 30.7 bits (70), Expect = 1.1
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ER+K K+ KK KK KKKKKK+KK
Sbjct: 144 ISEREKAKQIKKLYKKAKKKKKKRKKT 170
Score = 30.3 bits (69), Expect = 1.5
Identities = 14/29 (48%), Positives = 17/29 (58%), Gaps = 5/29 (17%)
Query: 432 RKKKKKKKKKKK-----KKKKKKKKKKKK 455
+K KKKKKK+KK KK K K + K
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPK 186
Score = 30.0 bits (68), Expect = 2.1
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 429 KNERKKKKKKKK----KKKKKKKKKKKKKKKWSFRF 460
K ++KKKK+KK KK K K + K K ++
Sbjct: 159 KAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKV 194
Score = 28.8 bits (65), Expect = 5.6
Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 4/30 (13%)
Query: 429 KNERKKKKKKKKK----KKKKKKKKKKKKK 454
K +KKKKK+KK KK K K + K
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPKG 187
Score = 28.0 bits (63), Expect = 7.9
Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 4/32 (12%)
Query: 427 VHKNERKKKKK----KKKKKKKKKKKKKKKKK 454
K ++K+KK KK K K + K K K
Sbjct: 160 AKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGK 191
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.9 bits (70), Expect = 1.1
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKK 450
P + +K ++ KK KK KKKKKKK+
Sbjct: 189 PLYTWHYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 29.0 bits (65), Expect = 4.9
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKK 451
H K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 28.2 bits (63), Expect = 7.7
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
K K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.8 bits (62), Expect = 8.9
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKK 452
+ K KK KK KKKKKKK+
Sbjct: 193 WHYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.8 bits (62), Expect = 9.3
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKK 453
+ + K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|234534 TIGR04286, MSEP-CTERM, MSEP-CTERM protein. Members of this protein
family average over 900 residues in length and appear to
have multiple membrane-spanning helices in the
N-terminal half. The extreme C-terminal region consists
of a motif with consensus sequence MSEP, then a
transmembrane alpha helix, then a short region with
several basic residues. This region, hereby dubbed
MSEP-CTERM, resembles other putative sorting signals
associated with the archaeosortase/exosortase protein
family (see TIGR04178). Genes for all members of this
family are found next to a gene for exosortase K.
Length = 920
Score = 31.6 bits (72), Expect = 1.2
Identities = 32/184 (17%), Positives = 62/184 (33%), Gaps = 36/184 (19%)
Query: 134 CTLVFLLVYFF-------GMSSYIW---WVILSFTWFLAAGLKWGNEAIASYSLYFHLAA 183
+ +FL++ F +S VIL F GL N + +
Sbjct: 185 ASFLFLILRFLYILITRRNLSFKNPELLIVILVGIIFPVLGLALNNNFGGIFGNFSSPWF 244
Query: 184 WLIPTVKSVSVIIMSAVDGD-------TIAGICYVGNHNIHNLKYFVVF-PLIIYLILGI 235
++I + + +++ + + L +F+VF PL+ + I
Sbjct: 245 YIIAILNGLLLLLPL-QKNVLLRLILFYGKSAGFP-----YILYFFIVFLPLLPLGFIAI 298
Query: 236 IFLFGGF-------ISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLSTLPASIV 288
I GF + ++R + + + + ++I GI +L TLP S
Sbjct: 299 IIYGLGFLMLTPAVAFILQLRTISREFKTLKKTYG----KPILILCGIAGLL-TLPVSFT 353
Query: 289 IGCL 292
C
Sbjct: 354 GNCR 357
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 29.1 bits (66), Expect = 1.2
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKK 453
+++V ++K KK KK +KKKK KKK
Sbjct: 23 LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.0 bits (63), Expect = 2.3
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
P ++ ++KK KK KK +KKKK KKK
Sbjct: 21 PGLNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 27.2 bits (61), Expect = 5.2
Identities = 14/32 (43%), Positives = 18/32 (56%)
Query: 424 SSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ N KKKK KK KK +KKKK K
Sbjct: 20 APGLNNNVVAKKKKLKKLKKIDDDLEKKKKNK 51
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 31.0 bits (70), Expect = 1.2
Identities = 13/27 (48%), Positives = 13/27 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K KK KK KK KK K KK
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKK 269
Score = 30.2 bits (68), Expect = 2.3
Identities = 13/37 (35%), Positives = 13/37 (35%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K S K K K KK KKK K KK
Sbjct: 204 KTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKK 240
Score = 29.9 bits (67), Expect = 2.8
Identities = 12/27 (44%), Positives = 13/27 (48%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K KK KK K KK KK K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAK 276
Score = 29.5 bits (66), Expect = 3.7
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K KK KK KK K KK KK
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKK 273
Score = 29.5 bits (66), Expect = 3.9
Identities = 10/27 (37%), Positives = 11/27 (40%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +K K KK KK K K K
Sbjct: 258 KALKKAAKAVKKAAKKAAKAAAKAAKG 284
Score = 29.5 bits (66), Expect = 4.2
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 415 TSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S K+ ++ K K K KK KK KK KK K
Sbjct: 225 KVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAK 265
>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
[RNA processing and modification].
Length = 654
Score = 31.2 bits (70), Expect = 1.2
Identities = 14/85 (16%), Positives = 34/85 (40%), Gaps = 3/85 (3%)
Query: 381 KQHDYLPPLAHGMP--PPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKK 438
K++ P A P + + + G + + K S+ + + K +
Sbjct: 247 KENRQEKPAAAKQPHHMDDDGTKRKMVIEIEGLSLLENR-KPEAVSAPEAVSPQSKSEGP 305
Query: 439 KKKKKKKKKKKKKKKKKWSFRFQRF 463
++K+K+ K+KK + ++ +F
Sbjct: 306 SSGQEKEKQIKEKKSFSYGWKHTKF 330
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 31.1 bits (71), Expect = 1.2
Identities = 10/33 (30%), Positives = 22/33 (66%), Gaps = 8/33 (24%)
Query: 431 ERKKKKKKKKKKKKK--------KKKKKKKKKK 455
E+K++KKK++++ K +K ++K++KK
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318
Score = 30.3 bits (69), Expect = 2.3
Identities = 10/35 (28%), Positives = 22/35 (62%), Gaps = 8/35 (22%)
Query: 429 KNERKKKKKKKKK--------KKKKKKKKKKKKKK 455
K E+KK++++ K +K ++K++KK+ +K
Sbjct: 288 KEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 29.9 bits (68), Expect = 2.8
Identities = 9/29 (31%), Positives = 21/29 (72%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
+ ER+++ ++KK++KKK++++ K K
Sbjct: 274 KAAEEERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 29.5 bits (66), Expect = 1.2
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
KK +K K K KKK KKKKK
Sbjct: 10 EKKAEKAKAGTAKDKKKWSKKKKK 33
Score = 28.3 bits (63), Expect = 2.7
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWS 457
K + K+KK +K K K KKKWS
Sbjct: 4 KSQLSKEKKAEKAKAGTAKDKKKWS 28
Score = 28.3 bits (63), Expect = 3.1
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKK 454
E+K +K K K KKK KKKKK
Sbjct: 10 EKKAEKAKAGTAKDKKKWSKKKKK 33
Score = 28.3 bits (63), Expect = 3.5
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K E+ K K KKK KKKKK++ +
Sbjct: 12 KAEKAKAGTAKDKKKWSKKKKKEEAR 37
Score = 28.0 bits (62), Expect = 4.0
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
K+KK +K K K KKK KKK
Sbjct: 7 LSKEKKAEKAKAGTAKDKKKWSKKK 31
Score = 28.0 bits (62), Expect = 4.2
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
S + K ++ +K K K KKK KKKKK++
Sbjct: 5 SQLSKEKKAEKAKAGTAKDKKKWSKKKKKEE 35
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 30.8 bits (70), Expect = 1.3
Identities = 21/75 (28%), Positives = 27/75 (36%), Gaps = 22/75 (29%)
Query: 387 PPLAHGMPPPPNI-------PIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKK 439
PP A PP P + PIP P + K E K K K K
Sbjct: 60 PPQAVQPPPEPVVEPEPEPEPIPEPPK---------------EAPVVIPKPEPKPKPKPK 104
Query: 440 KKKKKKKKKKKKKKK 454
K K KK +++ K+
Sbjct: 105 PKPKPVKKVEEQPKR 119
>gnl|CDD|238175 cd00283, GIY-YIG_Cterm, GIYX(10-11)YIG family of class I homing
endonucleases C-terminus (GIY-YIG_Cterm). Homing
endonucleases promote the mobility of intron or intein
by recognizing and cleaving a homologous allele that
lacks the sequence. They catalyze a double-strand break
in the DNA near the insertion site of that element to
facilitate homing at that site. Class I homing
endonucleases are sorted into four families based on the
presence of these motifs in their respective N-termini:
LAGLIDADG, His-Cys box, HNH, and GIY-YIG. This CD
contains several but not all members of the GIY-YIG
family. The C-terminus of GIY-YIG is a DNA-binding
domain which is separated from the N-terminus by a long,
flexible linker. The DNA-binding domain consists of a
minor-groove binding alpha-helix, and a
helix-turn-helix. Some also contain a zinc finger (i.e.
I-TevI) which is not required for DNA binding or
catalysis, but is a component of the linker and directs
the catalytic domain to cleave the homing site at a
fixed distance from the intron insertion site.
Length = 113
Score = 29.4 bits (66), Expect = 1.3
Identities = 15/39 (38%), Positives = 18/39 (46%)
Query: 413 SMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKK 451
S S + NP H +E K K KKK +K KKK
Sbjct: 31 SKNRSGENNPMFGKKHSDETKAKISAKKKGRKNLNAKKK 69
>gnl|CDD|226613 COG4128, Zot, Zonula occludens toxin [General function prediction
only].
Length = 398
Score = 31.1 bits (70), Expect = 1.3
Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 2/29 (6%)
Query: 136 LVFLLVYFFGMSSYIWWVILSFTWFLAAG 164
+ FLLV F M ++I SF G
Sbjct: 226 VRFLLVLFPLMFIGSSYLIYSF--PSTFG 252
>gnl|CDD|192537 pfam10329, DUF2417, Region of unknown function (DUF2417). This is
a region of a family of proteins conserved in fungi some
of whose members also have the Abhydrolase_1, pfam00561,
domain in their sequence. The function of this region is
not known.
Length = 227
Score = 30.3 bits (69), Expect = 1.5
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 214 HNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRN 250
+I L++ V L+I I +I L F+S+ + N
Sbjct: 28 WSIRALRWISVILLLINFIWFVILLVSDFVSIPGLNN 64
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 29.6 bits (67), Expect = 1.5
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RK+ K ++KK + K+K K KKK
Sbjct: 109 RKELKNRRKKVRGKEKTKVSGKKK 132
Score = 29.2 bits (66), Expect = 2.1
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K RK +K+ K ++KK + K+K K
Sbjct: 103 KAGRKARKELKNRRKKVRGKEKTKVS 128
Score = 28.8 bits (65), Expect = 2.7
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKK 452
+ E K ++KK + K+K K KKK
Sbjct: 109 RKELKNRRKKVRGKEKTKVSGKKK 132
Score = 28.8 bits (65), Expect = 2.9
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 428 HKNERKKKKKKKKKKKKKKKK 448
KN RKK + K+K K KKK
Sbjct: 112 LKNRRKKVRGKEKTKVSGKKK 132
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.2 bits (71), Expect = 1.5
Identities = 14/41 (34%), Positives = 16/41 (39%)
Query: 415 TSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
LK + K KKK+KKKKK K KK
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV 1195
Score = 30.4 bits (69), Expect = 2.3
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K +R K K K K K +K K KKK+KK
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKK 1180
Score = 29.6 bits (67), Expect = 4.2
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ ++ K K K K K +K K KKK+
Sbjct: 1152 IAKEQRLKSKTKGKASKLRKPKLKKKE 1178
Score = 29.6 bits (67), Expect = 5.0
Identities = 15/44 (34%), Positives = 19/44 (43%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
+ + RK K KKK+KKKKK K KK +R
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 28.9 bits (65), Expect = 7.2
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K K+++ K K K K K +K K KK
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKK 1176
Score = 28.5 bits (64), Expect = 9.8
Identities = 9/33 (27%), Positives = 15/33 (45%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
E++ K+++ K K K K K +K K
Sbjct: 1141 LEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPK 1173
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 30.7 bits (70), Expect = 1.5
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKKW 456
+R ++ K+KKKKKKKKKKK++ ++W
Sbjct: 80 DRAEELLKEKKKKKKKKKKKEELREW 105
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 30.5 bits (69), Expect = 1.5
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKK 451
K E +KK+K+KK++ K +KK+
Sbjct: 157 KLELLRKKEKEKKEQLKIQKKQS 179
Score = 29.3 bits (66), Expect = 4.1
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
RKK+K+KK++ K +KK+ +K
Sbjct: 162 RKKEKEKKEQLKIQKKQSLLEKYG 185
Score = 28.9 bits (65), Expect = 4.6
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
H+ N K + +KK+K+KK++ K +KK+
Sbjct: 150 HLQANPTKLELLRKKEKEKKEQLKIQKKQS 179
>gnl|CDD|100088 cd03086, PGM3, PGM3 (phosphoglucomutase 3), also known as PAGM
(phosphoacetylglucosamine mutase) and AGM1
(N-acetylglucosamine-phosphate mutase), is an essential
enzyme found in eukaryotes that reversibly catalyzes the
conversion of GlcNAc-6-phosphate into GlcNAc-1-phosphate
as part of the UDP-N-acetylglucosamine (UDP-GlcNAc)
biosynthetic pathway. UDP-GlcNAc is an essential
metabolite that serves as the biosynthetic precursor of
many glycoproteins and mucopolysaccharides. AGM1 is a
member of the alpha-D-phosphohexomutase superfamily,
which catalyzes the intramolecular phosphoryl transfer
of sugar substrates. The alpha-D-phosphohexomutases have
four domains with a centrally located active site formed
by four loops, one from each domain. All four domains
are included in this alignment model.
Length = 513
Score = 30.6 bits (70), Expect = 1.6
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 261 RTKADKLEKLMIRIGIFSVL-STLPASIVIGCLI 293
RTKA+ L+ ++ R+GI + L S IG +I
Sbjct: 8 RTKAELLDSVVFRVGILAALRSKKLGGKTIGVMI 41
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 29.9 bits (68), Expect = 1.7
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ KK+++KK++ K +K++ K KK
Sbjct: 138 EDPFAKKRREKKERVAKNEKRELKNKK 164
Score = 29.2 bits (66), Expect = 2.9
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 419 KQNPYSSHVHKNERKKKKKKKKKKKKKKKKKK 450
+P K R+KK++ K +K++ K KK
Sbjct: 133 NDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 30.7 bits (70), Expect = 1.7
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 421 NPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
NP ++ K +++K+ K + + K K +K K
Sbjct: 759 NPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLK 793
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel repeat
in Pneumocystis carinii Major surface glycoprotein (MSG)
some members of the alignment have up to nine repeats of
this family, the repeats containing several conserved
cysteines. The MSG of P. carinii is an important protein
in host-pathogen interactions. Surface glycoprotein A
from Pneumocystis carinii is a main target for the host
immune system, this protein is implicated in the
attachment of Pneumocystis carinii to the host alveolar
epithelial cells, alveolar macrophages, host surfactant
and possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 28.6 bits (64), Expect = 1.8
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWS 457
K+ K + KKKKK KK K+K K
Sbjct: 52 KELKGELKKKKKCKKALKEKCTKLK 76
Score = 27.4 bits (61), Expect = 4.6
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E KKKKK KK K+K K KK+ +
Sbjct: 52 KELKGELKKKKKCKKALKEKCTKLKKESNE 81
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.0 bits (70), Expect = 1.8
Identities = 9/27 (33%), Positives = 12/27 (44%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E KK ++ KK KK KK
Sbjct: 1188 RGESGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 30.6 bits (69), Expect = 2.3
Identities = 6/29 (20%), Positives = 12/29 (41%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
+ + + KK ++ KK KK +
Sbjct: 1183 QRAAARGESGAAKKVSRQAPKKPAPKKTT 1211
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.6 bits (70), Expect = 1.9
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKK 454
V ++ +K +K KKKKKKK K K K +
Sbjct: 674 VPLSDLEKIQKPKKKKKKKPKTVKPKPR 701
Score = 30.2 bits (69), Expect = 3.0
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKKWSFR 459
H+ +K+ K +KK+KKKKK+K+K+ +
Sbjct: 607 HELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|220281 pfam09531, Ndc1_Nup, Nucleoporin protein Ndc1-Nup. Ndc1 is a
nucleoporin protein that is a component of the Nuclear
Pore Complex, and, in fungi, also of the Spindle Pole
Body. It consists of six transmembrane segments, three
lumenal loops, both concentrated at the N-terminus and
cytoplasmic domains largely at the C-terminus, all of
which are well conserved.
Length = 557
Score = 30.8 bits (70), Expect = 1.9
Identities = 23/148 (15%), Positives = 47/148 (31%), Gaps = 26/148 (17%)
Query: 60 IFVMSALCCISTLMTLTTFLIDTERFK--YPERPI-----VFLSGCYFMVSIGY------ 106
+V+SA + K +RP+ ++L + + Y
Sbjct: 98 TYVVSAWLFAWVYIWSLPTSYRLLWVKSEAHDRPLLNERALYLWLFGVFLGLYYSVQHLI 157
Query: 107 --LIRVSVGHEEVACEDQ-------MVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSF 157
R+S + + + ++ TL++ +VY+FG+ S IW + S
Sbjct: 158 FDRDRLSFPVIQQPPQKRLKERLPSLLYKSLILSLISTLLYPIVYWFGLRSPIWKLTNSL 217
Query: 158 T-WFLA---AGLKWGNEAIASYSLYFHL 181
FL + ++ L
Sbjct: 218 AKLFLDLDSSAPLPSFSGPLNFRLLLRT 245
>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
Length = 175
Score = 29.7 bits (67), Expect = 2.0
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 4/34 (11%)
Query: 426 HVHKNERKKKKKKKK----KKKKKKKKKKKKKKK 455
H KNE+ K+KK + ++ K K + K+K K
Sbjct: 141 HKTKNEKVKEKKIQDQLEARRAKAKALRNKRKAK 174
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 30.2 bits (68), Expect = 2.0
Identities = 22/94 (23%), Positives = 31/94 (32%), Gaps = 24/94 (25%)
Query: 381 KQHDYLPPLAHGMPPPPNIPIPAVPVMGGGA---------------HSMTSSLKQNP--- 422
+PP+ +P PP P A H M S +Q P
Sbjct: 123 PATSPVPPV--PLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPSFQQPPPMK 180
Query: 423 --YSSH--VHKNERKKKKKKKKKKKKKKKKKKKK 452
S H +H+N K K + + KK K+K
Sbjct: 181 NCLSCHQQIHRNAPICPLCKAKSRSRNPKKPKRK 214
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 29.6 bits (67), Expect = 2.0
Identities = 9/37 (24%), Positives = 15/37 (40%), Gaps = 7/37 (18%)
Query: 426 HVHKN-------ERKKKKKKKKKKKKKKKKKKKKKKK 455
HV K+ +R K + KKK + K++
Sbjct: 95 HVRKSRCREDFLKRVAANDAIKAEAKKKGELPSTKRQ 131
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 30.2 bits (69), Expect = 2.2
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKK 453
V + E KK +K +K KKKKK++K
Sbjct: 184 VKQTEELIKKLLEKPEKPKKKKKRRKG 210
Score = 27.9 bits (63), Expect = 9.9
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 437 KKKKKKKKKKKKKKKKKKKWSF 458
KK +K +K KKKKK++K +S
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSK 213
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 30.4 bits (68), Expect = 2.4
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
YS ++ K K KK+K +++++KK K
Sbjct: 515 YSETSEADKDVNKSKNKKRKVDEEEEEKKLKM 546
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.5 bits (69), Expect = 2.4
Identities = 13/25 (52%), Positives = 21/25 (84%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKKKK 455
E K KKKKKK+KKK+++ K+++K +
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 29.7 bits (67), Expect = 4.8
Identities = 23/110 (20%), Positives = 36/110 (32%), Gaps = 11/110 (10%)
Query: 360 LFGFNNNTTRVLINTNKHRVLKQHDYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLK 419
L N N N NK K+ ++P N G G K
Sbjct: 1489 LLNLNGN-----ENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKK 1543
Query: 420 QNPYSSH------VHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
N S + ++ + KK+K KK+ K + + + R+ RF
Sbjct: 1544 GNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593
Score = 29.7 bits (67), Expect = 4.9
Identities = 12/24 (50%), Positives = 21/24 (87%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
++K KKKKKK+KKK+++ K+++K
Sbjct: 741 VEEKTKKKKKKEKKKEEEYKREEK 764
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 30.5 bits (69), Expect = 2.4
Identities = 12/58 (20%), Positives = 17/58 (29%), Gaps = 6/58 (10%)
Query: 387 PPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKK 444
++P + M S + R KKKKKKK+K
Sbjct: 699 SSQPPTSQSSSDLPPSSSQAFSLSDLPMQSQSESGLSGG------RSASKKKKKKKRK 750
>gnl|CDD|188624 cd09904, H3TH_XPG, H3TH domain of Xeroderma pigmentosum
complementation group G (XPG) nuclease, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease. The Xeroderma pigmentosum complementation
group G (XPG) nuclease plays a central role in
nucleotide excision repair (NER) in cleaving DNA bubble
structures or loops. XPG is a member of the
structure-specific, 5' nuclease family that catalyzes
hydrolysis of DNA duplex-containing nucleic acid
structures during DNA replication, repair, and
recombination. Members of this subgroup include the
H3TH (helix-3-turn-helix) domains of XPG and other
similar eukaryotic 5' nucleases. These nucleases contain
a PIN (PilT N terminus) domain with a helical arch/clamp
region/I domain (not included here) and inserted within
the PIN domain is an atypical helix-hairpin-helix-2
(HhH2)-like region. This atypical HhH2 region, the H3TH
domain, has an extended loop with at least three turns
between the first two helices, and only three of the
four helices appear to be conserved. Both the H3TH
domain and the helical arch/clamp region are involved in
DNA binding. Studies suggest that a glycine-rich loop
in the H3TH domain contacts the phosphate backbone of
the template strand in the downstream DNA duplex. These
nucleases have a carboxylate rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+ or Mn2+) required for nuclease activity.
The first metal binding site is composed entirely of
Asp/Glu residues from the PIN domain, whereas, the
second metal binding site is composed generally of two
Asp residues from the PIN domain and one Asp residue
from the H3TH domain. Together with the helical arch and
network of amino acids interacting with metal binding
ions, the H3TH region defines a positively charged
active-site DNA-binding groove in structure-specific 5'
nucleases.
Length = 97
Score = 28.4 bits (64), Expect = 2.7
Identities = 6/32 (18%), Positives = 12/32 (37%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKKWSF 458
+ +K + K++ K+K K K
Sbjct: 46 WENAQPEKSEDSDNDKQEFKRKHKNYLKNLIL 77
>gnl|CDD|218845 pfam05991, NYN_YacP, YacP-like NYN domain. This family consists of
bacterial proteins related to YacP. This family is
uncharacterized functionally, but it has been suggested
that these proteins are nucleases due to them containing
a NYN domain. NYN (for N4BP1, YacP-like Nuclease)
domains were discovered by Anantharaman and Aravind.
Based on gene neighborhoods it was suggested that the
bacterial YacP proteins interact with the Ribonuclease
III and TrmH methylase in a processome complex that
catalyzes the maturation of rRNA and tRNA.
Length = 165
Score = 29.1 bits (66), Expect = 2.8
Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 6/59 (10%)
Query: 406 VMGGGAHSMTS-SLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
+ G GA ++S L + V + E+K +KK +K++K +KK + + + ++
Sbjct: 110 IFGRGALRISSRELAEE-----VKRAEKKIRKKAEKRRKSRKKYLDRLSDEVLEKLEKL 163
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 28.1 bits (63), Expect = 2.8
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKK 452
++ K +KK+K +KK++ KK K
Sbjct: 73 LYIPPDKLIRKKRKLPRKKRRPKKPK 98
Score = 28.1 bits (63), Expect = 3.4
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKK 453
K +KK+K +KK++ KK K
Sbjct: 77 PDKLIRKKRKLPRKKRRPKKPK 98
Score = 27.7 bits (62), Expect = 4.5
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKK 454
K +KK+K +KK++ KK K
Sbjct: 76 PPDKLIRKKRKLPRKKRRPKKPK 98
Score = 27.7 bits (62), Expect = 4.6
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K +KK+K +KK++ KK K
Sbjct: 76 PPDKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 28.4 bits (64), Expect = 2.9
Identities = 12/30 (40%), Positives = 21/30 (70%), Gaps = 6/30 (20%)
Query: 432 RKKKKKKKKKKK------KKKKKKKKKKKK 455
+++KKKKKKK + KK+KK+ +++K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRK 67
Score = 28.4 bits (64), Expect = 3.3
Identities = 13/30 (43%), Positives = 20/30 (66%), Gaps = 6/30 (20%)
Query: 432 RKKKKKKKKKK------KKKKKKKKKKKKK 455
R+KKKKKKK + KK+KK+ +++K
Sbjct: 39 REKKKKKKKSEGEFFAEKKEKKEVSEQRKA 68
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 28.2 bits (63), Expect = 2.9
Identities = 10/22 (45%), Positives = 12/22 (54%)
Query: 435 KKKKKKKKKKKKKKKKKKKKKW 456
+ +K K KK KK KK KW
Sbjct: 48 ETTTEKCKGKKDKKDCKKGCKW 69
>gnl|CDD|218643 pfam05571, DUF766, Protein of unknown function (DUF766). This
family consists of several eukaryotic proteins of
unknown function.
Length = 296
Score = 29.7 bits (67), Expect = 3.0
Identities = 35/132 (26%), Positives = 59/132 (44%), Gaps = 22/132 (16%)
Query: 221 YFVVFPLIIYLILGIIFLFG-GFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSV 279
V+PL Y I+ + +LF + L R V K + ++ + + F +
Sbjct: 141 QEAVYPL--YTIVFVFYLFCLVLMFLLRPILVCKILY------VSGNMKPIYAALYFFPI 192
Query: 280 LSTLPASIVIGCLIYESYVYNEYVESIVCPC---SEKLDKDY--------FKPLHVVLLL 328
L+ + A V+G LIY S+ Y V S+V S KLD+ P ++V+L
Sbjct: 193 LTVIHA--VLGGLIYYSFPYISLVISLVANAFHLSLKLDQPLKALIVRSVTSPTNLVILF 250
Query: 329 KYFMALAVGLTS 340
+++ LA G+ S
Sbjct: 251 IHWLLLAYGVIS 262
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 29.1 bits (66), Expect = 3.0
Identities = 10/42 (23%), Positives = 18/42 (42%), Gaps = 5/42 (11%)
Query: 427 VHKNERK-----KKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
+H RK + + K +KKK ++ +K+ R F
Sbjct: 13 LHPKSRKAKQLNRATLRDDKLEKKKAERAEKRSLELDRLFFF 54
Score = 28.8 bits (65), Expect = 3.6
Identities = 8/33 (24%), Positives = 16/33 (48%), Gaps = 3/33 (9%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQ 461
R K +KKK ++ +K+ + + + FQ
Sbjct: 26 ATLRDDKLEKKKAERAEKRSLELDRLFF---FQ 55
Score = 28.0 bits (63), Expect = 6.3
Identities = 9/39 (23%), Positives = 17/39 (43%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRF 460
P S + R + K +KKK ++ +K+ + F
Sbjct: 15 PKSRKAKQLNRATLRDDKLEKKKAERAEKRSLELDRLFF 53
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.3 bits (68), Expect = 3.2
Identities = 11/56 (19%), Positives = 19/56 (33%)
Query: 386 LPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKK 441
LPP H PPP P PA ++ + R ++ ++ +
Sbjct: 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGR 2672
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 30.0 bits (67), Expect = 3.2
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 415 TSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
TS++K N + KN+ K K KK + K+ K+ +K K
Sbjct: 211 TSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSK 251
Score = 28.8 bits (64), Expect = 7.2
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 418 LKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFR 459
++++P ++ KN K K+K K+K K KK + K + R
Sbjct: 204 MRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKR 245
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 28.7 bits (65), Expect = 3.3
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKK 455
K+KKKKKKKKK+ + ++KKKK
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKY 104
Score = 27.9 bits (63), Expect = 6.3
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 432 RKKKKKKKKKKKKKKKKKKKKKKK 455
++KKKKKKKKK+ + ++KKKK
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYV 105
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 29.4 bits (66), Expect = 3.3
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K E+KK K+ K+++K++K+
Sbjct: 189 KEEKKKVKEAKRREKEEKRMAALVAA 214
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 30.0 bits (67), Expect = 3.6
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 13/79 (16%)
Query: 389 LAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNP-------------YSSHVHKNERKKK 435
+A +PPPP VP M++ + +N + V +RK +
Sbjct: 1 MAPTIPPPPAATPVTVPADSSIWDRMSNWISENKTAVYTIAGVTVVATGAGVVYYQRKGQ 60
Query: 436 KKKKKKKKKKKKKKKKKKK 454
++++ K K KK+++K+K+
Sbjct: 61 QQRESKPKISKKERRKRKQ 79
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 29.2 bits (66), Expect = 3.6
Identities = 8/27 (29%), Positives = 20/27 (74%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
K E+ ++KK+K + + ++++ K +K+K
Sbjct: 35 KREKAQEKKRKAEAQAERRELKARKEK 61
Score = 28.5 bits (64), Expect = 5.3
Identities = 7/28 (25%), Positives = 18/28 (64%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
+ +RK + + ++++ K +K+K K + W
Sbjct: 41 EKKRKAEAQAERRELKARKEKLKTRSDW 68
Score = 28.5 bits (64), Expect = 5.6
Identities = 7/27 (25%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E+K+K + + ++++ K +K+K K +
Sbjct: 39 AQEKKRKAEAQAERRELKARKEKLKTR 65
Score = 28.1 bits (63), Expect = 6.8
Identities = 6/30 (20%), Positives = 19/30 (63%)
Query: 434 KKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
K++K ++KK+K + + ++++ R ++
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKL 62
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 29.7 bits (67), Expect = 3.7
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 131 PTPCTLVFLLVYFFGMSSYIWWVIL 155
PTP L L F + Y+WW I
Sbjct: 227 PTPKRLSLLRTLFAALPYYVWWQIP 251
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 29.5 bits (66), Expect = 3.7
Identities = 47/200 (23%), Positives = 88/200 (44%), Gaps = 35/200 (17%)
Query: 130 GPTPCTLVFLLVYFFGMSSYIWWVIL-SFTWFLAA--GLKWGNEAIASYSLYFHLAAWLI 186
G C LV L VYF G S +++V L S ++ K N++I S S+ +AAW+
Sbjct: 94 GEFMCKLV-LGVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSI-SLSVLLSVAAWVC 151
Query: 187 PTVKSVSVIIMSAVDG-DTIAGICYVGNHNIHNLKYFV-----VFPLIIYLILGIIFLFG 240
+ S+ +++ VD D + + H + F +F ++I LI+ I
Sbjct: 152 SVILSMPAMVLYYVDNTDNLKQCIFNDYHENFSWSAFFNFEINIFGIVIPLIILIYCYSK 211
Query: 241 GFISLFRIRNVIKNQGGISNRTKADKLEKLMIRIGIFSVLSTLPASIV------------ 288
+L +N N+T++ K+++ + F+V+ +P +IV
Sbjct: 212 ILYTLKNCKN--------KNKTRSI---KIILTVVTFTVVFWVPFNIVLFINSLQSVGLI 260
Query: 289 -IGCLIYESYVYNEYVESIV 307
IGC ++ VY+ + ++
Sbjct: 261 DIGCYHFKKIVYSIDIAELI 280
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 29.1 bits (66), Expect = 3.7
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +++ +K+KKK K++KK K++K K
Sbjct: 89 TPIYEYFEAEKEKKKAMSKEEKKAIKEEKDK 119
Score = 28.7 bits (65), Expect = 5.2
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E++KKK K++KK K++K K ++
Sbjct: 95 FEAEKEKKKAMSKEEKKAIKEEKDKLEE 122
Score = 28.7 bits (65), Expect = 5.9
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKW 456
K ++K K++KK K++K K ++ +
Sbjct: 99 KEKKKAMSKEEKKAIKEEKDKLEEPYGY 126
Score = 27.9 bits (63), Expect = 9.2
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKK 454
Y + E+KK K++KK K++K K ++
Sbjct: 92 YEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 29.6 bits (67), Expect = 3.8
Identities = 10/25 (40%), Positives = 10/25 (40%)
Query: 382 QHDYLPPLAHGMPPPPNIPIPAVPV 406
QH Y PP A P P P V
Sbjct: 89 QHQYQPPYASAQPRQPVQQPPEAQV 113
>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
I [DNA replication, recombination, and repair].
Length = 140
Score = 28.7 bits (64), Expect = 3.9
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
++ E+K K +K +KK+KKK +
Sbjct: 93 FTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLE 125
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 29.0 bits (65), Expect = 4.1
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 429 KNERKKKKKKKKKKKKKKK 447
E K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
Score = 28.7 bits (64), Expect = 6.0
Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 9/37 (24%)
Query: 428 HKNERKKKKKKKK---------KKKKKKKKKKKKKKK 455
K + KK+ + K+KKKKK KKKK K
Sbjct: 68 RKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 29.4 bits (66), Expect = 4.1
Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 4/30 (13%)
Query: 430 NERKKKKKKKK----KKKKKKKKKKKKKKK 455
E KKK K+ + K++K++K+KK KK
Sbjct: 128 KEYKKKLLKRSQNLDRSKRRKRRKRKKNKK 157
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 29.3 bits (66), Expect = 4.2
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKKWS 457
K RK+K+ KK+ K+K+ KWS
Sbjct: 355 KENRKRKELALAKKQSSTTKRKENLPKWS 383
>gnl|CDD|220528 pfam10028, DUF2270, Predicted integral membrane protein (DUF2270).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 185
Score = 28.8 bits (65), Expect = 4.3
Identities = 12/59 (20%), Positives = 26/59 (44%), Gaps = 4/59 (6%)
Query: 99 YFMVSIGYLIRVSVGHEEVACEDQMVRYGASGPTPCTLVFLLVYFFGMSSYIWWVILSF 157
++ + +L+++++ +++ A GP P LV L V F Y + L+
Sbjct: 131 LAILLLAWLLKIAIHPTIAPSLSEVLERAAVGPIPGWLVLLGVAAF----YAALIALAL 185
>gnl|CDD|224444 COG1527, NtpC, Archaeal/vacuolar-type H+-ATPase subunit C [Energy
production and conversion].
Length = 346
Score = 29.2 bits (66), Expect = 4.4
Identities = 19/100 (19%), Positives = 35/100 (35%), Gaps = 16/100 (16%)
Query: 242 FISLFRIRNVIKN--QGGISNRTKADKLEKLMIRIGIFSVLSTLPA-----SIVIGCLIY 294
++ + I N IK + ++ + + L+I +G F L TL +V
Sbjct: 95 YLYKWDIEN-IKTLLRAKLAGDPEE--ISDLLIPLGDFETLLTLAEAKTMEEVVETLEG- 150
Query: 295 ESYVYNEYVESIVCPCSEKLDKDYFKPLHVVLLLKYFMAL 334
Y+ + + D +PL L Y+ L
Sbjct: 151 -----TTYLAPLEEALRDYEDTGDIEPLENALDKAYYEDL 185
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 29.3 bits (66), Expect = 4.4
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKK 453
+ K+ KKKKKKKKKKKKK
Sbjct: 48 QEAENKQNNKKKKKKKKKKKKKNLG 72
Score = 29.3 bits (66), Expect = 4.9
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
+NE K++K++ K+K KKK KK KK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 29.3 bits (66), Expect = 4.9
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 428 HKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+K KKKKKKKKKKKKK +
Sbjct: 52 NKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
Score = 29.3 bits (66), Expect = 5.1
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ +N+ + K+ KKKKKKKKKKKKK
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLG 72
Score = 28.9 bits (65), Expect = 6.8
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
E + K++K++ K+K KKK KK KK
Sbjct: 5 NGENEVKQQKQQNKQKGTKKKNKKSKK 31
Score = 28.5 bits (64), Expect = 8.8
Identities = 15/26 (57%), Positives = 17/26 (65%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKKK 455
E K+ KKKKKKKKKKKKK +
Sbjct: 50 AENKQNNKKKKKKKKKKKKKNLGEAY 75
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 29.3 bits (66), Expect = 4.5
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 435 KKKKKKKKKKKKKKKKKKKKKW 456
+KKK + KKK K KK +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRID 174
Score = 28.5 bits (64), Expect = 7.8
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+ E +KKK + KKK K KK +
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRID 174
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 28.8 bits (65), Expect = 4.5
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKK 452
E KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 28.4 bits (64), Expect = 5.7
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 433 KKKKKKKKKKKKKKKKKKKKK 453
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.4 bits (64), Expect = 5.7
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 434 KKKKKKKKKKKKKKKKKKKKK 454
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.4 bits (64), Expect = 5.7
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 435 KKKKKKKKKKKKKKKKKKKKK 455
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 29.1 bits (65), Expect = 4.7
Identities = 15/37 (40%), Positives = 19/37 (51%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRFQRF 463
V E ++K+ KK+ KK KK KKK FRF
Sbjct: 207 VESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRFSSV 243
>gnl|CDD|206208 pfam14038, YqzE, YqzE-like protein. The YqzE-like protein family
includes the B. subtilis YqzE protein, which is
functionally uncharacterized. It is a part of the ComG
operon, which is regulated by the competence
transcription factor ComK. This family of proteins is
found in bacteria. Proteins in this family are typically
between 49 and 66 amino acids in length.
Length = 54
Score = 26.5 bits (59), Expect = 4.7
Identities = 8/21 (38%), Positives = 19/21 (90%)
Query: 439 KKKKKKKKKKKKKKKKKWSFR 459
K+++K+KK+++K++K +S+R
Sbjct: 21 KEERKEKKEERKEEKPPFSYR 41
>gnl|CDD|133045 cd06423, CESA_like, CESA_like is the cellulose synthase
superfamily. The cellulose synthase (CESA) superfamily
includes a wide variety of glycosyltransferase family 2
enzymes that share the common characteristic of
catalyzing the elongation of polysaccharide chains. The
members include cellulose synthase catalytic subunit,
chitin synthase, glucan biosynthesis protein and other
families of CESA-like proteins. Cellulose synthase
catalyzes the polymerization reaction of cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues in plants, most algae, some bacteria
and fungi, and even some animals. In bacteria, algae and
lower eukaryotes, there is a second unrelated type of
cellulose synthase (Type II), which produces acylated
cellulose, a derivative of cellulose. Chitin synthase
catalyzes the incorporation of GlcNAc from substrate
UDP-GlcNAc into chitin, which is a linear homopolymer of
beta-(1,4)-linked GlcNAc residues and Glucan
Biosynthesis protein catalyzes the elongation of
beta-1,2 polyglucose chains of Glucan.
Length = 180
Score = 28.7 bits (65), Expect = 4.7
Identities = 16/66 (24%), Positives = 25/66 (37%), Gaps = 9/66 (13%)
Query: 205 IAGICYVGNHNIHNL------KYFVVFPLIIYL--ILGIIFLFGGFISLFRIRNVIKNQG 256
+ G V N + + L +Y +F L LG + + G FR R ++ G
Sbjct: 112 VQGRVRVRNGSENLLTRLQAIEYLSIFRLGRRAQSALGGVLVLSGAFGAFR-REALREVG 170
Query: 257 GISNRT 262
G T
Sbjct: 171 GWDEDT 176
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 28.7 bits (64), Expect = 4.7
Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 4/54 (7%)
Query: 377 HRVLKQHDYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSSHVHKN 430
+ H L P + PP P+ VP G H M Q P+ ++
Sbjct: 59 QQQQSAHPKLTPHHQLLILPPQQPMMPVP----GHHPMVPMTGQQPHLQPPAQH 108
>gnl|CDD|220443 pfam09852, DUF2079, Predicted membrane protein (DUF2079). This
domain, found in various hypothetical prokaryotic
proteins, has no known function.
Length = 451
Score = 29.2 bits (66), Expect = 4.8
Identities = 37/202 (18%), Positives = 69/202 (34%), Gaps = 42/202 (20%)
Query: 71 TLMTLTTFLIDTERFKYPERPIVFLSGCYFMVSIGYLIRVSVGHEEVACEDQMVRYGASG 130
+ L +L+ R + ++ L +F++ LI + E MV Y G
Sbjct: 155 VVAGLGLWLLLRRRKRRWGLALLLLGLAWFLLVTNVLIPLFSSGESGR---FMVEYSYLG 211
Query: 131 ---------------------PTPCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGN 169
TP L++LL+ + ++ S TW LAA
Sbjct: 212 GSTLEILLNLLLQPALLLQELFTPPKLLYLLLLLLPLG--FLPLLSSPTWLLAALPLLLL 269
Query: 170 EAIASY----SLYFHLAAWLIPTVKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVF 225
++ SL + + L+P + ++A+DG +A + K ++
Sbjct: 270 NLLSGSPSYRSLGYQYSLPLVP------FLFLAAIDG--LARL----QSPRSRPKLKRIW 317
Query: 226 PLIIYLILGIIFLFGGFISLFR 247
+ ++L L F SL
Sbjct: 318 SVALFLALAKTLFFNPLRSLSF 339
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 29.2 bits (66), Expect = 5.0
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKKK 455
KN KK+ K K+ +++K K++ K
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAA 36
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 28.0 bits (63), Expect = 5.6
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K +KKKKK K K KKKKKK
Sbjct: 61 KEVKKKKKKSSLKSKSSSSSKKKKKK 86
>gnl|CDD|223605 COG0531, PotE, Amino acid transporters [Amino acid transport and
metabolism].
Length = 466
Score = 29.0 bits (65), Expect = 6.1
Identities = 29/164 (17%), Positives = 51/164 (31%), Gaps = 16/164 (9%)
Query: 135 TLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSV 194
L I + L+ L G +A A + + +I +
Sbjct: 119 YLFPGPGLLSIGPLLIILIALALIALLTLLNLRGIKASAKINSIITILKIIIL----LIF 174
Query: 195 IIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFISLFRIRNVIKN 254
II+ G N F I+ IL F F GF ++ + +KN
Sbjct: 175 IILGLFAFGFSNG-----NLFAPFNPGGGSFGGILAAILLAFFAFTGFEAIATLAEEVKN 229
Query: 255 QGGISNRTKADKLEKLMIRIGIFSVLSTLPASIVIGCLIYESYV 298
+A +++ + I +L L A +++G L
Sbjct: 230 --PKRTIPRA-----IILSLLIVLILYILGALVIVGVLPAGDLA 266
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 29.0 bits (65), Expect = 6.2
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 433 KKKKKKKKKKKKKKKKKKKKKKKWSFRFQR 462
KKKKKKKKK K+ + KK F +
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNL 38
>gnl|CDD|217911 pfam04123, DUF373, Domain of unknown function (DUF373). Archaeal
domain of unknown function. Predicted to be an integral
membrane protein with six transmembrane regions.
Length = 344
Score = 28.7 bits (65), Expect = 6.4
Identities = 18/126 (14%), Positives = 42/126 (33%), Gaps = 11/126 (8%)
Query: 136 LVFLLVYFFGMSSYIWWVILSF--TWFLAAGLKWGNEAIASYSLYFHLAAW-----LIPT 188
L++ L+ + +Y VIL + L G + + + + I
Sbjct: 171 LIYGLLALANLPAYALGVILLILGLYLLYRGFGLD-DFLRELLKRVRESLYEGRITFITY 229
Query: 189 VKSVSVIIMSAVDGDTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGG-FISLFR 247
V S+ +I++ + G + + + + + + +L L + F G I +
Sbjct: 230 VVSIILILIGIIYG--YLSLEKLSAISAGGVAATFLNGAVPWLALAALIAFLGKLIDEYL 287
Query: 248 IRNVIK 253
Sbjct: 288 NGRFRL 293
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 28.5 bits (64), Expect = 6.7
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 427 VHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
+K E K+K++ K + + KK +KK +K
Sbjct: 116 DYKKEYKQKREDLDKARSELKKLQKKSQK 144
>gnl|CDD|216245 pfam01015, Ribosomal_S3Ae, Ribosomal S3Ae family.
Length = 195
Score = 28.3 bits (64), Expect = 7.1
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 439 KKKKKKKKKKKKKKKKKW 456
KK KKK K+K+W
Sbjct: 1 AGKKGKKKVVDPWKRKEW 18
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 28.7 bits (64), Expect = 7.5
Identities = 13/55 (23%), Positives = 22/55 (40%)
Query: 406 VMGGGAHSMTSSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKKWSFRF 460
V+ G + K + KN+ K KK K K K+ K+K+ + +
Sbjct: 191 VLHSGGKFQDTFPKNSRSDKSEDKNDTKSSKKGKSSHVKGPKEAKEKESSQMYEY 245
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 28.5 bits (64), Expect = 7.5
Identities = 10/23 (43%), Positives = 11/23 (47%)
Query: 431 ERKKKKKKKKKKKKKKKKKKKKK 453
ER+ K KK KK K KK
Sbjct: 254 ERRAKGGPGKKPAKKATAAKAKK 276
Score = 28.1 bits (63), Expect = 9.2
Identities = 11/26 (42%), Positives = 11/26 (42%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K K KK KK KK K KK
Sbjct: 268 KATAAKAKKTTAKKAAAKKAAKTKKA 293
>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
Length = 150
Score = 27.9 bits (63), Expect = 7.8
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 5/37 (13%)
Query: 425 SHVHKNERKKKKKKKKKKKKKKKKKKK-----KKKKW 456
S ER +K+KK +++ +K KK KK++W
Sbjct: 59 SRGRARERHEKRKKGRRRGPGSRKGKKGARTPKKERW 95
>gnl|CDD|219354 pfam07264, EI24, Etoposide-induced protein 2.4 (EI24). This family
contains a number of eukaryotic etoposide-induced 2.4
(EI24) proteins approximately 350 residues long as well
as bacterial CysZ proteins (formerly known as DUF540).
In cells treated with the cytotoxic drug etoposide, EI24
is induced by p53. It has been suggested to play an
important role in negative cell growth control.
Length = 205
Score = 27.9 bits (63), Expect = 7.9
Identities = 11/72 (15%), Positives = 25/72 (34%), Gaps = 3/72 (4%)
Query: 135 TLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSV 194
L+ + + S + ++ + L G I + L + W + V V+
Sbjct: 24 NLLLFAGLLYFLFSLLGPLLDALLSLLWLGWLSWLLWILAVLLLL-VLLWFLFVV--VAS 80
Query: 195 IIMSAVDGDTIA 206
+I + +A
Sbjct: 81 LIAAPFWTPDLA 92
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 28.7 bits (65), Expect = 8.2
Identities = 15/35 (42%), Positives = 19/35 (54%)
Query: 416 SSLKQNPYSSHVHKNERKKKKKKKKKKKKKKKKKK 450
LK P + E+KK+KKK KKKK K + K
Sbjct: 386 KQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420
>gnl|CDD|240527 cd12082, MATE_like, Multidrug and toxic compound extrusion family
and similar proteins. The integral membrane proteins
from the MATE family are involved in exporting
metabolites across the cell membrane and are responsible
for multidrug resistance (MDR) in many bacteria and
animals. MATE has also been identified as a large
multigene family in plants, where the proteins are
linked to disease resistance. A number of family members
are involved in the synthesis of peptidoglycan
components in bacteria.
Length = 420
Score = 28.4 bits (64), Expect = 8.3
Identities = 18/102 (17%), Positives = 35/102 (34%), Gaps = 8/102 (7%)
Query: 143 FFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYFHLAAWLIPTVKSVSVIIMSAVDG 202
+S + +L +AA G A+A+Y++ F +A+ ++ + V
Sbjct: 228 SAIQNSLLSLGLLIIVAIVAA--FGGAAALAAYTVAFRIASLAFMPALGLAQAALPVV-- 283
Query: 203 DTIAGICYVGNHNIHNLKYFVVFPLIIYLILGIIFLFGGFIS 244
G + + V LI LI ++ L
Sbjct: 284 ----GQNLGAGNFDRARRITWVALLIGLLIGAVLGLLILLFP 321
>gnl|CDD|217667 pfam03666, NPR3, Nitrogen Permease regulator of amino acid
transport activity 3. This family, also known in yeasts
as Rmd11, complexes with NPR2, pfam06218. This complex
heterodimer is responsible for inactivating TORC1. an
evolutionarily conserved protein complex that controls
cell size via nutritional input signals, specifically,
in response to amino acid starvation.
Length = 424
Score = 28.5 bits (64), Expect = 8.4
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 423 YSSHVHKNERKKKKKKKKKKKKKKKKK 449
+ HV +N +KKKKKK+KK +
Sbjct: 34 HPIHVDENGTWRKKKKKKQKKSDRADP 60
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 28.4 bits (64), Expect = 8.5
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKK 454
+KKK++KK+++K+ K+ +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELR 199
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 27.9 bits (62), Expect = 9.5
Identities = 12/63 (19%), Positives = 16/63 (25%)
Query: 366 NTTRVLINTNKHRVLKQHDYLPPLAHGMPPPPNIPIPAVPVMGGGAHSMTSSLKQNPYSS 425
TR T + L Q P G P+P P M +S+
Sbjct: 113 VMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQ 172
Query: 426 HVH 428
Sbjct: 173 GPP 175
>gnl|CDD|225105 COG2194, COG2194, Predicted membrane-associated, metal-dependent
hydrolase [General function prediction only].
Length = 555
Score = 28.5 bits (64), Expect = 9.6
Identities = 10/47 (21%), Positives = 14/47 (29%), Gaps = 6/47 (12%)
Query: 133 PCTLVFLLVYFFGMSSYIWWVILSFTWFLAAGLKWGNEAIASYSLYF 179
LVF + + S+ + A ASY YF
Sbjct: 50 LLALVFAFLLLLLLLSFPRLLKPLAGVLSLVS------AAASYFAYF 90
>gnl|CDD|205184 pfam13003, MRL1, Ribosomal protein L1. This family includes
putative ribosomal L1 and L10 proteins and fragments.
Length = 105
Score = 26.9 bits (58), Expect = 9.6
Identities = 14/34 (41%), Positives = 15/34 (44%)
Query: 422 PYSSHVHKNERKKKKKKKKKKKKKKKKKKKKKKK 455
P S + KKK K K KK K K KKK
Sbjct: 2 PRSVNDILPYYKKKVKTNDKTKKGTKTVKNDKKK 35
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 28.5 bits (64), Expect = 9.7
Identities = 6/25 (24%), Positives = 12/25 (48%)
Query: 430 NERKKKKKKKKKKKKKKKKKKKKKK 454
+ K+ K++ KK + K K+
Sbjct: 526 EQGTNFGKRNSKERYKKNEDKIKEF 550
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 27.7 bits (62), Expect = 9.7
Identities = 7/26 (26%), Positives = 13/26 (50%)
Query: 429 KNERKKKKKKKKKKKKKKKKKKKKKK 454
K +R ++KK + +K +KK
Sbjct: 104 KIKRDSAERKKAASSGSRAIEKLRKK 129
>gnl|CDD|219784 pfam08302, tRNA_lig_CPD, Fungal tRNA ligase phosphodiesterase
domain. This domain is found in fungal tRNA ligases and
has cyclic phosphodiesterase activity. tRNA ligases are
enzymes required for the splicing of precursor tRNA
molecules containing introns.
Length = 259
Score = 28.1 bits (63), Expect = 9.7
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 426 HVHKNERKKKKKKKKKKKKKKKKKKKK 452
+ K K K+K + +KKKKK
Sbjct: 46 KIIGYGSKNNKNKQKGETPSPEKKKKK 72
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.325 0.139 0.441
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,307,995
Number of extensions: 2447940
Number of successful extensions: 18729
Number of sequences better than 10.0: 1
Number of HSP's gapped: 15412
Number of HSP's successfully gapped: 1071
Length of query: 463
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 363
Effective length of database: 6,502,202
Effective search space: 2360299326
Effective search space used: 2360299326
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.3 bits)