RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy680
(122 letters)
>gnl|CDD|214500 smart00065, GAF, Domain present in phytochromes and cGMP-specific
phosphodiesterases. Mutations within these domains in
PDE6B result in autosomal recessive inheritance of
retinitis pigmentosa.
Length = 149
Score = 43.5 bits (103), Expect = 2e-06
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 12 DLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEV 71
DL+ L + ++ + LL AD ++ V+ G LV D + L +
Sbjct: 1 DLEELLQTILEELRQLLGADRVLIYLVDENDRG--ELVLVAADGLTLPTL------GIRF 52
Query: 72 PWGVGILGHVAVTGETVNL 90
P G+ G VA TG +N+
Sbjct: 53 PLDEGLAGRVAETGRPLNI 71
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 40.3 bits (95), Expect = 8e-05
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L K+ EKK+K+K+K K KK+ + K +R++
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 37.6 bits (88), Expect = 6e-04
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 80 HVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
A + + KK K+ +KK+K+K+K K KK + K
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422
Score = 37.6 bits (88), Expect = 7e-04
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K +KK+K+K+K K KK+ + + KR
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 34.5 bits (80), Expect = 0.008
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+K+K K KK+ + K K++K
Sbjct: 406 EKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 32.2 bits (74), Expect = 0.047
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+K + KK+ + K K++K E
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 31.8 bits (73), Expect = 0.061
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+K K KK+ + K K++K +E
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTSE 433
Score = 30.3 bits (69), Expect = 0.19
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ K +K+ + K K++K E
Sbjct: 409 KPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 28.0 bits (63), Expect = 1.3
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 92 KKKEKKKKKKKKKKKK---KKKKKKKAEIERKR 121
K K +KK K KK K+ +KK+ E E+ +
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
Score = 25.7 bits (57), Expect = 7.7
Identities = 9/32 (28%), Positives = 17/32 (53%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L+ K + +KK K KK K+ + E+++
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEK 407
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 38.4 bits (90), Expect = 3e-04
Identities = 16/30 (53%), Positives = 25/30 (83%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK+E+KK +K+KKKKK+KKK KK + + ++
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 38.0 bits (89), Expect = 5e-04
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK ++KK +K+KKKKK+KKK K ++ R
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 35.7 bits (83), Expect = 0.003
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK++++KK +K+KKKKK+KKK + ++K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 35.7 bits (83), Expect = 0.003
Identities = 16/24 (66%), Positives = 22/24 (91%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
++KK +K+KKKKK+KKK KK+KKK
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 34.9 bits (81), Expect = 0.006
Identities = 15/24 (62%), Positives = 20/24 (83%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
+ +K KKKKK+KKK KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.9 bits (81), Expect = 0.006
Identities = 16/24 (66%), Positives = 21/24 (87%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
+KK +K+KKKKK+KKK KK+KKK
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 34.2 bits (79), Expect = 0.010
Identities = 14/24 (58%), Positives = 21/24 (87%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
+K +++KKKKK+KKK KK+KKK +
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 33.0 bits (76), Expect = 0.027
Identities = 14/24 (58%), Positives = 22/24 (91%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
+++K+ +K+KKKKK+KKK KK+KK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKK 410
Score = 32.6 bits (75), Expect = 0.034
Identities = 14/31 (45%), Positives = 25/31 (80%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K+K+++KK +K+KKKKK+KKK + +K+
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 28.8 bits (65), Expect = 0.72
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKK 114
K+K K KKK+++KK +K+KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKK 398
Score = 28.8 bits (65), Expect = 0.76
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K K KKK+++KK +K+KKKK ++ +
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGK 406
Score = 28.4 bits (64), Expect = 0.94
Identities = 13/30 (43%), Positives = 22/30 (73%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + KKK+++KK +K+KKKKK + + K+
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 37.9 bits (89), Expect = 3e-04
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K EK +KK +K K K +KKK K E + R
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 33.3 bits (77), Expect = 0.011
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ K +K +KK +K K K +KKK + E K
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 28.3 bits (64), Expect = 0.69
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERK 120
K++ K +K +KK +K K K + + K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 27.5 bits (62), Expect = 1.5
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E++ +K+ K +K +KK +K K KAE ++
Sbjct: 56 ESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKA 90
Score = 27.1 bits (61), Expect = 1.8
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKK 114
+K ++K +K K K +KKK KK++
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEP 95
>gnl|CDD|164595 MTH00022, CYTB, cytochrome b; Validated.
Length = 379
Score = 36.2 bits (84), Expect = 0.002
Identities = 23/76 (30%), Positives = 32/76 (42%), Gaps = 26/76 (34%)
Query: 3 IVRDIARDLDLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLV 62
I+RD+ L+ L A+GASLFF+ CL +H G GL
Sbjct: 68 IMRDVNYGFLLRYLH------------ANGASLFFL--------CLY-----IHIGRGLY 102
Query: 63 YSTADEVEVPWGVGIL 78
Y + V W VG++
Sbjct: 103 YGGYLKFHV-WNVGVV 117
>gnl|CDD|177162 MTH00100, CYTB, cytochrome b; Provisional.
Length = 379
Score = 35.0 bits (81), Expect = 0.005
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS+FF+ CL LH G GL Y + +E W +GI+
Sbjct: 82 LHANGASMFFI--------CLF-----LHVGRGLYYGSYLFLET-WNIGII 118
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 34.8 bits (81), Expect = 0.006
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L KKE K K +KKK+K++ K + R
Sbjct: 60 LLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 33.3 bits (77), Expect = 0.021
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
L KK+ K K +KKK+K++ K KA+ + + +
Sbjct: 61 LDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93
Score = 29.8 bits (68), Expect = 0.27
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E + +K+ K K +KKK+K++ KA + +
Sbjct: 54 EELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
>gnl|CDD|177109 MTH00034, CYTB, cytochrome b; Validated.
Length = 379
Score = 34.7 bits (80), Expect = 0.006
Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 14/49 (28%)
Query: 30 ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
A+GASLFF+ CL H G GL Y + +E W +G++
Sbjct: 84 ANGASLFFI--------CLY-----FHIGRGLYYGSYVNIET-WNIGVI 118
>gnl|CDD|177239 MTH00191, CYTB, cytochrome b; Provisional.
Length = 365
Score = 34.1 bits (79), Expect = 0.009
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 14/49 (28%)
Query: 30 ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
A+GAS FF+ C+ LH G GL Y + E W VG++
Sbjct: 81 ANGASFFFI--------CIY-----LHIGRGLYYGSYLNKET-WNVGVI 115
>gnl|CDD|177102 MTH00016, CYTB, cytochrome b; Validated.
Length = 378
Score = 33.9 bits (78), Expect = 0.013
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)
Query: 27 LLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGI 77
L A+GAS FF+ CL LH G G+ Y + +E W +G+
Sbjct: 82 NLHANGASFFFI--------CLY-----LHIGRGIYYGSYFLMET-WNIGV 118
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 32.9 bits (75), Expect = 0.014
Identities = 15/21 (71%), Positives = 15/21 (71%)
Query: 94 KEKKKKKKKKKKKKKKKKKKK 114
K KKKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 32.5 bits (74), Expect = 0.019
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKK 113
+K K+KKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 31.4 bits (71), Expect = 0.046
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 95 EKKKKKKKKKKKKKKKKKKKAE 116
EK KKKKKK KK KK KK +
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKK 119
Score = 31.4 bits (71), Expect = 0.047
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKK 114
K +KKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.6 bits (69), Expect = 0.081
Identities = 14/28 (50%), Positives = 15/28 (53%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKK 111
E KKK+KK KK KK KK KK
Sbjct: 93 PPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.2 bits (68), Expect = 0.11
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 97 KKKKKKKKKKKKKKKKKKAE 116
KKKKKK KK KK KK K +
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 0.26
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 97 KKKKKKKKKKKKKKKKKKAEIE 118
K KKKKKK KK KK KK ++ +
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 27.9 bits (62), Expect = 0.87
Identities = 13/37 (35%), Positives = 17/37 (45%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ E +K KKKKKK KK KK K ++
Sbjct: 84 GEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 25.6 bits (56), Expect = 6.1
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 101 KKKKKKKKKKKKKKAEIERKR 121
K KKKKKK KK KK + K+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 33.5 bits (77), Expect = 0.014
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K KE+ KKKK K+ K K KK K+
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 30.4 bits (69), Expect = 0.16
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKK 112
E K++ KKKK K+ K K KK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 29.6 bits (67), Expect = 0.31
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
++K +++ KKKK K+ K K KK
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKK 274
Score = 29.2 bits (66), Expect = 0.43
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKK 114
++KKK K+ K K KK K KK
Sbjct: 257 EIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 28.8 bits (65), Expect = 0.57
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
KKK+ K+ K K KK K K
Sbjct: 258 IKKKKPKESKGVKALKKVVAKGMKK 282
Score = 28.1 bits (63), Expect = 1.1
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
KK++ K++ KKKK K+ K K
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALK 273
Score = 27.3 bits (61), Expect = 2.2
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEI 117
+K++K K++ KKKK K+ K KA
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALK 273
Score = 25.4 bits (56), Expect = 9.2
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ + +KK+K K++ KKKK K+ K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGV 269
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.9 bits (75), Expect = 0.020
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
+KK E K++KKKKK+KKKKKK+ +
Sbjct: 146 KKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 32.9 bits (75), Expect = 0.023
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
KK +KKK + K++KKKKK+KKK +
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKK 166
Score = 32.5 bits (74), Expect = 0.028
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+K ++ K++KKKKK+KKKKKK+ E
Sbjct: 147 KKHEDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 32.5 bits (74), Expect = 0.031
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E + +KK KKKK + K++KKKKK+KK + +R
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 32.1 bits (73), Expect = 0.033
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
+ + KK+K + K++KKKKK+KKKKK
Sbjct: 136 GLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
Score = 31.3 bits (71), Expect = 0.060
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KKK + K++KKKKK+KKKKKK+ E
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
>gnl|CDD|164624 MTH00053, CYTB, cytochrome b; Provisional.
Length = 381
Score = 32.8 bits (75), Expect = 0.026
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS+FF+ C V+ H G G+ Y + ++ V W VG+L
Sbjct: 83 LHANGASMFFL--------C----VY-FHIGRGIYYGSYTKIIV-WNVGVL 119
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.032
Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKK--KKKAEIERKR 121
GE NL + KK +KK++K +KK +K +KA ++K
Sbjct: 401 GENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA 439
Score = 32.6 bits (75), Expect = 0.039
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
Q+K EKK +K++ +K KKK + A + K
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448
Score = 31.4 bits (72), Expect = 0.072
Identities = 10/31 (32%), Positives = 22/31 (70%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
++KK +KK++K +KK +K++ +K A ++
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440
Score = 29.5 bits (67), Expect = 0.42
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
L+KK+ K +KK +K++ +K KKKAE
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAE 440
Score = 29.1 bits (66), Expect = 0.47
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K+++ +KK +K++ +K KKK +K
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 27.6 bits (62), Expect = 1.8
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K+E +K KKK + KK K + E K
Sbjct: 426 EKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 27.2 bits (61), Expect = 2.2
Identities = 9/25 (36%), Positives = 13/25 (52%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
+KE+ +K KKK + KK K
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGP 449
Score = 26.4 bits (59), Expect = 4.3
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
GE + ++KK +KK++K +KK + E ++
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEK 432
Score = 26.4 bits (59), Expect = 4.4
Identities = 9/31 (29%), Positives = 15/31 (48%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K++ +K KKK + KK K +K
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 32.7 bits (75), Expect = 0.036
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K+ K++K+ +K KK+++KKKK+ E K
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 32.3 bits (74), Expect = 0.045
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
++K+EK+ K++K+ +K KK+++K + E ++
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586
Score = 31.5 bits (72), Expect = 0.073
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
++K+ K++K+ +K KK+++KKKK + ++
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 31.5 bits (72), Expect = 0.076
Identities = 10/30 (33%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ ++EK++K+ K++K+ +K KK+ E ++K
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKK 581
Score = 29.6 bits (67), Expect = 0.40
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
++K+ +K KK+++KKKK+ +K +KA+I +
Sbjct: 563 LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEF 596
Score = 27.3 bits (61), Expect = 2.6
Identities = 11/37 (29%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKK--KKKKAEIERKR 121
L K+E +++K++K+ K++K +K K + E+K+
Sbjct: 544 VWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 32.0 bits (72), Expect = 0.051
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
G+ ++KK +K KKK+KK+K+K++ K KK E+E
Sbjct: 190 GDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 29.3 bits (65), Expect = 0.52
Identities = 10/34 (29%), Positives = 23/34 (67%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
V L++++ +++ +K K++KKK++K+K R
Sbjct: 82 VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 28.9 bits (64), Expect = 0.63
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
KK +KK+KK+K+K++ K KKK+ +
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 28.5 bits (63), Expect = 0.96
Identities = 12/25 (48%), Positives = 21/25 (84%)
Query: 97 KKKKKKKKKKKKKKKKKKAEIERKR 121
+KK KK KKK+KK+K+K+ + ++K+
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKK 220
Score = 28.5 bits (63), Expect = 1.0
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
+K+ KK KKK+KK+K+K++ K + E + +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGF 225
Score = 27.0 bits (59), Expect = 2.8
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ KK++KK+K+K++ K KKK+ + +
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.0 bits (59), Expect = 3.3
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
KK K KKKK++K+K++KKKKKK R
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 26.6 bits (58), Expect = 4.0
Identities = 9/29 (31%), Positives = 19/29 (65%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
L+K K +KKK++K+K+ +++ E +
Sbjct: 94 LEKDKREKKKREKEKRGRRRHHSLGTESD 122
Score = 26.6 bits (58), Expect = 4.1
Identities = 9/30 (30%), Positives = 24/30 (80%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
++E++ +++ +K K++KKK++K + R+R+
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRH 114
Score = 26.6 bits (58), Expect = 4.4
Identities = 9/36 (25%), Positives = 25/36 (69%)
Query: 80 HVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
+V + E + Q+ ++ K++KKK++K+K+ +++ +
Sbjct: 81 YVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116
Score = 26.6 bits (58), Expect = 4.5
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
ET K K+KK++K+K++KKKKKK
Sbjct: 273 ETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301
Score = 26.6 bits (58), Expect = 4.5
Identities = 11/38 (28%), Positives = 26/38 (68%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
V G ++ Q K +++++ +++ +K K++KKK E E++
Sbjct: 72 VPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKR 109
Score = 25.8 bits (56), Expect = 6.6
Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 4/76 (5%)
Query: 50 SKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKK----KKKKKK 105
S + L S A+ E + G + + E+ KK KKKK++
Sbjct: 227 SLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQR 286
Query: 106 KKKKKKKKKAEIERKR 121
K+K++KKKK + R
Sbjct: 287 KEKEEKKKKKKHHHHR 302
Score = 25.4 bits (55), Expect = 9.6
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
+KKK++K+K++KKKKKK +
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGG 309
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.2 bits (74), Expect = 0.052
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
T ++K EK +KK++++KK+KKKK + + +
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 31.8 bits (73), Expect = 0.054
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 5/56 (8%)
Query: 65 TADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
T +E+E + G T + + +K EKK++++KK+KKKK KK E E +
Sbjct: 395 TEEEIEF-----LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445
Score = 30.3 bits (69), Expect = 0.22
Identities = 12/30 (40%), Positives = 23/30 (76%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+KKEKKKK KKK+++++++K + E ++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 29.9 bits (68), Expect = 0.28
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+KK +++KK+KKKK KKK++ E E K
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 29.9 bits (68), Expect = 0.31
Identities = 13/30 (43%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K++E+KK+KKKK KKK++++ E + K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 29.9 bits (68), Expect = 0.33
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK KK KK +K +KK++++KK + ++
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAF 435
Score = 29.5 bits (67), Expect = 0.36
Identities = 12/31 (38%), Positives = 21/31 (67%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
KK +K +KK++++KK+KKKK ++K
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442
Score = 29.5 bits (67), Expect = 0.44
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L K+ KK KK +K +KK++++ + ++K+
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433
Score = 28.0 bits (63), Expect = 1.3
Identities = 11/29 (37%), Positives = 23/29 (79%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K+K+KK KKK+++++++K+K E E++
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 27.6 bits (62), Expect = 2.0
Identities = 8/29 (27%), Positives = 22/29 (75%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+++++K+KKKK KKK++++++ + +
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 27.2 bits (61), Expect = 2.7
Identities = 9/32 (28%), Positives = 23/32 (71%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+++K++KKKK KKK+++++++ E + +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 26.8 bits (60), Expect = 3.7
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ +KK++KKK KKK+++++++K+ + E K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 26.4 bits (59), Expect = 5.0
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E +KK KKK+++++++K+KK+++ E E +
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Score = 26.0 bits (58), Expect = 6.3
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
KK KK KK +K +KK++ E + K+
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKK 432
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 31.7 bits (72), Expect = 0.057
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
T ++KKK+ K KK K K KK + KKK E
Sbjct: 86 TTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115
Score = 29.8 bits (67), Expect = 0.26
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
KKK K KK K K KK + KKK +
Sbjct: 92 KKKGKSKKNKLKGKKDEDKKKAREG 116
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 32.1 bits (74), Expect = 0.058
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K KK K +K K++ +E + K
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 30.5 bits (70), Expect = 0.17
Identities = 8/24 (33%), Positives = 12/24 (50%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
KK K +K K++ +K KA
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858
Score = 30.5 bits (70), Expect = 0.20
Identities = 6/25 (24%), Positives = 9/25 (36%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
K KK K +K K++
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASE 853
Score = 30.1 bits (69), Expect = 0.26
Identities = 6/27 (22%), Positives = 10/27 (37%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
L+ + K KK K +K +
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQ 847
Score = 29.4 bits (67), Expect = 0.39
Identities = 7/26 (26%), Positives = 10/26 (38%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKA 115
L K KK K +K K++
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQY 849
Score = 29.0 bits (66), Expect = 0.58
Identities = 5/26 (19%), Positives = 11/26 (42%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ KK K +K K++ + +
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 29.0 bits (66), Expect = 0.65
Identities = 6/24 (25%), Positives = 10/24 (41%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
K K +K K++ +K K
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKAT 859
Score = 28.6 bits (65), Expect = 0.84
Identities = 6/29 (20%), Positives = 11/29 (37%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ KK K +K K++ E+
Sbjct: 827 APAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 28.6 bits (65), Expect = 0.87
Identities = 9/31 (29%), Positives = 12/31 (38%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
L +K EK + K KK + RK
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844
Score = 26.7 bits (60), Expect = 4.3
Identities = 8/30 (26%), Positives = 12/30 (40%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
E + L +K +K + K KK A
Sbjct: 809 EALELLAEKPEKLRYLADAPAKDPAGKKAA 838
Score = 25.5 bits (57), Expect = 8.2
Identities = 4/29 (13%), Positives = 8/29 (27%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ K KK K + ++
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQQY 849
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.8 bits (72), Expect = 0.061
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+++ E+KKKK++K K+K+ KK K A+ E K
Sbjct: 15 EEELERKKKKEEKAKEKELKKLKAAQKEAK 44
Score = 31.4 bits (71), Expect = 0.081
Identities = 15/37 (40%), Positives = 26/37 (70%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
T E + +KKKE+K K+K+ KK K +K+ KA+++ +
Sbjct: 14 TEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
Score = 27.2 bits (60), Expect = 3.1
Identities = 13/30 (43%), Positives = 22/30 (73%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
E L +++ ++KKKK++K K+K+ KK KA
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKA 38
>gnl|CDD|214425 MTH00119, CYTB, cytochrome b; Provisional.
Length = 380
Score = 31.8 bits (73), Expect = 0.067
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS+FF+ C+ LH G GL Y + E W G++
Sbjct: 83 LHANGASMFFI--------CIY-----LHIGRGLYYGSYLYKET-WNTGVI 119
>gnl|CDD|214440 MTH00156, CYTB, cytochrome b; Provisional.
Length = 356
Score = 31.7 bits (73), Expect = 0.072
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS FF+ C+ LH G G+ Y + W G++
Sbjct: 72 LHANGASFFFI--------CIY-----LHIGRGIYYGSYKLKHT-WMSGVI 108
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 31.6 bits (72), Expect = 0.073
Identities = 15/33 (45%), Positives = 26/33 (78%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ + ++K KKKKKK+KKK+++ K+++KA IE
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 31.6 bits (72), Expect = 0.082
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 5/39 (12%)
Query: 89 NLQKKKEK---KKKKKKKKKKKKKKKKKKAEIE--RKRY 122
N +K E+ + KK+K KK+ K +AE++ KRY
Sbjct: 1552 NQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRY 1590
Score = 31.2 bits (71), Expect = 0.13
Identities = 13/27 (48%), Positives = 23/27 (85%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
+K K+KKKK+KKK+++ K+++K + EI
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKARIEI 769
Score = 28.9 bits (65), Expect = 0.70
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + ++K KKKKKK+KKK+++ + E K
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 28.5 bits (64), Expect = 0.86
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 95 EKKKKKKKKKKKKKKKKKKKAEIER 119
E+K KKKKKK+KKK+++ K+ E R
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.1 bits (63), Expect = 1.3
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
++V + KK+KKK+KKK+++ K+++K +
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.9 bits (70), Expect = 0.084
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
Q+ + K + K+KKK++ K K + + K
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKT 141
Score = 26.6 bits (59), Expect = 3.1
Identities = 8/31 (25%), Positives = 20/31 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ ++E + K + K+KKK++ K + E+++
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEK 138
Score = 26.3 bits (58), Expect = 4.3
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K + K+KKK++ K K +K+K + E K+
Sbjct: 115 PETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
Score = 25.5 bits (56), Expect = 6.4
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
KK+E K K +K+K K + KK K
Sbjct: 125 KKREVPKPKTEKEKPKTEPKKPKP 148
>gnl|CDD|177203 MTH00145, CYTB, cytochrome b; Provisional.
Length = 379
Score = 31.4 bits (72), Expect = 0.094
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS FF+ C+ LH G GL Y + W +G+
Sbjct: 83 LHANGASFFFI--------CIY-----LHIGRGLYYGSYLMQHT-WNIGVT 119
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 30.4 bits (69), Expect = 0.11
Identities = 19/31 (61%), Positives = 26/31 (83%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
L++K ++KK+KKKKKKKKKKK KKA ++K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 29.7 bits (67), Expect = 0.19
Identities = 18/27 (66%), Positives = 20/27 (74%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
QKK++KKKKKKKKKK KK KKK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 29.3 bits (66), Expect = 0.26
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
Q KEK K+KK+KKKKKKKKKKK ++ K
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 28.9 bits (65), Expect = 0.48
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E + K+ K+K K+KK+KKKKKKKKKK ++
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 27.7 bits (62), Expect = 1.1
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKK 112
QKKK+KKKKKKK KK KKKK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.0 bits (60), Expect = 2.2
Identities = 17/34 (50%), Positives = 24/34 (70%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
V K++ K+KK+KKKKKKKKKKK + +K+
Sbjct: 101 VKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 30.8 bits (70), Expect = 0.11
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
V ++KKEKKKKK+ KK+KK+KK KK+ +E K
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181
Score = 27.0 bits (60), Expect = 2.3
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKK 114
++KK+KK+K + K KKKKKKKK
Sbjct: 167 KKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 27.0 bits (60), Expect = 2.7
Identities = 14/23 (60%), Positives = 17/23 (73%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKK 114
K K++K + K KKKKKKKKKK
Sbjct: 171 KDKKEKMVEPKGSKKKKKKKKKK 193
Score = 26.6 bits (59), Expect = 2.8
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ KKEKK+KK KK+K + K KK + ++K+
Sbjct: 160 KKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.9 bits (70), Expect = 0.12
Identities = 10/27 (37%), Positives = 26/27 (96%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
Q+K+E++++K++KK+++++++K+KAE
Sbjct: 195 QQQKREEERRKQRKKQQEEEERKQKAE 221
Score = 30.1 bits (68), Expect = 0.21
Identities = 11/29 (37%), Positives = 25/29 (86%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK K++++K++++++K++KK++ E ERK
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERK 217
Score = 27.0 bits (60), Expect = 2.2
Identities = 8/29 (27%), Positives = 24/29 (82%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K+ K++++K++++++K++KK+ E E ++
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERK 217
Score = 25.8 bits (57), Expect = 6.1
Identities = 9/27 (33%), Positives = 24/27 (88%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
QK++E+++K++KK+++++++K+K E
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEA 223
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.8 bits (70), Expect = 0.13
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
T E V L K + ++++ KKKKKKKKKK KK
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSS 201
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.8 bits (70), Expect = 0.13
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E N + KK+ +KKKKKKK+KK+ K + + ++ K
Sbjct: 58 KKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95
Score = 27.0 bits (60), Expect = 2.4
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK EKKKKKKK+KK+ K + + K +
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96
Score = 27.0 bits (60), Expect = 2.7
Identities = 16/34 (47%), Positives = 22/34 (64%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
V + KK++K + KKK +KKKKKKK+K E
Sbjct: 49 VATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82
Score = 25.4 bits (56), Expect = 9.3
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK+KKK+KK+ K + + K K + K
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 30.7 bits (70), Expect = 0.14
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+++KEKKK K++KK K++K+K E
Sbjct: 94 FEEEKEKKKAMSKEEKKAIKEEKEKLE 120
Score = 29.5 bits (67), Expect = 0.36
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++EK+KKK K++KK K++K ++E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 27.2 bits (61), Expect = 1.8
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
+E+K+KKK K++KK K++ E + Y
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 30.7 bits (70), Expect = 0.15
Identities = 14/35 (40%), Positives = 25/35 (71%), Gaps = 3/35 (8%)
Query: 90 LQKKKEKKKKKKKKKKKKK---KKKKKKAEIERKR 121
Q+KKE+KKK++++ K K ++++K E ERK+
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318
Score = 27.6 bits (62), Expect = 1.7
Identities = 8/32 (25%), Positives = 22/32 (68%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ K E++++++ ++KK++KKK+++ K
Sbjct: 271 KILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 25.7 bits (57), Expect = 7.8
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 87 TVNLQKKKEKKKKKKKKKKK-----KKKKKKKKAEIERKR 121
++++E+K K ++++ +KK++KKK E E K
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKL 300
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 30.0 bits (68), Expect = 0.15
Identities = 16/28 (57%), Positives = 22/28 (78%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
KK+ K++KKK+KKKKKKK KK + E+
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKKEEK 112
Score = 29.6 bits (67), Expect = 0.21
Identities = 15/30 (50%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++K KK+ K++KKK+KKKKKKK + +K
Sbjct: 80 EEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 27.7 bits (62), Expect = 1.1
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++K E+K KK+ K++KKK+KKKK + +K
Sbjct: 76 KRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 27.3 bits (61), Expect = 1.4
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK K++KKK+KKKKKKK KK + E K
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKKEEK 112
Score = 26.2 bits (58), Expect = 3.6
Identities = 16/22 (72%), Positives = 19/22 (86%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKK 112
QKKK+KKKKKKK KK KK++K
Sbjct: 91 QKKKQKKKKKKKAKKGNKKEEK 112
Score = 25.8 bits (57), Expect = 5.6
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+K +EK KK+ K++KKK+KKKKK +
Sbjct: 75 KKRKDEEKTAKKRAKRQKKKQKKKKKKK 102
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.2 bits (69), Expect = 0.18
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ +KEKKK K++KK K++K K E
Sbjct: 95 FEAEKEKKKAMSKEEKKAIKEEKDKLE 121
Score = 27.5 bits (62), Expect = 1.3
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ EK+KKK K++KK K++K ++E
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 30.6 bits (69), Expect = 0.19
Identities = 14/20 (70%), Positives = 18/20 (90%)
Query: 96 KKKKKKKKKKKKKKKKKKKA 115
+KKKKKKK K+KKKKK ++A
Sbjct: 684 RKKKKKKKSKEKKKKKNREA 703
Score = 29.5 bits (66), Expect = 0.47
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 98 KKKKKKKKKKKKKKKKKAEIERKRY 122
+KKKKKKK K+KKKKK E ++ +
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708
Score = 29.5 bits (66), Expect = 0.47
Identities = 13/23 (56%), Positives = 19/23 (82%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKK 114
+KK+KKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 28.7 bits (64), Expect = 0.86
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+KKK+KKK K+KKKKK ++ K+ +E
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALE 711
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 30.4 bits (69), Expect = 0.19
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K ++ KK++ + ++KKK+KKA+ ER+
Sbjct: 355 EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392
Score = 29.3 bits (66), Expect = 0.40
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K E+ KK++ + ++KKK+KK R
Sbjct: 364 AKPERDKKERPGRYRRKKKEKKAKSERRGL 393
Score = 27.0 bits (60), Expect = 2.5
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ KKE+ + ++KKK+KK K +++
Sbjct: 367 ERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.5 bits (69), Expect = 0.20
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K KK +K KK K +K A + KR
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805
Score = 29.3 bits (66), Expect = 0.51
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ EKK K K KK +K KK A+ ++
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIA 798
Score = 27.4 bits (61), Expect = 2.5
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK K K KK +K KK K +I
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAA 800
Score = 26.2 bits (58), Expect = 6.3
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK+ +K KK K +K KA+ K+
Sbjct: 779 AAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 30.2 bits (68), Expect = 0.22
Identities = 12/26 (46%), Positives = 13/26 (50%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ K EK KKK K KKKKK
Sbjct: 119 KPKTEKLKKKITVNKSTNKKKKKVLS 144
Score = 27.9 bits (62), Expect = 1.5
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + +K KKK K KKKK + K
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147
Score = 27.1 bits (60), Expect = 2.4
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+K K+K K KKKKK K E+
Sbjct: 121 KTEKLKKKITVNKSTNKKKKKVLSSKDELI 150
Score = 25.9 bits (57), Expect = 6.7
Identities = 10/31 (32%), Positives = 11/31 (35%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
KKK K KKKKK K +
Sbjct: 122 TEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 29.1 bits (66), Expect = 0.24
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
Q++KE K+K + KKKKK+KA+ +R
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
>gnl|CDD|177145 MTH00074, CYTB, cytochrome b; Provisional.
Length = 380
Score = 30.0 bits (68), Expect = 0.27
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
+ A+GAS FF+ C+ LH G GL Y + E W +G++
Sbjct: 83 IHANGASFFFI--------CIY-----LHIGRGLYYGSYMYKET-WNIGVI 119
>gnl|CDD|133672 MTH00033, CYTB, cytochrome b; Provisional.
Length = 383
Score = 29.9 bits (67), Expect = 0.29
Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 14/49 (28%)
Query: 30 ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
A+GASLFF+ C+ H G GL Y V + W VG+L
Sbjct: 81 ANGASLFFI--------CVYC-----HIGRGLYYGGYSRV-LTWIVGVL 115
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 30.1 bits (68), Expect = 0.29
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ K KK KKKKKKKKK++K E E
Sbjct: 269 VSEMVKFKKPKKKKKKKKKRRKDLDEDE 296
Score = 29.0 bits (65), Expect = 0.66
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIE 118
K KK KKKKKKKKK++K + E+E
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELE 298
Score = 28.6 bits (64), Expect = 0.84
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ K KK KKKKKKKKK++K +
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRKDLD 293
Score = 25.9 bits (57), Expect = 7.1
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKKK 111
V+ K +K KKKKKKKKK++K
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDL 292
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 29.6 bits (67), Expect = 0.31
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E++KKKKKKK KKKK KK A + K
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 28.4 bits (64), Expect = 0.79
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
+KKK+KKK KKKK KK KA
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 26.5 bits (59), Expect = 3.3
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
+E+KKKKKKK KKKK KK A
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAV 201
Score = 25.3 bits (56), Expect = 8.4
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
++ N +++K+KKKKK KKKK KK
Sbjct: 170 RDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|177189 MTH00131, CYTB, cytochrome b; Provisional.
Length = 380
Score = 29.6 bits (67), Expect = 0.31
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)
Query: 28 LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
L A+GAS FF+ C+ LH G GL Y + E W +G++
Sbjct: 82 LHANGASFFFI--------CIY-----LHIGRGLYYGSYLYKET-WNIGVV 118
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 29.8 bits (66), Expect = 0.35
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 3/45 (6%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKK---KKKKKKKAEIERKRY 122
+ +GE N+ KKKKKKKKK K K++ IE +
Sbjct: 91 INKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEEI 135
>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
(A-type). The trypanosome parasite expresses these
proteins to evade the immune response. This family
includes a variety of surface proteins such as
Trypanosoma brucei VSGs such as expression site
associated gene (ESAG) 6 and 7.
Length = 370
Score = 29.7 bits (67), Expect = 0.35
Identities = 18/79 (22%), Positives = 23/79 (29%), Gaps = 4/79 (5%)
Query: 43 GGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKK 102
G KKC ++ +G GL V W G L V +G T+ L
Sbjct: 198 GSKKC---RLTQGGTGNGLGKGGNLTANVKWAGGYLT-VTGSGATITLADLTALATAATG 253
Query: 103 KKKKKKKKKKKKAEIERKR 121
K KK
Sbjct: 254 AAKAWKKAAHAAKAKPTAT 272
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 29.7 bits (66), Expect = 0.36
Identities = 8/26 (30%), Positives = 25/26 (96%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
QK +E+++++K+K+K+++++++++AE
Sbjct: 594 QKAREEREREKEKEKEREREREREAE 619
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 29.6 bits (66), Expect = 0.37
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 97 KKKKKKKKKKKKKKKKKKAEI 117
K KK KKKK KKK+ KA+I
Sbjct: 122 KDKKNKKKKSAKKKEAHKAQI 142
Score = 26.9 bits (59), Expect = 3.0
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
K + + K KK KKKK KKK+ + +
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQ 141
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 28.5 bits (64), Expect = 0.37
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
KK+KKKKKKK + K KKA E
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAE 49
Score = 26.2 bits (58), Expect = 2.1
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIE 118
KKKKKKKKKK + KA+
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKA 46
Score = 26.2 bits (58), Expect = 2.5
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
L KKK+KKKKKK + K KK
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATA 48
Score = 26.2 bits (58), Expect = 2.5
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK+KKKKK + K KK AE +
Sbjct: 25 KKKKKKKKKAEDTAATAKAKKATAEDVSEG 54
Score = 25.4 bits (56), Expect = 4.3
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAE 116
K E KKKKKKKKKK + A
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAAT 40
Score = 25.0 bits (55), Expect = 6.2
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIE 118
+KKKKKKKKK + K +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKAT 47
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.7 bits (66), Expect = 0.37
Identities = 8/32 (25%), Positives = 16/32 (50%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+KK E+ KK + K + + +AE ++
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Score = 29.0 bits (64), Expect = 0.64
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+KK E+ KKK KKK ++ KK AE +
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351
Score = 29.0 bits (64), Expect = 0.78
Identities = 7/31 (22%), Positives = 15/31 (48%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K E K + + + + ++KAE K+
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Score = 28.2 bits (62), Expect = 1.3
Identities = 6/31 (19%), Positives = 16/31 (51%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ + E + + ++K + +KK E +K+
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 27.8 bits (61), Expect = 1.7
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++K E +KKK++ KKK KKKAE ++K
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 27.8 bits (61), Expect = 1.9
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ E ++K + +KKK++ KKKA+ +K+
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 27.4 bits (60), Expect = 2.2
Identities = 5/31 (16%), Positives = 15/31 (48%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + + + + ++K + +K + E K+
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Score = 26.3 bits (57), Expect = 5.0
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK + KKK ++KKK + KKKAE +K
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 26.3 bits (57), Expect = 5.7
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK E+KKK + KKK ++ KKK E+++
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Score = 26.3 bits (57), Expect = 5.7
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK E+ KKK + KK KKK E ++K
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 26.3 bits (57), Expect = 5.8
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK KKK ++KKK + KKKAE ++K+
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Score = 26.3 bits (57), Expect = 6.4
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK ++ KK + KKK ++ KKKA+ +K
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 26.3 bits (57), Expect = 6.5
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK + KK KKK + KKKAE ++K
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 25.9 bits (56), Expect = 7.1
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK ++ KK + KKK ++ KKKA+ +K+
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 25.9 bits (56), Expect = 7.1
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK ++ KKK + KK + KKKA+ +K
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 25.9 bits (56), Expect = 7.8
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK KKK + KKK ++KKK E ++K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 25.9 bits (56), Expect = 7.9
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KKK + KKK ++ KK + KKKAE +K
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 25.9 bits (56), Expect = 8.2
Identities = 7/31 (22%), Positives = 13/31 (41%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K +E KK + K + + + E K
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
Score = 25.9 bits (56), Expect = 8.4
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK E+ KK + KKK ++ KKK ++K
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 25.9 bits (56), Expect = 8.8
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK E+ +KK + KK+ ++ KKAE +K+
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710
Score = 25.9 bits (56), Expect = 8.9
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+KK + KKK ++ KK + K +AE
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Score = 25.5 bits (55), Expect = 9.2
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK E+ KK + KKK ++ KKK E ++
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 25.5 bits (55), Expect = 9.6
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK ++ KKK + KK KKKA+ +K+
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 25.5 bits (55), Expect = 9.7
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK ++ KKK ++ KKK KKKAE +K
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 29.3 bits (66), Expect = 0.41
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 90 LQKKKEKKKKKKKKKKKKKK 109
++K+E KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374
Score = 27.7 bits (62), Expect = 1.5
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 95 EKKKKKKKKKKKKKKKKKKKA 115
K+K++ KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLKL 375
Score = 25.8 bits (57), Expect = 7.3
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKK 111
+ L K++++ KK+KKKK KK K
Sbjct: 351 LALSPKRKREGDKKQKKKKSKKLK 374
Score = 25.4 bits (56), Expect = 8.8
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 94 KEKKKKKKKKKKKKKKKKK 112
K K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
>gnl|CDD|220636 pfam10222, DUF2152, Uncharacterized conserved protein (DUF2152).
This is a family of proteins conserved from worms to
humans. Its function is unknown.
Length = 603
Score = 29.5 bits (66), Expect = 0.43
Identities = 12/56 (21%), Positives = 18/56 (32%), Gaps = 9/56 (16%)
Query: 6 DIARDLDLKSL--SKKTVDNVSVLLDADGASLFFVERRRGGKK-------CLVSKV 52
+ R + + NV++LLD D V +K CL V
Sbjct: 444 VLHRSYAFRGINYGNMDHINVAILLDTDNKPYLHVSVDSSDQKLYACDAGCLDEPV 499
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 28.0 bits (63), Expect = 0.47
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ ++ ++ ++KKKKKKK KKK K I RK
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILRK 60
Score = 24.6 bits (54), Expect = 9.2
Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 74 GVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKK--KKKKAEIERKR 121
G + V ++ ++KKKKKKK KKK K +K E R+R
Sbjct: 18 EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67
>gnl|CDD|222174 pfam13492, GAF_3, GAF domain.
Length = 129
Score = 28.5 bits (64), Expect = 0.49
Identities = 14/78 (17%), Positives = 32/78 (41%), Gaps = 10/78 (12%)
Query: 12 DLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEV 71
D L ++ ++ ++ +L AD A+L+ ++ L +G+G ++ +
Sbjct: 1 DPDELLERLLELLAEILGADRAALYLLDEDGL--------ELRLVAGSGGEPRLSE--SL 50
Query: 72 PWGVGILGHVAVTGETVN 89
P + GE V+
Sbjct: 51 PEDSPLAQWALEKGEPVS 68
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 0.49
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K KK K K ++K+K K++K+ + R+
Sbjct: 54 AKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 28.9 bits (65), Expect = 0.53
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
L +K K +K K KKKK K K K
Sbjct: 189 VLINEKLKAEKAAKGGKKKKGKAKAKL 215
Score = 27.3 bits (61), Expect = 1.7
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K K K K KK K K ++K+KA+ E++
Sbjct: 48 EKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78
Score = 27.3 bits (61), Expect = 1.8
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+++K K K K KK K K ++K + +R++
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKREK 76
Score = 27.3 bits (61), Expect = 2.1
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K K KK K K ++K+K + ++
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKEEK 79
Score = 27.3 bits (61), Expect = 2.1
Identities = 13/32 (40%), Positives = 16/32 (50%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
K K K KK K K ++K+K K E E K
Sbjct: 50 AKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81
Score = 26.5 bits (59), Expect = 3.6
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ +EK+++K K K K KK KA+IE K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEK 69
Score = 25.8 bits (57), Expect = 5.7
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+K++EK K K K KK K K + + + KR
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 25.8 bits (57), Expect = 6.8
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKK 112
V+ T + +K K +K K KKKK K K K
Sbjct: 183 VSSTLNVLINEKLKAEKAAKGGKKKKGKAKAK 214
Score = 25.4 bits (56), Expect = 10.0
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++KE++K K K K KK K K E + K
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEK 71
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 28.7 bits (64), Expect = 0.49
Identities = 12/37 (32%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 86 ETVNLQKKKEK-KKKKKKKKKKKKKKKKKKAEIERKR 121
V L KK +K ++ +KK +KK+KK + R
Sbjct: 46 SIVGLNKKPKKISAEEAEKKLLQKKEKKALTNVLRPE 82
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 28.9 bits (65), Expect = 0.50
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
+ K K K K K K KK +++ K E++
Sbjct: 94 KPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123
Score = 27.3 bits (61), Expect = 2.2
Identities = 8/26 (30%), Positives = 15/26 (57%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ K + K K KK +++ K++ K E
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVE 125
Score = 27.0 bits (60), Expect = 2.7
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
K K K K K KK +++ K++ K R
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPR 127
Score = 26.6 bits (59), Expect = 3.4
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
V + K + K K K K K K KK +++ + E K
Sbjct: 88 APVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.0 bits (66), Expect = 0.51
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAE 116
++++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 29.0 bits (66), Expect = 0.54
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKK 111
++L+K++ KKKKK+KK KK + +K
Sbjct: 730 LDLEKRRLKKKKKRKKVKKWEVEK 753
Score = 28.7 bits (65), Expect = 0.90
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 96 KKKKKKKKKKKKKKKKKKKAEIE 118
+K++ KKKKK+KK KK + +IE
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 27.9 bits (63), Expect = 1.6
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 92 KKKEKKKKKKKKKKKKKKKKK 112
+K+ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 26.3 bits (59), Expect = 5.4
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 93 KKEKKKKKKKKKKKKKKKKKK 113
+K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 26.0 bits (58), Expect = 6.2
Identities = 12/21 (57%), Positives = 18/21 (85%)
Query: 99 KKKKKKKKKKKKKKKKAEIER 119
+K++ KKKKK+KK KK E+E+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 29.0 bits (65), Expect = 0.54
Identities = 14/31 (45%), Positives = 17/31 (54%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K EKK KK K K K K K K K + + K
Sbjct: 87 KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKP 117
Score = 27.8 bits (62), Expect = 1.5
Identities = 13/35 (37%), Positives = 15/35 (42%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
T K+K K +KK KK K K K K K
Sbjct: 74 ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108
Score = 26.7 bits (59), Expect = 2.9
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
K KEK K +KK KK K K K K
Sbjct: 82 PKPKEKPKPEKKPKKPKPKPKPKPKPK 108
Score = 25.9 bits (57), Expect = 5.3
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ K K K K K K K K + K KK
Sbjct: 92 KKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS 122
Score = 25.9 bits (57), Expect = 5.3
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
++K + +KK KK K K K K K K +++
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPKPKVK 112
Score = 25.9 bits (57), Expect = 6.3
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
E +KK +K K K K K K K K K + + +
Sbjct: 86 EKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK 118
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 29.2 bits (66), Expect = 0.56
Identities = 8/25 (32%), Positives = 17/25 (68%)
Query: 97 KKKKKKKKKKKKKKKKKKAEIERKR 121
++KKK KK +K+ + + +++KR
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKR 108
Score = 27.6 bits (62), Expect = 2.0
Identities = 12/45 (26%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKK-----KKKKKKKKAEIERKR 121
+ T ++KK+ KK +K+ + KK+K+ +A E++R
Sbjct: 75 IIFSTTTYFREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQR 119
>gnl|CDD|146077 pfam03263, Cucumo_2B, Cucumovirus protein 2B. This protein may be
a viral movement protein.
Length = 103
Score = 28.0 bits (62), Expect = 0.60
Identities = 12/41 (29%), Positives = 25/41 (60%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
AVT + L + EKK+++++ +K ++++ K+ ER R
Sbjct: 2 GAVTNVELQLARLVEKKRQRRRSHRKNRRERGHKSPSERAR 42
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 28.7 bits (65), Expect = 0.64
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 7/37 (18%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKK-------KAEIERKR 121
KKK KK KK+K+K++K+ +K K E +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAK 50
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 28.8 bits (65), Expect = 0.66
Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 74 GVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
G G G V V G + +KKK++KK+++K+ K+ + + +
Sbjct: 165 GGGTWGLVGVPG------HSSDSEKKKQRKKQRRKRSKELREDDDD 204
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.9 bits (65), Expect = 0.72
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E + K+ KKKKKKKKKKKKK + ++
Sbjct: 46 ENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80
Score = 28.5 bits (64), Expect = 0.84
Identities = 16/35 (45%), Positives = 21/35 (60%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E ++ ++ E K+ KKKKKKKKKKKKK
Sbjct: 42 ELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76
Score = 28.1 bits (63), Expect = 1.0
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ ++ + K+ KKKKKKKKKKKKK E
Sbjct: 45 SENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76
Score = 28.1 bits (63), Expect = 1.2
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
Q+ + K+ KKKKKKKKKKKKK E Y
Sbjct: 48 QEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
Score = 27.0 bits (60), Expect = 2.8
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
E + K++K++ K+K KKK KK KK +
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVD 34
Score = 26.2 bits (58), Expect = 5.9
Identities = 16/32 (50%), Positives = 18/32 (56%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
E N Q K+KKKKKKKKKKK + A
Sbjct: 47 NQEAENKQNNKKKKKKKKKKKKKNLGEAYDLA 78
>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1. This domain
family is found in eukaryotes, and is typically between
216 and 263 amino acids in length. Casc1 has many SNPs
associated with cancer susceptibility.
Length = 227
Score = 28.5 bits (64), Expect = 0.73
Identities = 6/30 (20%), Positives = 12/30 (40%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
++ + + K+ K K K EI +
Sbjct: 72 AEEAEGARGYVTDIPKEYKAKPVKYLEIPK 101
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 28.6 bits (64), Expect = 0.74
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK + KKK K K K KKK++ + K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777
Score = 28.2 bits (63), Expect = 1.0
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK K K K KKK K K + +KR
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 27.9 bits (62), Expect = 1.4
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
K K K KKK K K KK KK+A
Sbjct: 761 KGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 27.9 bits (62), Expect = 1.4
Identities = 10/30 (33%), Positives = 13/30 (43%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K+K K K K KKK K + +
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781
Score = 27.5 bits (61), Expect = 2.0
Identities = 12/30 (40%), Positives = 14/30 (46%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K K K K KKK K K KK +R
Sbjct: 755 KKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 27.1 bits (60), Expect = 2.4
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK K K K KKK K K K +++
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 26.3 bits (58), Expect = 5.4
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
AV + K +KK K KKK K K K
Sbjct: 734 AVITKYAGTPKTPYEKKTKAKKKSASTKGKAAK 766
Score = 25.6 bits (56), Expect = 9.0
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKA 115
K + KKK K K KK KK+
Sbjct: 763 KAAKTVKKKSKAKSKKTTKKRAGP 786
>gnl|CDD|218675 pfam05648, PEX11, Peroxisomal biogenesis factor 11 (PEX11). This
family consists of several peroxisomal biogenesis factor
11 (PEX11) proteins from several eukaryotic species. The
PEX11 peroxisomal membrane proteins promote peroxisome
division in multiple eukaryotes.
Length = 225
Score = 28.4 bits (64), Expect = 0.78
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 73 WGVGILGHVAVT-GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
W G+L + E LQ+K++K KK+K + + ++K +I +KR
Sbjct: 129 WLFGLLLSLVRDLRELRQLQEKEKKLKKEKDSEGESTSERKLLKKILKKRP 179
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 28.4 bits (64), Expect = 0.85
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ K+K+ K+KKK+KK+KK A I
Sbjct: 377 NKYKEKEAKEKKKEKKRKKTWARIAEA 403
Score = 28.0 bits (63), Expect = 1.3
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
K KEK+ K+KKK+KK+KK + AE
Sbjct: 376 GNKYKEKEAKEKKKEKKRKKTWARIAEA 403
>gnl|CDD|235627 PRK05854, PRK05854, short chain dehydrogenase; Provisional.
Length = 313
Score = 28.5 bits (64), Expect = 0.85
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 64 STADEVEVPWGVGILGHVAVTG 85
+TAD E+ +G LGH A+T
Sbjct: 111 TTADGFELQFGTNHLGHFALTA 132
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.1 bits (63), Expect = 0.88
Identities = 16/26 (61%), Positives = 17/26 (65%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
QK K KKKK KKKK K K KK K +
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKD 107
Score = 28.1 bits (63), Expect = 1.1
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
K K+KK KK K +KK +K+ + ++E
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Score = 27.7 bits (62), Expect = 1.3
Identities = 15/30 (50%), Positives = 18/30 (60%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ K +KKK KKKK K K KK KK + K
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Score = 27.3 bits (61), Expect = 1.5
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
K K+KK KKKK K K KK KK + E+K
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 27.3 bits (61), Expect = 1.6
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK + K KK KK K +KK +K ++
Sbjct: 94 KKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 27.3 bits (61), Expect = 2.0
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK+ KKKK K K KK KK K+E + +
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 27.0 bits (60), Expect = 2.2
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K+K K KKKK KKKK K K KK + + K
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 27.0 bits (60), Expect = 2.3
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
++K++ K KKKK KKKK K K KK +
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDD 104
Score = 27.0 bits (60), Expect = 2.3
Identities = 15/29 (51%), Positives = 17/29 (58%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KKK KKKK K K KK KK K + + K
Sbjct: 88 KKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Score = 27.0 bits (60), Expect = 2.3
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK+K K K KK KK K +KK E E +
Sbjct: 91 SKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 27.0 bits (60), Expect = 2.6
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAE 116
+K K+K KK KK K +KK +K+AE
Sbjct: 94 KKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 26.2 bits (58), Expect = 4.9
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
++K+ E+K+K K KKKK KKKK K +
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKD 100
Score = 25.8 bits (57), Expect = 5.3
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK +KKK K K KK KK K + + E++
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 25.0 bits (55), Expect = 9.9
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+K K+KK K K KK KK K +K + +
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.5 bits (64), Expect = 0.92
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K K K +K K KKK+KKKKK A+ +K
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191
Score = 28.1 bits (63), Expect = 1.1
Identities = 16/33 (48%), Positives = 18/33 (54%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L+K K KKK+KKKKK K KK KR
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 26.9 bits (60), Expect = 2.9
Identities = 12/31 (38%), Positives = 14/31 (45%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K K KKK+KKKKK K +
Sbjct: 1166 ASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196
Score = 26.9 bits (60), Expect = 2.9
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
K K K K +K K KKK+KKKKK+ ++ +
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191
Score = 26.5 bits (59), Expect = 4.6
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK KK+ + KKKKK +K +K+
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKK 1336
Score = 25.8 bits (57), Expect = 7.5
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
++++ K K K K K +K K KKK + ++K
Sbjct: 1153 AKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183
Score = 25.4 bits (56), Expect = 9.4
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K+++ K K K K K +K K K + ++K+
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKK 1182
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 28.3 bits (63), Expect = 0.92
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E ++K K+ + K K K KK KKK + R
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
Score = 28.3 bits (63), Expect = 0.93
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK +K + K K K KK KKK + +
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
Score = 27.5 bits (61), Expect = 1.6
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
T + KK KK + K K K KK KK
Sbjct: 137 ISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169
Score = 26.8 bits (59), Expect = 2.9
Identities = 10/30 (33%), Positives = 12/30 (40%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ KK + K K K KK KKK
Sbjct: 149 KPKKGQSPKVPKAPKPKKPKKKGSVSNRSV 178
Score = 26.8 bits (59), Expect = 3.7
Identities = 9/31 (29%), Positives = 11/31 (35%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + K KK KKK + K R
Sbjct: 156 PKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186
Score = 26.4 bits (58), Expect = 4.1
Identities = 9/31 (29%), Positives = 11/31 (35%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K K KK KKK + K R +
Sbjct: 158 VPKAPKPKKPKKKGSVSNRSVKMPGIDPRSK 188
Score = 26.0 bits (57), Expect = 5.1
Identities = 9/31 (29%), Positives = 14/31 (45%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
++ K+ KK + K K K KK K +
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170
Score = 25.6 bits (56), Expect = 7.6
Identities = 8/28 (28%), Positives = 9/28 (32%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
K K K KK KKK +
Sbjct: 152 KGQSPKVPKAPKPKKPKKKGSVSNRSVK 179
Score = 25.6 bits (56), Expect = 8.6
Identities = 11/33 (33%), Positives = 13/33 (39%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
+ V KK + K K K KK KKK
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 27.5 bits (62), Expect = 0.93
Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 3/32 (9%)
Query: 92 KKKEKKKKKKKKKKKKKKK---KKKKAEIERK 120
KKKEKK+ K + K+ KK +++KA + +K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 25.6 bits (57), Expect = 3.9
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 96 KKKKKKKKKKKKKKKKKKKAEIERKR 121
KKK+KK+ K + K+ KK +E+++
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRK 87
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 28.1 bits (63), Expect = 1.0
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ +KE+KK K++KK K +K+K E E
Sbjct: 94 FKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 28.2 bits (63), Expect = 1.0
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+T + +NL K+KK+ +K KK+KK K ++
Sbjct: 52 ITADLINLGAAISIPKRKKQTEKGKKEKKPKVSD 85
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 28.3 bits (63), Expect = 1.1
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ KE++K+K++ K++KKKKK+K E + R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131
Score = 27.5 bits (61), Expect = 1.9
Identities = 16/39 (41%), Positives = 24/39 (61%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E ++ KE+KKKKK+K K++ K +K K E + KR
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141
Score = 26.8 bits (59), Expect = 3.1
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E N K+E+K+K++ K++KKKKK+K K E + ++
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 26.4 bits (58), Expect = 5.2
Identities = 9/31 (29%), Positives = 22/31 (70%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
++ K + K+++K+K++ K++KKK + + K
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126
Score = 25.6 bits (56), Expect = 8.1
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
Q K+EKKKKK+K K++ K +K K+ E++
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 25.6 bits (56), Expect = 8.7
Identities = 18/67 (26%), Positives = 33/67 (49%)
Query: 55 LHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
L A S+ + V+ G G A T + + K+++K+K++ K++KKKKK+
Sbjct: 63 LAKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122
Query: 115 AEIERKR 121
E +
Sbjct: 123 KPKEEPK 129
Score = 25.6 bits (56), Expect = 9.8
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K+++ K+K+K+K+KK ++ + E E+KR
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 28.0 bits (62), Expect = 1.1
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
+K K KKK K KK KKK+ K++A
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREA 231
Score = 26.8 bits (59), Expect = 2.6
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAE 116
K+K KKK K KK KKK+ K E
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKRE 230
Score = 26.0 bits (57), Expect = 5.7
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 95 EKKKKKKKKKKKKKKKKKKKAEIE 118
EK K KKK K KK KKK+A+ E
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKRE 230
Score = 26.0 bits (57), Expect = 5.9
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKK 111
K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 27.9 bits (62), Expect = 1.1
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 93 KKEKKKKKKKKKKKKKKKK 111
+E K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
Score = 27.1 bits (60), Expect = 2.0
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 91 QKKKEKKKKKKKKKKKKK 108
Q+ K+KKKKK KKKK K
Sbjct: 87 QENKQKKKKKDKKKKSPK 104
Score = 26.3 bits (58), Expect = 3.7
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 92 KKKEKKKKKKKKKKKKKKK 110
++ K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
Score = 26.3 bits (58), Expect = 3.7
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKK 112
ET + ++ K+KKKKK KKKK K
Sbjct: 78 ETEDWFSPNQENKQKKKKKDKKKKSPK 104
Score = 26.3 bits (58), Expect = 4.7
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 96 KKKKKKKKKKKKKKKKKKK 114
++ K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 27.6 bits (62), Expect = 1.3
Identities = 7/33 (21%), Positives = 21/33 (63%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ + EK+ K K++K+++ ++K+K ++ +
Sbjct: 119 QIDELLEKELAKLKREKRRENERKQKEILKEQM 151
Score = 26.1 bits (58), Expect = 4.7
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KE K K++K+++ ++K+K+ + + K
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMKM 153
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 27.2 bits (61), Expect = 1.3
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
T +K+E++K++KK+K +KK +++ K
Sbjct: 59 TETWLEKREREKREKKEKLEKKLEEELKE 87
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 27.7 bits (62), Expect = 1.3
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 96 KKKKKKKKKKKKKKKKKKKA 115
KK K K+ +K + KKKKKK
Sbjct: 87 KKAKGKRSRKNQTKKKKKKD 106
Score = 26.5 bits (59), Expect = 2.8
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 93 KKEKKKKKKKKKKKKKKKKKK 113
+K K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 26.5 bits (59), Expect = 3.0
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 95 EKKKKKKKKKKKKKKKKKKK 114
E KK K K+ +K + KKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKK 103
Score = 25.7 bits (57), Expect = 5.5
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 96 KKKKKKKKKKKKKKKKKKK 114
KK K K+ +K + KKKKK
Sbjct: 86 SKKAKGKRSRKNQTKKKKK 104
Score = 25.7 bits (57), Expect = 5.6
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 92 KKKEKKKKKKKKKKKKKKKK 111
KK K K+ +K + KKKKKK
Sbjct: 86 SKKAKGKRSRKNQTKKKKKK 105
Score = 25.4 bits (56), Expect = 9.2
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 92 KKKEKKKKKKKKKKKKKKKKK 112
K+ K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|165586 PHA03342, PHA03342, US22 family homolog; Provisional.
Length = 511
Score = 28.0 bits (62), Expect = 1.4
Identities = 18/68 (26%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 55 LHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
L+SGAGL AD V + +G+L + +G KE++K+K + + + K
Sbjct: 293 LYSGAGLTRRVADNVAEFFQIGMLKRLFSSGFEF---HAKERRKEKLEGAPRCPHRIADK 349
Query: 115 AEIERKRY 122
E+ +
Sbjct: 350 FELAEEHC 357
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 27.9 bits (63), Expect = 1.5
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ ++K+KKK+KKK KKKK ++ R +
Sbjct: 390 AQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420
Score = 26.8 bits (60), Expect = 3.3
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
+ V ++K++KK+KKK KKKK K + K ++
Sbjct: 388 LKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKV 424
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 27.6 bits (62), Expect = 1.6
Identities = 8/31 (25%), Positives = 18/31 (58%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ EK+K ++++K K K +KK + ++
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153
Score = 26.4 bits (59), Expect = 3.5
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KEK ++++K K K +KK+K ++
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 26.0 bits (58), Expect = 5.0
Identities = 8/30 (26%), Positives = 18/30 (60%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K + ++ +++ KK++ +K+K ERK
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKP 138
Score = 26.0 bits (58), Expect = 5.2
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E ++K ++++K K K +KK+K + + E +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
Score = 25.7 bits (57), Expect = 7.5
Identities = 7/31 (22%), Positives = 21/31 (67%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+++ +K++ +K+K ++++K K + RK+
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146
Score = 25.7 bits (57), Expect = 7.7
Identities = 5/25 (20%), Positives = 17/25 (68%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
Q+ +++ KK++ +K+K ++++
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERK 137
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 27.6 bits (61), Expect = 1.7
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
AV T+ + ++K K+K K+ K + E E +R
Sbjct: 785 AVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGER 824
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 27.8 bits (62), Expect = 1.7
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
L+KK KKK+ KK K KA ER
Sbjct: 607 LKKKALSDGGKKKQGKKAGGGGKSKAAAER 636
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 27.1 bits (61), Expect = 1.7
Identities = 15/29 (51%), Positives = 21/29 (72%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+KK+KKKKKK+ + ++KKKK ERK
Sbjct: 83 RKKKKKKKKKEGEFFEEKKKKYVVSEERK 111
>gnl|CDD|185727 cd08986, GH43_7, Glycosyl hydrolase family 43. This glycosyl
hydrolase family 43 (GH43) includes enzymes with
beta-1,4-xylosidase (xylan 1,4-beta-xylosidase; EC
3.2.1.37), beta-1,3-xylosidase (EC 3.2.1.-),
alpha-L-arabinofuranosidase (EC 3.2.1.55), arabinanase
(EC 3.2.1.99), xylanase (EC 3.2.1.8),
endo-alpha-L-arabinanase and galactan
1,3-beta-galactosidase (EC 3.2.1.145) activities. These
are inverting enzymes (i.e. they invert the
stereochemistry of the anomeric carbon atom of the
substrate) that have an aspartate as the catalytic
general base, a glutamate as the catalytic general acid
and another aspartate that is responsible for pKa
modulation and orienting the catalytic acid. Many of the
enzymes in this family display both
alpha-L-arabinofuranosidase and beta-D-xylosidase
activity using aryl-glycosides as substrates. A common
structural feature of GH43 enzymes is a 5-bladed
beta-propeller domain that contains the catalytic acid
and catalytic base. A long V-shaped groove, partially
enclosed at one end, forms a single extended
substrate-binding surface across the face of the
propeller.
Length = 269
Score = 27.4 bits (61), Expect = 1.8
Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 13/60 (21%)
Query: 31 DGASLFFVERRRGGKKCLVSKVFDLHSGA----GLVYSTADEVEVPWG-----VGILGHV 81
+GA +F + GGK L + G L Y+ AD++ P+G LGH
Sbjct: 184 EGAFVF----KYGGKYYLFGTAWSTDKGRKGSYDLYYAVADKITGPYGARYFAGRFLGHN 239
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 27.3 bits (61), Expect = 2.0
Identities = 11/37 (29%), Positives = 21/37 (56%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
V T ++ + +K++ KK KK KK+K+K++
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPP 250
>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767). This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 112 and 199 amino
acids in length.
Length = 119
Score = 26.9 bits (60), Expect = 2.1
Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 73 WGVGILGHVAVTG-ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
W VG ++ G E +++K + KK + +KK KK E
Sbjct: 72 WAVGTFALGSIAGWEQCRYRRRKSFQGMKKAVEAMAEKKAKKLRE 116
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 27.1 bits (60), Expect = 2.2
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
N + KK + +K + K +KK+ +K+ ER+
Sbjct: 329 NAEIKKTRTAEKNEAKARKKEIAQKRRAAERE 360
Score = 26.0 bits (57), Expect = 7.4
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ EK + K +KK+ +K++ + EI R+
Sbjct: 336 RTAEKNEAKARKKEIAQKRRAAEREINREA 365
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 27.2 bits (61), Expect = 2.3
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIER 119
+ EKK +KK KK KK K ++A E
Sbjct: 89 EAEKKGLEKKAKKAIKKGKDEEALAEE 115
>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
protein DltB. Members of this protein family are DltB,
part of a four-gene operon for D-alanyl-lipoteichoic
acid biosynthesis that is present in the vast majority
of low-GC Gram-positive organisms. This protein may be
involved in transport of D-alanine across the plasma
membrane [Cell envelope, Biosynthesis and degradation of
murein sacculus and peptidoglycan].
Length = 380
Score = 27.1 bits (61), Expect = 2.3
Identities = 8/14 (57%), Positives = 10/14 (71%)
Query: 95 EKKKKKKKKKKKKK 108
++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.2
Identities = 8/14 (57%), Positives = 9/14 (64%)
Query: 102 KKKKKKKKKKKKKA 115
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNKW 356
Score = 26.3 bits (59), Expect = 4.8
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 97 KKKKKKKKKKKKK 109
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.8
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 98 KKKKKKKKKKKKK 110
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.8
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 99 KKKKKKKKKKKKK 111
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.8
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 100 KKKKKKKKKKKKK 112
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.8
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 101 KKKKKKKKKKKKK 113
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 27.1 bits (60), Expect = 2.4
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIE 118
K++ K +K K+ KK+KK+++K E++
Sbjct: 24 VKDEAKPRKIKRVKKRKKREEKDELD 49
Score = 25.2 bits (55), Expect = 10.0
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 97 KKKKKKKKKKKKKKKKKKAEIER 119
K + K +K K+ KK+KK E +
Sbjct: 24 VKDEAKPRKIKRVKKRKKREEKD 46
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 27.1 bits (61), Expect = 2.4
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
KE+ + K+ ++K+KK +K EI
Sbjct: 518 YIKEQMEGSAPKEPEEKEKKPEKPEIT 544
Score = 26.7 bits (60), Expect = 4.2
Identities = 8/33 (24%), Positives = 19/33 (57%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
E + K++ + K+ ++K+KK +K + I+
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPEITID 546
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 26.8 bits (60), Expect = 2.4
Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKA-EIERKR 121
LQ K + KK+K + + + K+ + KA E E +
Sbjct: 71 ELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 27.0 bits (60), Expect = 2.4
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+++ +KKK+ K K K+ KKKKK +
Sbjct: 81 EERLKKKKRVKTKAYKEPTKKKKKKDPTAA 110
Score = 25.8 bits (57), Expect = 6.0
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K+KK+ K K K+ KKKKKK K
Sbjct: 83 RLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 27.1 bits (60), Expect = 2.6
Identities = 9/24 (37%), Positives = 19/24 (79%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAE 116
K+E+ +KK+K++ + K+K++K E
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDE 431
Score = 26.7 bits (59), Expect = 3.3
Identities = 7/24 (29%), Positives = 19/24 (79%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
+ +K++K++ + K+K++K ++KK
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434
Score = 26.7 bits (59), Expect = 3.3
Identities = 7/29 (24%), Positives = 21/29 (72%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
E + ++ ++K+K++ + K+K++K ++K
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 25.5 bits (56), Expect = 9.2
Identities = 9/23 (39%), Positives = 18/23 (78%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKK 111
N +K+KE+ + K+K++K ++KK
Sbjct: 412 NEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 26.9 bits (60), Expect = 2.7
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
K KKK K + KK +KK +++
Sbjct: 87 AAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Score = 26.9 bits (60), Expect = 3.4
Identities = 10/29 (34%), Positives = 12/29 (41%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K K KKK K + KKAE +
Sbjct: 83 AKAAAAKAPAKKKLKDELDSSKKAEKKNA 111
Score = 26.5 bits (59), Expect = 4.6
Identities = 13/52 (25%), Positives = 14/52 (26%)
Query: 64 STADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
S E V I V + KKK K K K KK
Sbjct: 44 SKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKK 95
Score = 26.1 bits (58), Expect = 6.3
Identities = 11/45 (24%), Positives = 15/45 (33%)
Query: 76 GILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ G V T + K+K K K K KKK +
Sbjct: 58 FLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Score = 26.1 bits (58), Expect = 6.4
Identities = 11/52 (21%), Positives = 17/52 (32%)
Query: 64 STADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
+ +V + + ++ KKK K K K KKK K
Sbjct: 48 TPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99
Score = 25.3 bits (56), Expect = 9.1
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK + K K KKK K + + +K
Sbjct: 77 KKTKTAAKAAAAKAPAKKKLKDELDSSKK 105
Score = 25.3 bits (56), Expect = 9.3
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K KKK K + KK +KK + +
Sbjct: 89 KAPAKKKLKDELDSSKKAEKKNALDKDDD 117
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 27.3 bits (61), Expect = 2.7
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
+ + E+K KK++KK+K K+++K++ EI
Sbjct: 650 KKEFKILDYTEEKTKKEEKKEKNKREEKERIEI 682
Score = 26.1 bits (58), Expect = 5.2
Identities = 15/31 (48%), Positives = 20/31 (64%), Gaps = 3/31 (9%)
Query: 92 KKKEKKKKK---KKKKKKKKKKKKKKAEIER 119
KKKE K +K KK++KK+K K+ E ER
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
>gnl|CDD|240094 cd04743, NPD_PKS, 2-Nitropropane dioxygenase (NPD)-like domain,
associated with polyketide synthases (PKS). NPD is part
of the nitroalkaneoxidizing enzyme family, that
catalyzes oxidative denitrification of nitroalkanes to
their corresponding carbonyl compounds and nitrites.
NDPs are members of the NAD(P)H-dependent flavin
oxidoreductase family that reduce a range of
alternative electron acceptors. Most use FAD/FMN as a
cofactor and NAD(P)H as electron donor. Some contain
4Fe-4S cluster to transfer electron from FAD to FMN.
Length = 320
Score = 27.1 bits (60), Expect = 2.8
Identities = 12/24 (50%), Positives = 16/24 (66%), Gaps = 1/24 (4%)
Query: 59 AGLVYSTADEV-EVPWGVGILGHV 81
L+ TA+ + + PWGVGILG V
Sbjct: 43 KALLEETAELLGDKPWGVGILGFV 66
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 26.4 bits (59), Expect = 2.8
Identities = 11/31 (35%), Positives = 25/31 (80%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
+ K+EKKK+ K+KK+++K++K++ + ++K
Sbjct: 97 RNKQEKKKRSKEKKEEEKERKRQLKQQKKKA 127
Score = 24.9 bits (55), Expect = 9.2
Identities = 10/23 (43%), Positives = 20/23 (86%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKK 114
+KKE++K++K++ K++KKK K +
Sbjct: 108 EKKEEEKERKRQLKQQKKKAKHR 130
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 26.9 bits (60), Expect = 2.8
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
N KK+ K K+ +++K K++ K A
Sbjct: 11 NALKKRLKAKQAEEEKAAKEEAKAAAA 37
Score = 25.7 bits (57), Expect = 7.1
Identities = 5/25 (20%), Positives = 11/25 (44%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
+K K++ K K + +K+
Sbjct: 25 EKAAKEEAKAAAAAAAAKGRSRKSA 49
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 26.5 bits (59), Expect = 2.8
Identities = 8/27 (29%), Positives = 20/27 (74%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
++ KE+ +K+K+ +K ++KK K++ +
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQK 126
Score = 24.9 bits (55), Expect = 9.8
Identities = 9/23 (39%), Positives = 19/23 (82%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKK 113
+ +KEK+ +K ++KK K+++K+K
Sbjct: 106 RAEKEKRTRKNREKKFKRRQKEK 128
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 27.1 bits (60), Expect = 2.9
Identities = 8/29 (27%), Positives = 22/29 (75%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
++E++++K++ K K ++++K K E ++R
Sbjct: 13 EREQRERKQRAKLKLERERKAKEEAAKQR 41
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 27.0 bits (60), Expect = 2.9
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
L ++K K +K K ++ +KK KKA +
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKKARL 368
Score = 26.6 bits (59), Expect = 3.5
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
+K K +K K ++ +KK KK + K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 27.0 bits (60), Expect = 3.0
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
N+ KKK+KKKKK K+ + KK
Sbjct: 6 NIFKKKKKKKKKASKESSNGSQNSKKTIF 34
Score = 26.6 bits (59), Expect = 4.3
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 99 KKKKKKKKKKKKKKKKAEIERKR 121
KKKKKKKKK K+ K+
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKK 31
>gnl|CDD|235083 PRK02889, tolB, translocation protein TolB; Provisional.
Length = 427
Score = 26.9 bits (60), Expect = 3.2
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 5/48 (10%)
Query: 23 NVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGA--GLVYSTADE 68
N S + DG L ++ R G K V DL +G L +T DE
Sbjct: 330 NTSPRISPDGKLLAYISRVGGAFKL---YVQDLATGQVTALTDTTRDE 374
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 26.7 bits (59), Expect = 3.2
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+ E KKK + KKK + + K KAE + K
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 26.3 bits (58), Expect = 4.0
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
Q ++E K K + KKK + KKKAE E K
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180
Score = 26.0 bits (57), Expect = 6.9
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
Q + K K + + +KK K++ KK+AE E K
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157
Score = 25.6 bits (56), Expect = 7.6
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E Q+ E+ ++K+ +++ +K K+AE K
Sbjct: 78 EEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 26.8 bits (60), Expect = 3.2
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
GE + E+ K+KKKKKKKKKKK++ E
Sbjct: 73 GELYERADRAEELLKEKKKKKKKKKKKEELREW 105
>gnl|CDD|173227 PRK14766, PRK14766, lipoprotein signal peptidase; Provisional.
Length = 201
Score = 26.8 bits (59), Expect = 3.2
Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 4/57 (7%)
Query: 53 FDLHSGAGLVYSTAD-EVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKK 108
F L + V++ AD V + G+ IL + E V K K K KK+K + ++
Sbjct: 147 FSLLNSRDYVFNLADLYVNIAIGLTILFTII---ELVLYIKSKIKTKKEKIENEQNN 200
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 26.9 bits (60), Expect = 3.3
Identities = 13/34 (38%), Positives = 16/34 (47%)
Query: 87 TVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
T KK K K KKKKKK K+E ++
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQE 382
Score = 25.4 bits (56), Expect = 9.6
Identities = 12/29 (41%), Positives = 15/29 (51%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
KK KK K K KKKKKK + + +
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSE 378
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 26.7 bits (59), Expect = 3.3
Identities = 9/32 (28%), Positives = 12/32 (37%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
+ KK ++ KK KK KKA
Sbjct: 1184 RAAARGESGAAKKVSRQAPKKPAPKKTTKKAS 1215
Score = 25.6 bits (56), Expect = 9.5
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
KK ++ KK KK KK ++E + Y
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETY 1224
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 26.6 bits (59), Expect = 3.3
Identities = 9/28 (32%), Positives = 13/28 (46%)
Query: 83 VTGETVNLQKKKEKKKKKKKKKKKKKKK 110
VT KKKKK + K++K+
Sbjct: 280 VTPSQNQAIASLPKKKKKNRNAGKQRKR 307
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 26.6 bits (59), Expect = 3.4
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K+ +K KK KKK K+K E E+K
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKK 82
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 26.6 bits (59), Expect = 3.5
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
+ +KKK + KKK K KK + ++
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRIDL 175
Score = 26.2 bits (58), Expect = 4.6
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 97 KKKKKKKKKKKKKKKKKKAEIER 119
+KKK + KKK K KK I+
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDL 175
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 26.4 bits (58), Expect = 3.5
Identities = 13/27 (48%), Positives = 21/27 (77%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
L+++KEKKK K++K+ KKK++ K E
Sbjct: 93 LEEEKEKKKNLNKEEKEAKKKERAKRE 119
Score = 26.0 bits (57), Expect = 5.0
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIE 118
+EK+KKK K++K+ KKK++A+ E
Sbjct: 95 EEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 26.6 bits (59), Expect = 3.6
Identities = 8/14 (57%), Positives = 10/14 (71%)
Query: 93 KKEKKKKKKKKKKK 106
+ KKKKKKK+K
Sbjct: 737 GRSASKKKKKKKRK 750
Score = 25.9 bits (57), Expect = 7.8
Identities = 8/15 (53%), Positives = 10/15 (66%)
Query: 101 KKKKKKKKKKKKKKA 115
+ KKKKKKK+K
Sbjct: 737 GRSASKKKKKKKRKG 751
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 26.6 bits (59), Expect = 3.6
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ L + ++ KKK++ + KKKK+ E++
Sbjct: 76 DEKELSASSLEAEQAKKKEEAEAKKKKEMEELK 108
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 26.6 bits (59), Expect = 3.7
Identities = 9/36 (25%), Positives = 25/36 (69%)
Query: 85 GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
E +N + ++K K ++ K+K++K+ +++A+ E++
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 26.7 bits (59), Expect = 3.7
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 95 EKKKKKKKKKKKKKKKKKKKAEIE 118
+++++K++KKKK KK K+ E E
Sbjct: 62 DEEEEKEEKKKKTKKVKETTTEWE 85
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 26.5 bits (58), Expect = 4.0
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
VA V Q+K +++++ K K KK+++K+K+AE E
Sbjct: 46 VATGAGVVYYQRKGQQQRESKPKISKKERRKRKQAEKE 83
Score = 26.1 bits (57), Expect = 6.0
Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 60 GLVYSTADEVEVPWGVGILGHV---AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
VY+ A V G G++ + E+ KKE++K+K+ +K+ + K ++KK+
Sbjct: 35 TAVYTIAGVTVVATGAGVVYYQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKST 94
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 26.3 bits (59), Expect = 4.4
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 87 TVNLQ--KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
V L +K +K KKKKKKK K K K + +E
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLE 706
Score = 26.3 bits (59), Expect = 4.5
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ +K K +KK+KKKKK+K+K+ E++
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 26.3 bits (59), Expect = 5.4
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
V + E +K+ K +KK+KKKKK+ E + +
Sbjct: 604 VKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 26.4 bits (58), Expect = 4.5
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 77 ILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
++G + G NL +KEK K+ K+KKK+KK
Sbjct: 136 LIGGLIFGGLLANLIHRKEKDDNKENINLFKQKKKRKK 173
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 26.3 bits (58), Expect = 4.6
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKA 115
+ +E K++ +KK+K KKK +K+K
Sbjct: 345 EDDEESKEEVEKKQKVKKKPRKRKV 369
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 26.5 bits (59), Expect = 4.7
Identities = 8/36 (22%), Positives = 22/36 (61%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
E + ++ + E+ KK+ KK ++K ++ + + E + +
Sbjct: 89 ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEE 124
Score = 25.7 bits (57), Expect = 8.1
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
L+K +EK ++ + + +KK+++ +K +
Sbjct: 104 LKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
Score = 25.4 bits (56), Expect = 9.9
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ KKE KK ++K ++ + + +KK+ E+E
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELE 126
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 26.5 bits (58), Expect = 4.7
Identities = 6/20 (30%), Positives = 12/20 (60%)
Query: 99 KKKKKKKKKKKKKKKKAEIE 118
KK+ +K K+ +++A E
Sbjct: 33 KKENAIRKLGKEAEEEAMEE 52
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 26.3 bits (58), Expect = 4.7
Identities = 10/26 (38%), Positives = 21/26 (80%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEI 117
++KEK++KKK K+ K+++K++K+
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAA 210
Score = 25.5 bits (56), Expect = 7.6
Identities = 11/21 (52%), Positives = 19/21 (90%)
Query: 91 QKKKEKKKKKKKKKKKKKKKK 111
QK+KE+KKK K+ K+++K++K
Sbjct: 186 QKEKEEKKKVKEAKRREKEEK 206
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 26.2 bits (58), Expect = 4.8
Identities = 15/38 (39%), Positives = 26/38 (68%), Gaps = 3/38 (7%)
Query: 85 GETVNLQ---KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
G V+LQ K E +KK+K+KK++ K +KK++ +E+
Sbjct: 146 GGDVHLQANPTKLELLRKKEKEKKEQLKIQKKQSLLEK 183
>gnl|CDD|235222 PRK04125, PRK04125, murein hydrolase regulator LrgA; Provisional.
Length = 141
Score = 25.8 bits (57), Expect = 5.2
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
A TG K +K+K+ KK+KK K K ++ +
Sbjct: 107 ACTGLFSQFILGKTEKEKEDKKEKKVKIKGRQHGK 141
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 26.3 bits (58), Expect = 5.3
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 96 KKKKKKKKKKKKKKKKKKKAEIERK 120
K+K KK KK K+ E+E K
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESK 355
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 26.0 bits (58), Expect = 5.4
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
L++K +++++K +KKK ++ K +I +
Sbjct: 6 TLEEKLNEEEREKLEKKKSSLSEEDKEKIIER 37
>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
Length = 399
Score = 26.1 bits (58), Expect = 5.6
Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)
Query: 92 KKKEKKKK----KKKKKKKKKKKKKKKAEIER 119
K+EKK K +++ ++KKK +K + ER
Sbjct: 48 SKREKKPYPRPMKLLRREAREKKKLRKLQPER 79
>gnl|CDD|215877 pfam00360, PHY, Phytochrome region. Phytochromes are red/far-red
photochromic biliprotein photoreceptors which regulate
plant development. They are widely represented in both
photosynthetic and non-photosynthetic bacteria and are
known in a variety of fungi. Although sequence
similarities are low, this domain is structurally
related to pfam01590, which is generally located
immediately N-terminal to this domain. Compared with
pfam01590, this domain carries an additional
tongue-like hairpin loop between the fifth beta-sheet
and the sixth alpha-helix which functions to seal the
chromophore pocket and stabilise the photoactivated
far-red-absorbing state (Pfr). The tongue carries a
conserved PRxSF motif, from which an arginine finger
points into the chromophore pocket close to ring D
forming a salt bridge with a conserved aspartate
residue.
Length = 183
Score = 26.0 bits (58), Expect = 5.6
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 3 IVRDIARDLDLKSLSKKTVDNVSVLLDADGASLFF 37
+ ++R+ DL N+ L+ ADGA+L++
Sbjct: 8 LCDMLSREADLPEGLVTQSPNLLDLVKADGAALYY 42
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 25.9 bits (57), Expect = 5.8
Identities = 10/30 (33%), Positives = 22/30 (73%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
+K++++ K+ ++K+ K KK+KK + E+K
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
>gnl|CDD|220644 pfam10238, Eapp_C, E2F-associated phosphoprotein. This entry
represents the conserved C-terminal portion of an E2F
binding protein. E2F transcription factors play an
essential role in cell proliferation and apoptosis and
their activity is frequently deregulated in human
cancers. E2F activity is regulated by a variety of
mechanisms, frequently mediated by proteins binding to
individual members or a subgroup of the family. EAPP
interacts with a subset of E2F factors and influences
E2F-dependent promoter activity. EAPP is present
throughout the cell cycle but disappears during mitosis.
Length = 128
Score = 25.4 bits (56), Expect = 5.8
Identities = 10/40 (25%), Positives = 16/40 (40%), Gaps = 1/40 (2%)
Query: 81 VAVTGETVNLQKKKEKKKKKKKKKK-KKKKKKKKKAEIER 119
V E V K +KK+++ KK + +EI
Sbjct: 61 CKVDEEKVLYSKPINRKKRRRNSKKNELNPNNTATSEIYH 100
>gnl|CDD|237262 PRK12997, PRK12997, PTS system ascorbate-specific transporter
subunit IIC; Reviewed.
Length = 466
Score = 26.0 bits (58), Expect = 5.8
Identities = 9/57 (15%), Positives = 21/57 (36%), Gaps = 4/57 (7%)
Query: 52 VFDLHSGAGLVYSTADEVEVPWGV--GILGHVAVTGETVNLQKKKEKKKKKKKKKKK 106
+ L G + + + W +L ++ + G + + + K K K+K
Sbjct: 412 LLGLAKYGG--WHGNTDWDTDWPAFGLLLKYIGLGGYLLLAIPQLQFAKAKDKEKYY 466
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 26.1 bits (57), Expect = 6.0
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 88 VNLQKKKEKKKKKKKKKKKKKKKKKKK 114
V+ ++ +KKK KK K +K+ K+K KK
Sbjct: 66 VSGKRVPKKKKIKKPKLRKRTKRKNKK 92
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 26.1 bits (58), Expect = 6.1
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+++KEK+KK KK+ K++++ + IE
Sbjct: 393 REEKEKEKKAKKEGKEERRIHFQNKSIE 420
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 25.6 bits (57), Expect = 6.3
Identities = 7/29 (24%), Positives = 19/29 (65%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
Q+ K+ +++K++ +K+ + + K IE+
Sbjct: 127 QEIKKLEEEKEELEKRVAELEAKLEAIEK 155
>gnl|CDD|173607 PTZ00417, PTZ00417, lysine-tRNA ligase; Provisional.
Length = 585
Score = 26.1 bits (57), Expect = 6.5
Identities = 17/75 (22%), Positives = 36/75 (48%), Gaps = 8/75 (10%)
Query: 50 SKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKK--EKKKKKKKKKKKK 107
S + HS + ++ + + P H T++ +K+ E +KK + + K
Sbjct: 17 SHFYKQHSFSKIILNKNKNIICPV------HCKQCFVTMSEKKEHVMEGEKKVRSVQASK 70
Query: 108 KKKKKKKAEIERKRY 122
KKK+++AE++ + Y
Sbjct: 71 DKKKEEEAEVDPRLY 85
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 25.9 bits (57), Expect = 6.7
Identities = 8/27 (29%), Positives = 9/27 (33%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
L +K K K KK KK
Sbjct: 186 KLSSTASSGDEKSPKSKAAPKKAGKKM 212
Score = 25.5 bits (56), Expect = 9.4
Identities = 9/22 (40%), Positives = 10/22 (45%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKK 113
EK K K KK KK +K
Sbjct: 193 SGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 25.8 bits (57), Expect = 6.7
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
+ E+KK + +K +KK AE E
Sbjct: 105 IKRDSAERKKAASSGSRAIEKLRKKLAEQE 134
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 26.1 bits (58), Expect = 6.8
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKK 113
L K K K + K KK K K
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 26.0 bits (57), Expect = 6.8
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAE 116
+K K +K K KK KK +AE
Sbjct: 357 ARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|235737 PRK06197, PRK06197, short chain dehydrogenase; Provisional.
Length = 306
Score = 25.8 bits (57), Expect = 6.9
Identities = 14/31 (45%), Positives = 20/31 (64%), Gaps = 4/31 (12%)
Query: 59 AGLVY----STADEVEVPWGVGILGHVAVTG 85
AG++Y +TAD E+ +G LGH A+TG
Sbjct: 103 AGVMYTPKQTTADGFELQFGTNHLGHFALTG 133
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 25.9 bits (57), Expect = 7.0
Identities = 8/27 (29%), Positives = 13/27 (48%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
K+ E + K +K+KK+ K E
Sbjct: 429 KRPEPRAAADTKSAAEKQKKRAKEPFE 455
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 25.8 bits (57), Expect = 7.2
Identities = 8/29 (27%), Positives = 21/29 (72%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
+++KE++KK+ K++K+++K+ +K
Sbjct: 181 AERRRREEKERRKKQDKERKQREKETAEK 209
>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP). NAP proteins
are involved in moving histones into the nucleus,
nucleosome assembly and chromatin fluidity. They affect
the transcription of many genes.
Length = 236
Score = 25.4 bits (56), Expect = 7.6
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 89 NLQKKKEKKKKKKKKKKKKKKKKK 112
NL K KKK++ KK + + K
Sbjct: 156 NLTVKTVKKKQRNKKTGQTRTITK 179
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 25.5 bits (57), Expect = 7.8
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 91 QKKKEKKKKKKKKKKKKKK 109
+K EK +K KKKKK++K
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 25.3 bits (56), Expect = 8.0
Identities = 9/30 (30%), Positives = 28/30 (93%)
Query: 84 TGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
+G +++++K+E+K+++K++K++K++K+K+
Sbjct: 8 SGRIIDIEEKREEKEREKEEKERKEEKEKE 37
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 24.7 bits (54), Expect = 8.0
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
KK++KKKK+ K + K KK + + ++K
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This is
a bacterial family of uncharacterized proteins.
Length = 55
Score = 24.1 bits (53), Expect = 8.7
Identities = 12/40 (30%), Positives = 24/40 (60%)
Query: 77 ILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
I+G++AV + ++ + K+KKKK K+ ++ K A+
Sbjct: 16 IVGNLAVLKYSAKMKFPQFDKQKKKKAKRNAEQDAKNSAD 55
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 25.4 bits (56), Expect = 8.7
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
K + K+KKK+KKK++KKKK+ +E +
Sbjct: 323 KLDAEVKEKKKEKKKEEKKKKQIERLEER 351
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 25.5 bits (56), Expect = 8.8
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 93 KKEKKKKKKKKKKKKKKKKKKK 114
K +K KK KK KKKKKKK+
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 25.4 bits (56), Expect = 9.4
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 94 KEKKKKKKKKKKKKKKKKKKKAEIERK 120
K++K ++KK+K + + +++ RK
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARK 59
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 24.5 bits (54), Expect = 9.5
Identities = 15/26 (57%), Positives = 16/26 (61%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEI 117
KKK KK KK +KKKK KKK I
Sbjct: 31 KKKLKKLKKIDDDLEKKKKNKKKIII 56
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 25.0 bits (55), Expect = 9.5
Identities = 9/35 (25%), Positives = 20/35 (57%)
Query: 86 ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
V + K++K+K +K +K+ + +K E++ K
Sbjct: 33 RAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67
>gnl|CDD|130283 TIGR01216, ATP_synt_epsi, ATP synthase, F1 epsilon subunit (delta
in mitochondria). This model describes one of the five
types of subunits in the F1 part of F1/F0 ATP synthases.
Members of this family are designated epsilon in
bacterial and chloroplast systems but designated delta
in mitochondria, where the counterpart of the bacterial
delta subunit is designated OSCP. In a few cases
(Propionigenium modestum, Acetobacterium woodii) scoring
above the trusted cutoff and designated here as
exceptions, Na+ replaces H+ for translocation [Energy
metabolism, ATP-proton motive force interconversion].
Length = 130
Score = 24.9 bits (55), Expect = 9.5
Identities = 10/50 (20%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 75 VGILGHVAVTGETVN---LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
V IL AV + ++ +K E +K + + K + ++++ R
Sbjct: 74 VTILADGAVFADDIDEAEAEKALEAAEKLLESAEDDKDLAEALLKLKKAR 123
>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
basal transcription 1 (ABT1) and similar proteins. This
subfamily corresponds to the RRM of novel nuclear
proteins termed ABT1 and its homologous counterpart,
pre-rRNA-processing protein ESF2 (eighteen S factor 2),
from yeast. ABT1 associates with the TATA-binding
protein (TBP) and enhances basal transcription activity
of class II promoters. Meanwhile, ABT1 could be a
transcription cofactor that can bind to DNA in a
sequence-independent manner. The yeast ABT1 homolog,
ESF2, is a component of 90S preribosomes and 5'
ETS-based RNPs. It is previously identified as a
putative partner of the TATA-element binding protein.
However, it is primarily localized to the nucleolus and
physically associates with pre-rRNA processing factors.
ESF2 may play a role in ribosome biogenesis. It is
required for normal pre-rRNA processing, as well as for
SSU processome assembly and function. Both ABT1 and ESF2
contain an RNA recognition motif (RRM), also termed RBD
(RNA binding domain) or RNP (ribonucleoprotein domain).
.
Length = 98
Score = 24.5 bits (54), Expect = 9.6
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 90 LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
LQ + K+K++KKK KKKK + +E
Sbjct: 32 LQPEDPAKRKRRKKKGGNKKKKFTEGWVE 60
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 24.7 bits (54), Expect = 9.8
Identities = 9/24 (37%), Positives = 12/24 (50%)
Query: 91 QKKKEKKKKKKKKKKKKKKKKKKK 114
KKK KK + + KK KK+
Sbjct: 78 HKKKRKKSRHVSSRSAKKISAKKR 101
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 25.0 bits (55), Expect = 9.9
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 11/53 (20%)
Query: 59 AGLVYST---ADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKK 108
AGL Y+ A +E A E K++E + +KK+ KKKK
Sbjct: 152 AGLWYAEWKDAKLLEE--------FAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 25.2 bits (55), Expect = 9.9
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 92 KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
KK KKKK +KKK KK KKK + + +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEE 76
>gnl|CDD|232985 TIGR00463, gltX_arch, glutamyl-tRNA synthetase, archaeal and
eukaryotic family. The glutamyl-tRNA synthetases of the
eukaryotic cytosol and of the Archaea are more similar
to glutaminyl-tRNA synthetases than to bacterial
glutamyl-tRNA synthetases. This model models just the
eukaryotic cytosolic and archaeal forms of the enzyme.
In some eukaryotes, the glutamyl-tRNA synthetase is part
of a longer, multifunctional aminoacyl-tRNA ligase. In
many species, the charging of tRNA(gln) proceeds first
through misacylation with Glu and then transamidation.
For this reason, glutamyl-tRNA synthetases, including
all known archaeal enzymes (as of 2010) may act on both
tRNA(gln) and tRNA(glu) [Protein synthesis, tRNA
aminoacylation].
Length = 556
Score = 25.6 bits (56), Expect = 10.0
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 82 AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
A E +L +++K+ K+ KKK+KK+K E
Sbjct: 49 AAVEEVNSLSPEEQKELMKRLGLDIKKKEKKRKGLRE 85
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.132 0.357
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,268,187
Number of extensions: 581734
Number of successful extensions: 11752
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7534
Number of HSP's successfully gapped: 1931
Length of query: 122
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 38
Effective length of database: 7,211,866
Effective search space: 274050908
Effective search space used: 274050908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 53 (24.0 bits)