RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15969
(111 letters)
>gnl|CDD|197696 smart00389, HOX, Homeodomain. DNA-binding factors that are
involved in the transcriptional regulation of key
developmental processes.
Length = 57
Score = 46.5 bits (111), Expect = 2e-08
Identities = 17/56 (30%), Positives = 29/56 (51%)
Query: 44 PKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRRS 99
++ R FT Q L+ F++ PS+E + +A++LGL V +F N R +
Sbjct: 1 KRRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQVKVWFQNRRAKW 56
>gnl|CDD|238039 cd00086, homeodomain, Homeodomain; DNA binding domains involved
in the transcriptional regulation of key eukaryotic
developmental processes; may bind to DNA as monomers or
as homo- and/or heterodimers, in a sequence-specific
manner.
Length = 59
Score = 44.2 bits (105), Expect = 2e-07
Identities = 17/55 (30%), Positives = 29/55 (52%)
Query: 45 KKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRRS 99
++ R FT Q L+ F++ PS+E + +A++LGL V +F N R +
Sbjct: 1 RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQVKIWFQNRRAKL 55
>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain.
Length = 57
Score = 42.9 bits (102), Expect = 5e-07
Identities = 17/54 (31%), Positives = 29/54 (53%)
Query: 45 KKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRR 98
++ R FT Q L+ F++ + PS E + +A++LGL V +F N R +
Sbjct: 1 RRKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQVKVWFQNRRAK 54
>gnl|CDD|203350 pfam05920, Homeobox_KN, Homeobox KN domain. This is a homeobox
transcription factor KN domain conserved from fungi to
human and plants.
Length = 40
Score = 35.2 bits (82), Expect = 3e-04
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 69 PSKEMQVTIARQLGLEPTTVGNFFMNARRR 98
PS+E + +ARQ GL + N+F+NARRR
Sbjct: 11 PSEEEKSELARQTGLSRKQIDNWFINARRR 40
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 38.0 bits (89), Expect = 3e-04
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD 39
+ KKEK+K+K K KK+ + K KR++ S++
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 36.8 bits (86), Expect = 8e-04
Identities = 11/37 (29%), Positives = 24/37 (64%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
KR +K++K+K+K K KK+ + K ++ +P + ++
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 34.9 bits (81), Expect = 0.004
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKT 27
++ EKK+K+K+K K KK+ + T
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDT 419
Score = 33.8 bits (78), Expect = 0.010
Identities = 11/29 (37%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK+ K+++K K KK+ + K K +RK
Sbjct: 401 EKKEKEKEKPKVKKRHRDTKNIGK--RRK 427
Score = 33.0 bits (76), Expect = 0.016
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
K KK K+ +KK+K+K+K K + ++
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Score = 32.6 bits (75), Expect = 0.023
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K K +KK+K+K+K K KK+ +
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 32.2 bits (74), Expect = 0.032
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
EK ++K K KK+ + K K++K EE
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 31.8 bits (73), Expect = 0.040
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K+ ++ +KK+K+K+K K KK+ K
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 31.8 bits (73), Expect = 0.044
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK+ K KK K+ +KK+K+K+ K K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVK 413
Score = 31.1 bits (71), Expect = 0.078
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
K K KK K+ +KK+K+K+K +K D
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Score = 29.1 bits (66), Expect = 0.36
Identities = 7/31 (22%), Positives = 17/31 (54%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
+++ K + KK+ + K K++K + +E
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 28.8 bits (65), Expect = 0.57
Identities = 11/40 (27%), Positives = 20/40 (50%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
KK K KK K+ +KK+K ++ + +D + K+
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 28.8 bits (65), Expect = 0.61
Identities = 11/42 (26%), Positives = 18/42 (42%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
+K+ K KK K+ +KK+ K K + + K R
Sbjct: 385 EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426
Score = 27.6 bits (62), Expect = 1.5
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
+ K +KK K KK K+ K KE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEK 407
Score = 26.1 bits (58), Expect = 5.3
Identities = 8/28 (28%), Positives = 15/28 (53%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K ++K K KK K+ +KK +++
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKE 408
Score = 25.7 bits (57), Expect = 6.7
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 10 KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
K K +KK K KK KR E+ + +++ K+ R
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 38.0 bits (89), Expect = 4e-04
Identities = 16/28 (57%), Positives = 24/28 (85%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K++++EKK +K+KKKKK+KKK K K+K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 34.9 bits (81), Expect = 0.003
Identities = 15/30 (50%), Positives = 24/30 (80%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
K+R+++K +K+KKKKK+KKK KK K+ +
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.2 bits (79), Expect = 0.008
Identities = 14/29 (48%), Positives = 24/29 (82%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
++K+E+KK +K+KKKKK+KKK ++K+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 33.8 bits (78), Expect = 0.010
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKR 30
+ +K +K+KKKKK+KKK KK+KK ++
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 33.0 bits (76), Expect = 0.016
Identities = 16/29 (55%), Positives = 22/29 (75%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
++ KK +K+KKKKK+KKK KK+K RK
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.9 bits (73), Expect = 0.040
Identities = 16/28 (57%), Positives = 24/28 (85%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
+KK ++KK +K+KKKKK+KKK KRK++
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 29.2 bits (66), Expect = 0.40
Identities = 15/29 (51%), Positives = 21/29 (72%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK K KKK+++KK +K+KKKKK K+
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKG 405
Score = 28.8 bits (65), Expect = 0.46
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEE 33
K+K+++KK +K+KKKKK+K +K +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 28.8 bits (65), Expect = 0.58
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
K + + KKK+++KK +K+KKK ++K+
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGK 406
Score = 27.6 bits (62), Expect = 1.3
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
E + K K KKK+++KK +K+KK KRK++
Sbjct: 374 EIKEKYPKPPKKKREEKKPQKRKKKKKRKKK 404
Score = 27.6 bits (62), Expect = 1.5
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 10 KKKKKKKKKKKKKKKKKTCKRKEEP 34
KK+++KK +K+KKKKK+ K K+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 26.9 bits (60), Expect = 2.6
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
+R +E K+K K KKK+++KK KRK++ + KK R
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 25.7 bits (57), Expect = 5.5
Identities = 11/30 (36%), Positives = 22/30 (73%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
+E+ ++K K KKK+++KK +K+K K++
Sbjct: 372 IEEIKEKYPKPPKKKREEKKPQKRKKKKKR 401
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 36.0 bits (84), Expect = 0.001
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K +K +KK +K K K +KKK K + K
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 35.2 bits (82), Expect = 0.002
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K EKK +K K K +KKK KK +
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 33.3 bits (77), Expect = 0.008
Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 12/49 (24%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
+ +K K +K +KK +K K K +KK K K+E KPRL
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKK--KAKKE----------EPKPRL 99
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 36.5 bits (85), Expect = 0.001
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTD 53
+ KK +KK KK KK K ++ + E + + + P +PRL+ D
Sbjct: 89 EAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPE--PPLRPRLIVND 136
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 35.6 bits (83), Expect = 0.002
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
KKE K K +KKK+K++ K K K +P
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 34.4 bits (80), Expect = 0.005
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKT 27
K +KKK+K++ K K K K
Sbjct: 68 AWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.0 bits (79), Expect = 0.007
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTC 28
K K +KKK+K++ K K K K
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 33.3 bits (77), Expect = 0.014
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
+ +K K K +KKK+K++ K K K +
Sbjct: 61 LDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 25.2 bits (56), Expect = 7.7
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 9/39 (23%)
Query: 11 KKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
KK+ K K +KKK+K + + KPRL
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAK---------SKPRL 92
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.4 bits (77), Expect = 0.015
Identities = 10/45 (22%), Positives = 29/45 (64%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
E++++K+KK KKK+++++++K K +E+ + ++ ++
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 32.2 bits (74), Expect = 0.033
Identities = 15/46 (32%), Positives = 29/46 (63%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
EK+R++EKK+KKKK KKK+++ + KE+ + +++ +
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 31.4 bits (72), Expect = 0.056
Identities = 12/45 (26%), Positives = 27/45 (60%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
+ +K+E++KK+KKKK KKK+ + +E+ +++ ++
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 31.4 bits (72), Expect = 0.061
Identities = 14/46 (30%), Positives = 30/46 (65%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
++K +K +KK++++KK+KKKK K++EE + +++ K+
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 28.7 bits (65), Expect = 0.50
Identities = 16/57 (28%), Positives = 32/57 (56%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTL 59
+ KKEKKKK KKK++++++ ++KEE +++ + +K ++ TL
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 26.8 bits (60), Expect = 2.2
Identities = 9/44 (20%), Positives = 24/44 (54%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
+K KK KK +K +KK++++ ++K++ + + ++
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 26.4 bits (59), Expect = 3.6
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
+E +K KK KK +K +KK+ ++KE+
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEK 430
Score = 25.7 bits (57), Expect = 5.6
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
K+ KK KK +K +KK++ K++++
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK 431
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.2 bits (73), Expect = 0.041
Identities = 16/43 (37%), Positives = 28/43 (65%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
+R+KK+++K K+K+ KK K +K K K + + D ++ PKK
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.8 bits (73), Expect = 0.052
Identities = 11/47 (23%), Positives = 20/47 (42%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
E+KK +KK++K +KK +K K + ++ K
Sbjct: 402 ENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448
Score = 31.4 bits (72), Expect = 0.072
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
EK+ +KE+ +K KKK + KK E
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 29.5 bits (67), Expect = 0.35
Identities = 12/44 (27%), Positives = 22/44 (50%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
K RKK++K +KK +K++ +K K + ++ KK
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 28.4 bits (64), Expect = 0.75
Identities = 12/52 (23%), Positives = 27/52 (51%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDL 54
+R+K KK++K +KK +K++ +K +K+ ++ + + V D
Sbjct: 409 AERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 31.4 bits (72), Expect = 0.064
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKK 23
+EKRR K+KKK+KK KK + +K
Sbjct: 731 DLEKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 0.43
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 9 EKKKKKKKKKKKKKKKKKT 27
EK++ KKKKK+KK KK +
Sbjct: 733 EKRRLKKKKKRKKVKKWEV 751
Score = 28.7 bits (65), Expect = 0.66
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 7 KKEKKKKKKKKKKKKKKKKK 26
K+ KKKKK+KK KK + +K
Sbjct: 734 KRRLKKKKKRKKVKKWEVEK 753
Score = 28.3 bits (64), Expect = 0.72
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 4 KRRKKEKKKKKKKKKKKKKKK 24
++R+ +KKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 27.9 bits (63), Expect = 0.97
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 11 KKKKKKKKKKKKKKKKTCKRKEE 33
+K++ KKKKK+KK KK K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 27.9 bits (63), Expect = 1.3
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 8 KEKKKKKKKKKKKKKKKKKT 27
++++ KKKKK+KK KK +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVE 752
Score = 27.1 bits (61), Expect = 2.2
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 7 KKEKKKKKKKKKKKKKKKKKT 27
+K + KKKKK+KK KK + +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|227863 COG5576, COG5576, Homeodomain-containing transcription factor
[Transcription].
Length = 156
Score = 30.9 bits (70), Expect = 0.078
Identities = 20/63 (31%), Positives = 30/63 (47%)
Query: 37 SQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNAR 96
QD SS PK R TD Q L+ F+ PS ++ ++ L + P +V +F N R
Sbjct: 44 KQDGSSPPKSKRRRTTDEQLMVLEREFEINPYPSSITRIKLSLLLNMPPKSVQIWFQNKR 103
Query: 97 RRS 99
+
Sbjct: 104 AKE 106
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 31.2 bits (71), Expect = 0.086
Identities = 13/31 (41%), Positives = 24/31 (77%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
E +R+KE+K+ K++K+ +K KK+ K+K+E
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582
Score = 30.0 bits (68), Expect = 0.22
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
EK K+ K++K+ +K KK+++KK + ++
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 28.8 bits (65), Expect = 0.47
Identities = 10/31 (32%), Positives = 23/31 (74%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
E++ +++K+ +K KK+++KKKK ++ E+
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 28.1 bits (63), Expect = 1.0
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
+R K+EK+ K++K+ +K KK++ K+KE
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKEL 583
Score = 26.1 bits (58), Expect = 4.4
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
+KR +K KK+++KKKK+ +K +K E
Sbjct: 566 QKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
Score = 26.1 bits (58), Expect = 5.2
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
+ +K +K KK+++KKKK+ +K K K P
Sbjct: 562 ALKEQKRLRKLKKQEEKKKKELEKLEKAKIPP 593
Score = 25.8 bits (57), Expect = 5.6
Identities = 10/30 (33%), Positives = 21/30 (70%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
++ +KE K++K+ +K KK+++K K E+
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.4 bits (68), Expect = 0.14
Identities = 13/42 (30%), Positives = 26/42 (61%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTP 44
E+RR +++ +K K++KKK++K+K+ +R D+ P
Sbjct: 86 EERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAP 127
Score = 28.5 bits (63), Expect = 0.79
Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 1/72 (1%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQ 60
VEK+ KK KKK+KK+K+K++ K KK + + +P V +
Sbjct: 194 AVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK-SLLLALDDSPASAASVAEADEASLAN 252
Query: 61 AIFKETKRPSKE 72
+ +
Sbjct: 253 TVSGTAPDSEPD 264
Score = 26.2 bits (57), Expect = 4.1
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
K +KK+++K+K++KKKKKK C + Q+ ++ L
Sbjct: 279 KHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEEPL 324
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 30.6 bits (69), Expect = 0.14
Identities = 16/66 (24%), Positives = 24/66 (36%), Gaps = 11/66 (16%)
Query: 14 KKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSKEM 73
K+ KKKK ++ ++ PKK + Q +T+ I KE M
Sbjct: 656 KQVKKKKNYQRYHMASQES-----------PKKSSYTLPEKQHKTITEILKELGEHIDRM 704
Query: 74 QVTIAR 79
I R
Sbjct: 705 IKQIKR 710
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.5 bits (69), Expect = 0.15
Identities = 19/48 (39%), Positives = 28/48 (58%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLV 50
E ++K EKKKKKKK+KK+ K + +T + P S+ P KP+
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 27.0 bits (60), Expect = 1.9
Identities = 15/39 (38%), Positives = 20/39 (51%)
Query: 8 KEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
E KKK +KKKKKKK+KK+ E + + K
Sbjct: 64 NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102
Score = 25.1 bits (55), Expect = 9.8
Identities = 11/32 (34%), Positives = 13/32 (40%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
K + K K KK KK KKK + E
Sbjct: 83 PKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 30.3 bits (68), Expect = 0.17
Identities = 17/88 (19%), Positives = 35/88 (39%), Gaps = 10/88 (11%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE---EPDISQDQSSTPKKPRLVFTDL----- 54
R +EK +K+ +K + + K+K +KE ++ + L D
Sbjct: 13 NNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKST 72
Query: 55 --QRRTLQAIFKETKRPSKEMQVTIARQ 80
+ + L + K + KE+Q I ++
Sbjct: 73 KEESKQLLEVLKTKEEHQKEIQYEILQK 100
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 30.3 bits (68), Expect = 0.18
Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 7/54 (12%)
Query: 4 KRRKKEK---KKKKKKKKKKKKKKKKTCKRKEEPDISQDQS----STPKKPRLV 50
K++ KEK K+ K + + + + C K+ + D KKP V
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Score = 27.6 bits (61), Expect = 1.3
Identities = 9/39 (23%), Positives = 15/39 (38%)
Query: 10 KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
K + K+K K+K + E P + + K R
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQR 830
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.8 bits (67), Expect = 0.22
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKK 26
VEK++K + KK K K KK + KKK
Sbjct: 89 VEKKKKGKSKKNKLKGKKDEDKKKA 113
Score = 27.5 bits (61), Expect = 1.3
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 9 EKKKKKKKKKKKKKKKKKTCKRK 31
EKKKK K KK K K KK K+K
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKK 112
Score = 27.5 bits (61), Expect = 1.4
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKE 32
KK+K K KK K K KK + K R+
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 27.1 bits (60), Expect = 1.6
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 8 KEKKKKKKKKKKKKKKKKKTCKRKEE 33
K+KK K KK K K KK + K +E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 26.3 bits (58), Expect = 3.8
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 11 KKKKKKKKKKKKKKKKTCKRKEEPD 35
KKKK K KK K K KK +K+ +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKARE 115
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.8 bits (67), Expect = 0.23
Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 3/42 (7%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
+ +K KKK K KKKKK + E + ++ PK
Sbjct: 120 PKTEKLKKKITVNKSTNKKKKKVLSSKDEL---IKYDNNKPK 158
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.3 bits (66), Expect = 0.24
Identities = 15/44 (34%), Positives = 24/44 (54%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
E+ E K + K+KKK++ K KT K K + + + + S PK
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 27.0 bits (60), Expect = 1.7
Identities = 8/41 (19%), Positives = 17/41 (41%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
+ E++ + K + K+KKK+ + + KP
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Score = 25.9 bits (57), Expect = 4.9
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
+++ + K + K+KKK++ K ++E ++ + P KP+L
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 30.0 bits (68), Expect = 0.24
Identities = 17/54 (31%), Positives = 28/54 (51%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQR 56
+++R K K K K K +K K KKK K+K+ ++S + V +D +R
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR 1207
Score = 27.3 bits (61), Expect = 1.9
Identities = 14/51 (27%), Positives = 19/51 (37%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTD 53
+ R+ K KKK+KKKKK K KK + D+
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKS 1218
Score = 25.8 bits (57), Expect = 6.7
Identities = 13/47 (27%), Positives = 19/47 (40%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
K+R + KKKKK +KK + K K + S+ R
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRP 1356
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 29.7 bits (67), Expect = 0.25
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRK 31
R +K+ K K KK +K KK +K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 27.8 bits (62), Expect = 1.1
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRKE 32
R ++K K K KK +K KK + K ++
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 27.8 bits (62), Expect = 1.2
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCK 29
+K + K KK +K KK KT K
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 25.5 bits (56), Expect = 7.2
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKK 26
EK+ K + KK +K KK K +
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQ 795
>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel. The
family carries eight putative transmembrane domains,
and, although it has no similarity to other known
channel proteins, it is clearly a calcium-activated
ionic channel. It is expressed in various secretory
epithelia, the retina and sensory neurons, and mediates
receptor-activated chloride currents in diverse
physiological processes.
Length = 449
Score = 29.5 bits (67), Expect = 0.33
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 10 KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
KKK +KKK KKKK+K+K + E + +++S P
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIENESELPP 305
Score = 26.0 bits (58), Expect = 5.3
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
K++ ++KK KKKK+K+KK ++ + +R E
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIEN 299
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 29.5 bits (66), Expect = 0.34
Identities = 13/25 (52%), Positives = 19/25 (76%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKR 30
++KKKKKK K+KKKKK ++ K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.5 bits (61), Expect = 1.4
Identities = 13/22 (59%), Positives = 18/22 (81%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKK 26
R+KK+KKK K+KKKKK ++ K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
Score = 27.1 bits (60), Expect = 2.4
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 5 RRKKEKKKKKKKKKKKKKKKK 25
+K+KKKKK K+KKKKK ++
Sbjct: 682 GSRKKKKKKKSKEKKKKKNRE 702
Score = 26.0 bits (57), Expect = 5.5
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKK 26
+++KK+KK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 25.2 bits (55), Expect = 9.6
Identities = 14/49 (28%), Positives = 27/49 (55%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVF 51
+K++KK+ K+KKKKK ++ K+ + EE + + + P+L
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKL 733
>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
trafficking and secretion].
Length = 526
Score = 29.5 bits (66), Expect = 0.34
Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 7/76 (9%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDIS-------QDQSSTPKKPRLVFTDLQR 56
+RR++++ + +K+K+++ K++ E S Q S P+ + +F+D
Sbjct: 27 RRREEQQVELRKQKREELLNKRRNLADVSEEAESSFIPMEQQFYSELPQLTQQLFSDDIE 86
Query: 57 RTLQAIFKETKRPSKE 72
+ LQA++K K SKE
Sbjct: 87 QQLQAVYKFRKLLSKE 102
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 29.4 bits (67), Expect = 0.35
Identities = 8/42 (19%), Positives = 11/42 (26%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
K EK + K KK + Q +S
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 28.2 bits (64), Expect = 0.81
Identities = 5/29 (17%), Positives = 12/29 (41%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
+ KK K +K K++ ++ +
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 27.1 bits (61), Expect = 2.0
Identities = 8/29 (27%), Positives = 8/29 (27%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK K K KK RK
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRK 844
Score = 27.1 bits (61), Expect = 2.4
Identities = 8/32 (25%), Positives = 9/32 (28%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
K KK K +K K E D
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 26.7 bits (60), Expect = 2.8
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKT 27
K K +K K++ +K K T
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKAT 859
Score = 26.7 bits (60), Expect = 3.3
Identities = 6/24 (25%), Positives = 8/24 (33%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKT 27
+ K KK K +KT
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKT 845
Score = 26.3 bits (59), Expect = 4.7
Identities = 4/29 (13%), Positives = 9/29 (31%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
++ +K + K KK +
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSR 843
Score = 25.9 bits (58), Expect = 5.0
Identities = 6/29 (20%), Positives = 11/29 (37%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
++ + K KK K + K K+
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQ 847
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 29.1 bits (65), Expect = 0.36
Identities = 13/41 (31%), Positives = 17/41 (41%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
KE KK KK + K K K K K++ +S P
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGID 184
Score = 28.3 bits (63), Expect = 0.79
Identities = 14/46 (30%), Positives = 18/46 (39%), Gaps = 2/46 (4%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCK--RKEEPDISQDQSSTPKKP 47
K+ KK + K K K KK KKK R + +S K
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKS 193
>gnl|CDD|132580 TIGR03541, reg_near_HchA, LuxR family transcriptional regulatory,
chaperone HchA-associated. Members of this protein
family belong to the LuxR transcriptional regulator
family, and contain both autoinducer binding (pfam03472)
and transcriptional regulator (pfam00196) domains.
Members, however, occur only in a few members of the
Gammaproteobacteria that have the
chaperone/aminopeptidase HchA, and are always encoded by
the adjacent gene.
Length = 232
Score = 29.0 bits (65), Expect = 0.36
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 18/72 (25%)
Query: 29 KRKEEPDISQDQS--STPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPT 86
+R E + ++ S ++ L +T L RR Q IA LG+
Sbjct: 158 RRLLEAPLLREAGVLSEREREVLAWTALGRR----------------QADIAAILGISER 201
Query: 87 TVGNFFMNARRR 98
TV N +ARR+
Sbjct: 202 TVENHLRSARRK 213
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 29.2 bits (66), Expect = 0.37
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKK 26
+KR+ KE+ KKKK K+ K K K
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 28.5 bits (64), Expect = 0.64
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKT 27
EK+RK +++ KKKK K+ K K
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALK 273
>gnl|CDD|223522 COG0445, GidA, Flavin-dependent tRNA uridine
5-carboxymethylaminomethyl modification enzyme GidA
[Cell cycle control, cell division, chromosome
partitioning].
Length = 621
Score = 29.1 bits (66), Expect = 0.38
Identities = 13/55 (23%), Positives = 23/55 (41%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRR 57
++R ++ KKK+ +++ ++ K E P K R DL RR
Sbjct: 457 DERYERFLKKKENIEEEIERLKSTWVTPSEVAKELLALGGQPLKRRSSLYDLLRR 511
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 29.2 bits (66), Expect = 0.39
Identities = 17/41 (41%), Positives = 21/41 (51%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
KK KK K K KKKKKK + + SQ+ S+ KK
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 28.9 bits (65), Expect = 0.42
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDIS 37
+ K E+ KK++ + ++KKK+K K + +
Sbjct: 361 PTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 25.1 bits (55), Expect = 9.2
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
+R KKE+ + ++KKK+KK K + +
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|217667 pfam03666, NPR3, Nitrogen Permease regulator of amino acid
transport activity 3. This family, also known in yeasts
as Rmd11, complexes with NPR2, pfam06218. This complex
heterodimer is responsible for inactivating TORC1. an
evolutionarily conserved protein complex that controls
cell size via nutritional input signals, specifically,
in response to amino acid starvation.
Length = 424
Score = 28.9 bits (65), Expect = 0.45
Identities = 16/62 (25%), Positives = 23/62 (37%)
Query: 12 KKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSK 71
+KKKKKK+KK + +E S+D SS + E + S
Sbjct: 45 RKKKKKKQKKSDRADPNDDREPSVDSEDSSSKSSSKSESGSLANSDPASDPSSEDSKSSM 104
Query: 72 EM 73
M
Sbjct: 105 TM 106
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 28.5 bits (65), Expect = 0.63
Identities = 7/19 (36%), Positives = 10/19 (52%)
Query: 3 EKRRKKEKKKKKKKKKKKK 21
+ R + KK K+KKK
Sbjct: 380 AQARLRAAKKVKRKKKTSG 398
Score = 25.1 bits (56), Expect = 9.2
Identities = 7/27 (25%), Positives = 12/27 (44%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRK 31
+ + + + KK K+KKK K
Sbjct: 378 KAAQARLRAAKKVKRKKKTSGPALPGK 404
Score = 25.1 bits (56), Expect = 9.8
Identities = 7/24 (29%), Positives = 10/24 (41%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKK 24
K + + KK K+KKK
Sbjct: 375 KAIKAAQARLRAAKKVKRKKKTSG 398
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 28.4 bits (64), Expect = 0.64
Identities = 9/53 (16%), Positives = 24/53 (45%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQ 55
E++ KK + +K+K ++++K K +++ + P ++L
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168
Score = 28.0 bits (63), Expect = 0.84
Identities = 9/44 (20%), Positives = 19/44 (43%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
++ K ++++K K K +KK+K ++ E S
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTV 169
Score = 28.0 bits (63), Expect = 0.95
Identities = 10/55 (18%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 2 VEKRRKKEKKKKKK--KKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDL 54
V+ +R +++ KK++ +K+K ++++ K K + ++ P+ +D+
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI 164
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 27.9 bits (62), Expect = 0.75
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKK 25
EK +KK+KK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 27.5 bits (61), Expect = 0.85
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 8 KEKKKKKKKKKKKKKKKKK 26
K+KKKK KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119
Score = 27.5 bits (61), Expect = 1.0
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 7 KKEKKKKKKKKKKKKKKKKK 26
KK+KKK KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 27.2 bits (60), Expect = 1.5
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKK 26
+ +K KKKKKK KK KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSS 117
Score = 26.8 bits (59), Expect = 1.6
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKK 26
E K +KKKKK KK KK KK K
Sbjct: 95 EPTEKPKKKKKKSKKTKKPKKSSK 118
Score = 26.8 bits (59), Expect = 1.7
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 9 EKKKKKKKKKKKKKKKKKTCKRK 31
EK KKKKKK KK KK KK+ K+
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 26.8 bits (59), Expect = 1.9
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 11 KKKKKKKKKKKKKKKKTCKRKE 32
K KKKKKK KK KK K +K+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 24.8 bits (54), Expect = 8.9
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 14 KKKKKKKKKKKKKTCKRKEEPD 35
K KKKKKK KK K K+ + D
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 28.0 bits (63), Expect = 0.86
Identities = 10/36 (27%), Positives = 14/36 (38%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQ 38
EK K+E K K + +K E D +Q
Sbjct: 25 EKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQ 60
Score = 27.3 bits (61), Expect = 1.7
Identities = 8/42 (19%), Positives = 18/42 (42%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTP 44
K ++ E++K K++ K R + + D++ P
Sbjct: 17 LKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDP 58
Score = 25.7 bits (57), Expect = 5.7
Identities = 9/40 (22%), Positives = 15/40 (37%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSST 43
K+ ++EK K++ K K RK + T
Sbjct: 20 KQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59
>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
protein DltB. Members of this protein family are DltB,
part of a four-gene operon for D-alanyl-lipoteichoic
acid biosynthesis that is present in the vast majority
of low-GC Gram-positive organisms. This protein may be
involved in transport of D-alanine across the plasma
membrane [Cell envelope, Biosynthesis and degradation of
murein sacculus and peptidoglycan].
Length = 380
Score = 28.3 bits (64), Expect = 0.86
Identities = 8/14 (57%), Positives = 10/14 (71%)
Query: 9 EKKKKKKKKKKKKK 22
++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355
Score = 27.1 bits (61), Expect = 1.9
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 11 KKKKKKKKKKKKK 23
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 27.1 bits (61), Expect = 1.9
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 12 KKKKKKKKKKKKK 24
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 27.1 bits (61), Expect = 1.9
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 13 KKKKKKKKKKKKK 25
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 27.1 bits (61), Expect = 1.9
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 14 KKKKKKKKKKKKK 26
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases. This family
consists of known phosphorylases, and homologs believed
to share the function of using inorganic phosphate to
cleave an alpha 1,4 linkage between the terminal glucose
residue and the rest of the polymer (maltodextrin,
glycogen, etc.). The name of the glucose storage polymer
substrate, and therefore the name of this enzyme,
depends on the chain lengths and branching patterns. A
number of the members of this family have been shown to
operate on small maltodextrins, as may be obtained by
utilization of exogenous sources. This family represents
a distinct clade from the related family modeled by
TIGR02093/pfam00343.
Length = 601
Score = 28.1 bits (63), Expect = 0.92
Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 2/28 (7%)
Query: 42 STPKKPRLVFTDLQRRTLQAIFKETKRP 69
+T K+ L+F DL+R L I +RP
Sbjct: 399 ATYKRADLIFRDLER--LARILNNPERP 424
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 27.9 bits (62), Expect = 1.0
Identities = 16/29 (55%), Positives = 22/29 (75%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK+ KK+K + K++KKKKK+KKK KR
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 27.1 bits (60), Expect = 1.7
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
K++K E K++KKKKK+KKKKKK +
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175
Score = 26.3 bits (58), Expect = 3.0
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
++ K+KK + K++KKKKK+KK K++ P+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172
Score = 25.9 bits (57), Expect = 4.3
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
K +K+ KKKK + K++KKKKK K+K+
Sbjct: 137 LKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 27.7 bits (62), Expect = 1.1
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 3/60 (5%)
Query: 1 MVEKRRKKE---KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRR 57
M R++ K+ +KK K +K K K++ S+ +S P++V + RR
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGIPKVVSNRMARR 60
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 27.7 bits (62), Expect = 1.2
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 6 RKKEKKKKKKKKKKKKKK 23
RK+E KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374
Score = 26.6 bits (59), Expect = 3.6
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 4 KRRKKEKKKKKKKKKKKK 21
++R+ +KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374
Score = 25.8 bits (57), Expect = 5.8
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 3 EKRRKKEKKKKKKKKKKKKK 22
KR+++ KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374
Score = 25.4 bits (56), Expect = 7.5
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 8 KEKKKKKKKKKKKKKKKKKT 27
K K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 27.4 bits (62), Expect = 1.4
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKK 24
+EKR ++ +KK K K+ K++
Sbjct: 135 TIEKRLERLEKKAKSGDKEAKEEL 158
>gnl|CDD|180803 PRK07037, PRK07037, extracytoplasmic-function sigma-70 factor;
Validated.
Length = 163
Score = 27.3 bits (61), Expect = 1.5
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 52 TDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNF 91
++L RT A E R E Q IAR+LG+ PT V NF
Sbjct: 108 SELPARTRYAF--EMYRLHGETQKDIARELGVSPTLV-NF 144
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 27.2 bits (60), Expect = 1.8
Identities = 20/95 (21%), Positives = 40/95 (42%), Gaps = 9/95 (9%)
Query: 1 MVEKRRKKEK--KKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVF------- 51
MV++ KK++ ++ + +K+ KK K++EE +S++ + L+
Sbjct: 1 MVDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQDPD 60
Query: 52 TDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPT 86
DLQ +L + + K + M L P
Sbjct: 61 IDLQNNSLNMLKEVIKSSTSSMTAVPKPLKFLRPN 95
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 27.3 bits (60), Expect = 1.9
Identities = 11/43 (25%), Positives = 27/43 (62%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
R+ +++++ K K KK+++K+K +++ E + +S+ PK
Sbjct: 57 RKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNA 99
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 26.9 bits (60), Expect = 1.9
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
VE + + ++++ KKKKKKKKKK K + S+ P +
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216
Score = 25.0 bits (55), Expect = 9.8
Identities = 14/58 (24%), Positives = 23/58 (39%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQA 61
+ + +KKKKKKKKK KK + + S+ L+++ A
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIA 234
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 27.1 bits (60), Expect = 2.0
Identities = 13/28 (46%), Positives = 13/28 (46%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
K K K KKK K K KK T KR
Sbjct: 757 SASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 26.7 bits (59), Expect = 2.8
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
EK+ K +KK K K K KKK+ + ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKT 779
Score = 25.9 bits (57), Expect = 5.7
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
K + K K KKK K K KKT K++ P
Sbjct: 756 KSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 27.0 bits (61), Expect = 2.0
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKK 25
VEKR ++ +KK K K+ K + +
Sbjct: 140 VEKRLERLEKKAKGGDKEAKAELE 163
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.
LIN-8 is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex
is absent, other family members can partially replace
LIN-8 activity.
Length = 316
Score = 27.0 bits (60), Expect = 2.1
Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 10/77 (12%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE-PDI-SQDQSSTPKKPRLVFTDLQRRT- 58
+++ + EK K K K KK E+ P++ ++ KK +V ++ RT
Sbjct: 3 LKEYLELEKTKFGKYATKDVTLKKVVLSLLEKRPEMWKRNTRGQAKKWAIVGVEVYERTG 62
Query: 59 -------LQAIFKETKR 68
L+ IFK K
Sbjct: 63 KLVSVKDLRKIFKTAKD 79
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 26.8 bits (60), Expect = 2.1
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKK 25
VEKR + KK+KKKKKKK++K +
Sbjct: 158 FVEKRLDELTKKRKKKKKKKEEKVE 182
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 27.2 bits (60), Expect = 2.2
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKT 27
+ + + + E + K KK KK KKKK
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKV 259
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 27.0 bits (60), Expect = 2.4
Identities = 12/47 (25%), Positives = 21/47 (44%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
+ + + ++ + K+ KKKKKKKK K+K + P
Sbjct: 39 FLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVW 85
Score = 26.6 bits (59), Expect = 3.5
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
M E + K++K++ K+K KKK K K+ + D
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDD 36
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 26.5 bits (59), Expect = 2.4
Identities = 19/45 (42%), Positives = 27/45 (60%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
K+R K +KKK+KKKKKKK KK + KE S++ S ++
Sbjct: 84 AKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Score = 25.4 bits (56), Expect = 5.9
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
EK KK K++KKK+KKKKKKK K +KEE
Sbjct: 81 EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 26.7 bits (60), Expect = 2.6
Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 3/72 (4%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD--QSSTPKKPRLVFTDLQRRTLQ 60
+++ KE + KK + +KKK+ K KE P + + + +R ++
Sbjct: 183 DEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMKEINEEERRVVVE 242
Query: 61 A-IFKETKRPSK 71
+FK + K
Sbjct: 243 GYVFKVEIKELK 254
>gnl|CDD|114472 pfam05748, Rubella_E1, Rubella membrane glycoprotein E1. Rubella
virus (RV), the sole member of the genus Rubivirus
within the family Togaviridae, is a small enveloped,
positive strand RNA virus. The nucleocapsid consists of
40S genomic RNA and a single species of capsid protein
which is enveloped within a host-derived lipid bilayer
containing two viral glycoproteins, E1 (58 kDa) and E2
(42-46 kDa). In virus infected cells, RV matures by
budding either at the plasma membrane, or at the
internal membranes depending on the cell type and enters
adjacent uninfected cells by a membrane fusion process
in the endosome, directed by E1-E2 heterodimers. The
heterodimer formation is crucial for E1 transport out of
the endoplasmic reticulum to the Golgi and plasma
membrane. In RV E1, a cysteine at position 82 is crucial
for the E1-E2 heterodimer formation and cell surface
expression of the two proteins. The E1 has been shown to
be a type 1 membrane protein, rich in cysteine residues
with extensive intramolecular disulfide bonds.
Length = 496
Score = 27.0 bits (59), Expect = 2.6
Identities = 11/26 (42%), Positives = 16/26 (61%), Gaps = 1/26 (3%)
Query: 28 CKRKEEPDISQDQSSTPKKPRLVFTD 53
C+R PD S+ +TP++PRL D
Sbjct: 250 CQR-HSPDCSRLVGATPERPRLRLVD 274
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 26.7 bits (60), Expect = 2.6
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKT 27
E R++ K +KK+KKKKK+K+K+
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQE 635
Score = 25.9 bits (58), Expect = 6.0
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKT 27
+ ++ KKKKKKK K K K +T
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.9 bits (60), Expect = 2.7
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD 39
V +KKK++KK+++K+ K+ ++ D D
Sbjct: 173 GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDD 211
Score = 26.5 bits (59), Expect = 2.9
Identities = 10/40 (25%), Positives = 22/40 (55%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
+V +KKK++KK+++K+ K + ++ D +D
Sbjct: 171 LVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
Score = 26.5 bits (59), Expect = 3.1
Identities = 9/42 (21%), Positives = 21/42 (50%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSS 42
+ EKKK++KK+++K+ K+ + +E + +
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGG 215
Score = 25.7 bits (57), Expect = 6.1
Identities = 11/47 (23%), Positives = 26/47 (55%)
Query: 10 KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQR 56
+KKK++KK+++K++ + +E+ D +D+ + + QR
Sbjct: 178 SSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQR 224
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 27.0 bits (59), Expect = 2.8
Identities = 11/45 (24%), Positives = 32/45 (71%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
+R+ E+K +++++++K+K+K++ +R+ E + + SS+ + R+
Sbjct: 589 KREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRM 633
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 26.6 bits (59), Expect = 2.8
Identities = 19/40 (47%), Positives = 25/40 (62%)
Query: 7 KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
KKE ++K+K K KKKK KKK K K++ D +D S K
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 26.6 bits (59), Expect = 3.5
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
+ + +K KKKKKK+KKK+++ K++ R E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 25.8 bits (57), Expect = 5.7
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDI 36
E + ++K KKKKKK+KKK++ KR+E+ I
Sbjct: 734 EFKISDSVEEKTKKKKKKEKKKEEEYKREEKARI 767
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 26.5 bits (58), Expect = 3.5
Identities = 16/57 (28%), Positives = 25/57 (43%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTL 59
+ R + K KK KKKK KKK+ + E + + S+ +P + R L
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIPIPRNKL 171
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 26.2 bits (58), Expect = 4.0
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
E + K + K K K KK +++ K+ K E S +++ P +P
Sbjct: 96 EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARP 140
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 26.1 bits (58), Expect = 4.0
Identities = 8/45 (17%), Positives = 14/45 (31%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
+ K++ K K K KKK K ++ +
Sbjct: 74 IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDL 118
Score = 25.7 bits (57), Expect = 5.8
Identities = 9/54 (16%), Positives = 15/54 (27%)
Query: 2 VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQ 55
+ +KK K K K KKK + + + D+
Sbjct: 72 SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 26.1 bits (58), Expect = 4.2
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDI 36
+R +KEK+KK KK+ K++++ K EE +I
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAEI 424
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 26.0 bits (58), Expect = 4.3
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKT 27
R +E K+KKKKKKKKKKK++
Sbjct: 79 ADRAEELLKEKKKKKKKKKKKEEL 102
Score = 25.3 bits (56), Expect = 7.0
Identities = 14/26 (53%), Positives = 20/26 (76%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCK 29
+ KEKKKKKKKKKKK++ ++ C+
Sbjct: 83 EELLKEKKKKKKKKKKKEELREWMCE 108
>gnl|CDD|225767 COG3226, COG3226, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 204
Score = 26.0 bits (57), Expect = 4.5
Identities = 10/40 (25%), Positives = 15/40 (37%)
Query: 51 FTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGN 90
FT++ R LQA F + +A G + T
Sbjct: 68 FTEIMARQLQAFFSDVSDDRGACAAVLADLYGSQVATPDR 107
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 26.1 bits (58), Expect = 4.6
Identities = 8/26 (30%), Positives = 15/26 (57%)
Query: 8 KEKKKKKKKKKKKKKKKKKTCKRKEE 33
+EKKK KK +K+ + + +K +
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRK 109
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 26.0 bits (57), Expect = 4.7
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 7 KKEKKKKKKKKKKKKKKKK 25
+E K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 25.9 bits (57), Expect = 5.5
Identities = 11/44 (25%), Positives = 24/44 (54%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
R + + K +K K+ KK+KK+ K + + ++ +S P++
Sbjct: 20 GPRPVKDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAPRRR 63
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 25.7 bits (57), Expect = 5.6
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTC 28
R K + ++ K KKK KKK C
Sbjct: 782 HRSKLRSSQRDKLKKKLLKKKDFC 805
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 25.4 bits (56), Expect = 5.8
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKK 24
E +K + K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 25.4 bits (56), Expect = 7.0
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 6 RKKEKKKKKKKKKKKKKKKK 25
KK K K+ +K + KKKKKK
Sbjct: 86 SKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 25.5 bits (57), Expect = 5.8
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKK 25
E+ KK +K +K KKKKK++K
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 25.8 bits (57), Expect = 5.9
Identities = 11/28 (39%), Positives = 26/28 (92%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKR 30
E+RR++EK+++KK+ K++K+++K+T ++
Sbjct: 182 ERRRREEKERRKKQDKERKQREKETAEK 209
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 25.8 bits (57), Expect = 6.0
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKK 26
++RK KK KK KK+K+K+
Sbjct: 225 RKRKAPKKVAKKVAAAKKRKQKR 247
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 26.0 bits (57), Expect = 6.0
Identities = 11/31 (35%), Positives = 18/31 (58%), Gaps = 3/31 (9%)
Query: 3 EKRRKKE---KKKKKKKKKKKKKKKKKTCKR 30
E+RR +E + K +K + K + K+KT K
Sbjct: 441 EERRLREEAIRAKAEKTARMKAEMKEKTLKM 471
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 25.5 bits (56), Expect = 6.6
Identities = 10/29 (34%), Positives = 22/29 (75%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK +++E +KK+K++ + K+K++ +RK
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase. Cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues, is the major component of wood and
thus paper, and is synthesised by plants, most algae,
some bacteria and fungi, and even some animals. The
genes that synthesise cellulose in higher plants differ
greatly from the well-characterized genes found in
Acetobacter and Agrobacterium sp. More correctly
designated as 'cellulose synthase catalytic subunits',
plant cellulose synthase (CesA) proteins are integral
membrane proteins, approximately 1,000 amino acids in
length. There are a number of highly conserved residues,
including several motifs shown to be necessary for
processive glycosyltransferase activity.
Length = 716
Score = 25.9 bits (57), Expect = 6.7
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 13 KKKKKKKKKKKKKKTCKRKEE 33
++KK K K KKK KRKE
Sbjct: 312 RRKKPKSASKAKKKDSKRKES 332
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 25.0 bits (55), Expect = 7.0
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCK 29
K++K++KKKKKKKKKK KK K K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 25.0 bits (55), Expect = 8.3
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
K+ KEK K+KK+KKKKKKKKKK +K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 24.6 bits (54), Expect = 9.9
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
EK ++K++KKKKKKKKKKK KK K+K
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|240399 PTZ00393, PTZ00393, protein tyrosine phosphatase; Provisional.
Length = 241
Score = 25.3 bits (55), Expect = 7.0
Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 2/32 (6%)
Query: 2 VEKRRKKEKKKKKKK--KKKKKKKKKKTCKRK 31
+ RRK K++ + K KKKKKKK C RK
Sbjct: 206 IRDRRKGAINKRQLQFLKAYKKKKKKKNCLRK 237
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 25.4 bits (56), Expect = 7.5
Identities = 13/67 (19%), Positives = 22/67 (32%), Gaps = 4/67 (5%)
Query: 5 RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFK 64
KK+ + KKK K KK + +S S P + + R ++
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLVGEW----LSTAISPPDSSPVSLDAREKLRAYLPKWE 208
Query: 65 ETKRPSK 71
+ K
Sbjct: 209 GKTKNKK 215
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 25.3 bits (56), Expect = 8.2
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 8 KEKKKKKKKKKKKKKKKKKTCKRKEE 33
+E+K+KKK K++KK K K K E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLE 120
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 25.0 bits (55), Expect = 8.4
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 6 RKKEKKKKKKKKKKKKKKKKKTCKRK 31
++EKKKKKKK KKKK KK
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKVAAVGM 203
Score = 25.0 bits (55), Expect = 9.1
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
++K +++KKKKKKK KKK K+
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 25.1 bits (55), Expect = 8.5
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
+K+R +K KK KKK K+K + +++P+ D+
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 25.5 bits (56), Expect = 8.6
Identities = 8/14 (57%), Positives = 10/14 (71%)
Query: 7 KKEKKKKKKKKKKK 20
+ KKKKKKK+K
Sbjct: 737 GRSASKKKKKKKRK 750
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 25.4 bits (56), Expect = 8.7
Identities = 9/43 (20%), Positives = 15/43 (34%)
Query: 4 KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
K+EK K K+ K + + +D S P +
Sbjct: 217 NINKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259
>gnl|CDD|239463 cd03347, eu_PheOH, Eukaryotic phenylalanine-4-hydroxylase
(eu_PheOH); a member of the biopterin-dependent
aromatic amino acid hydroxylase family of non-heme,
iron(II)-dependent enzymes that also includes
prokaryotic phenylalanine-4-hydroxylase (pro_PheOH),
eukaryotic tyrosine hydroxylase (TyrOH) and eukaryotic
tryptophan hydroxylase (TrpOH). PheOH catalyzes the
first and rate-limiting step in the metabolism of the
amino acid L-phenylalanine (L-Phe), the hydroxylation
of L-Phe to L-tyrosine (L-Tyr). It uses
(6R)-L-erythro-5,6,7,8-tetrahydrobiopterin (BH4) as the
physiological electron donor. The catalytic activity of
the tetrameric enzyme is tightly regulated by the
binding of L-Phe and BH4 as well as by phosphorylation.
Mutations in the human enzyme are linked to a severe
variant of phenylketonuria.
Length = 306
Score = 25.1 bits (55), Expect = 9.4
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 29 KRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKR 68
+RKE DI+ + PR+ +T+ +++T +F+E K
Sbjct: 39 RRKEFADIAYNYKHGQPIPRVEYTEEEKKTWGTVFRELKS 78
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 25.1 bits (55), Expect = 9.6
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 3 EKRRKKEKKKKKKKKKKKKKKKK 25
K K KK KK KKKKKKK+
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 25.1 bits (55), Expect = 9.8
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 1 MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
++ +RR K KK KK K KKT +K
Sbjct: 251 LLAERRAKGGPGKKPAKKATAAKAKKTTAKK 281
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.126 0.352
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,616,691
Number of extensions: 483117
Number of successful extensions: 9725
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6008
Number of HSP's successfully gapped: 1523
Length of query: 111
Length of database: 10,937,602
Length adjustment: 75
Effective length of query: 36
Effective length of database: 7,611,052
Effective search space: 273997872
Effective search space used: 273997872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 53 (24.3 bits)