RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14571
(123 letters)
>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain.
Length = 57
Score = 51.3 bits (124), Expect = 4e-10
Identities = 21/40 (52%), Positives = 25/40 (62%)
Query: 80 RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
RT FT +QL ELEKEF N+Y + R E+A L L E Q
Sbjct: 4 RTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQ 43
>gnl|CDD|197696 smart00389, HOX, Homeodomain. DNA-binding factors that are
involved in the transcriptional regulation of key
developmental processes.
Length = 57
Score = 46.5 bits (111), Expect = 3e-08
Identities = 22/40 (55%), Positives = 27/40 (67%)
Query: 80 RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
RT+FT +QL ELEKEF N Y +R R E+A L L+E Q
Sbjct: 5 RTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQ 44
>gnl|CDD|238039 cd00086, homeodomain, Homeodomain; DNA binding domains involved in
the transcriptional regulation of key eukaryotic
developmental processes; may bind to DNA as monomers or
as homo- and/or heterodimers, in a sequence-specific
manner.
Length = 59
Score = 44.2 bits (105), Expect = 2e-07
Identities = 22/40 (55%), Positives = 25/40 (62%)
Query: 80 RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
RT FT +QL ELEKEF N Y +R R E+A L L E Q
Sbjct: 4 RTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQ 43
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 43.8 bits (104), Expect = 4e-06
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
++ EKK+K+K+K K KK+ + K I K + PS S
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 39.9 bits (94), Expect = 1e-04
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K+ K+ +KK+K+K+K K KK+ R TK
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 39.2 bits (92), Expect = 2e-04
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R KKEK+K+K K KK+ + K +
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 37.6 bits (88), Expect = 5e-04
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
KK K+ +KK+K+K+K K KKR
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKR 415
Score = 36.1 bits (84), Expect = 0.002
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+K K KK K+ +KK+K+K+K ++ K
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414
Score = 36.1 bits (84), Expect = 0.002
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
++++KEK K KK+ + K K++K
Sbjct: 403 KEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 36.1 bits (84), Expect = 0.002
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+K K+++K K KK+ + K K+++
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 34.1 bits (79), Expect = 0.011
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
KK K KK K+ +KK+K+K KV
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412
Score = 33.4 bits (77), Expect = 0.020
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K+ + KK K+ +KK+K+K+K K
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414
Score = 33.0 bits (76), Expect = 0.021
Identities = 7/26 (26%), Positives = 15/26 (57%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
++++K K KK+ + K K++K
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPS 429
Score = 30.7 bits (70), Expect = 0.17
Identities = 6/25 (24%), Positives = 13/25 (52%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+++ K KK+ + K K++K
Sbjct: 407 KEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 28.8 bits (65), Expect = 0.77
Identities = 9/32 (28%), Positives = 17/32 (53%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
+ ++K K KK K+ +KK+K K ++
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 39.6 bits (93), Expect = 1e-04
Identities = 17/25 (68%), Positives = 23/25 (92%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K+K+E+KK +K+KKKKK+KKK KKR
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 39.6 bits (93), Expect = 1e-04
Identities = 15/26 (57%), Positives = 24/26 (92%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+KR+++K +K+KKKKK+KKK KK+K+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 39.2 bits (92), Expect = 2e-04
Identities = 17/26 (65%), Positives = 22/26 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R+ KK +K+KKKKK+KKK KK+KKK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 38.0 bits (89), Expect = 4e-04
Identities = 15/26 (57%), Positives = 24/26 (92%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+K+++EKK +K+KKKKK+KKK KK++
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 36.9 bits (86), Expect = 0.001
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+KK ++KK +K+KKKKK+KKK K+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKK 407
Score = 36.9 bits (86), Expect = 0.001
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
++K +K+KKKKK+KKK KK+KKK R
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 36.9 bits (86), Expect = 0.001
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
K+ +++KKKKK+KKK KK+KKK ++
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 33.8 bits (78), Expect = 0.011
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITK 30
KK++++KK +K+KKKKK+KKK + K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 32.2 bits (74), Expect = 0.039
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
K K KKK+++KK +K+KKKKK++
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRK 402
Score = 32.2 bits (74), Expect = 0.041
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K+K+++KK +K+KKKKK+KKK
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKG 405
Score = 31.1 bits (71), Expect = 0.10
Identities = 13/25 (52%), Positives = 19/25 (76%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K K KKK+++KK +K+KKKK+
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKK 400
Score = 30.7 bits (70), Expect = 0.16
Identities = 12/26 (46%), Positives = 21/26 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+++ + KKK+++KK +K+KKKKKR
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKR 401
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 35.2 bits (82), Expect = 0.003
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
+ KK +K K K +KKK KK++ K R+
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 33.3 bits (77), Expect = 0.013
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKK 26
+ EK +KK +K K K +KKK K
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 32.9 bits (76), Expect = 0.018
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
K +K +KK +K K K +KKK + + +
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 31.3 bits (72), Expect = 0.054
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K ++ +KK +K K K +KKK KK K
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 31.3 bits (72), Expect = 0.066
Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 2/26 (7%)
Query: 5 KKEKKK--KKKKKKKKKKKKKKKKRI 28
KKE K K +KK +K K K +KK+
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKA 90
Score = 31.3 bits (72), Expect = 0.067
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K +K +KK +K K K +KKK
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKA 90
Score = 27.1 bits (61), Expect = 2.1
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 8 KKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
KK+ K +K +KK +K K + K + E
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKE 93
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 34.4 bits (80), Expect = 0.006
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
K + +KKK+K++ K K K K R+
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKAKSKPRL 92
Score = 32.5 bits (75), Expect = 0.034
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
KKE K K +KKK+K++ K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAA 84
Score = 31.3 bits (72), Expect = 0.088
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K K +KKK+K++ K K K
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKS 88
Score = 29.0 bits (66), Expect = 0.58
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKR 27
K++ K K +KKK+K++ K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAA 84
Score = 28.3 bits (64), Expect = 0.91
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
++ +KK+ K K +KKK+K++ + K
Sbjct: 56 LKAALLDKKELKAWHKAQKKKEKQEAKAAK 85
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 34.2 bits (78), Expect = 0.012
Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 6/70 (8%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPS 65
K + + K KK KKKK KKK K QIP E PS D PL+ + P
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPE------QQMEEKPSADAYEPLSRVIPIPR 168
Query: 66 AALSQHQSLL 75
L+ +++++
Sbjct: 169 NKLTPYRAVI 178
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 32.5 bits (74), Expect = 0.039
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 10 KKKKKKKKKKKKKKKKKRITKVQI 33
+KKKKKKK K+KKKKK R QI
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQI 707
Score = 31.4 bits (71), Expect = 0.095
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
RKK+KKKK K+KKKKK ++ K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706
Score = 31.4 bits (71), Expect = 0.11
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRI 28
+K+KKKKK K+KKKKK ++ K+I
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQI 707
Score = 30.6 bits (69), Expect = 0.20
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKK 24
RK+KK+KK K+KKKKK ++ K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 32.5 bits (74), Expect = 0.039
Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 3/86 (3%)
Query: 11 KKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQ 70
KKKK K + + + +P++ G P P+ P + A PS +
Sbjct: 101 PTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAPAPALTPS-QKDR 159
Query: 71 HQSLLNNSGRTNFTNK--QLTELEKE 94
++LL+ + + ++ + ELE E
Sbjct: 160 LETLLSPKDKISLNSEKPKFKELESE 185
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 32.5 bits (75), Expect = 0.044
Identities = 11/37 (29%), Positives = 25/37 (67%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
K ++ ++ +++++KKKKK+K +K +V IP ++
Sbjct: 147 GKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETIT 183
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.2 bits (74), Expect = 0.050
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K+++E+KK+KKKK KKK++++ K + E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 30.7 bits (70), Expect = 0.14
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K EKK++++KK+KKKK KK+ + + E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 30.7 bits (70), Expect = 0.16
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K+ EK +KK++++KK+KKKK K
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKE 441
Score = 30.3 bits (69), Expect = 0.21
Identities = 12/55 (21%), Positives = 24/55 (43%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTP 56
++K+E++KK+KKKK KKK+++ + + E +
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 29.5 bits (67), Expect = 0.37
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K ++ +KK++++KK+KKKK + + + E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 29.1 bits (66), Expect = 0.62
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K KK +K +KK++++KK+KKKK K
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKK 439
Score = 28.7 bits (65), Expect = 0.79
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
KK KK KK +K +KK++++K+ K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK 431
Score = 28.7 bits (65), Expect = 0.83
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+K KK KK +K +KK++++KK K
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKK 432
Score = 28.3 bits (64), Expect = 0.97
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+K +K +K +KK++++KK+KKK+
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFA 436
Score = 28.3 bits (64), Expect = 1.00
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+K K+ KK +K +KK++++KK+K+ +
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 31.4 bits (71), Expect = 0.050
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKK 26
+K KKKKKK KK KK KK KK
Sbjct: 97 TEKPKKKKKKSKKTKKPKKSSKK 119
Score = 29.5 bits (66), Expect = 0.20
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 6 KEKKKKKKKKKKKKKKKKKK 25
K+KKKK KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 0.22
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
K +KKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 0.23
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKK 24
K KK+KKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 28.7 bits (64), Expect = 0.49
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 9 KKKKKKKKKKKKKKKKKKR 27
KKKKKK KK KK KK K+
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119
Score = 27.2 bits (60), Expect = 1.7
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 7 EKKKKKKKKKKKKKKKKKKKRITK 30
E +K KKKKKK KK KK K+ +K
Sbjct: 95 EPTEKPKKKKKKSKKTKKPKKSSK 118
Score = 26.4 bits (58), Expect = 3.2
Identities = 14/21 (66%), Positives = 14/21 (66%)
Query: 10 KKKKKKKKKKKKKKKKKRITK 30
K KKKKKK KK KK KK K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 25.6 bits (56), Expect = 4.9
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 2 RKRKKEKKKKKKKKKKKKKK 21
+K+KK+ KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 32.1 bits (73), Expect = 0.053
Identities = 15/47 (31%), Positives = 18/47 (38%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS 48
K + K KKK K K KK KKR + PS + P
Sbjct: 758 ASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGAEPVG 804
Score = 29.0 bits (65), Expect = 0.61
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K+K K K K KKK K K K+ TK + +P
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTP 790
Score = 27.1 bits (60), Expect = 3.0
Identities = 11/41 (26%), Positives = 14/41 (34%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHP 44
+ K+K K K K KKK K + K P
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSP 792
Score = 26.7 bits (59), Expect = 3.7
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
++K + KKK K K K KKK K + ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 31.9 bits (73), Expect = 0.057
Identities = 13/35 (37%), Positives = 26/35 (74%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
++ KE+K+ +K KK+++KKKK+ +++ K +IP
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPA 594
Score = 30.0 bits (68), Expect = 0.27
Identities = 11/31 (35%), Positives = 24/31 (77%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
++ K+ K++K+ +K KK+++KKKK + K++
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587
Score = 30.0 bits (68), Expect = 0.29
Identities = 10/28 (35%), Positives = 22/28 (78%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+R+KE+K+ K++K+ +K KK++++ K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKK 581
Score = 27.3 bits (61), Expect = 2.1
Identities = 8/27 (29%), Positives = 22/27 (81%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
R++++++ K++K+ +K KK+++KK+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKK 580
Score = 25.8 bits (57), Expect = 8.7
Identities = 7/28 (25%), Positives = 20/28 (71%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
K+E +++K++K+ K++K+ +K + +
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEE 577
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 32.1 bits (73), Expect = 0.057
Identities = 15/53 (28%), Positives = 21/53 (39%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLT 58
K K K +K KKK K K ++ S L + + S +PLT
Sbjct: 115 MRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167
Score = 26.3 bits (58), Expect = 4.5
Identities = 11/36 (30%), Positives = 14/36 (38%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
K+K K KKKKK K + P +S
Sbjct: 126 KKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSIS 161
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 31.5 bits (72), Expect = 0.067
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
KRK +++ KKKK K+ K K KK + K
Sbjct: 251 KRKSKEEIKKKKPKESKGVKALKKVVAK 278
Score = 30.4 bits (69), Expect = 0.16
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
+++ KE+ KKKK K+ K K KK
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 28.5 bits (64), Expect = 0.81
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ +++ KKKK K+ K K KK + K
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMK 281
Score = 28.1 bits (63), Expect = 1.3
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
R KK K K++ KKKK K+ K K V
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVV 276
Score = 27.7 bits (62), Expect = 1.6
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K++K K++ KKKK K+ K K K++
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAK 278
Score = 26.9 bits (60), Expect = 2.7
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ +K++K K++ KKKK K+ K K + KV
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 26.9 bits (60), Expect = 3.2
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
R ++K+K K++ KKKK K+ K + K +
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278
Score = 25.4 bits (56), Expect = 8.8
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKK 24
+K+ KE K K KK K KK
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMKK 282
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.4 bits (72), Expect = 0.083
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH 43
+K +KE+ +K KKK + KK K + PLG
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
Score = 29.5 bits (67), Expect = 0.45
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKR 27
RKK++K +KK +K++ +K KK+
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKK 438
Score = 29.5 bits (67), Expect = 0.45
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+RKK +KK++K +KK +K++ +K
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAA 434
Score = 29.1 bits (66), Expect = 0.60
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+K++K +KK +K++ +K KKK + K
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Score = 28.0 bits (63), Expect = 1.3
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 5 KKEKKKKKKKKKKKKKKKKK 24
E+KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427
Score = 28.0 bits (63), Expect = 1.3
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+RK++++ +KK +K++ +K KKK
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443
Score = 27.6 bits (62), Expect = 1.8
Identities = 8/26 (30%), Positives = 19/26 (73%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
RK+ ++K++K +KK +K++ +K +
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAK 436
Score = 26.4 bits (59), Expect = 4.4
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
K +K+++K +KK +K++ +K K+ +
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEAA 442
Score = 26.4 bits (59), Expect = 5.1
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 7 EKKKKKKKKKKKKKKKKKKK 26
++KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427
Score = 25.7 bits (57), Expect = 8.9
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 6 KEKKKKKKKKKKKKKKKKKK 25
++KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427
Score = 25.7 bits (57), Expect = 9.3
Identities = 7/20 (35%), Positives = 15/20 (75%)
Query: 8 KKKKKKKKKKKKKKKKKKKR 27
++KK +KK++K +KK ++
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427
>gnl|CDD|223557 COG0481, LepA, Membrane GTPase LepA [Cell envelope biogenesis,
outer membrane].
Length = 603
Score = 31.0 bits (71), Expect = 0.11
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 10 KKKKKKKKKKKKKKKKKRITKVQIPSE 36
+K+K +K+K+ KK+ K+I V+IP E
Sbjct: 566 RKRKLLEKQKEGKKRMKQIGNVEIPQE 592
>gnl|CDD|130460 TIGR01393, lepA, GTP-binding protein LepA. LepA (GUF1 in
Saccaromyces) is a GTP-binding membrane protein related
to EF-G and EF-Tu. Two types of phylogenetic tree,
rooted by other GTP-binding proteins, suggest that
eukaryotic homologs (including GUF1 of yeast) originated
within the bacterial LepA family. The function is
unknown [Unknown function, General].
Length = 595
Score = 31.1 bits (71), Expect = 0.12
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 10 KKKKKKKKKKKKKKKKKRITKVQIPSE 36
+K+K +K+K+ KK+ K+I KV++P E
Sbjct: 560 RKRKLLEKQKEGKKRMKQIGKVEVPQE 586
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.0 bits (70), Expect = 0.15
Identities = 13/30 (43%), Positives = 22/30 (73%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
+RKK+K++K K+K+ KK K +K+ K+Q
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQ 48
Score = 30.3 bits (68), Expect = 0.22
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
RK+KKE+K K+K+ KK K +K+ K + Q
Sbjct: 20 RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
Score = 25.6 bits (56), Expect = 9.0
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
+K E++ ++KKKK++K K+K+ K++ Q
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQ 40
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.4 bits (69), Expect = 0.16
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
++ R +E++ KKKKKKKKKK KK +
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSS 201
Score = 27.7 bits (62), Expect = 1.4
Identities = 12/39 (30%), Positives = 16/39 (41%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+ + + +KKKKKKKKK KK V
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTEL 214
Score = 26.5 bits (59), Expect = 4.0
Identities = 12/46 (26%), Positives = 16/46 (34%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS 48
K+KK+KKKKK KK + + V V
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKL 227
Score = 25.8 bits (57), Expect = 7.7
Identities = 15/47 (31%), Positives = 20/47 (42%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPT 47
+ KK+KKKKKKK KK + +P+ELS
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 30.0 bits (68), Expect = 0.17
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
++++ K+KK+KKKKKKKKKKK KK K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 29.3 bits (66), Expect = 0.30
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
K+ ++K K+KK+KKKKKKKKKKK K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 28.1 bits (63), Expect = 0.75
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K K++K+KKKKKKKKKKK KK + K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 28.1 bits (63), Expect = 0.86
Identities = 17/25 (68%), Positives = 21/25 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
++ KEK K+KK+KKKKKKKKKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 28.1 bits (63), Expect = 0.89
Identities = 17/25 (68%), Positives = 21/25 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+K++K+KKKKKKKKK KK KKKK
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 26.6 bits (59), Expect = 2.6
Identities = 17/25 (68%), Positives = 20/25 (80%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKK 25
+K+KK+KKKKKKKK KK KKKK
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 25.4 bits (56), Expect = 6.2
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+ K++ K+KK+KKKKKKKKKKK +
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSK 128
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 30.6 bits (70), Expect = 0.17
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 7 EKKKKKKKKKKKKKKKKKKKRIT 29
EK++ KKKKK+KK KK + ++I
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 29.0 bits (66), Expect = 0.52
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 10 KKKKKKKKKKKKKKKKKRITKV 31
+K++ KKKKK+KK KK + K+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
Score = 29.0 bits (66), Expect = 0.58
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 6 KEKKKKKKKKKKKKKKKKKKK 26
++++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 28.3 bits (64), Expect = 1.2
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKK 22
+ KR+ +KKKK+KK KK + +K
Sbjct: 732 LEKRRLKKKKKRKKVKKWEVEK 753
Score = 27.9 bits (63), Expect = 1.5
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 3 KRKKEKKKKKKKKKKKKKKKK 23
++++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 27.9 bits (63), Expect = 1.6
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 5 KKEKKKKKKKKKKKKKKKKK 24
K+ KKKKK+KK KK + +K
Sbjct: 734 KRRLKKKKKRKKVKKWEVEK 753
Score = 27.5 bits (62), Expect = 2.0
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 5 KKEKKKKKKKKKKKKKKKKKK 25
+K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|235462 PRK05433, PRK05433, GTP-binding protein LepA; Provisional.
Length = 600
Score = 30.4 bits (70), Expect = 0.19
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 10 KKKKKKKKKKKKKKKKKRITKVQIPSE 36
+K+K +K+K+ KK+ K+I V+IP E
Sbjct: 563 RKRKLLEKQKEGKKRMKQIGNVEIPQE 589
>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal
protein S8e and similar proteins. This family contains
the eukaryotic/archaeal ribosomal protein S8, a
component of the small ribosomal subunits, as well as
the NSA2 gene product.
Length = 138
Score = 29.8 bits (68), Expect = 0.19
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
KRK K K +KK+K + +K TK+
Sbjct: 6 KRKATGGKFKVVRKKRKYELGRKPANTKL 34
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 30.2 bits (68), Expect = 0.20
Identities = 15/27 (55%), Positives = 21/27 (77%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
+K+K E K++KKKKK+KKKKKK+
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSP 171
Score = 29.8 bits (67), Expect = 0.29
Identities = 15/26 (57%), Positives = 22/26 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+ +KK+ + K++KKKKK+KKKKKKR
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 29.4 bits (66), Expect = 0.36
Identities = 15/26 (57%), Positives = 22/26 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+K KK+K + K++KKKKK+KKKKK+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 28.3 bits (63), Expect = 0.93
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
KK KKKK + K++KKKKK+KK+ K P
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSP 171
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.4 bits (68), Expect = 0.21
Identities = 17/48 (35%), Positives = 25/48 (52%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
KK KK KKK+KK+K+K++ K K+ S L L P + +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAE 244
Score = 28.1 bits (62), Expect = 1.2
Identities = 17/63 (26%), Positives = 29/63 (46%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
+K KK KKK+KK+K+K++ K KKK+ + L + + + + T
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSG 256
Query: 62 YPP 64
P
Sbjct: 257 TAP 259
Score = 28.1 bits (62), Expect = 1.2
Identities = 9/37 (24%), Positives = 25/37 (67%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+++ EK K++KKK++K+K+ +++ + +++P
Sbjct: 91 RQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAP 127
Score = 27.7 bits (61), Expect = 1.9
Identities = 9/25 (36%), Positives = 21/25 (84%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+R +++ +K K++KKK++K+K+ +R
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRR 112
Score = 26.6 bits (58), Expect = 4.1
Identities = 9/26 (34%), Positives = 21/26 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R R++ +K K++KKK++K+K+ +++
Sbjct: 89 RHRQRLEKDKREKKKREKEKRGRRRH 114
Score = 26.2 bits (57), Expect = 5.8
Identities = 10/34 (29%), Positives = 23/34 (67%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
+R+ ++ +K K++KKK++K+K+ R + +E
Sbjct: 87 ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120
Score = 26.2 bits (57), Expect = 6.5
Identities = 9/26 (34%), Positives = 21/26 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R+ ++ +K K++KKK++K+K+ ++R
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRR 113
Score = 25.4 bits (55), Expect = 9.1
Identities = 14/26 (53%), Positives = 20/26 (76%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+K K KKKK++K+K++KKKKKK
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHH 300
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.5 bits (69), Expect = 0.22
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R KK K K KK +K KK K +
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQ 795
Score = 29.7 bits (67), Expect = 0.31
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRIT 29
++K + K KK +K KK K ++I
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIA 798
Score = 29.7 bits (67), Expect = 0.38
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKV 31
R ++K K K KK +K KK + K+
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKI 797
Score = 29.3 bits (66), Expect = 0.43
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 8 KKKKKKKKKKKKKKKKKKKRITKVQ 32
+KK K K KK +K KK K Q
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQ 795
Score = 26.6 bits (59), Expect = 3.9
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ K KK +K KK K +K TK
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATK 802
Score = 25.9 bits (57), Expect = 7.6
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
K+ K KK K +K K KR K ++
Sbjct: 781 KKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 29.5 bits (66), Expect = 0.23
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+ KK KK ++ +KK +KK+KK +T V P SP
Sbjct: 48 VGLNKKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSP 86
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 30.1 bits (69), Expect = 0.24
Identities = 8/30 (26%), Positives = 11/30 (36%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
++ + K KK K R TK Q
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848
Score = 30.1 bits (69), Expect = 0.27
Identities = 5/28 (17%), Positives = 9/28 (32%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ KK K +K K++
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 29.8 bits (68), Expect = 0.30
Identities = 6/29 (20%), Positives = 10/29 (34%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K KK K +K K++ +
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVAS 852
Score = 29.4 bits (67), Expect = 0.42
Identities = 7/26 (26%), Positives = 9/26 (34%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R K KK K +K K+
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQ 847
Score = 29.4 bits (67), Expect = 0.42
Identities = 6/24 (25%), Positives = 11/24 (45%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKK 26
+K K +K K++ +K K
Sbjct: 834 GKKAAVKFSRKTKQQYVASEKDGK 857
Score = 29.4 bits (67), Expect = 0.44
Identities = 5/30 (16%), Positives = 11/30 (36%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ KK K +K K++ ++
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 29.0 bits (66), Expect = 0.69
Identities = 7/29 (24%), Positives = 9/29 (31%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K +K + K KK K K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRK 844
Score = 29.0 bits (66), Expect = 0.70
Identities = 6/30 (20%), Positives = 10/30 (33%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
+ + K KK K +K + V
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVA 851
Score = 28.6 bits (65), Expect = 0.85
Identities = 6/24 (25%), Positives = 11/24 (45%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKK 26
K+ K +K K++ +K K
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858
Score = 28.2 bits (64), Expect = 1.1
Identities = 7/27 (25%), Positives = 12/27 (44%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
KK K +K K++ +K + T
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKAT 859
Score = 27.8 bits (63), Expect = 1.4
Identities = 7/28 (25%), Positives = 12/28 (42%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K KK K +K K++ ++ K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 27.4 bits (62), Expect = 2.3
Identities = 7/34 (20%), Positives = 12/34 (35%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K + K KK K ++ + + SE
Sbjct: 820 KLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853
Score = 27.1 bits (61), Expect = 2.5
Identities = 5/24 (20%), Positives = 11/24 (45%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKK 25
+K + +K K++ +K K
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 29.9 bits (68), Expect = 0.24
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 9 KKKKKKKKKKKKKKKKKKRITKVQIPSEL 37
KKK KK KK+K+K++K+ R EL
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDEL 42
Score = 28.0 bits (63), Expect = 0.94
Identities = 10/20 (50%), Positives = 17/20 (85%)
Query: 7 EKKKKKKKKKKKKKKKKKKK 26
+KK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33
Score = 27.6 bits (62), Expect = 1.5
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 5 KKEKKKKKKKKKKKKKKKKKK 25
KK KK KK+K+K++K+ +K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGA 35
Score = 27.2 bits (61), Expect = 2.0
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITKV 31
K+K KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGD 39
Score = 26.4 bits (59), Expect = 3.2
Identities = 10/22 (45%), Positives = 17/22 (77%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
KK+ KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 25.6 bits (57), Expect = 6.7
Identities = 8/19 (42%), Positives = 14/19 (73%)
Query: 1 MRKRKKEKKKKKKKKKKKK 19
+K KKEK+K++K+ +K
Sbjct: 17 AKKAKKEKRKQRKQARKGA 35
Score = 25.3 bits (56), Expect = 9.2
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 3 KRKKEKKKKKKKKKKKKKKK 22
K KK KK+K+K++K+ +K
Sbjct: 16 KAKKAKKEKRKQRKQARKGA 35
>gnl|CDD|163257 TIGR03425, urea_degr_2, urea carboxylase-associated protein 2. A
number of bacteria degrade urea as a nitrogen source by
the urea carboxylase/allophanate hydrolase pathway,
which uses biotin and consumes ATP, rather than my means
of the nickel-dependent enzyme urease. This model
represents one of a pair of homologous, tandem
uncharacterized genes found together with the urea
carboxylase and allophanate hydrolase genes.
Length = 233
Score = 29.6 bits (67), Expect = 0.28
Identities = 14/27 (51%), Positives = 15/27 (55%), Gaps = 3/27 (11%)
Query: 44 PHPTSPSTDYT-TP--LTALKYPPSAA 67
PHP P DY P LTA + PP AA
Sbjct: 184 PHPLDPRPDYQPKPVRLTAWRAPPVAA 210
>gnl|CDD|203441 pfam06421, LepA_C, GTP-binding protein LepA C-terminus. This
family consists of the C-terminal region of several pro-
and eukaryotic GTP-binding LepA proteins.
Length = 108
Score = 29.1 bits (66), Expect = 0.29
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 10 KKKKKKKKKKKKKKKKKRITKVQIPSE 36
+KKK +K+K+ KK+ K+I V+IP E
Sbjct: 74 RKKKLLEKQKEGKKRMKQIGNVEIPQE 100
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 29.7 bits (67), Expect = 0.31
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K E+ KK++ + ++KKK+KK +
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKAK 387
Score = 28.5 bits (64), Expect = 0.88
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ K ++ KK++ + ++KKK+KK ++
Sbjct: 361 PTKAKPERDKKERPGRYRRKKKEKKAKSER 390
Score = 28.1 bits (63), Expect = 1.1
Identities = 7/26 (26%), Positives = 17/26 (65%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+K + + ++KKK+KK K +++ +
Sbjct: 370 KKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 28.1 bits (63), Expect = 1.3
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKR 27
R K+++ + ++KKK+KK K ++R
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERR 391
Score = 27.0 bits (60), Expect = 2.7
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+ KKE+ + ++KKK+KK K +++
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRG 392
Score = 27.0 bits (60), Expect = 2.9
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R +K+ + ++KKK+KK K +++
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGL 393
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 29.7 bits (67), Expect = 0.37
Identities = 13/27 (48%), Positives = 22/27 (81%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
+ K +KKKKK+KKK+++ K+++K RI
Sbjct: 741 VEEKTKKKKKKEKKKEEEYKREEKARI 767
Score = 27.4 bits (61), Expect = 2.7
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
+EK KKKKKK+KKK+++ K++ +++I
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKARIEIA 770
Score = 26.6 bits (59), Expect = 3.9
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
K KK+KKK+KKK+++ K+++K + I
Sbjct: 744 KTKKKKKKEKKKEEEYKREEKARIEIA 770
>gnl|CDD|220440 pfam09848, DUF2075, Uncharacterized conserved protein (DUF2075).
This domain, found in various prokaryotic proteins
(including putative ATP/GTP binding proteins), has no
known function.
Length = 348
Score = 29.5 bits (67), Expect = 0.37
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K E K +K+K K+ + R+ K
Sbjct: 300 DGTKWEDKSAFQKRKDLKEDPEDADRLIK 328
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 28.4 bits (64), Expect = 0.38
Identities = 8/31 (25%), Positives = 23/31 (74%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
++RK+E+K +++++ K+ K K+++ K++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLE 31
Score = 26.5 bits (59), Expect = 2.1
Identities = 9/29 (31%), Positives = 21/29 (72%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K ++E++ K+ K K+++ ++K ++I KV
Sbjct: 8 KAQREEELKRLKNLKREEIEEKLEKIKKV 36
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 29.1 bits (66), Expect = 0.42
Identities = 12/38 (31%), Positives = 20/38 (52%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
R+ EK+K ++++K K K +KKR + Q P
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 28.7 bits (65), Expect = 0.55
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
KR+ +K+K ++++K K K ++
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRK 145
Score = 28.4 bits (64), Expect = 0.73
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGV 42
K + ++K K K +KK+K + +K + S++S L V
Sbjct: 129 EKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTV 169
Score = 28.4 bits (64), Expect = 0.95
Identities = 10/54 (18%), Positives = 26/54 (48%), Gaps = 1/54 (1%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS-PSTDYTT 55
+R +++ KK++ +K+K +++++ + P P P P +D +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Score = 26.4 bits (59), Expect = 3.7
Identities = 7/35 (20%), Positives = 17/35 (48%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
++ +K+K ++++K K K +K+ E
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 29.3 bits (66), Expect = 0.45
Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 4/66 (6%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPP 64
K K K K K K KK +++ KR K P SP T+P+ ++ TA P
Sbjct: 97 PKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPF----ENTAPARPTSSTATAAASKP 152
Query: 65 SAALSQ 70
++S
Sbjct: 153 VTSVSS 158
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 29.2 bits (66), Expect = 0.46
Identities = 9/38 (23%), Positives = 23/38 (60%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
+ K+ K K ++K+K K++K++K + +++ + L
Sbjct: 56 AKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93
Score = 28.5 bits (64), Expect = 0.68
Identities = 12/21 (57%), Positives = 12/21 (57%)
Query: 3 KRKKEKKKKKKKKKKKKKKKK 23
K K EK K KKKK K K K
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAK 214
Score = 28.1 bits (63), Expect = 1.2
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 6 KEKKKKKKKKKKKKKKKKKKK 26
K K +K K KKKK K K K
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAK 214
Score = 26.9 bits (60), Expect = 2.9
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 8 KKKKKKKKKKKKKKKKKKKRITK 30
K K KK K K ++K+K KR +
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKE 77
Score = 26.5 bits (59), Expect = 3.7
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 6 KEKKKKKKKKKKKKKKKKKKK 26
EK K +K K KKKK K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAK 212
Score = 25.8 bits (57), Expect = 6.1
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
K + KK K K ++K+K K++K ++ E +P
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90
Score = 25.8 bits (57), Expect = 6.5
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ +K K K K KK K K ++K K
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAK 73
Score = 25.4 bits (56), Expect = 7.8
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K K +K K K ++K+K K++K+++
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEK 79
Score = 25.4 bits (56), Expect = 7.9
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
++++EK K K K KK K K + K +
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAK 73
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 29.2 bits (66), Expect = 0.47
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ +KEKKK K++KK K++K+K
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.8 bits (65), Expect = 0.47
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
EK+ K K++K+++ ++K+K I K Q
Sbjct: 123 LLEKELAKLKREKRRENERKQKEILKEQ 150
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.0 bits (65), Expect = 0.54
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
KK+K K KK K K KK + KKK
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKK 112
Score = 28.3 bits (63), Expect = 0.87
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKK 26
K+KK K KK K K KK + KKK +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 27.9 bits (62), Expect = 1.0
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKR 27
+KK+ K KK K K KK + KKK R
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 27.5 bits (61), Expect = 1.7
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRI 28
K+KK K KK K K KK + KK+
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKA 113
Score = 27.5 bits (61), Expect = 1.8
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKK 25
+K+K + KK K K KK + KKK +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 27.1 bits (60), Expect = 2.6
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 7 EKKKKKKKKKKKKKKKKKKKR 27
EKKKK K KK K K KK + +
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDK 110
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.0 bits (65), Expect = 0.57
Identities = 14/23 (60%), Positives = 17/23 (73%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRI 28
E K KK KKKKKKKKK++K +
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDL 292
Score = 27.8 bits (62), Expect = 1.4
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKR 27
+ K KK KKKKKKKKK++K
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDL 292
Score = 27.8 bits (62), Expect = 1.7
Identities = 14/21 (66%), Positives = 16/21 (76%)
Query: 1 MRKRKKEKKKKKKKKKKKKKK 21
M K KK KKKKKKKKK++K
Sbjct: 272 MVKFKKPKKKKKKKKKRRKDL 292
Score = 27.0 bits (60), Expect = 2.5
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKR 27
+ K KK KKKKKKKKK+R
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRR 289
>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
protein DltB. Members of this protein family are DltB,
part of a four-gene operon for D-alanyl-lipoteichoic
acid biosynthesis that is present in the vast majority
of low-GC Gram-positive organisms. This protein may be
involved in transport of D-alanine across the plasma
membrane [Cell envelope, Biosynthesis and degradation of
murein sacculus and peptidoglycan].
Length = 380
Score = 29.0 bits (66), Expect = 0.59
Identities = 9/17 (52%), Positives = 11/17 (64%)
Query: 15 KKKKKKKKKKKKRITKV 31
+K KK KK KK + TK
Sbjct: 343 RKSKKYKKNKKNKWTKA 359
Score = 27.5 bits (62), Expect = 2.0
Identities = 8/14 (57%), Positives = 10/14 (71%)
Query: 7 EKKKKKKKKKKKKK 20
++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355
Score = 26.7 bits (60), Expect = 3.4
Identities = 9/20 (45%), Positives = 11/20 (55%)
Query: 14 KKKKKKKKKKKKKRITKVQI 33
+K KK KK KK K + I
Sbjct: 343 RKSKKYKKNKKNKWTKALSI 362
Score = 26.3 bits (59), Expect = 4.1
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 9 KKKKKKKKKKKKK 21
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.1
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 10 KKKKKKKKKKKKK 22
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.1
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 11 KKKKKKKKKKKKK 23
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.1
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 12 KKKKKKKKKKKKK 24
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
Score = 26.3 bits (59), Expect = 4.1
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 13 KKKKKKKKKKKKK 25
+K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 28.0 bits (63), Expect = 0.61
Identities = 9/37 (24%), Positives = 18/37 (48%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+ K+ K+K + KKKKK+K + + + +
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69
Score = 25.7 bits (57), Expect = 4.2
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
++ K++ + KKKKK+K KKKR
Sbjct: 35 KELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 25.7 bits (57), Expect = 4.6
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K+K ++ KKKKK+K KKK+ +E
Sbjct: 37 LKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 28.4 bits (64), Expect = 0.69
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
++KK+KKK KKKK KK K I+
Sbjct: 181 EKKKKKKKSAKKKKLKKVAAVGMKAIS 207
Score = 28.0 bits (63), Expect = 1.1
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ KK+ +++KKKKKKK KKK++ KV
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198
Score = 26.5 bits (59), Expect = 3.3
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+K +++KKKKKKK KKKK KK
Sbjct: 172 KKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 28.5 bits (64), Expect = 0.70
Identities = 17/25 (68%), Positives = 20/25 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+KR K +KKK+KKKKKKK KK KK
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 27.3 bits (61), Expect = 1.7
Identities = 14/29 (48%), Positives = 23/29 (79%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+++ +EK KK+ K++KKK+KKKKK+ K
Sbjct: 76 KRKDEEKTAKKRAKRQKKKQKKKKKKKAK 104
Score = 26.9 bits (60), Expect = 1.9
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K KK+ K++KKK+KKKKKKK + + E S
Sbjct: 82 KTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Score = 26.9 bits (60), Expect = 2.3
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKR 27
+K+ K++KKK+KKKKKKK KK +
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNK 108
Score = 26.5 bits (59), Expect = 2.5
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
KR+K+K+KKKKKKK KK KK++K
Sbjct: 88 AKRQKKKQKKKKKKKAKKGNKKEEK 112
Score = 26.5 bits (59), Expect = 2.7
Identities = 15/24 (62%), Positives = 20/24 (83%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKK 26
K K++ K++KKK+KKKKKKK KK
Sbjct: 82 KTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 26.5 bits (59), Expect = 3.0
Identities = 15/23 (65%), Positives = 18/23 (78%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKK 26
KK K++KKK+KKKKKKK KK
Sbjct: 84 AKKRAKRQKKKQKKKKKKKAKKG 106
Score = 26.2 bits (58), Expect = 3.7
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
K++ +++KKK+KKKKKKK KK K
Sbjct: 84 AKKRAKRQKKKQKKKKKKKAKKGNK 108
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.8 bits (65), Expect = 0.75
Identities = 10/35 (28%), Positives = 12/35 (34%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
KK+ K K K KKK K +E
Sbjct: 73 DIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107
Score = 28.0 bits (63), Expect = 1.3
Identities = 10/37 (27%), Positives = 14/37 (37%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
+K K K K KKK K + K + + L
Sbjct: 77 KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 28.5 bits (64), Expect = 0.77
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRI 28
+ + +KKK + KKK K KK RI
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRI 173
Score = 26.6 bits (59), Expect = 3.2
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRI 28
++KK + KKK K KK + +
Sbjct: 152 LAEKKKPRSKKKSSKNSKKLRIDLV 176
Score = 26.6 bits (59), Expect = 3.3
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITK 30
EKKK + KKK K KK + +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLVG 177
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 28.5 bits (65), Expect = 0.78
Identities = 7/19 (36%), Positives = 10/19 (52%)
Query: 1 MRKRKKEKKKKKKKKKKKK 19
+ R + KK K+KKK
Sbjct: 380 AQARLRAAKKVKRKKKTSG 398
Score = 25.8 bits (58), Expect = 8.3
Identities = 7/21 (33%), Positives = 11/21 (52%)
Query: 3 KRKKEKKKKKKKKKKKKKKKK 23
K + + + KK K+KKK
Sbjct: 378 KAAQARLRAAKKVKRKKKTSG 398
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.5 bits (64), Expect = 0.80
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K+KKEKK+ K + + K K KK + P + P
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
Score = 27.8 bits (62), Expect = 1.7
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K+K EKKKKKKK+KK+ K + + K K
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFK 94
Score = 27.0 bits (60), Expect = 3.3
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
++KK+KKK+KK+ K + + K K K + + P
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108
Score = 26.6 bits (59), Expect = 4.2
Identities = 13/39 (33%), Positives = 16/39 (41%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
+K KKE K + + K K KK KK K P
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Score = 26.2 bits (58), Expect = 5.3
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
KK+ +KKKKKKK+KK+ K + + + P +
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98
Score = 26.2 bits (58), Expect = 6.0
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
K +K+KKKKK+KK+ K + + K T +
Sbjct: 69 KSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
Score = 25.8 bits (57), Expect = 6.8
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K+K +KKKKKKK+KK+ K + + S
Sbjct: 64 NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 28.7 bits (65), Expect = 0.80
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
++++KKK+KKK KKKK K + +I KV
Sbjct: 396 APEQKEKKKEKKKNKKKKYKVPRGKIYKV 424
Score = 27.1 bits (61), Expect = 2.9
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSEL 37
++K+KKK+KKK KKKK ++ + +I L
Sbjct: 393 VGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVL 425
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 28.6 bits (64), Expect = 0.81
Identities = 11/59 (18%), Positives = 23/59 (38%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTAL 60
R ++ E + K +K+KK+ K ++ + L + P + T + L
Sbjct: 427 RTKRPEPRAAADTKSAAEKQKKRAKEPFEIDFGAPLEEIDFEVIFQPPKANSTLSKSKL 485
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 28.5 bits (64), Expect = 0.82
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKK 25
RKRK KK KK KK+K+K+
Sbjct: 225 RKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 27.5 bits (62), Expect = 0.82
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 7 EKKKKKKKKKKKKKKKKKKKRITKVQ 32
+KKK+KK+ K + K+ K++T Q
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQ 85
Score = 24.4 bits (54), Expect = 9.3
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 3 KRKKEKKKKKKKKKKKKKKK 22
++KKEKK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.5 bits (64), Expect = 0.83
Identities = 13/19 (68%), Positives = 15/19 (78%)
Query: 3 KRKKEKKKKKKKKKKKKKK 21
KRK+E KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
Score = 27.3 bits (61), Expect = 2.1
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 9 KKKKKKKKKKKKKKKKKKRIT 29
K+K++ KK+KKKK KK ++T
Sbjct: 356 KRKREGDKKQKKKKSKKLKLT 376
Score = 27.3 bits (61), Expect = 2.2
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 2 RKRKKEKKKKKKKKKKKK 19
RKR+ +KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374
Score = 26.6 bits (59), Expect = 4.0
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 7 EKKKKKKKKKKKKKKKKKKK 26
K+K++ KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.5 bits (64), Expect = 0.90
Identities = 18/72 (25%), Positives = 32/72 (44%), Gaps = 6/72 (8%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH--PHPTSPS----TDYTTPLT 58
++ + K+ KKKKKKKKKKKKK + + + P+ + ++
Sbjct: 48 QEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107
Query: 59 ALKYPPSAALSQ 70
PP+ +S+
Sbjct: 108 KQTQPPTIPVSK 119
Score = 27.0 bits (60), Expect = 2.6
Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 7/64 (10%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ-IPSELSPLGVHPHP------TSPSTDYT 54
++ KKKKKKKKKKKKK + + + + + H T
Sbjct: 51 ENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110
Query: 55 TPLT 58
P T
Sbjct: 111 QPPT 114
Score = 25.4 bits (56), Expect = 8.6
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ E K++K++ K+K KKK KK V
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDV 33
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 28.0 bits (63), Expect = 1.2
Identities = 13/35 (37%), Positives = 24/35 (68%), Gaps = 6/35 (17%)
Query: 1 MRKRKKEK------KKKKKKKKKKKKKKKKKKRIT 29
+ K+K+ K +K KK++KK+K K+++K+RI
Sbjct: 647 IEKKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 28.1 bits (63), Expect = 1.2
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
+ K K +K K ++ +KK KK R+ KV
Sbjct: 342 AQEKATAKGAQKVKNRRARKKAKKARLAKV 371
Score = 27.3 bits (61), Expect = 2.1
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
K + +K K ++ +KK KK +
Sbjct: 343 QEKATAKGAQKVKNRRARKKAKKARLA 369
>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
Length = 367
Score = 28.0 bits (63), Expect = 1.2
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHP 44
+ +KKE ++ K+ +K KK+ +T +I +L P +
Sbjct: 1 KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIPFELDS 44
>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel. The
family carries eight putative transmembrane domains,
and, although it has no similarity to other known
channel proteins, it is clearly a calcium-activated
ionic channel. It is expressed in various secretory
epithelia, the retina and sensory neurons, and mediates
receptor-activated chloride currents in diverse
physiological processes.
Length = 449
Score = 27.9 bits (63), Expect = 1.2
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH 43
KK +KKK KKKK+K+KK ++ + + +++ SEL P V
Sbjct: 271 KKIRKKKIKKKKEKEKKPEENEILERIENESELPPYDVF 309
>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
Length = 788
Score = 28.2 bits (63), Expect = 1.4
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 7 EKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHP 46
EK KKKK+K K ++KK EL+P + P P
Sbjct: 188 EKPVKKKKEKPAKVEEKKVAVAAAPPSEEELNPYSIFPQP 227
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 27.2 bits (61), Expect = 1.4
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
R+R KE+ +K+K+ +K ++KK K++
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRR 124
Score = 26.5 bits (59), Expect = 2.7
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKK 25
+R R++ K++ +K+K+ +K ++KK
Sbjct: 96 LRLRRERTKERAEKEKRTRKNREKK 120
Score = 26.5 bits (59), Expect = 3.2
Identities = 9/27 (33%), Positives = 20/27 (74%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
K + EK+K+ +K ++KK K+++K++
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 25.3 bits (56), Expect = 6.4
Identities = 7/32 (21%), Positives = 21/32 (65%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
+ R + ++ K++ +K+K+ +K ++K+ + Q
Sbjct: 94 IALRLRRERTKERAEKEKRTRKNREKKFKRRQ 125
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 27.7 bits (62), Expect = 1.4
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 7 EKKKKKKKKKKKKKKKKKKKR 27
E KK K K+ +K + KKKK+
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKK 104
Score = 27.3 bits (61), Expect = 1.6
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 8 KKKKKKKKKKKKKKKKKKKRIT 29
KK K K+ +K + KKKKKK
Sbjct: 87 KKAKGKRSRKNQTKKKKKKDDN 108
Score = 26.5 bits (59), Expect = 3.0
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 5 KKEKKKKKKKKKKKKKKKKKK 25
+K K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 26.1 bits (58), Expect = 5.0
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 8 KKKKKKKKKKKKKKKKKKKR 27
KK K K+ +K + KKKKK+
Sbjct: 86 SKKAKGKRSRKNQTKKKKKK 105
Score = 26.1 bits (58), Expect = 5.1
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 3 KRKKEKKKKKKKKKKKKKKKK 23
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 27.8 bits (62), Expect = 1.5
Identities = 8/27 (29%), Positives = 10/27 (37%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
+ +K K K KK KK R
Sbjct: 187 LSSTASSGDEKSPKSKAAPKKAGKKMR 213
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 27.5 bits (61), Expect = 1.7
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
+KEK K+ K+KKK+KK K+++P
Sbjct: 151 HRKEKDDNKENINLFKQKKKRKKLSFKIRLPR 182
>gnl|CDD|240545 cd13140, MATE_like_1, Uncharacterized subfamily of the multidrug
and toxic compound extrusion (MATE) proteins. The
integral membrane proteins from the MATE family are
involved in exporting metabolites across the cell
membrane and are responsible for multidrug resistance
(MDR) in many bacteria and animals. MATE has also been
identified as a large multigene family in plants, where
the proteins are linked to disease resistance. A number
of family members are involved in the synthesis of
peptidoglycan components in bacteria.
Length = 435
Score = 27.5 bits (62), Expect = 1.7
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
+ KK+K K + K+ +K K I K+ +P
Sbjct: 206 LYALKKKKGLFLKIRILKRPDRKYIKEILKLGLPV 240
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 27.6 bits (62), Expect = 1.8
Identities = 16/76 (21%), Positives = 25/76 (32%), Gaps = 24/76 (31%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK---------------------RITKVQIPSELSPL 40
K+ +E+K K++ K K R+ KV+ L
Sbjct: 19 AKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVE---ALRAK 75
Query: 41 GVHPHPTSPSTDYTTP 56
GV P+P +T P
Sbjct: 76 GVEPYPYKFDVTHTAP 91
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 27.3 bits (61), Expect = 1.9
Identities = 15/33 (45%), Positives = 20/33 (60%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
K+KK+KKKK KK K KK K+R +V+
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKA 286
Score = 27.0 bits (60), Expect = 2.9
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
R + KKKKKKKKK KK K K++T Q
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ 279
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 27.2 bits (61), Expect = 1.9
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+ +KEKKK K++KK K++K K
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKL 120
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 27.4 bits (61), Expect = 2.0
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K KK+KK+K + K KKKKKKKK+
Sbjct: 168 KEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 25.8 bits (57), Expect = 5.3
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKK 25
+K KKEK + K KKKKKKKKKK
Sbjct: 170 KKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 25.4 bits (56), Expect = 8.2
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+ KKEKKKKK+ KK+KK+KK KK+
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKE 175
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 27.2 bits (60), Expect = 2.0
Identities = 22/96 (22%), Positives = 33/96 (34%), Gaps = 3/96 (3%)
Query: 17 KKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQHQSLLN 76
K KKKKKK+ + T + S PL+ YP + SQ ++ N
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNVMSTPSAILARPQPLS---YPFGSQQSQQNAVKN 88
Query: 77 NSGRTNFTNKQLTELEKEFHFNKYLTRARRIEIASV 112
+ + + E E F L R + V
Sbjct: 89 SKEKRIWNTSTQEERENIREFWLSLGEEERRSLVKV 124
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 27.5 bits (61), Expect = 2.1
Identities = 9/26 (34%), Positives = 18/26 (69%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+ K++ K +K K+ KK+KK+++K
Sbjct: 21 PRPVKDEAKPRKIKRVKKRKKREEKD 46
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 27.3 bits (61), Expect = 2.1
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
++ ++ KKKK+ K K K+ KKKK+ S
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSP 113
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 26.9 bits (60), Expect = 2.1
Identities = 11/50 (22%), Positives = 18/50 (36%), Gaps = 5/50 (10%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ-----IPSELSPLGVHPHPT 47
KR K + KKK + K++ + + P + V P P
Sbjct: 107 KRVAANDAIKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVVTVRPLPF 156
Score = 25.3 bits (56), Expect = 8.1
Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 1/51 (1%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRI-TKVQIPSELSPLGVHPHPTSPST 51
R R+ K+ K + KKK + TK Q V P PT T
Sbjct: 100 RCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVVT 150
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 27.2 bits (61), Expect = 2.2
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
+ KR E KK+KKKKKKK++K + + KV
Sbjct: 159 VEKRLDELTKKRKKKKKKKEEKVELDVLKKVL 190
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 27.1 bits (60), Expect = 2.4
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKK 25
RKRK ++K K KK KK + +
Sbjct: 358 RKRKGDRKGVSHKAKKGGKKNQAE 381
Score = 26.0 bits (57), Expect = 6.3
Identities = 8/25 (32%), Positives = 11/25 (44%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITK 30
K+K +K K KK KK +
Sbjct: 357 ARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 27.5 bits (61), Expect = 2.4
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
R + + KK ++ KK K+ TK SE +
Sbjct: 1184 RAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT 1220
>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
Length = 399
Score = 27.3 bits (61), Expect = 2.4
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K+EKK + K +++ ++KK++ K+Q L P
Sbjct: 48 SKREKKPYPRPMKLLRREAREKKKLRKLQPERPLDP 83
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 26.4 bits (59), Expect = 2.5
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKR 27
R KE+++K KKK KK + ++ K+
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKK 90
Score = 25.2 bits (56), Expect = 6.4
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
R +++ +K KKK KK + ++ KKK
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKKK 91
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 27.2 bits (61), Expect = 2.5
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRI 28
R +E K+KKKKKKKKKKK++ +
Sbjct: 80 DRAEELLKEKKKKKKKKKKKEELREW 105
Score = 26.8 bits (60), Expect = 2.9
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
R + E+ K+KKKKKKKKKKK++ R
Sbjct: 78 RADRAEELLKEKKKKKKKKKKKEELR 103
Score = 26.0 bits (58), Expect = 5.2
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+R ++ K+KKKKKKKKKKK+ + +
Sbjct: 76 YERADRAEELLKEKKKKKKKKKKKEELRE 104
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 27.0 bits (60), Expect = 2.6
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
RK+KK + K +K+ K+K +KK+I + P
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 26.9 bits (60), Expect = 2.7
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K +KE+KK K++KK K +K+K +
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 26.1 bits (58), Expect = 2.7
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
K ++ ++ ++KKKKKKK KKK K
Sbjct: 29 ASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 25.3 bits (56), Expect = 4.6
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ R+ ++ ++KKKKKKK KKK K+ I +
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILR 59
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 26.6 bits (59), Expect = 2.8
Identities = 20/61 (32%), Positives = 24/61 (39%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
K KK+K KKKK K K KK KK + K L S + + L K
Sbjct: 84 WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Query: 62 Y 62
Y
Sbjct: 144 Y 144
Score = 26.6 bits (59), Expect = 3.0
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+K K +KKK KKKK K K KK KK
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKK 106
Score = 25.4 bits (56), Expect = 9.1
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
+++ K KKKK KKKK K K KK K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDK 105
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 26.9 bits (60), Expect = 2.9
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
R + K K K K +K K KKK+KKK+
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKK 1182
Score = 26.2 bits (58), Expect = 5.4
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLG 41
+ RK + KKK+KKKKK K KK + +
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR 1207
Score = 26.2 bits (58), Expect = 6.3
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
+ + K K +K K KKK+KKKKK
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSS 1185
Score = 25.8 bits (57), Expect = 7.0
Identities = 14/32 (43%), Positives = 16/32 (50%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
K K K KKK+KKKKK K + V S
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198
Score = 25.8 bits (57), Expect = 7.2
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K+K +K+ + KKKKK +KK K + +
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340
Score = 25.8 bits (57), Expect = 9.0
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
K+++ K K K K K +K K K+ K
Sbjct: 1152 IAKEQRLKSKTKGKASKLRKPKLKKKEK 1179
Score = 25.4 bits (56), Expect = 9.6
Identities = 12/50 (24%), Positives = 22/50 (44%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
+++ + K K K K +K K KKK++ K + V + +D
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 25.7 bits (57), Expect = 3.3
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
RKR ++ + K+ KK+ KK +KKKR
Sbjct: 6 RKRNEKLRAKRAKKRAAKKAARKKKR 31
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 25.8 bits (57), Expect = 3.5
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
KK+KKKKKKK + K KK
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKK 45
Score = 25.0 bits (55), Expect = 5.7
Identities = 11/23 (47%), Positives = 12/23 (52%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKK 26
KKKKKKKKKK + K
Sbjct: 20 ADLVKKKKKKKKKKAEDTAATAK 42
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 26.5 bits (59), Expect = 3.8
Identities = 15/33 (45%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 2 RKRKKEKKKKKKKKKKKKKKK--KKKKRITKVQ 32
+R EK + K+KKK+KKK KKKK+I +++
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE 349
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 26.4 bits (58), Expect = 3.8
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
+ KK KK + K K K KK KK+
Sbjct: 146 EVKKPKKGQSPKVPKAPKPKKPKKK 170
Score = 26.0 bits (57), Expect = 6.0
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
K+ K+ + K K K KK KKK ++ +
Sbjct: 147 VKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMP 181
Score = 25.6 bits (56), Expect = 7.8
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
K K+ KK + K K K KK K+ V
Sbjct: 145 KEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 26.3 bits (59), Expect = 3.9
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 9 KKKKKKKKKKKKKKKKKKRITK 30
KK +K +K KKKKK++K +K
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSK 213
Score = 25.9 bits (58), Expect = 6.0
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRI 28
K+ EK +K KKKKK++K K RI
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSKDVRI 217
Score = 25.2 bits (56), Expect = 9.6
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRIT 29
++ E+ KK +K +K KKKKK+R
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 26.3 bits (58), Expect = 4.0
Identities = 15/50 (30%), Positives = 27/50 (54%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
K ++E+ K++ K+KK +KK K+ +R + + VH H P T+
Sbjct: 100 KERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETN 149
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 26.4 bits (59), Expect = 4.0
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 10/40 (25%)
Query: 1 MRKRKKEKKKKKKKKK----------KKKKKKKKKKRITK 30
+ KKE+KKK++++ +K ++K++KK+ K
Sbjct: 283 EAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|223448 COG0371, GldA, Glycerol dehydrogenase and related enzymes [Energy
production and conversion].
Length = 360
Score = 26.5 bits (59), Expect = 4.2
Identities = 8/20 (40%), Positives = 12/20 (60%), Gaps = 1/20 (5%)
Query: 46 PTSPSTD-YTTPLTALKYPP 64
PT STD T+P+ ++ Y
Sbjct: 115 PTIASTDAITSPVASVIYNG 134
>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
[Energy production and conversion].
Length = 332
Score = 26.6 bits (59), Expect = 4.3
Identities = 8/25 (32%), Positives = 14/25 (56%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRI 28
R EK KKK K+ K+ +++ +
Sbjct: 301 RLVEKLAKKKLKRAAKEPEERVEFA 325
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 26.3 bits (59), Expect = 4.4
Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRI-TKVQIPS 35
RKR + +KK+KKKKK+K+K+++ ++ +V+ S
Sbjct: 612 ARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLS 647
Score = 25.9 bits (58), Expect = 7.5
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKK 23
+ K +K KKKKKKK K K K +
Sbjct: 679 LEKIQKPKKKKKKKPKTVKPKPR 701
Score = 25.6 bits (57), Expect = 8.6
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
M+ + +K +K KKKKKKK K K +
Sbjct: 670 MKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 26.1 bits (58), Expect = 4.5
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
K KKE++K++K++ K++KKK K +
Sbjct: 105 RSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 24.9 bits (55), Expect = 8.9
Identities = 11/29 (37%), Positives = 23/29 (79%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ K+EKKK+ K+KK+++K++K++ + K
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQK 124
>gnl|CDD|235798 PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed.
Length = 462
Score = 26.3 bits (59), Expect = 4.5
Identities = 7/14 (50%), Positives = 10/14 (71%), Gaps = 1/14 (7%)
Query: 34 PSELSPLGVHPHPT 47
P +L+ +HPHPT
Sbjct: 431 PEDLALT-IHPHPT 443
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 26.4 bits (58), Expect = 4.7
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKR 27
K+K KKK K KK KKK+ KR
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
Score = 25.6 bits (56), Expect = 8.1
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKK 24
K K KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
Length = 128
Score = 25.9 bits (58), Expect = 4.7
Identities = 10/16 (62%), Positives = 10/16 (62%)
Query: 14 KKKKKKKKKKKKKRIT 29
KKKK KKK KK I
Sbjct: 1 AKKKKTVKKKVKKNIP 16
>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
Srw. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srw is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'. The genes encoding Srw do not appear to be
under as strong an adaptive evolutionary pressure as
those of Srz.
Length = 317
Score = 26.4 bits (59), Expect = 4.8
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITK 30
E +K KK +K K R TK
Sbjct: 217 IELRKAKKSRKNLSSSSNKSDRTTK 241
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 26.3 bits (58), Expect = 4.8
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKK 26
K +K KK KK KKKKKKK+
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
Score = 25.5 bits (56), Expect = 8.6
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
KK ++KK++KKK KKKK ++ Q+
Sbjct: 59 KKLHSQEKKEEKKKPKKKKVPLQVNPAQL 87
>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767). This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 112 and 199 amino
acids in length.
Length = 119
Score = 25.7 bits (57), Expect = 5.0
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
+R+K + KK + +KK KK R
Sbjct: 90 YRRRKSFQGMKKAVEAMAEKKAKKLR 115
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 25.3 bits (56), Expect = 5.0
Identities = 15/26 (57%), Positives = 17/26 (65%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRI 28
K+KK KK KK +KKKK KKK I
Sbjct: 30 KKKKLKKLKKIDDDLEKKKKNKKKII 55
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an
exchange factor for Cdc42 and an effector of Ras1, the
homolog of the human H-Ras. Scd2/Bem1 mediates Cdc42
activation by binding to Scd1/Cdc24 and to Cdc42. Ras1
regulates Scd1/Cdc24/Ral1, which is a putative guanine
nucleotide exchange factor for Cdc42, a member of the
Rho family of Ras-like proteins. Cdc42 then activates
the Shk1/Orb2 protein kinase. Scd1 interacts with Klp5
and Klp6 kinesins to mediate cytokinesis. PH domains
have diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They
share little sequence conservation, but all have a
common fold, which is electrostatically polarized. Less
than 10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved
across all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 25.7 bits (57), Expect = 5.2
Identities = 13/31 (41%), Positives = 17/31 (54%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
K K+KKKK K K KKKK+ +Q+
Sbjct: 61 KEVKKKKKKSSLKSKSSSSSKKKKKKGPLQL 91
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 26.3 bits (59), Expect = 5.3
Identities = 10/34 (29%), Positives = 20/34 (58%), Gaps = 5/34 (14%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRIT-----KVQI 33
K++ + K+ ++K+KK +K IT KV++
Sbjct: 520 KEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVEL 553
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 26.0 bits (57), Expect = 5.6
Identities = 13/29 (44%), Positives = 15/29 (51%)
Query: 11 KKKKKKKKKKKKKKKKRITKVQIPSELSP 39
K+KKKKK KKKK K + E P
Sbjct: 90 KQKKKKKDKKKKSPKSLLLNFVTVEESKP 118
Score = 26.0 bits (57), Expect = 5.8
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 5 KKEKKKKKKKKKKKKKKKK 23
+E K+KKKKK KKKK K
Sbjct: 86 NQENKQKKKKKDKKKKSPK 104
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 25.7 bits (57), Expect = 5.8
Identities = 9/34 (26%), Positives = 25/34 (73%), Gaps = 2/34 (5%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITK--VQI 33
++++++K+++K++K++K++K+K K VQ
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQK 45
Score = 25.7 bits (57), Expect = 6.3
Identities = 8/25 (32%), Positives = 21/25 (84%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKK 26
R E+K+++K+++K++K++K++K
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEK 34
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.1 bits (58), Expect = 6.4
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKR 27
+KKK++KK+++K+ K+ R
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELR 199
Score = 25.7 bits (57), Expect = 7.0
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKK 26
+ +KKK++KK+++K+ K+ +
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELR 199
Score = 25.4 bits (56), Expect = 9.5
Identities = 6/21 (28%), Positives = 14/21 (66%)
Query: 7 EKKKKKKKKKKKKKKKKKKKR 27
+KKK++KK+++K+ +
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSK 196
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 25.8 bits (57), Expect = 6.5
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKK 25
RK KK +K + KKKK
Sbjct: 172 ERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 25.7 bits (57), Expect = 6.7
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKK 26
+ EKK +KK KK KK K ++
Sbjct: 85 KEIWEAEKKGLEKKAKKAIKKGKDEE 110
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 25.9 bits (57), Expect = 6.9
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKK 25
+RK++KEKK++ K +KK+ +K
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYG 185
Score = 25.5 bits (56), Expect = 8.7
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKK 25
RKKEK+KK++ K +KK+ +K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEK 183
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 25.7 bits (57), Expect = 7.0
Identities = 8/24 (33%), Positives = 10/24 (41%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKK 24
+ K K K + K KK K K
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 25.3 bits (56), Expect = 7.1
Identities = 8/36 (22%), Positives = 24/36 (66%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
++ +++KK++++K+K++ +K R +I + + L
Sbjct: 72 ERMDERRKKRREQKEKEEIEKYREENPKIQQQFADL 107
>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 125 to 142 amino acids in length.
Length = 131
Score = 25.4 bits (56), Expect = 7.1
Identities = 12/75 (16%), Positives = 21/75 (28%), Gaps = 22/75 (29%)
Query: 34 PSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQHQSLLNNSGRTNFTNKQLTELEK 93
P +L P P P+T L N G F++
Sbjct: 38 PQDLYPHSSQPDLPPPTT----------------------LDLNDGTITFSDPLSHFTGS 75
Query: 94 EFHFNKYLTRARRIE 108
F ++ L +++
Sbjct: 76 PFSYSSPLKEESKLD 90
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 25.9 bits (57), Expect = 7.1
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
E ++ KKK++ + KKKK+ + + VQ
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 25.4 bits (56), Expect = 7.6
Identities = 9/34 (26%), Positives = 17/34 (50%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
RK + ++++ +K +K+ KK KK S
Sbjct: 99 RKEALKAYQQEELRKIQKRSKKSKKAEPVQGQLS 132
>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1. This domain
family is found in eukaryotes, and is typically between
216 and 263 amino acids in length. Casc1 has many SNPs
associated with cancer susceptibility.
Length = 227
Score = 25.4 bits (56), Expect = 7.8
Identities = 6/36 (16%), Positives = 12/36 (33%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
+ E + K+ K K K + + + L
Sbjct: 74 EAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPEL 109
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 25.7 bits (56), Expect = 7.9
Identities = 6/35 (17%), Positives = 15/35 (42%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
+++ KEK K+ K + + + ++ E
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE 832
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The
family functions as a highly conserved exonuclease that
is required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing
and export.
Length = 134
Score = 25.4 bits (56), Expect = 7.9
Identities = 7/23 (30%), Positives = 20/23 (86%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKK 24
KRK++++KK +++ K+K+++++
Sbjct: 28 HKRKQQRRKKAQEEAKEKEREER 50
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 25.6 bits (56), Expect = 8.2
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
+ K + E + + K KK KK KKKK + K
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKVLGK 262
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This
is a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 25.2 bits (55), Expect = 8.2
Identities = 7/50 (14%), Positives = 19/50 (38%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDY 53
K +++++KK ++++++ + P P P Y
Sbjct: 21 HKYSQRRERKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRY 70
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 25.7 bits (57), Expect = 8.3
Identities = 13/35 (37%), Positives = 16/35 (45%)
Query: 4 RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
+K KK K K KKKKKK + K + E
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 25.7 bits (57), Expect = 8.3
Identities = 9/31 (29%), Positives = 17/31 (54%), Gaps = 1/31 (3%)
Query: 1 MRKRKKEKKKKKKKKKKKKKK-KKKKKRITK 30
R++KK KK +K+ + + KK+K +
Sbjct: 83 FREKKKYKKDVEKRNRSYRLYLDKKRKELQA 113
>gnl|CDD|215314 PLN02576, PLN02576, protoporphyrinogen oxidase.
Length = 496
Score = 25.7 bits (57), Expect = 8.9
Identities = 7/25 (28%), Positives = 12/25 (48%)
Query: 3 KRKKEKKKKKKKKKKKKKKKKKKKR 27
K +E KK K + + + K K +
Sbjct: 205 KAIQEAKKNPKPEPRDPRLPKPKGQ 229
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 25.0 bits (55), Expect = 9.0
Identities = 17/50 (34%), Positives = 21/50 (42%), Gaps = 13/50 (26%)
Query: 2 RKRKK-EKKKKKKKKKKK-----------KKKKKKKKRITKVQ-IPSELS 38
RKR E K+ +KK K K+KK K KR IP +S
Sbjct: 5 RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGIPKVVS 54
>gnl|CDD|236950 PRK11664, PRK11664, ATP-dependent RNA helicase HrpB; Provisional.
Length = 812
Score = 25.7 bits (57), Expect = 9.2
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 57 LTALKYPPSAALSQHQSLLNNSGRTNFTNKQLT 89
L+ L PP+AAL+ + LL G + +LT
Sbjct: 375 LSWLDQPPAAALAAAKRLLQQLGALD-GQGRLT 406
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 25.4 bits (56), Expect = 9.3
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKR 27
RK ++E + K+ K KKK KKKR
Sbjct: 7 RKARQELAVQVAKQAKAKKKANKKKR 32
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 25.3 bits (56), Expect = 9.5
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 5 KKEKKKKKKKKKKKKKKKKKKKRI 28
K K+K+ K+KKK+KK+KK RI
Sbjct: 377 NKYKEKEAKEKKKEKKRKKTWARI 400
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and
chromosome partitioning].
Length = 324
Score = 25.6 bits (56), Expect = 9.6
Identities = 10/72 (13%), Positives = 22/72 (30%), Gaps = 5/72 (6%)
Query: 2 RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
RK+ + + K+ K K+ +V + + V +P +
Sbjct: 26 TSRKERSSYFRDRPLKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFT-----QEHE 80
Query: 62 YPPSAALSQHQS 73
+ Q+Q
Sbjct: 81 AARQSPQHQYQP 92
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 25.4 bits (56), Expect = 9.7
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 1 MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
M++ +KE ++ ++ KK KK ++K++ + EL
Sbjct: 81 MKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRELM 118
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 25.5 bits (56), Expect = 9.9
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 6 KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
++K KK KK K+K+ +K Q +S L
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIEGISRL 365
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.123 0.331
Gapped
Lambda K H
0.267 0.0760 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,104,611
Number of extensions: 529982
Number of successful extensions: 10880
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6986
Number of HSP's successfully gapped: 1649
Length of query: 123
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 38
Effective length of database: 7,167,512
Effective search space: 272365456
Effective search space used: 272365456
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.1 bits)