RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14571
         (123 letters)



>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain. 
          Length = 57

 Score = 51.3 bits (124), Expect = 4e-10
 Identities = 21/40 (52%), Positives = 25/40 (62%)

Query: 80  RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
           RT FT +QL ELEKEF  N+Y +   R E+A  L L E Q
Sbjct: 4   RTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQ 43


>gnl|CDD|197696 smart00389, HOX, Homeodomain.  DNA-binding factors that are
           involved in the transcriptional regulation of key
           developmental processes.
          Length = 57

 Score = 46.5 bits (111), Expect = 3e-08
 Identities = 22/40 (55%), Positives = 27/40 (67%)

Query: 80  RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
           RT+FT +QL ELEKEF  N Y +R  R E+A  L L+E Q
Sbjct: 5   RTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQ 44


>gnl|CDD|238039 cd00086, homeodomain, Homeodomain;  DNA binding domains involved in
           the transcriptional regulation of key eukaryotic
           developmental processes; may bind to DNA as monomers or
           as homo- and/or heterodimers, in a sequence-specific
           manner.
          Length = 59

 Score = 44.2 bits (105), Expect = 2e-07
 Identities = 22/40 (55%), Positives = 25/40 (62%)

Query: 80  RTNFTNKQLTELEKEFHFNKYLTRARRIEIASVLQLNETQ 119
           RT FT +QL ELEKEF  N Y +R  R E+A  L L E Q
Sbjct: 4   RTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQ 43


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 43.8 bits (104), Expect = 4e-06
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
            ++ EKK+K+K+K K KK+ +  K I K + PS  S
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 39.9 bits (94), Expect = 1e-04
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           K+   K+ +KK+K+K+K K KK+ R TK
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R  KKEK+K+K K KK+ +  K   +
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 37.6 bits (88), Expect = 5e-04
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
             KK   K+ +KK+K+K+K K KKR
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 36.1 bits (84), Expect = 0.002
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           +K  K  KK   K+ +KK+K+K+K ++ K
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414



 Score = 36.1 bits (84), Expect = 0.002
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           ++++KEK K KK+ +  K   K++K
Sbjct: 403 KEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 36.1 bits (84), Expect = 0.002
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +K K+++K K KK+ +  K   K+++
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 34.1 bits (79), Expect = 0.011
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
             KK  K  KK   K+ +KK+K+K   KV
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412



 Score = 33.4 bits (77), Expect = 0.020
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           K+  +  KK   K+ +KK+K+K+K   K 
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414



 Score = 33.0 bits (76), Expect = 0.021
 Identities = 7/26 (26%), Positives = 15/26 (57%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKK 26
            ++++K K KK+ +  K   K++K  
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPS 429



 Score = 30.7 bits (70), Expect = 0.17
 Identities = 6/25 (24%), Positives = 13/25 (52%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           +++ K KK+ +  K   K++K    
Sbjct: 407 KEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 28.8 bits (65), Expect = 0.77
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
           +   ++K  K  KK   K+ +KK+K   K ++
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 39.6 bits (93), Expect = 1e-04
 Identities = 17/25 (68%), Positives = 23/25 (92%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K+K+E+KK +K+KKKKK+KKK KKR
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 39.6 bits (93), Expect = 1e-04
 Identities = 15/26 (57%), Positives = 24/26 (92%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +KR+++K +K+KKKKK+KKK KK+K+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 17/26 (65%), Positives = 22/26 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R+ KK +K+KKKKK+KKK KK+KKK 
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 38.0 bits (89), Expect = 4e-04
 Identities = 15/26 (57%), Positives = 24/26 (92%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +K+++EKK +K+KKKKK+KKK KK++
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 36.9 bits (86), Expect = 0.001
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
            +KK ++KK +K+KKKKK+KKK K+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKK 407



 Score = 36.9 bits (86), Expect = 0.001
 Identities = 16/25 (64%), Positives = 22/25 (88%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           ++K +K+KKKKK+KKK KK+KKK R
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 36.9 bits (86), Expect = 0.001
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
            K+ +++KKKKK+KKK KK+KKK ++
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 33.8 bits (78), Expect = 0.011
 Identities = 14/26 (53%), Positives = 22/26 (84%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITK 30
           KK++++KK +K+KKKKK+KKK +  K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 32.2 bits (74), Expect = 0.039
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
            K  K  KKK+++KK +K+KKKKK++
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRK 402



 Score = 32.2 bits (74), Expect = 0.041
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K  K+K+++KK +K+KKKKK+KKK 
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKG 405



 Score = 31.1 bits (71), Expect = 0.10
 Identities = 13/25 (52%), Positives = 19/25 (76%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K K  K  KKK+++KK +K+KKKK+
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKK 400



 Score = 30.7 bits (70), Expect = 0.16
 Identities = 12/26 (46%), Positives = 21/26 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +++  +  KKK+++KK +K+KKKKKR
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKR 401


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 35.2 bits (82), Expect = 0.003
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
          +  KK +K K K +KKK KK++ K R+
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 33.3 bits (77), Expect = 0.013
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKK 26
           +  EK +KK +K K K +KKK K
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 32.9 bits (76), Expect = 0.018
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 4  RKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           K  +K +KK +K K K +KKK +  + +
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 31.3 bits (72), Expect = 0.054
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           K  ++ +KK +K K K +KKK KK   K
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 31.3 bits (72), Expect = 0.066
 Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 2/26 (7%)

Query: 5  KKEKKK--KKKKKKKKKKKKKKKKRI 28
          KKE K   K +KK +K K K +KK+ 
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKA 90



 Score = 31.3 bits (72), Expect = 0.067
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKR 27
          K  K  +K +KK +K K K +KKK 
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKA 90



 Score = 27.1 bits (61), Expect = 2.1
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 8  KKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
          KK+ K  +K +KK +K K +  K +   E
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKE 93


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 34.4 bits (80), Expect = 0.006
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
           K   + +KKK+K++ K  K K K R+
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKAKSKPRL 92



 Score = 32.5 bits (75), Expect = 0.034
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKK 26
          KKE K   K +KKK+K++ K  
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAA 84



 Score = 31.3 bits (72), Expect = 0.088
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKR 27
          K  K   K +KKK+K++ K  K K 
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKS 88



 Score = 29.0 bits (66), Expect = 0.58
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 6  KEKKKKKKKKKKKKKKKKKKKR 27
          K++ K   K +KKK+K++ K  
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAA 84



 Score = 28.3 bits (64), Expect = 0.91
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 1  MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
          ++    +KK+ K   K +KKK+K++ +  K
Sbjct: 56 LKAALLDKKELKAWHKAQKKKEKQEAKAAK 85


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 34.2 bits (78), Expect = 0.012
 Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 6/70 (8%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPS 65
           K + +  K KK KKKK  KKK   K QIP E            PS D   PL+ +   P 
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPE------QQMEEKPSADAYEPLSRVIPIPR 168

Query: 66  AALSQHQSLL 75
             L+ +++++
Sbjct: 169 NKLTPYRAVI 178


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 32.5 bits (74), Expect = 0.039
 Identities = 16/24 (66%), Positives = 18/24 (75%)

Query: 10  KKKKKKKKKKKKKKKKKRITKVQI 33
           +KKKKKKK K+KKKKK R    QI
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQI 707



 Score = 31.4 bits (71), Expect = 0.095
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
             RKK+KKKK K+KKKKK ++  K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706



 Score = 31.4 bits (71), Expect = 0.11
 Identities = 14/24 (58%), Positives = 20/24 (83%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRI 28
           +K+KKKKK K+KKKKK ++  K+I
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQI 707



 Score = 30.6 bits (69), Expect = 0.20
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKK 24
            RK+KK+KK K+KKKKK ++  K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 32.5 bits (74), Expect = 0.039
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 3/86 (3%)

Query: 11  KKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQ 70
             KKKK   K   +  +  +  +P++    G  P P+ P +       A    PS    +
Sbjct: 101 PTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAPAPALTPS-QKDR 159

Query: 71  HQSLLNNSGRTNFTNK--QLTELEKE 94
            ++LL+   + +  ++  +  ELE E
Sbjct: 160 LETLLSPKDKISLNSEKPKFKELESE 185


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 32.5 bits (75), Expect = 0.044
 Identities = 11/37 (29%), Positives = 25/37 (67%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
            K ++ ++ +++++KKKKK+K  +K   +V IP  ++
Sbjct: 147 GKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETIT 183


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.2 bits (74), Expect = 0.050
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
             K+++E+KK+KKKK    KKK++++   K +   E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 30.7 bits (70), Expect = 0.14
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
               K EKK++++KK+KKKK    KK+  + +   E
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 30.7 bits (70), Expect = 0.16
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           K+  EK +KK++++KK+KKKK      K 
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKE 441



 Score = 30.3 bits (69), Expect = 0.21
 Identities = 12/55 (21%), Positives = 24/55 (43%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTP 56
            ++K+E++KK+KKKK    KKK+++   + +   E               +    
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 29.5 bits (67), Expect = 0.37
 Identities = 10/37 (27%), Positives = 21/37 (56%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K  ++ +KK++++KK+KKKK    +  + +   E   
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 29.1 bits (66), Expect = 0.62
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           K KK  +K +KK++++KK+KKKK    K 
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKK 439



 Score = 28.7 bits (65), Expect = 0.79
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
              KK  KK KK  +K +KK++++K+  K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK 431



 Score = 28.7 bits (65), Expect = 0.83
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            +K  KK KK  +K +KK++++KK   K
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKK 432



 Score = 28.3 bits (64), Expect = 0.97
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
             +K +K  +K +KK++++KK+KKK+   
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFA 436



 Score = 28.3 bits (64), Expect = 1.00
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           +K  K+ KK  +K +KK++++KK+K+        +   
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 31.4 bits (71), Expect = 0.050
 Identities = 15/23 (65%), Positives = 16/23 (69%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKK 26
            +K KKKKKK KK KK KK  KK
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKK 119



 Score = 29.5 bits (66), Expect = 0.20
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 6   KEKKKKKKKKKKKKKKKKKK 25
           K+KKKK KK KK KK  KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 0.22
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKK 26
           K +KKKKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 0.23
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKK 24
           K KK+KKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 28.7 bits (64), Expect = 0.49
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 9   KKKKKKKKKKKKKKKKKKR 27
           KKKKKK KK KK KK  K+
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119



 Score = 27.2 bits (60), Expect = 1.7
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 7   EKKKKKKKKKKKKKKKKKKKRITK 30
           E  +K KKKKKK KK KK K+ +K
Sbjct: 95  EPTEKPKKKKKKSKKTKKPKKSSK 118



 Score = 26.4 bits (58), Expect = 3.2
 Identities = 14/21 (66%), Positives = 14/21 (66%)

Query: 10  KKKKKKKKKKKKKKKKKRITK 30
           K KKKKKK KK KK KK   K
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 25.6 bits (56), Expect = 4.9
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 2   RKRKKEKKKKKKKKKKKKKK 21
           +K+KK+ KK KK KK  KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 32.1 bits (73), Expect = 0.053
 Identities = 15/47 (31%), Positives = 18/47 (38%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS 48
              K +  K  KKK K K KK  KKR   +  PS      +   P  
Sbjct: 758 ASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGAEPVG 804



 Score = 29.0 bits (65), Expect = 0.61
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K+K    K K  K  KKK K K K+ TK +     +P
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTP 790



 Score = 27.1 bits (60), Expect = 3.0
 Identities = 11/41 (26%), Positives = 14/41 (34%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHP 44
           + K+K    K K  K  KKK K +  K             P
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSP 792



 Score = 26.7 bits (59), Expect = 3.7
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           ++K + KKK    K K  K  KKK   K +  ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 31.9 bits (73), Expect = 0.057
 Identities = 13/35 (37%), Positives = 26/35 (74%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           ++  KE+K+ +K KK+++KKKK+ +++ K +IP  
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPA 594



 Score = 30.0 bits (68), Expect = 0.27
 Identities = 11/31 (35%), Positives = 24/31 (77%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           ++ K+  K++K+ +K KK+++KKKK + K++
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587



 Score = 30.0 bits (68), Expect = 0.29
 Identities = 10/28 (35%), Positives = 22/28 (78%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           +R+KE+K+  K++K+ +K KK++++  K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKK 581



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 8/27 (29%), Positives = 22/27 (81%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
            R++++++  K++K+ +K KK+++KK+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKK 580



 Score = 25.8 bits (57), Expect = 8.7
 Identities = 7/28 (25%), Positives = 20/28 (71%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           K+E +++K++K+  K++K+ +K   + +
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEE 577


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 32.1 bits (73), Expect = 0.057
 Identities = 15/53 (28%), Positives = 21/53 (39%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLT 58
             K K K +K KKK    K     K ++ S    L  + +    S    +PLT
Sbjct: 115 MRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167



 Score = 26.3 bits (58), Expect = 4.5
 Identities = 11/36 (30%), Positives = 14/36 (38%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
           K+K    K   KKKKK    K +        P  +S
Sbjct: 126 KKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSIS 161


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 31.5 bits (72), Expect = 0.067
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           KRK +++ KKKK K+ K  K  KK + K
Sbjct: 251 KRKSKEEIKKKKPKESKGVKALKKVVAK 278



 Score = 30.4 bits (69), Expect = 0.16
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKK 26
            +++ KE+ KKKK K+ K  K  KK 
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 28.5 bits (64), Expect = 0.81
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           + +++ KKKK K+ K  K  KK   +  K
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMK 281



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           R  KK K K++ KKKK K+ K  K     V
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKVV 276



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           K++K K++ KKKK K+ K  K  K++   
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAK 278



 Score = 26.9 bits (60), Expect = 2.7
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
             + +K++K K++ KKKK K+ K  K + KV
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 26.9 bits (60), Expect = 3.2
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
            R ++K+K K++ KKKK K+ K  +  K  +  
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278



 Score = 25.4 bits (56), Expect = 8.8
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKK 24
            +K+ KE K  K  KK   K  KK
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMKK 282


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.4 bits (72), Expect = 0.083
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH 43
           +K +KE+ +K   KKK +   KK K         +  PLG  
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464



 Score = 29.5 bits (67), Expect = 0.45
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKR 27
           RKK++K +KK +K++ +K   KK+
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKK 438



 Score = 29.5 bits (67), Expect = 0.45
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           +RKK +KK++K +KK +K++ +K  
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAA 434



 Score = 29.1 bits (66), Expect = 0.60
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           +K++K +KK +K++ +K   KKK +   K
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 5   KKEKKKKKKKKKKKKKKKKK 24
             E+KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           +RK++++ +KK +K++ +K   KKK     
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443



 Score = 27.6 bits (62), Expect = 1.8
 Identities = 8/26 (30%), Positives = 19/26 (73%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           RK+ ++K++K +KK +K++ +K   +
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAK 436



 Score = 26.4 bits (59), Expect = 4.4
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           K +K+++K +KK +K++ +K   K+  +  
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEAA 442



 Score = 26.4 bits (59), Expect = 5.1
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 7   EKKKKKKKKKKKKKKKKKKK 26
             ++KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427



 Score = 25.7 bits (57), Expect = 8.9
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 6   KEKKKKKKKKKKKKKKKKKK 25
             ++KK +KK++K +KK +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427



 Score = 25.7 bits (57), Expect = 9.3
 Identities = 7/20 (35%), Positives = 15/20 (75%)

Query: 8   KKKKKKKKKKKKKKKKKKKR 27
             ++KK +KK++K +KK ++
Sbjct: 408 PAERKKLRKKQRKAEKKAEK 427


>gnl|CDD|223557 COG0481, LepA, Membrane GTPase LepA [Cell envelope biogenesis,
           outer membrane].
          Length = 603

 Score = 31.0 bits (71), Expect = 0.11
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 10  KKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +K+K  +K+K+ KK+ K+I  V+IP E
Sbjct: 566 RKRKLLEKQKEGKKRMKQIGNVEIPQE 592


>gnl|CDD|130460 TIGR01393, lepA, GTP-binding protein LepA.  LepA (GUF1 in
           Saccaromyces) is a GTP-binding membrane protein related
           to EF-G and EF-Tu. Two types of phylogenetic tree,
           rooted by other GTP-binding proteins, suggest that
           eukaryotic homologs (including GUF1 of yeast) originated
           within the bacterial LepA family. The function is
           unknown [Unknown function, General].
          Length = 595

 Score = 31.1 bits (71), Expect = 0.12
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 10  KKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +K+K  +K+K+ KK+ K+I KV++P E
Sbjct: 560 RKRKLLEKQKEGKKRMKQIGKVEVPQE 586


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.0 bits (70), Expect = 0.15
 Identities = 13/30 (43%), Positives = 22/30 (73%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
          +RKK+K++K K+K+ KK K  +K+   K+Q
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQ 48



 Score = 30.3 bits (68), Expect = 0.22
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
          RK+KKE+K K+K+ KK K  +K+ K   + Q
Sbjct: 20 RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50



 Score = 25.6 bits (56), Expect = 9.0
 Identities = 11/31 (35%), Positives = 22/31 (70%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
          +K   E++ ++KKKK++K K+K+ K++   Q
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQ 40


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.4 bits (69), Expect = 0.16
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           ++ R +E++ KKKKKKKKKK KK     + 
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSS 201



 Score = 27.7 bits (62), Expect = 1.4
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           + + + +KKKKKKKKK KK           V        
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTEL 214



 Score = 26.5 bits (59), Expect = 4.0
 Identities = 12/46 (26%), Positives = 16/46 (34%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS 48
           K+KK+KKKKK KK        +    + V          V      
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKL 227



 Score = 25.8 bits (57), Expect = 7.7
 Identities = 15/47 (31%), Positives = 20/47 (42%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPT 47
             + KK+KKKKKKK KK        +      +P+ELS         
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 30.0 bits (68), Expect = 0.17
 Identities = 17/30 (56%), Positives = 23/30 (76%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           ++++ K+KK+KKKKKKKKKKK  KK    K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 29.3 bits (66), Expect = 0.30
 Identities = 17/31 (54%), Positives = 22/31 (70%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
           K+  ++K K+KK+KKKKKKKKKKK   K   
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 28.1 bits (63), Expect = 0.75
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           K K++K+KKKKKKKKKKK  KK  +  K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 28.1 bits (63), Expect = 0.86
 Identities = 17/25 (68%), Positives = 21/25 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           ++  KEK K+KK+KKKKKKKKKKK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 28.1 bits (63), Expect = 0.89
 Identities = 17/25 (68%), Positives = 21/25 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           +K++K+KKKKKKKKK  KK  KKKK
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 26.6 bits (59), Expect = 2.6
 Identities = 17/25 (68%), Positives = 20/25 (80%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKK 25
            +K+KK+KKKKKKKK  KK  KKKK
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 25.4 bits (56), Expect = 6.2
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +  K++ K+KK+KKKKKKKKKKK  +
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSK 128


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 30.6 bits (70), Expect = 0.17
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 7   EKKKKKKKKKKKKKKKKKKKRIT 29
           EK++ KKKKK+KK KK + ++I 
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 29.0 bits (66), Expect = 0.52
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 10  KKKKKKKKKKKKKKKKKRITKV 31
           +K++ KKKKK+KK KK  + K+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754



 Score = 29.0 bits (66), Expect = 0.58
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 6   KEKKKKKKKKKKKKKKKKKKK 26
           ++++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 1.2
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKK 22
           + KR+ +KKKK+KK KK + +K
Sbjct: 732 LEKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.9 bits (63), Expect = 1.5
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 3   KRKKEKKKKKKKKKKKKKKKK 23
           ++++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.9 bits (63), Expect = 1.6
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 5   KKEKKKKKKKKKKKKKKKKK 24
           K+  KKKKK+KK KK + +K
Sbjct: 734 KRRLKKKKKRKKVKKWEVEK 753



 Score = 27.5 bits (62), Expect = 2.0
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 5   KKEKKKKKKKKKKKKKKKKKK 25
           +K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|235462 PRK05433, PRK05433, GTP-binding protein LepA; Provisional.
          Length = 600

 Score = 30.4 bits (70), Expect = 0.19
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 10  KKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +K+K  +K+K+ KK+ K+I  V+IP E
Sbjct: 563 RKRKLLEKQKEGKKRMKQIGNVEIPQE 589


>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal
          protein S8e and similar proteins.  This family contains
          the eukaryotic/archaeal ribosomal protein S8, a
          component of the small ribosomal subunits, as well as
          the NSA2 gene product.
          Length = 138

 Score = 29.8 bits (68), Expect = 0.19
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
          KRK    K K  +KK+K +  +K   TK+
Sbjct: 6  KRKATGGKFKVVRKKRKYELGRKPANTKL 34


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 30.2 bits (68), Expect = 0.20
 Identities = 15/27 (55%), Positives = 21/27 (77%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
           +K+K E  K++KKKKK+KKKKKK+   
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSP 171



 Score = 29.8 bits (67), Expect = 0.29
 Identities = 15/26 (57%), Positives = 22/26 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           + +KK+ +  K++KKKKK+KKKKKKR
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 29.4 bits (66), Expect = 0.36
 Identities = 15/26 (57%), Positives = 22/26 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +K KK+K +  K++KKKKK+KKKKK+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKK 167



 Score = 28.3 bits (63), Expect = 0.93
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
           KK KKKK +  K++KKKKK+KK+  K   P
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSP 171


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.4 bits (68), Expect = 0.21
 Identities = 17/48 (35%), Positives = 25/48 (52%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
           KK KK KKK+KK+K+K++ K K+       S L  L   P   +   +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAE 244



 Score = 28.1 bits (62), Expect = 1.2
 Identities = 17/63 (26%), Positives = 29/63 (46%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
           +K KK KKK+KK+K+K++ K KKK+      +   L          + + + +   T   
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSG 256

Query: 62  YPP 64
             P
Sbjct: 257 TAP 259



 Score = 28.1 bits (62), Expect = 1.2
 Identities = 9/37 (24%), Positives = 25/37 (67%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           +++ EK K++KKK++K+K+ +++      +   +++P
Sbjct: 91  RQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAP 127



 Score = 27.7 bits (61), Expect = 1.9
 Identities = 9/25 (36%), Positives = 21/25 (84%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           +R +++ +K K++KKK++K+K+ +R
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRR 112



 Score = 26.6 bits (58), Expect = 4.1
 Identities = 9/26 (34%), Positives = 21/26 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R R++ +K K++KKK++K+K+ +++ 
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 26.2 bits (57), Expect = 5.8
 Identities = 10/34 (29%), Positives = 23/34 (67%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +R+  ++ +K K++KKK++K+K+ R     + +E
Sbjct: 87  ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120



 Score = 26.2 bits (57), Expect = 6.5
 Identities = 9/26 (34%), Positives = 21/26 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R+ ++  +K K++KKK++K+K+ ++R
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRR 113



 Score = 25.4 bits (55), Expect = 9.1
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +K  K KKKK++K+K++KKKKKK   
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHH 300


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.5 bits (69), Expect = 0.22
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R  KK K K  KK  +K KK   K +
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQ 795



 Score = 29.7 bits (67), Expect = 0.31
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRIT 29
            ++K + K  KK  +K KK   K ++I 
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIA 798



 Score = 29.7 bits (67), Expect = 0.38
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           R ++K K K  KK  +K KK   +  K+
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKI 797



 Score = 29.3 bits (66), Expect = 0.43
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 8   KKKKKKKKKKKKKKKKKKKRITKVQ 32
            +KK K K  KK  +K KK   K Q
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQ 795



 Score = 26.6 bits (59), Expect = 3.9
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            + K  KK  +K KK   K +K    TK
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATK 802



 Score = 25.9 bits (57), Expect = 7.6
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
           K+   K KK   K +K     K KR  K ++  
Sbjct: 781 KKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
          pigmentation in the mouse hair follicle producing a
          black hair with a subapical yellow band. A highly
          homologous protein agouti signal protein (ASIP) is
          present in humans and is expressed at highest levels in
          adipose tissue where it may play a role in energy
          homeostasis and possibly human pigmentation.
          Length = 124

 Score = 29.5 bits (66), Expect = 0.23
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 1  MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
          +   KK KK   ++ +KK  +KK+KK +T V  P   SP
Sbjct: 48 VGLNKKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSP 86


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 30.1 bits (69), Expect = 0.24
 Identities = 8/30 (26%), Positives = 11/30 (36%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           ++ +       K    KK   K  R TK Q
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848



 Score = 30.1 bits (69), Expect = 0.27
 Identities = 5/28 (17%), Positives = 9/28 (32%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            +    KK   K  +K K++        
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 29.8 bits (68), Expect = 0.30
 Identities = 6/29 (20%), Positives = 10/29 (34%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
                 K    KK   K  +K K++ +  
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVAS 852



 Score = 29.4 bits (67), Expect = 0.42
 Identities = 7/26 (26%), Positives = 9/26 (34%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R       K    KK   K  +K K+
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQ 847



 Score = 29.4 bits (67), Expect = 0.42
 Identities = 6/24 (25%), Positives = 11/24 (45%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKK 26
            +K   K  +K K++    +K  K
Sbjct: 834 GKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 29.4 bits (67), Expect = 0.44
 Identities = 5/30 (16%), Positives = 11/30 (36%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
                +    KK   K  +K K++   ++ 
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 29.0 bits (66), Expect = 0.69
 Identities = 7/29 (24%), Positives = 9/29 (31%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            K +K +       K    KK   K   K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRK 844



 Score = 29.0 bits (66), Expect = 0.70
 Identities = 6/30 (20%), Positives = 10/30 (33%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           +   +   K    KK   K  +K +   V 
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVA 851



 Score = 28.6 bits (65), Expect = 0.85
 Identities = 6/24 (25%), Positives = 11/24 (45%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKK 26
           K+   K  +K K++    +K  K 
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 28.2 bits (64), Expect = 1.1
 Identities = 7/27 (25%), Positives = 12/27 (44%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
             KK   K  +K K++    +K  + T
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKAT 859



 Score = 27.8 bits (63), Expect = 1.4
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           K    KK   K  +K K++    ++  K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 27.4 bits (62), Expect = 2.3
 Identities = 7/34 (20%), Positives = 12/34 (35%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           K +       K    KK   K  ++  +  + SE
Sbjct: 820 KLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853



 Score = 27.1 bits (61), Expect = 2.5
 Identities = 5/24 (20%), Positives = 11/24 (45%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKK 25
           +K   +  +K K++    +K  K 
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 29.9 bits (68), Expect = 0.24
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 9  KKKKKKKKKKKKKKKKKKRITKVQIPSEL 37
          KKK KK KK+K+K++K+ R        EL
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDEL 42



 Score = 28.0 bits (63), Expect = 0.94
 Identities = 10/20 (50%), Positives = 17/20 (85%)

Query: 7  EKKKKKKKKKKKKKKKKKKK 26
          +KK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARK 33



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 5  KKEKKKKKKKKKKKKKKKKKK 25
          KK KK KK+K+K++K+ +K  
Sbjct: 15 KKAKKAKKEKRKQRKQARKGA 35



 Score = 27.2 bits (61), Expect = 2.0
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 6  KEKKKKKKKKKKKKKKKKKKKRITKV 31
          K+K KK KK+K+K++K+ +K      
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGD 39



 Score = 26.4 bits (59), Expect = 3.2
 Identities = 10/22 (45%), Positives = 17/22 (77%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKK 26
          KK+ KK KK+K+K++K+ +K  
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35



 Score = 25.6 bits (57), Expect = 6.7
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 1  MRKRKKEKKKKKKKKKKKK 19
           +K KKEK+K++K+ +K  
Sbjct: 17 AKKAKKEKRKQRKQARKGA 35



 Score = 25.3 bits (56), Expect = 9.2
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 3  KRKKEKKKKKKKKKKKKKKK 22
          K KK KK+K+K++K+ +K  
Sbjct: 16 KAKKAKKEKRKQRKQARKGA 35


>gnl|CDD|163257 TIGR03425, urea_degr_2, urea carboxylase-associated protein 2.  A
           number of bacteria degrade urea as a nitrogen source by
           the urea carboxylase/allophanate hydrolase pathway,
           which uses biotin and consumes ATP, rather than my means
           of the nickel-dependent enzyme urease. This model
           represents one of a pair of homologous, tandem
           uncharacterized genes found together with the urea
           carboxylase and allophanate hydrolase genes.
          Length = 233

 Score = 29.6 bits (67), Expect = 0.28
 Identities = 14/27 (51%), Positives = 15/27 (55%), Gaps = 3/27 (11%)

Query: 44  PHPTSPSTDYT-TP--LTALKYPPSAA 67
           PHP  P  DY   P  LTA + PP AA
Sbjct: 184 PHPLDPRPDYQPKPVRLTAWRAPPVAA 210


>gnl|CDD|203441 pfam06421, LepA_C, GTP-binding protein LepA C-terminus.  This
           family consists of the C-terminal region of several pro-
           and eukaryotic GTP-binding LepA proteins.
          Length = 108

 Score = 29.1 bits (66), Expect = 0.29
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 10  KKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +KKK  +K+K+ KK+ K+I  V+IP E
Sbjct: 74  RKKKLLEKQKEGKKRMKQIGNVEIPQE 100


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 29.7 bits (67), Expect = 0.31
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K K E+ KK++  + ++KKK+KK +
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKAK 387



 Score = 28.5 bits (64), Expect = 0.88
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
             + K ++ KK++  + ++KKK+KK  ++ 
Sbjct: 361 PTKAKPERDKKERPGRYRRKKKEKKAKSER 390



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +K +  + ++KKK+KK K +++  + 
Sbjct: 370 KKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKR 27
           R K+++  + ++KKK+KK K ++R
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERR 391



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           + KKE+  + ++KKK+KK K +++ 
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRG 392



 Score = 27.0 bits (60), Expect = 2.9
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R +K+   + ++KKK+KK K +++  
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGL 393


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 29.7 bits (67), Expect = 0.37
 Identities = 13/27 (48%), Positives = 22/27 (81%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRI 28
            + K +KKKKK+KKK+++ K+++K RI
Sbjct: 741 VEEKTKKKKKKEKKKEEEYKREEKARI 767



 Score = 27.4 bits (61), Expect = 2.7
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
           +EK KKKKKK+KKK+++ K++   +++I 
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKARIEIA 770



 Score = 26.6 bits (59), Expect = 3.9
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
           K KK+KKK+KKK+++ K+++K +  I 
Sbjct: 744 KTKKKKKKEKKKEEEYKREEKARIEIA 770


>gnl|CDD|220440 pfam09848, DUF2075, Uncharacterized conserved protein (DUF2075).
           This domain, found in various prokaryotic proteins
           (including putative ATP/GTP binding proteins), has no
           known function.
          Length = 348

 Score = 29.5 bits (67), Expect = 0.37
 Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
              K E K   +K+K  K+  +   R+ K
Sbjct: 300 DGTKWEDKSAFQKRKDLKEDPEDADRLIK 328


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 28.4 bits (64), Expect = 0.38
 Identities = 8/31 (25%), Positives = 23/31 (74%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
          ++RK+E+K +++++ K+ K  K+++   K++
Sbjct: 1  KERKEEEKAQREEELKRLKNLKREEIEEKLE 31



 Score = 26.5 bits (59), Expect = 2.1
 Identities = 9/29 (31%), Positives = 21/29 (72%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
          K ++E++ K+ K  K+++ ++K ++I KV
Sbjct: 8  KAQREEELKRLKNLKREEIEEKLEKIKKV 36


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 29.1 bits (66), Expect = 0.42
 Identities = 12/38 (31%), Positives = 20/38 (52%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           R+   EK+K  ++++K K K  +KKR  + Q P     
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159



 Score = 28.7 bits (65), Expect = 0.55
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           KR+   +K+K  ++++K K K  ++
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRK 145



 Score = 28.4 bits (64), Expect = 0.73
 Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGV 42
            K  + ++K K K  +KK+K + +K   +    S++S L V
Sbjct: 129 EKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTV 169



 Score = 28.4 bits (64), Expect = 0.95
 Identities = 10/54 (18%), Positives = 26/54 (48%), Gaps = 1/54 (1%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTS-PSTDYTT 55
           +R +++ KK++   +K+K  +++++        +  P    P P   P +D + 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166



 Score = 26.4 bits (59), Expect = 3.7
 Identities = 7/35 (20%), Positives = 17/35 (48%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           ++    +K+K  ++++K K K  +K+        E
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 29.3 bits (66), Expect = 0.45
 Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 4/66 (6%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPP 64
            K K K K K K  KK +++ KR  K   P   SP       T+P+   ++  TA    P
Sbjct: 97  PKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPF----ENTAPARPTSSTATAAASKP 152

Query: 65  SAALSQ 70
             ++S 
Sbjct: 153 VTSVSS 158


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
          subunit.  This is a family of proteins which are
          subunits of the eukaryotic translation initiation
          factor 3 (eIF3). In yeast it is called Hcr1. The
          Saccharomyces cerevisiae protein eIF3j (HCR1) has been
          shown to be required for processing of 20S pre-rRNA and
          binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 29.2 bits (66), Expect = 0.46
 Identities = 9/38 (23%), Positives = 23/38 (60%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
           + K+  K K ++K+K K++K++K + +++  +    L
Sbjct: 56 AKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDEL 93



 Score = 28.5 bits (64), Expect = 0.68
 Identities = 12/21 (57%), Positives = 12/21 (57%)

Query: 3   KRKKEKKKKKKKKKKKKKKKK 23
           K K EK  K  KKKK K K K
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAK 214



 Score = 28.1 bits (63), Expect = 1.2
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query: 6   KEKKKKKKKKKKKKKKKKKKK 26
           K K +K  K  KKKK K K K
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAK 214



 Score = 26.9 bits (60), Expect = 2.9
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 8  KKKKKKKKKKKKKKKKKKKRITK 30
          K K KK  K K ++K+K KR  +
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKE 77



 Score = 26.5 bits (59), Expect = 3.7
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query: 6   KEKKKKKKKKKKKKKKKKKKK 26
            EK K +K  K  KKKK K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAK 212



 Score = 25.8 bits (57), Expect = 6.1
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
          K + KK  K K ++K+K K++K    ++   E +P 
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90



 Score = 25.8 bits (57), Expect = 6.5
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
          + +K K   K K KK  K K ++K   K
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAK 73



 Score = 25.4 bits (56), Expect = 7.8
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKR 27
          K K +K  K K ++K+K K++K+++
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 25.4 bits (56), Expect = 7.9
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
          ++++EK K   K K KK  K K +   K +
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAK 73


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.2 bits (66), Expect = 0.47
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           + +KEKKK   K++KK  K++K+K    
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 28.8 bits (65), Expect = 0.47
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
             EK+  K K++K+++ ++K+K I K Q
Sbjct: 123 LLEKELAKLKREKRRENERKQKEILKEQ 150


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 29.0 bits (65), Expect = 0.54
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKK 26
           KK+K K KK K K KK + KKK
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKK 112



 Score = 28.3 bits (63), Expect = 0.87
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKK 26
           K+KK K KK K K KK + KKK +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 27.9 bits (62), Expect = 1.0
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKR 27
           +KK+ K KK K K KK + KKK R
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 27.5 bits (61), Expect = 1.7
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRI 28
           K+KK K KK K K KK + KK+ 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKA 113



 Score = 27.5 bits (61), Expect = 1.8
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKK 25
           +K+K + KK K K KK + KKK +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 27.1 bits (60), Expect = 2.6
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 7   EKKKKKKKKKKKKKKKKKKKR 27
           EKKKK K KK K K KK + +
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDK 110


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 29.0 bits (65), Expect = 0.57
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRI 28
            E  K KK KKKKKKKKK++K +
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 27.8 bits (62), Expect = 1.4
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKR 27
            +  K KK KKKKKKKKK++K  
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 27.8 bits (62), Expect = 1.7
 Identities = 14/21 (66%), Positives = 16/21 (76%)

Query: 1   MRKRKKEKKKKKKKKKKKKKK 21
           M K KK KKKKKKKKK++K  
Sbjct: 272 MVKFKKPKKKKKKKKKRRKDL 292



 Score = 27.0 bits (60), Expect = 2.5
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKR 27
              +  K KK KKKKKKKKK+R
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRR 289


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 29.0 bits (66), Expect = 0.59
 Identities = 9/17 (52%), Positives = 11/17 (64%)

Query: 15  KKKKKKKKKKKKRITKV 31
           +K KK KK KK + TK 
Sbjct: 343 RKSKKYKKNKKNKWTKA 359



 Score = 27.5 bits (62), Expect = 2.0
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 7   EKKKKKKKKKKKKK 20
           ++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355



 Score = 26.7 bits (60), Expect = 3.4
 Identities = 9/20 (45%), Positives = 11/20 (55%)

Query: 14  KKKKKKKKKKKKKRITKVQI 33
           +K KK KK KK K    + I
Sbjct: 343 RKSKKYKKNKKNKWTKALSI 362



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 9   KKKKKKKKKKKKK 21
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 10  KKKKKKKKKKKKK 22
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 11  KKKKKKKKKKKKK 23
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 12  KKKKKKKKKKKKK 24
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.1
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 13  KKKKKKKKKKKKK 25
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 28.0 bits (63), Expect = 0.61
 Identities = 9/37 (24%), Positives = 18/37 (48%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
          + K+ K+K  +     KKKKK+K +  + +     + 
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69



 Score = 25.7 bits (57), Expect = 4.2
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
          ++ K++  +     KKKKK+K KKKR   
Sbjct: 35 KELKQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 25.7 bits (57), Expect = 4.6
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           K+K ++     KKKKK+K KKK+         +E
Sbjct: 37 LKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 28.4 bits (64), Expect = 0.69
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRIT 29
           ++KK+KKK  KKKK KK      K I+
Sbjct: 181 EKKKKKKKSAKKKKLKKVAAVGMKAIS 207



 Score = 28.0 bits (63), Expect = 1.1
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
             + KK+    +++KKKKKKK  KKK++ KV
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            +K    +++KKKKKKK  KKKK KK    
Sbjct: 172 KKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 28.5 bits (64), Expect = 0.70
 Identities = 17/25 (68%), Positives = 20/25 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           +KR K +KKK+KKKKKKK KK  KK
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 27.3 bits (61), Expect = 1.7
 Identities = 14/29 (48%), Positives = 23/29 (79%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           +++ +EK  KK+ K++KKK+KKKKK+  K
Sbjct: 76  KRKDEEKTAKKRAKRQKKKQKKKKKKKAK 104



 Score = 26.9 bits (60), Expect = 1.9
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K  KK+ K++KKK+KKKKKKK +    +   E S 
Sbjct: 82  KTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116



 Score = 26.9 bits (60), Expect = 2.3
 Identities = 14/24 (58%), Positives = 20/24 (83%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKR 27
           +K+ K++KKK+KKKKKKK KK  +
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNK 108



 Score = 26.5 bits (59), Expect = 2.5
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
            KR+K+K+KKKKKKK KK  KK++K
Sbjct: 88  AKRQKKKQKKKKKKKAKKGNKKEEK 112



 Score = 26.5 bits (59), Expect = 2.7
 Identities = 15/24 (62%), Positives = 20/24 (83%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKK 26
           K  K++ K++KKK+KKKKKKK KK
Sbjct: 82  KTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 26.5 bits (59), Expect = 3.0
 Identities = 15/23 (65%), Positives = 18/23 (78%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKK 26
            KK  K++KKK+KKKKKKK KK 
Sbjct: 84  AKKRAKRQKKKQKKKKKKKAKKG 106



 Score = 26.2 bits (58), Expect = 3.7
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
            K++ +++KKK+KKKKKKK KK  K
Sbjct: 84  AKKRAKRQKKKQKKKKKKKAKKGNK 108


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.8 bits (65), Expect = 0.75
 Identities = 10/35 (28%), Positives = 12/35 (34%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
              KK+ K   K    K   KKK K        +E
Sbjct: 73  DIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 10/37 (27%), Positives = 14/37 (37%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
           +K K   K    K   KKK K +     K +  + L 
Sbjct: 77  KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 28.5 bits (64), Expect = 0.77
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRI 28
             + +   +KKK + KKK  K  KK RI
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRI 173



 Score = 26.6 bits (59), Expect = 3.2
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRI 28
             ++KK + KKK  K  KK +   +
Sbjct: 152 LAEKKKPRSKKKSSKNSKKLRIDLV 176



 Score = 26.6 bits (59), Expect = 3.3
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITK 30
            EKKK + KKK  K  KK +  +  
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLVG 177


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 28.5 bits (65), Expect = 0.78
 Identities = 7/19 (36%), Positives = 10/19 (52%)

Query: 1   MRKRKKEKKKKKKKKKKKK 19
            + R +  KK K+KKK   
Sbjct: 380 AQARLRAAKKVKRKKKTSG 398



 Score = 25.8 bits (58), Expect = 8.3
 Identities = 7/21 (33%), Positives = 11/21 (52%)

Query: 3   KRKKEKKKKKKKKKKKKKKKK 23
           K  + + +  KK K+KKK   
Sbjct: 378 KAAQARLRAAKKVKRKKKTSG 398


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.5 bits (64), Expect = 0.80
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K+KKEKK+ K + + K   K  KK     + P +  P
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111



 Score = 27.8 bits (62), Expect = 1.7
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
          K+K EKKKKKKK+KK+ K + + K   K
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFK 94



 Score = 27.0 bits (60), Expect = 3.3
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
           ++KK+KKK+KK+ K + + K   K   K +   +  P 
Sbjct: 71  EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPK 108



 Score = 26.6 bits (59), Expect = 4.2
 Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
            +K KKE K + + K   K  KK KK   K   P     
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114



 Score = 26.2 bits (58), Expect = 5.3
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
          KK+ +KKKKKKK+KK+ K + +     + P +
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 26.2 bits (58), Expect = 6.0
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
           K +K+KKKKK+KK+ K + + K    T  +  
Sbjct: 69  KSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100



 Score = 25.8 bits (57), Expect = 6.8
 Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
              K+K +KKKKKKK+KK+ K +          + S 
Sbjct: 64  NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 28.7 bits (65), Expect = 0.80
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
             ++++KKK+KKK KKKK K  + +I KV
Sbjct: 396 APEQKEKKKEKKKNKKKKYKVPRGKIYKV 424



 Score = 27.1 bits (61), Expect = 2.9
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSEL 37
                ++K+KKK+KKK KKKK ++ + +I   L
Sbjct: 393 VGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVL 425


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 28.6 bits (64), Expect = 0.81
 Identities = 11/59 (18%), Positives = 23/59 (38%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTAL 60
           R ++ E +     K   +K+KK+ K   ++   + L  +        P  + T   + L
Sbjct: 427 RTKRPEPRAAADTKSAAEKQKKRAKEPFEIDFGAPLEEIDFEVIFQPPKANSTLSKSKL 485


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 28.5 bits (64), Expect = 0.82
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKK 25
           RKRK  KK  KK    KK+K+K+ 
Sbjct: 225 RKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 27.5 bits (62), Expect = 0.82
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 7  EKKKKKKKKKKKKKKKKKKKRITKVQ 32
           +KKK+KK+ K + K+   K++T  Q
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQ 85



 Score = 24.4 bits (54), Expect = 9.3
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 3  KRKKEKKKKKKKKKKKKKKK 22
          ++KKEKK+ K + K+   KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.5 bits (64), Expect = 0.83
 Identities = 13/19 (68%), Positives = 15/19 (78%)

Query: 3   KRKKEKKKKKKKKKKKKKK 21
           KRK+E  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 9   KKKKKKKKKKKKKKKKKKRIT 29
           K+K++  KK+KKKK KK ++T
Sbjct: 356 KRKREGDKKQKKKKSKKLKLT 376



 Score = 27.3 bits (61), Expect = 2.2
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 2   RKRKKEKKKKKKKKKKKK 19
           RKR+ +KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374



 Score = 26.6 bits (59), Expect = 4.0
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 7   EKKKKKKKKKKKKKKKKKKK 26
            K+K++  KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.5 bits (64), Expect = 0.90
 Identities = 18/72 (25%), Positives = 32/72 (44%), Gaps = 6/72 (8%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH--PHPTSPS----TDYTTPLT 58
           ++ + K+  KKKKKKKKKKKKK + +    +   P+         +       ++     
Sbjct: 48  QEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEW 107

Query: 59  ALKYPPSAALSQ 70
               PP+  +S+
Sbjct: 108 KQTQPPTIPVSK 119



 Score = 27.0 bits (60), Expect = 2.6
 Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 7/64 (10%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ-IPSELSPLGVHPHP------TSPSTDYT 54
             ++  KKKKKKKKKKKKK   +   +     +    +    + H              T
Sbjct: 51  ENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110

Query: 55  TPLT 58
            P T
Sbjct: 111 QPPT 114



 Score = 25.4 bits (56), Expect = 8.6
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKKRITKV 31
          + E K++K++ K+K  KKK KK    V
Sbjct: 7  ENEVKQQKQQNKQKGTKKKNKKSKKDV 33


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 13/35 (37%), Positives = 24/35 (68%), Gaps = 6/35 (17%)

Query: 1   MRKRKKEK------KKKKKKKKKKKKKKKKKKRIT 29
           + K+K+ K      +K KK++KK+K K+++K+RI 
Sbjct: 647 IEKKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
            + K   K  +K K ++ +KK KK R+ KV
Sbjct: 342 AQEKATAKGAQKVKNRRARKKAKKARLAKV 371



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 7/27 (25%), Positives = 13/27 (48%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
             K   +  +K K ++ +KK KK +  
Sbjct: 343 QEKATAKGAQKVKNRRARKKAKKARLA 369


>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
          Length = 367

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 1  MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHP 44
           + +KKE    ++  K+  +K KK+  +T  +I  +L P  +  
Sbjct: 1  KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIPFELDS 44


>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel.  The
           family carries eight putative transmembrane domains,
           and, although it has no similarity to other known
           channel proteins, it is clearly a calcium-activated
           ionic channel. It is expressed in various secretory
           epithelia, the retina and sensory neurons, and mediates
           receptor-activated chloride currents in diverse
           physiological processes.
          Length = 449

 Score = 27.9 bits (63), Expect = 1.2
 Identities = 17/39 (43%), Positives = 27/39 (69%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVH 43
           KK +KKK KKKK+K+KK ++ + + +++  SEL P  V 
Sbjct: 271 KKIRKKKIKKKKEKEKKPEENEILERIENESELPPYDVF 309


>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
          Length = 788

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 15/40 (37%), Positives = 20/40 (50%)

Query: 7   EKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHP 46
           EK  KKKK+K  K ++KK           EL+P  + P P
Sbjct: 188 EKPVKKKKEKPAKVEEKKVAVAAAPPSEEELNPYSIFPQP 227


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 27.2 bits (61), Expect = 1.4
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           R+R KE+ +K+K+ +K ++KK K++
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRR 124



 Score = 26.5 bits (59), Expect = 2.7
 Identities = 8/25 (32%), Positives = 19/25 (76%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKK 25
           +R R++  K++ +K+K+ +K ++KK
Sbjct: 96  LRLRRERTKERAEKEKRTRKNREKK 120



 Score = 26.5 bits (59), Expect = 3.2
 Identities = 9/27 (33%), Positives = 20/27 (74%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
             K + EK+K+ +K ++KK K+++K++
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 25.3 bits (56), Expect = 6.4
 Identities = 7/32 (21%), Positives = 21/32 (65%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           +  R + ++ K++ +K+K+ +K ++K+  + Q
Sbjct: 94  IALRLRRERTKERAEKEKRTRKNREKKFKRRQ 125


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 27.7 bits (62), Expect = 1.4
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 7   EKKKKKKKKKKKKKKKKKKKR 27
           E  KK K K+ +K + KKKK+
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKK 104



 Score = 27.3 bits (61), Expect = 1.6
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 8   KKKKKKKKKKKKKKKKKKKRIT 29
           KK K K+ +K + KKKKKK   
Sbjct: 87  KKAKGKRSRKNQTKKKKKKDDN 108



 Score = 26.5 bits (59), Expect = 3.0
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 5   KKEKKKKKKKKKKKKKKKKKK 25
             +K K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 26.1 bits (58), Expect = 5.0
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 8   KKKKKKKKKKKKKKKKKKKR 27
            KK K K+ +K + KKKKK+
Sbjct: 86  SKKAKGKRSRKNQTKKKKKK 105



 Score = 26.1 bits (58), Expect = 5.1
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 3   KRKKEKKKKKKKKKKKKKKKK 23
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 27.8 bits (62), Expect = 1.5
 Identities = 8/27 (29%), Positives = 10/27 (37%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +        +K  K K   KK  KK R
Sbjct: 187 LSSTASSGDEKSPKSKAAPKKAGKKMR 213


>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 260 amino acids in length.
          Length = 242

 Score = 27.5 bits (61), Expect = 1.7
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
            +KEK   K+     K+KKK+KK   K+++P 
Sbjct: 151 HRKEKDDNKENINLFKQKKKRKKLSFKIRLPR 182


>gnl|CDD|240545 cd13140, MATE_like_1, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. MATE has also been
           identified as a large multigene family in plants, where
           the proteins are linked to disease resistance. A number
           of family members are involved in the synthesis of
           peptidoglycan components in bacteria.
          Length = 435

 Score = 27.5 bits (62), Expect = 1.7
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
           +   KK+K    K +  K+  +K  K I K+ +P 
Sbjct: 206 LYALKKKKGLFLKIRILKRPDRKYIKEILKLGLPV 240


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 16/76 (21%), Positives = 25/76 (32%), Gaps = 24/76 (31%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKK---------------------RITKVQIPSELSPL 40
           K+ +E+K  K++ K        K                      R+ KV+    L   
Sbjct: 19 AKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANRLKKVE---ALRAK 75

Query: 41 GVHPHPTSPSTDYTTP 56
          GV P+P      +T P
Sbjct: 76 GVEPYPYKFDVTHTAP 91


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 27.3 bits (61), Expect = 1.9
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
             K+KK+KKKK   KK K KK   K+R  +V+ 
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKA 286



 Score = 27.0 bits (60), Expect = 2.9
 Identities = 16/31 (51%), Positives = 19/31 (61%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           R    + KKKKKKKKK   KK K K++T  Q
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ 279


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 27.2 bits (61), Expect = 1.9
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           + +KEKKK   K++KK  K++K K 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKL 120


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 27.4 bits (61), Expect = 2.0
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K KK+KK+K  + K  KKKKKKKK+
Sbjct: 168 KEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 25.8 bits (57), Expect = 5.3
 Identities = 16/24 (66%), Positives = 18/24 (75%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKK 25
           +K KKEK  + K  KKKKKKKKKK
Sbjct: 170 KKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 25.4 bits (56), Expect = 8.2
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
            + KKEKKKKK+ KK+KK+KK KK+
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKE 175


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
           is a family of proteins that seem to be involved,
           directly or indirectly, in the salt sensitivity of some
           cellular functions in yeast. These proteins also
           interact with the splicing factor Msl1p.
          Length = 189

 Score = 27.2 bits (60), Expect = 2.0
 Identities = 22/96 (22%), Positives = 33/96 (34%), Gaps = 3/96 (3%)

Query: 17  KKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQHQSLLN 76
           K KKKKKK+ + T     +            S       PL+   YP  +  SQ  ++ N
Sbjct: 32  KSKKKKKKRSKATSPSHNASDQSTNNVMSTPSAILARPQPLS---YPFGSQQSQQNAVKN 88

Query: 77  NSGRTNFTNKQLTELEKEFHFNKYLTRARRIEIASV 112
           +  +  +      E E    F   L    R  +  V
Sbjct: 89  SKEKRIWNTSTQEERENIREFWLSLGEEERRSLVKV 124


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 27.5 bits (61), Expect = 2.1
 Identities = 9/26 (34%), Positives = 18/26 (69%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           +  K++ K +K K+ KK+KK+++K 
Sbjct: 21 PRPVKDEAKPRKIKRVKKRKKREEKD 46


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.3 bits (61), Expect = 2.1
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           ++ ++ KKKK+ K K  K+  KKKK+       S 
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSP 113


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 26.9 bits (60), Expect = 2.1
 Identities = 11/50 (22%), Positives = 18/50 (36%), Gaps = 5/50 (10%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ-----IPSELSPLGVHPHPT 47
           KR       K + KKK +    K++  + +      P     + V P P 
Sbjct: 107 KRVAANDAIKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVVTVRPLPF 156



 Score = 25.3 bits (56), Expect = 8.1
 Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 1/51 (1%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRI-TKVQIPSELSPLGVHPHPTSPST 51
           R R+   K+       K + KKK +   TK Q         V P PT   T
Sbjct: 100 RCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVVT 150


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 27.2 bits (61), Expect = 2.2
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           + KR  E  KK+KKKKKKK++K +   + KV 
Sbjct: 159 VEKRLDELTKKRKKKKKKKEEKVELDVLKKVL 190


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 27.1 bits (60), Expect = 2.4
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKK 25
           RKRK ++K    K KK  KK + +
Sbjct: 358 RKRKGDRKGVSHKAKKGGKKNQAE 381



 Score = 26.0 bits (57), Expect = 6.3
 Identities = 8/25 (32%), Positives = 11/25 (44%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITK 30
             K+K  +K    K KK  KK   +
Sbjct: 357 ARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 27.5 bits (61), Expect = 2.4
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 2    RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
            R   + +    KK  ++  KK   K+ TK    SE +
Sbjct: 1184 RAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT 1220


>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
          Length = 399

 Score = 27.3 bits (61), Expect = 2.4
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query: 4  RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K+EKK   +  K  +++ ++KK++ K+Q    L P
Sbjct: 48 SKREKKPYPRPMKLLRREAREKKKLRKLQPERPLDP 83


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 26.4 bits (59), Expect = 2.5
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 4  RKKEKKKKKKKKKKKKKKKKKKKR 27
          R KE+++K KKK  KK + ++ K+
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKK 90



 Score = 25.2 bits (56), Expect = 6.4
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKK 26
          R +++ +K KKK  KK + ++ KKK
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKKK 91


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 27.2 bits (61), Expect = 2.5
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRI 28
            R +E  K+KKKKKKKKKKK++ +  
Sbjct: 80  DRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 26.8 bits (60), Expect = 2.9
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
           R  + E+  K+KKKKKKKKKKK++ R
Sbjct: 78  RADRAEELLKEKKKKKKKKKKKEELR 103



 Score = 26.0 bits (58), Expect = 5.2
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            +R    ++  K+KKKKKKKKKKK+ + +
Sbjct: 76  YERADRAEELLKEKKKKKKKKKKKEELRE 104


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 27.0 bits (60), Expect = 2.6
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIP 34
          RK+KK +    K  +K+ K+K +KK+I  +  P
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 26.9 bits (60), Expect = 2.7
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           K +KE+KK   K++KK  K +K+K   +
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 26.1 bits (58), Expect = 2.7
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 1  MRKRKKEKKKKKKKKKKKKKKKKKKK 26
            K ++  ++ ++KKKKKKK KKK K
Sbjct: 29 ASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 25.3 bits (56), Expect = 4.6
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
          + R+  ++ ++KKKKKKK KKK K+ I +
Sbjct: 31 KVRRISQRNRRKKKKKKKLKKKSKEWILR 59


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 26.6 bits (59), Expect = 2.8
 Identities = 20/61 (32%), Positives = 24/61 (39%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
            K KK+K KKKK K K KK  KK  +  K         L       S +    + L   K
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143

Query: 62  Y 62
           Y
Sbjct: 144 Y 144



 Score = 26.6 bits (59), Expect = 3.0
 Identities = 15/25 (60%), Positives = 17/25 (68%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           +K K +KKK KKKK K K KK  KK
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKK 106



 Score = 25.4 bits (56), Expect = 9.1
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKK 26
           +++ K KKKK KKKK K K KK  K
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDK 105


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 1    MRKRKKEKKKKKKKKKKKKKKKKKKKR 27
             R + K K K  K +K K KKK+KKK+
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKK 1182



 Score = 26.2 bits (58), Expect = 5.4
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 2    RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLG 41
            + RK + KKK+KKKKK    K KK  +       +     
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR 1207



 Score = 26.2 bits (58), Expect = 6.3
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 1    MRKRKKEKKKKKKKKKKKKKKKKKKK 26
             + + K  K +K K KKK+KKKKK  
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSS 1185



 Score = 25.8 bits (57), Expect = 7.0
 Identities = 14/32 (43%), Positives = 16/32 (50%)

Query: 4    RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
             K  K K KKK+KKKKK    K +   V   S
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198



 Score = 25.8 bits (57), Expect = 7.2
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 3    KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
            K+K +K+ +      KKKKK +KK   K +  + 
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340



 Score = 25.8 bits (57), Expect = 9.0
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 3    KRKKEKKKKKKKKKKKKKKKKKKKRITK 30
              K+++ K K K K  K +K K K+  K
Sbjct: 1152 IAKEQRLKSKTKGKASKLRKPKLKKKEK 1179



 Score = 25.4 bits (56), Expect = 9.6
 Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 3    KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
            +++ + K K K  K +K K KKK++  K     +     V  +     +D
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 25.7 bits (57), Expect = 3.3
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKR 27
          RKR ++ + K+ KK+  KK  +KKKR
Sbjct: 6  RKRNEKLRAKRAKKRAAKKAARKKKR 31


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 25.8 bits (57), Expect = 3.5
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKK 26
          KK+KKKKKKK +      K KK
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKK 45



 Score = 25.0 bits (55), Expect = 5.7
 Identities = 11/23 (47%), Positives = 12/23 (52%)

Query: 4  RKKEKKKKKKKKKKKKKKKKKKK 26
              KKKKKKKKKK +      K
Sbjct: 20 ADLVKKKKKKKKKKAEDTAATAK 42


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 26.5 bits (59), Expect = 3.8
 Identities = 15/33 (45%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 2   RKRKKEKKKKKKKKKKKKKKK--KKKKRITKVQ 32
            +R  EK   + K+KKK+KKK  KKKK+I +++
Sbjct: 317 FERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE 349


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 26.4 bits (58), Expect = 3.8
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           + KK KK +  K  K  K KK KK+
Sbjct: 146 EVKKPKKGQSPKVPKAPKPKKPKKK 170



 Score = 26.0 bits (57), Expect = 6.0
 Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
            K+ K+ +  K  K  K KK KKK  ++   +   
Sbjct: 147 VKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMP 181



 Score = 25.6 bits (56), Expect = 7.8
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKV 31
           K  K+ KK +  K  K  K KK K+   V
Sbjct: 145 KEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 26.3 bits (59), Expect = 3.9
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 9   KKKKKKKKKKKKKKKKKKRITK 30
           KK  +K +K KKKKK++K  +K
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSK 213



 Score = 25.9 bits (58), Expect = 6.0
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRI 28
           K+  EK +K KKKKK++K   K  RI
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSKDVRI 217



 Score = 25.2 bits (56), Expect = 9.6
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRIT 29
           ++ E+  KK  +K +K KKKKK+R  
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 26.3 bits (58), Expect = 4.0
 Identities = 15/50 (30%), Positives = 27/50 (54%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTD 52
           K ++E+ K++ K+KK +KK K+ +R  +    +      VH H   P T+
Sbjct: 100 KERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETN 149


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 26.4 bits (59), Expect = 4.0
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 10/40 (25%)

Query: 1   MRKRKKEKKKKKKKKK----------KKKKKKKKKKRITK 30
             + KKE+KKK++++           +K ++K++KK+  K
Sbjct: 283 EAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|223448 COG0371, GldA, Glycerol dehydrogenase and related enzymes [Energy
           production and conversion].
          Length = 360

 Score = 26.5 bits (59), Expect = 4.2
 Identities = 8/20 (40%), Positives = 12/20 (60%), Gaps = 1/20 (5%)

Query: 46  PTSPSTD-YTTPLTALKYPP 64
           PT  STD  T+P+ ++ Y  
Sbjct: 115 PTIASTDAITSPVASVIYNG 134


>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
           [Energy production and conversion].
          Length = 332

 Score = 26.6 bits (59), Expect = 4.3
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRI 28
           R  EK  KKK K+  K+ +++ +  
Sbjct: 301 RLVEKLAKKKLKRAAKEPEERVEFA 325


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 26.3 bits (59), Expect = 4.4
 Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRI-TKVQIPS 35
            RKR  +  +KK+KKKKK+K+K+++ ++  +V+  S
Sbjct: 612 ARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLS 647



 Score = 25.9 bits (58), Expect = 7.5
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKK 23
           + K +K KKKKKKK K  K K +
Sbjct: 679 LEKIQKPKKKKKKKPKTVKPKPR 701



 Score = 25.6 bits (57), Expect = 8.6
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
           M+ +      +K +K KKKKKKK K    K +
Sbjct: 670 MKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 26.1 bits (58), Expect = 4.5
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKK 26
             K KKE++K++K++ K++KKK K +
Sbjct: 105 RSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 24.9 bits (55), Expect = 8.9
 Identities = 11/29 (37%), Positives = 23/29 (79%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
            + K+EKKK+ K+KK+++K++K++ +  K
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQK 124


>gnl|CDD|235798 PRK06416, PRK06416, dihydrolipoamide dehydrogenase; Reviewed.
          Length = 462

 Score = 26.3 bits (59), Expect = 4.5
 Identities = 7/14 (50%), Positives = 10/14 (71%), Gaps = 1/14 (7%)

Query: 34  PSELSPLGVHPHPT 47
           P +L+   +HPHPT
Sbjct: 431 PEDLALT-IHPHPT 443


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 26.4 bits (58), Expect = 4.7
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKR 27
           K+K   KKK K KK KKK+ KR
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 25.6 bits (56), Expect = 8.1
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKK 24
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
          Length = 128

 Score = 25.9 bits (58), Expect = 4.7
 Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 14 KKKKKKKKKKKKKRIT 29
           KKKK  KKK KK I 
Sbjct: 1  AKKKKTVKKKVKKNIP 16


>gnl|CDD|220692 pfam10324, 7TM_GPCR_Srw, Serpentine type 7TM GPCR chemoreceptor
           Srw.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srw is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'. The genes encoding Srw do not appear to be
           under as strong an adaptive evolutionary pressure as
           those of Srz.
          Length = 317

 Score = 26.4 bits (59), Expect = 4.8
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITK 30
            E +K KK +K       K  R TK
Sbjct: 217 IELRKAKKSRKNLSSSSNKSDRTTK 241


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 26.3 bits (58), Expect = 4.8
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKK 26
           K +K   KK KK KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 25.5 bits (56), Expect = 8.6
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 5  KKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
          KK   ++KK++KKK KKKK   ++   Q+
Sbjct: 59 KKLHSQEKKEEKKKPKKKKVPLQVNPAQL 87


>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767).  This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 112 and 199 amino
           acids in length.
          Length = 119

 Score = 25.7 bits (57), Expect = 5.0
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKR 27
            +R+K  +  KK  +   +KK KK R
Sbjct: 90  YRRRKSFQGMKKAVEAMAEKKAKKLR 115


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 25.3 bits (56), Expect = 5.0
 Identities = 15/26 (57%), Positives = 17/26 (65%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRI 28
          K+KK KK KK     +KKKK KKK I
Sbjct: 30 KKKKLKKLKKIDDDLEKKKKNKKKII 55


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
          homology (PH) domain.  Fission yeast Scd1 is an
          exchange factor for Cdc42 and an effector of Ras1, the
          homolog of the human H-Ras. Scd2/Bem1 mediates Cdc42
          activation by binding to Scd1/Cdc24 and to Cdc42. Ras1
          regulates Scd1/Cdc24/Ral1, which is a putative guanine
          nucleotide exchange factor for Cdc42, a member of the
          Rho family of Ras-like proteins. Cdc42 then activates
          the Shk1/Orb2 protein kinase. Scd1 interacts with Klp5
          and Klp6 kinesins to mediate cytokinesis. PH domains
          have diverse functions, but in general are involved in
          targeting proteins to the appropriate cellular location
          or in the interaction with a binding partner. They
          share little sequence conservation, but all have a
          common fold, which is electrostatically polarized. Less
          than 10% of PH domains bind phosphoinositide phosphates
          (PIPs) with high affinity and specificity. PH domains
          are distinguished from other PIP-binding domains by
          their specific high-affinity binding to PIPs with two
          vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
          or PtdIns(3,4,5)P3 which results in targeting some PH
          domain proteins to the plasma membrane. A few display
          strong specificity in lipid binding. Any specificity is
          usually determined by loop regions or insertions in the
          N-terminus of the domain, which are not conserved
          across all PH domains. PH domains are found in cellular
          signaling proteins such as serine/threonine kinase,
          tyrosine kinases, regulators of G-proteins, endocytotic
          GTPases, adaptors, as well as cytoskeletal associated
          molecules and in lipid associated enzymes.
          Length = 148

 Score = 25.7 bits (57), Expect = 5.2
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 3  KRKKEKKKKKKKKKKKKKKKKKKKRITKVQI 33
          K  K+KKKK   K K     KKKK+   +Q+
Sbjct: 61 KEVKKKKKKSSLKSKSSSSSKKKKKKGPLQL 91


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 26.3 bits (59), Expect = 5.3
 Identities = 10/34 (29%), Positives = 20/34 (58%), Gaps = 5/34 (14%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRIT-----KVQI 33
           K++ +    K+ ++K+KK +K  IT     KV++
Sbjct: 520 KEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVEL 553


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 26.0 bits (57), Expect = 5.6
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 11  KKKKKKKKKKKKKKKKRITKVQIPSELSP 39
           K+KKKKK KKKK  K  +       E  P
Sbjct: 90  KQKKKKKDKKKKSPKSLLLNFVTVEESKP 118



 Score = 26.0 bits (57), Expect = 5.8
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 5   KKEKKKKKKKKKKKKKKKK 23
            +E K+KKKKK KKKK  K
Sbjct: 86  NQENKQKKKKKDKKKKSPK 104


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 25.7 bits (57), Expect = 5.8
 Identities = 9/34 (26%), Positives = 25/34 (73%), Gaps = 2/34 (5%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITK--VQI 33
             ++++++K+++K++K++K++K+K   K  VQ 
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQK 45



 Score = 25.7 bits (57), Expect = 6.3
 Identities = 8/25 (32%), Positives = 21/25 (84%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKK 26
          R    E+K+++K+++K++K++K++K
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEK 34


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.1 bits (58), Expect = 6.4
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKR 27
                +KKK++KK+++K+ K+ R
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELR 199



 Score = 25.7 bits (57), Expect = 7.0
 Identities = 7/23 (30%), Positives = 17/23 (73%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKK 26
              + +KKK++KK+++K+ K+ +
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELR 199



 Score = 25.4 bits (56), Expect = 9.5
 Identities = 6/21 (28%), Positives = 14/21 (66%)

Query: 7   EKKKKKKKKKKKKKKKKKKKR 27
                 +KKK++KK+++K+ +
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSK 196


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 25.8 bits (57), Expect = 6.5
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKK 25
            RK       KK    +K + KKKK
Sbjct: 172 ERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 25.7 bits (57), Expect = 6.7
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKK 26
               + EKK  +KK KK  KK K ++
Sbjct: 85  KEIWEAEKKGLEKKAKKAIKKGKDEE 110


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 25.9 bits (57), Expect = 6.9
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKK 25
           +RK++KEKK++ K +KK+   +K  
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYG 185



 Score = 25.5 bits (56), Expect = 8.7
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKK 25
              RKKEK+KK++ K +KK+   +K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEK 183


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 25.7 bits (57), Expect = 7.0
 Identities = 8/24 (33%), Positives = 10/24 (41%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKK 24
           + K K      K + K KK  K K
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 25.3 bits (56), Expect = 7.1
 Identities = 8/36 (22%), Positives = 24/36 (66%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
           ++  +++KK++++K+K++ +K R    +I  + + L
Sbjct: 72  ERMDERRKKRREQKEKEEIEKYREENPKIQQQFADL 107


>gnl|CDD|221266 pfam11851, DUF3371, Domain of unknown function (DUF3371).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 125 to 142 amino acids in length.
          Length = 131

 Score = 25.4 bits (56), Expect = 7.1
 Identities = 12/75 (16%), Positives = 21/75 (28%), Gaps = 22/75 (29%)

Query: 34  PSELSPLGVHPHPTSPSTDYTTPLTALKYPPSAALSQHQSLLNNSGRTNFTNKQLTELEK 93
           P +L P    P    P+T                      L  N G   F++        
Sbjct: 38  PQDLYPHSSQPDLPPPTT----------------------LDLNDGTITFSDPLSHFTGS 75

Query: 94  EFHFNKYLTRARRIE 108
            F ++  L    +++
Sbjct: 76  PFSYSSPLKEESKLD 90


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 25.9 bits (57), Expect = 7.1
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKRITKVQ 32
               E ++ KKK++ + KKKK+ + +  VQ
Sbjct: 82  ASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 25.4 bits (56), Expect = 7.6
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPS 35
           RK   +  ++++ +K +K+ KK KK        S
Sbjct: 99  RKEALKAYQQEELRKIQKRSKKSKKAEPVQGQLS 132


>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1.  This domain
           family is found in eukaryotes, and is typically between
           216 and 263 amino acids in length. Casc1 has many SNPs
           associated with cancer susceptibility.
          Length = 227

 Score = 25.4 bits (56), Expect = 7.8
 Identities = 6/36 (16%), Positives = 12/36 (33%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
           + E  +       K+ K K  K +   +   +   L
Sbjct: 74  EAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPEL 109


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 25.7 bits (56), Expect = 7.9
 Identities = 6/35 (17%), Positives = 15/35 (42%)

Query: 2   RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSE 36
           +++ KEK    K+ K +    + +     ++   E
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE 832


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
          family of proteins are part of the yeast nuclear pore
          complex-associated pre-60S ribosomal subunit. The
          family functions as a highly conserved exonuclease that
          is required for the 5'-end maturation of 5.8S and 25S
          rRNAs, demonstrating that 5'-end processing also has a
          redundant pathway. Nop25 binds late pre-60S ribosomes,
          accompanying them from the nucleolus to the nuclear
          periphery; and there is evidence for both physical and
          functional links between late 60S subunit processing
          and export.
          Length = 134

 Score = 25.4 bits (56), Expect = 7.9
 Identities = 7/23 (30%), Positives = 20/23 (86%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKK 24
           KRK++++KK +++ K+K+++++
Sbjct: 28 HKRKQQRRKKAQEEAKEKEREER 50


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 25.6 bits (56), Expect = 8.2
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITK 30
           + K + E + +  K  KK KK KKKK + K
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKVLGK 262


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This
          is a repeating domain found in fungal proteins. It is
          proline-rich, and the function is not known.
          Length = 135

 Score = 25.2 bits (55), Expect = 8.2
 Identities = 7/50 (14%), Positives = 19/50 (38%)

Query: 4  RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDY 53
           K  +++++KK ++++++ +               P    P P      Y
Sbjct: 21 HKYSQRRERKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRY 70


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 25.7 bits (57), Expect = 8.3
 Identities = 13/35 (37%), Positives = 16/35 (45%)

Query: 4   RKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
           +K  KK   K K KKKKKK     + K +   E  
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 25.7 bits (57), Expect = 8.3
 Identities = 9/31 (29%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 1   MRKRKKEKKKKKKKKKKKKKK-KKKKKRITK 30
            R++KK KK  +K+ +  +    KK+K +  
Sbjct: 83  FREKKKYKKDVEKRNRSYRLYLDKKRKELQA 113


>gnl|CDD|215314 PLN02576, PLN02576, protoporphyrinogen oxidase.
          Length = 496

 Score = 25.7 bits (57), Expect = 8.9
 Identities = 7/25 (28%), Positives = 12/25 (48%)

Query: 3   KRKKEKKKKKKKKKKKKKKKKKKKR 27
           K  +E KK  K + +  +  K K +
Sbjct: 205 KAIQEAKKNPKPEPRDPRLPKPKGQ 229


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 137 to
          196 amino acids in length.
          Length = 149

 Score = 25.0 bits (55), Expect = 9.0
 Identities = 17/50 (34%), Positives = 21/50 (42%), Gaps = 13/50 (26%)

Query: 2  RKRKK-EKKKKKKKKKKK-----------KKKKKKKKRITKVQ-IPSELS 38
          RKR   E K+ +KK  K            K+KK K KR      IP  +S
Sbjct: 5  RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGIPKVVS 54


>gnl|CDD|236950 PRK11664, PRK11664, ATP-dependent RNA helicase HrpB; Provisional.
          Length = 812

 Score = 25.7 bits (57), Expect = 9.2
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 57  LTALKYPPSAALSQHQSLLNNSGRTNFTNKQLT 89
           L+ L  PP+AAL+  + LL   G  +    +LT
Sbjct: 375 LSWLDQPPAAALAAAKRLLQQLGALD-GQGRLT 406


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 25.4 bits (56), Expect = 9.3
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKR 27
          RK ++E   +  K+ K KKK  KKKR
Sbjct: 7  RKARQELAVQVAKQAKAKKKANKKKR 32


>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
           family).  Protein phosphatase 2A (PP2A) is a major
           intracellular protein phosphatase that regulates
           multiple aspects of cell growth and metabolism. The
           ability of this widely distributed heterotrimeric enzyme
           to act on a diverse array of substrates is largely
           controlled by the nature of its regulatory B subunit.
           There are multiple families of B subunits (See also
           pfam01240), this family is called the B56 family.
          Length = 405

 Score = 25.3 bits (56), Expect = 9.5
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 5   KKEKKKKKKKKKKKKKKKKKKKRI 28
            K K+K+ K+KKK+KK+KK   RI
Sbjct: 377 NKYKEKEAKEKKKEKKRKKTWARI 400


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and
          chromosome partitioning].
          Length = 324

 Score = 25.6 bits (56), Expect = 9.6
 Identities = 10/72 (13%), Positives = 22/72 (30%), Gaps = 5/72 (6%)

Query: 2  RKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPLGVHPHPTSPSTDYTTPLTALK 61
            RK+     + +  K+ K K+      +V     +  + V     +P           +
Sbjct: 26 TSRKERSSYFRDRPLKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFT-----QEHE 80

Query: 62 YPPSAALSQHQS 73
              +   Q+Q 
Sbjct: 81 AARQSPQHQYQP 92


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 25.4 bits (56), Expect = 9.7
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 1   MRKRKKEKKKKKKKKKKKKKKKKKKKRITKVQIPSELS 38
           M++ +KE ++ ++    KK KK ++K++  +    EL 
Sbjct: 81  MKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRELM 118


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 25.5 bits (56), Expect = 9.9
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 6   KEKKKKKKKKKKKKKKKKKKKRITKVQIPSELSPL 40
            ++K KK       KK K+K+  +K Q    +S L
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIEGISRL 365


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.123    0.331 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,104,611
Number of extensions: 529982
Number of successful extensions: 10880
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6986
Number of HSP's successfully gapped: 1649
Length of query: 123
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 38
Effective length of database: 7,167,512
Effective search space: 272365456
Effective search space used: 272365456
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.1 bits)