RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15969
         (111 letters)



>gnl|CDD|197696 smart00389, HOX, Homeodomain.  DNA-binding factors that are
          involved in the transcriptional regulation of key
          developmental processes.
          Length = 57

 Score = 46.5 bits (111), Expect = 2e-08
 Identities = 17/56 (30%), Positives = 29/56 (51%)

Query: 44 PKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRRS 99
           ++ R  FT  Q   L+  F++   PS+E +  +A++LGL    V  +F N R + 
Sbjct: 1  KRRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQVKVWFQNRRAKW 56


>gnl|CDD|238039 cd00086, homeodomain, Homeodomain;  DNA binding domains involved
          in the transcriptional regulation of key eukaryotic
          developmental processes; may bind to DNA as monomers or
          as homo- and/or heterodimers, in a sequence-specific
          manner.
          Length = 59

 Score = 44.2 bits (105), Expect = 2e-07
 Identities = 17/55 (30%), Positives = 29/55 (52%)

Query: 45 KKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRRS 99
          ++ R  FT  Q   L+  F++   PS+E +  +A++LGL    V  +F N R + 
Sbjct: 1  RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQVKIWFQNRRAKL 55


>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain. 
          Length = 57

 Score = 42.9 bits (102), Expect = 5e-07
 Identities = 17/54 (31%), Positives = 29/54 (53%)

Query: 45 KKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNARRR 98
          ++ R  FT  Q   L+  F++ + PS E +  +A++LGL    V  +F N R +
Sbjct: 1  RRKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQVKVWFQNRRAK 54


>gnl|CDD|203350 pfam05920, Homeobox_KN, Homeobox KN domain.  This is a homeobox
          transcription factor KN domain conserved from fungi to
          human and plants.
          Length = 40

 Score = 35.2 bits (82), Expect = 3e-04
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query: 69 PSKEMQVTIARQLGLEPTTVGNFFMNARRR 98
          PS+E +  +ARQ GL    + N+F+NARRR
Sbjct: 11 PSEEEKSELARQTGLSRKQIDNWFINARRR 40


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 38.0 bits (89), Expect = 3e-04
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD 39
           +  KKEK+K+K K KK+ +  K   KR++    S++
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 36.8 bits (86), Expect = 8e-04
 Identities = 11/37 (29%), Positives = 24/37 (64%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
           KR +K++K+K+K K KK+ +  K   ++ +P  + ++
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 34.9 bits (81), Expect = 0.004
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKT 27
              ++ EKK+K+K+K K KK+ + T
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDT 419



 Score = 33.8 bits (78), Expect = 0.010
 Identities = 11/29 (37%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK+ K+++K K KK+ +  K   K  +RK
Sbjct: 401 EKKEKEKEKPKVKKRHRDTKNIGK--RRK 427



 Score = 33.0 bits (76), Expect = 0.016
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
           K  KK   K+ +KK+K+K+K K   + ++
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418



 Score = 32.6 bits (75), Expect = 0.023
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           K   K  +KK+K+K+K K KK+    + 
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKN 421



 Score = 32.2 bits (74), Expect = 0.032
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           EK ++K K KK+ +  K   K++K     EE
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 31.8 bits (73), Expect = 0.040
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
             K+   ++ +KK+K+K+K K KK+    K
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 31.8 bits (73), Expect = 0.044
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
            EK+  K  KK   K+ +KK+K+K+  K K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVK 413



 Score = 31.1 bits (71), Expect = 0.078
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
           K  K  KK   K+ +KK+K+K+K   +K   D
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418



 Score = 29.1 bits (66), Expect = 0.36
 Identities = 7/31 (22%), Positives = 17/31 (54%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
            +++ K + KK+ +  K   K++K +   +E
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 28.8 bits (65), Expect = 0.57
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
           KK  K  KK   K+ +KK+K  ++ +     +D  +  K+
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 28.8 bits (65), Expect = 0.61
 Identities = 11/42 (26%), Positives = 18/42 (42%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
           +K+  K  KK   K+ +KK+  K K +       +    K R
Sbjct: 385 EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           + K   +KK  K  KK   K+   K KE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEK 407



 Score = 26.1 bits (58), Expect = 5.3
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           K   ++K  K  KK   K+ +KK  +++
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKE 408



 Score = 25.7 bits (57), Expect = 6.7
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 10  KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
           K K   +KK  K  KK   KR E+ +  +++    K+ R
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 38.0 bits (89), Expect = 4e-04
 Identities = 16/28 (57%), Positives = 24/28 (85%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           K++++EKK +K+KKKKK+KKK K  K+K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 34.9 bits (81), Expect = 0.003
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           K+R+++K +K+KKKKK+KKK KK  K+  +
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 34.2 bits (79), Expect = 0.008
 Identities = 14/29 (48%), Positives = 24/29 (82%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
            ++K+E+KK +K+KKKKK+KKK   ++K+
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 33.8 bits (78), Expect = 0.010
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKR 30
            + +K +K+KKKKK+KKK KK+KK  ++
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 33.0 bits (76), Expect = 0.016
 Identities = 16/29 (55%), Positives = 22/29 (75%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           ++  KK +K+KKKKK+KKK KK+K   RK
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.9 bits (73), Expect = 0.040
 Identities = 16/28 (57%), Positives = 24/28 (85%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           +KK ++KK +K+KKKKK+KKK  KRK++
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 29.2 bits (66), Expect = 0.40
 Identities = 15/29 (51%), Positives = 21/29 (72%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK  K  KKK+++KK +K+KKKKK  K+ 
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKG 405



 Score = 28.8 bits (65), Expect = 0.46
 Identities = 12/27 (44%), Positives = 21/27 (77%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEE 33
            K+K+++KK +K+KKKKK+K   +K +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 28.8 bits (65), Expect = 0.58
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           K +  +  KKK+++KK +K+KKK  ++K+  
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGK 406



 Score = 27.6 bits (62), Expect = 1.3
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           E + K  K  KKK+++KK +K+KK  KRK++
Sbjct: 374 EIKEKYPKPPKKKREEKKPQKRKKKKKRKKK 404



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 10  KKKKKKKKKKKKKKKKKTCKRKEEP 34
           KK+++KK +K+KKKKK+  K K+  
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 26.9 bits (60), Expect = 2.6
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
            +R +E K+K  K  KKK+++KK  KRK++    +      KK R
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 25.7 bits (57), Expect = 5.5
 Identities = 11/30 (36%), Positives = 22/30 (73%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           +E+ ++K  K  KKK+++KK +K+K  K++
Sbjct: 372 IEEIKEKYPKPPKKKREEKKPQKRKKKKKR 401


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 36.0 bits (84), Expect = 0.001
 Identities = 12/28 (42%), Positives = 17/28 (60%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
          K  +K +KK +K K K +KKK K  + K
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 35.2 bits (82), Expect = 0.002
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
             K EKK +K K K +KKK KK   + 
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 33.3 bits (77), Expect = 0.008
 Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 12/49 (24%)

Query: 1  MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
          + +K  K  +K +KK +K K K +KK  K K+E            KPRL
Sbjct: 63 LDKKELKAWEKAEKKAEKAKAKAEKK--KAKKE----------EPKPRL 99


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 36.5 bits (85), Expect = 0.001
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 2/50 (4%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTD 53
           +  KK  +KK KK  KK K ++   +   E +  + +   P +PRL+  D
Sbjct: 89  EAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPE--PPLRPRLIVND 136


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 35.6 bits (83), Expect = 0.002
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 7  KKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
          KKE K   K +KKK+K++ K  K K +P
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 34.4 bits (80), Expect = 0.005
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKT 27
             K +KKK+K++ K  K K K  
Sbjct: 68 AWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.0 bits (79), Expect = 0.007
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTC 28
          K   K +KKK+K++ K  K K K  
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 33.3 bits (77), Expect = 0.014
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 1  MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
          + +K  K   K +KKK+K++ K  K   K +
Sbjct: 61 LDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 25.2 bits (56), Expect = 7.7
 Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 9/39 (23%)

Query: 11 KKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
          KK+ K   K +KKK+K   +  +            KPRL
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAK---------SKPRL 92


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.4 bits (77), Expect = 0.015
 Identities = 10/45 (22%), Positives = 29/45 (64%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           E++++K+KK    KKK+++++++K  K +E+ +  ++     ++ 
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 32.2 bits (74), Expect = 0.033
 Identities = 15/46 (32%), Positives = 29/46 (63%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
           EK+R++EKK+KKKK    KKK+++  + KE+ +  +++     +  
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 31.4 bits (72), Expect = 0.056
 Identities = 12/45 (26%), Positives = 27/45 (60%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           +  +K+E++KK+KKKK    KKK+  + +E+    +++    ++ 
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 31.4 bits (72), Expect = 0.061
 Identities = 14/46 (30%), Positives = 30/46 (65%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           ++K  +K +KK++++KK+KKKK     K++EE +  +++    K+ 
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457



 Score = 28.7 bits (65), Expect = 0.50
 Identities = 16/57 (28%), Positives = 32/57 (56%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTL 59
            +  KKEKKKK    KKK++++++  ++KEE    +++ +  +K        ++ TL
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479



 Score = 26.8 bits (60), Expect = 2.2
 Identities = 9/44 (20%), Positives = 24/44 (54%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
             +K  KK KK  +K +KK++++  ++K++    + +    ++ 
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 26.4 bits (59), Expect = 3.6
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
            +E     +K  KK KK  +K +KK+  ++KE+
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEK 430



 Score = 25.7 bits (57), Expect = 5.6
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
              K+  KK KK  +K +KK++  K++++
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKK 431


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 0.041
 Identities = 16/43 (37%), Positives = 28/43 (65%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
          +R+KK+++K K+K+ KK K  +K  K K +   + D ++ PKK
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.8 bits (73), Expect = 0.052
 Identities = 11/47 (23%), Positives = 20/47 (42%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
                   E+KK +KK++K +KK +K    K       + ++   K 
Sbjct: 402 ENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448



 Score = 31.4 bits (72), Expect = 0.072
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
           EK+ +KE+ +K   KKK +   KK      E  
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454



 Score = 29.5 bits (67), Expect = 0.35
 Identities = 12/44 (27%), Positives = 22/44 (50%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           K RKK++K +KK +K++ +K     K +     ++      KK 
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 28.4 bits (64), Expect = 0.75
 Identities = 12/52 (23%), Positives = 27/52 (51%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDL 54
            +R+K  KK++K +KK +K++ +K   +K+    ++       + + V  D 
Sbjct: 409 AERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 31.4 bits (72), Expect = 0.064
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKK 23
            +EKRR K+KKK+KK KK + +K
Sbjct: 731 DLEKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 0.43
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 9   EKKKKKKKKKKKKKKKKKT 27
           EK++ KKKKK+KK KK + 
Sbjct: 733 EKRRLKKKKKRKKVKKWEV 751



 Score = 28.7 bits (65), Expect = 0.66
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 7   KKEKKKKKKKKKKKKKKKKK 26
           K+  KKKKK+KK KK + +K
Sbjct: 734 KRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 0.72
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 4   KRRKKEKKKKKKKKKKKKKKK 24
           ++R+ +KKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.9 bits (63), Expect = 0.97
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 11  KKKKKKKKKKKKKKKKTCKRKEE 33
           +K++ KKKKK+KK KK    K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 27.9 bits (63), Expect = 1.3
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 8   KEKKKKKKKKKKKKKKKKKT 27
           ++++ KKKKK+KK KK +  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVE 752



 Score = 27.1 bits (61), Expect = 2.2
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 7   KKEKKKKKKKKKKKKKKKKKT 27
           +K + KKKKK+KK KK + + 
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|227863 COG5576, COG5576, Homeodomain-containing transcription factor
           [Transcription].
          Length = 156

 Score = 30.9 bits (70), Expect = 0.078
 Identities = 20/63 (31%), Positives = 30/63 (47%)

Query: 37  SQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNFFMNAR 96
            QD SS PK  R   TD Q   L+  F+    PS   ++ ++  L + P +V  +F N R
Sbjct: 44  KQDGSSPPKSKRRRTTDEQLMVLEREFEINPYPSSITRIKLSLLLNMPPKSVQIWFQNKR 103

Query: 97  RRS 99
            + 
Sbjct: 104 AKE 106


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 31.2 bits (71), Expect = 0.086
 Identities = 13/31 (41%), Positives = 24/31 (77%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           E +R+KE+K+  K++K+ +K KK+  K+K+E
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582



 Score = 30.0 bits (68), Expect = 0.22
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
             EK  K+  K++K+ +K KK+++KK  + ++
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 28.8 bits (65), Expect = 0.47
 Identities = 10/31 (32%), Positives = 23/31 (74%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           E++   +++K+ +K KK+++KKKK  ++ E+
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.1 bits (63), Expect = 1.0
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           +R K+EK+  K++K+ +K KK++  K+KE 
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKEL 583



 Score = 26.1 bits (58), Expect = 4.4
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
           +KR +K KK+++KKKK+ +K +K      E
Sbjct: 566 QKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595



 Score = 26.1 bits (58), Expect = 5.2
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
             + +K  +K KK+++KKKK+ +K  K K  P
Sbjct: 562 ALKEQKRLRKLKKQEEKKKKELEKLEKAKIPP 593



 Score = 25.8 bits (57), Expect = 5.6
 Identities = 10/30 (33%), Positives = 21/30 (70%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           ++ +KE  K++K+ +K KK+++K  K  E+
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.4 bits (68), Expect = 0.14
 Identities = 13/42 (30%), Positives = 26/42 (61%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTP 44
           E+RR +++ +K K++KKK++K+K+  +R        D+   P
Sbjct: 86  EERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAP 127



 Score = 28.5 bits (63), Expect = 0.79
 Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQ 60
            VEK+ KK KKK+KK+K+K++ K KK      +  +      +P     V    +     
Sbjct: 194 AVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK-SLLLALDDSPASAASVAEADEASLAN 252

Query: 61  AIFKETKRPSKE 72
            +         +
Sbjct: 253 TVSGTAPDSEPD 264



 Score = 26.2 bits (57), Expect = 4.1
 Identities = 14/46 (30%), Positives = 25/46 (54%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
           K +KK+++K+K++KKKKKK     C   +       Q+   ++  L
Sbjct: 279 KHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEEPL 324


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 30.6 bits (69), Expect = 0.14
 Identities = 16/66 (24%), Positives = 24/66 (36%), Gaps = 11/66 (16%)

Query: 14  KKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSKEM 73
           K+ KKKK  ++     ++            PKK      + Q +T+  I KE       M
Sbjct: 656 KQVKKKKNYQRYHMASQES-----------PKKSSYTLPEKQHKTITEILKELGEHIDRM 704

Query: 74  QVTIAR 79
              I R
Sbjct: 705 IKQIKR 710


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.5 bits (69), Expect = 0.15
 Identities = 19/48 (39%), Positives = 28/48 (58%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLV 50
           E ++K EKKKKKKK+KK+ K + +T    + P  S+     P KP+  
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 27.0 bits (60), Expect = 1.9
 Identities = 15/39 (38%), Positives = 20/39 (51%)

Query: 8   KEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
            E KKK +KKKKKKK+KK+     E     +    + K 
Sbjct: 64  NESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102



 Score = 25.1 bits (55), Expect = 9.8
 Identities = 11/32 (34%), Positives = 13/32 (40%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
            K   + K   K  KK KK KKK    +  E 
Sbjct: 83  PKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 30.3 bits (68), Expect = 0.17
 Identities = 17/88 (19%), Positives = 35/88 (39%), Gaps = 10/88 (11%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE---EPDISQDQSSTPKKPRLVFTDL----- 54
              R +EK  +K+ +K + + K+K   +KE     ++         +  L   D      
Sbjct: 13  NNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKST 72

Query: 55  --QRRTLQAIFKETKRPSKEMQVTIARQ 80
             + + L  + K  +   KE+Q  I ++
Sbjct: 73  KEESKQLLEVLKTKEEHQKEIQYEILQK 100


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 30.3 bits (68), Expect = 0.18
 Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 7/54 (12%)

Query: 4   KRRKKEK---KKKKKKKKKKKKKKKKTCKRKEEPDISQDQS----STPKKPRLV 50
           K++ KEK    K+ K +    + + + C  K+  +   D         KKP  V
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851



 Score = 27.6 bits (61), Expect = 1.3
 Identities = 9/39 (23%), Positives = 15/39 (38%)

Query: 10  KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPR 48
           K   + K+K K+K       + E P +  +      K R
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQR 830


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 29.8 bits (67), Expect = 0.22
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKK 26
           VEK++K + KK K K KK + KKK 
Sbjct: 89  VEKKKKGKSKKNKLKGKKDEDKKKA 113



 Score = 27.5 bits (61), Expect = 1.3
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 9   EKKKKKKKKKKKKKKKKKTCKRK 31
           EKKKK K KK K K KK   K+K
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKK 112



 Score = 27.5 bits (61), Expect = 1.4
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKE 32
           KK+K K KK K K KK + K   R+ 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 27.1 bits (60), Expect = 1.6
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 8   KEKKKKKKKKKKKKKKKKKTCKRKEE 33
           K+KK K KK K K KK +   K +E 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 26.3 bits (58), Expect = 3.8
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 11  KKKKKKKKKKKKKKKKTCKRKEEPD 35
           KKKK K KK K K KK   +K+  +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKARE 115


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.8 bits (67), Expect = 0.23
 Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 3/42 (7%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
            + +K KKK    K   KKKKK    + E     +  ++ PK
Sbjct: 120 PKTEKLKKKITVNKSTNKKKKKVLSSKDEL---IKYDNNKPK 158


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.3 bits (66), Expect = 0.24
 Identities = 15/44 (34%), Positives = 24/44 (54%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
            E+    E K + K+KKK++  K KT K K + +  + + S PK
Sbjct: 109 TEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 27.0 bits (60), Expect = 1.7
 Identities = 8/41 (19%), Positives = 17/41 (41%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           + E++   + K + K+KKK+   + +             KP
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148



 Score = 25.9 bits (57), Expect = 4.9
 Identities = 11/43 (25%), Positives = 25/43 (58%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
           +++  + K + K+KKK++    K ++E   ++ +   P KP+L
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.0 bits (68), Expect = 0.24
 Identities = 17/54 (31%), Positives = 28/54 (51%)

Query: 3    EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQR 56
            +++R K K K K  K +K K KKK  K+K+       ++S     + V +D +R
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR 1207



 Score = 27.3 bits (61), Expect = 1.9
 Identities = 14/51 (27%), Positives = 19/51 (37%)

Query: 3    EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTD 53
            + R+ K KKK+KKKKK    K KK         +  D+             
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKS 1218



 Score = 25.8 bits (57), Expect = 6.7
 Identities = 13/47 (27%), Positives = 19/47 (40%)

Query: 3    EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
             K+R +      KKKKK +KK  +  K K     +    S+    R 
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRP 1356


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 29.7 bits (67), Expect = 0.25
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           R +K+ K K  KK  +K KK     +K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 27.8 bits (62), Expect = 1.1
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRKE 32
           R ++K K K  KK  +K KK + K ++
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 27.8 bits (62), Expect = 1.2
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCK 29
             +K + K  KK  +K KK   KT K
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQK 796



 Score = 25.5 bits (56), Expect = 7.2
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKK 26
           EK+ K +  KK  +K KK   K +
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQ 795


>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel.  The
           family carries eight putative transmembrane domains,
           and, although it has no similarity to other known
           channel proteins, it is clearly a calcium-activated
           ionic channel. It is expressed in various secretory
           epithelia, the retina and sensory neurons, and mediates
           receptor-activated chloride currents in diverse
           physiological processes.
          Length = 449

 Score = 29.5 bits (67), Expect = 0.33
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 10  KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPK 45
           KKK +KKK KKKK+K+K  +  E  +  +++S  P 
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIENESELPP 305



 Score = 26.0 bits (58), Expect = 5.3
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           K++ ++KK KKKK+K+KK ++ +  +R E 
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIEN 299


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 29.5 bits (66), Expect = 0.34
 Identities = 13/25 (52%), Positives = 19/25 (76%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKR 30
             ++KKKKKK K+KKKKK ++  K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706



 Score = 27.5 bits (61), Expect = 1.4
 Identities = 13/22 (59%), Positives = 18/22 (81%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKK 26
           R+KK+KKK K+KKKKK ++  K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705



 Score = 27.1 bits (60), Expect = 2.4
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 5   RRKKEKKKKKKKKKKKKKKKK 25
             +K+KKKKK K+KKKKK ++
Sbjct: 682 GSRKKKKKKKSKEKKKKKNRE 702



 Score = 26.0 bits (57), Expect = 5.5
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKK 26
           +++KK+KK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 25.2 bits (55), Expect = 9.6
 Identities = 14/49 (28%), Positives = 27/49 (55%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVF 51
           +K++KK+ K+KKKKK ++  K+    +  EE     +   + + P+L  
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKL 733


>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
           trafficking and secretion].
          Length = 526

 Score = 29.5 bits (66), Expect = 0.34
 Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDIS-------QDQSSTPKKPRLVFTDLQR 56
           +RR++++ + +K+K+++   K++      E   S       Q  S  P+  + +F+D   
Sbjct: 27  RRREEQQVELRKQKREELLNKRRNLADVSEEAESSFIPMEQQFYSELPQLTQQLFSDDIE 86

Query: 57  RTLQAIFKETKRPSKE 72
           + LQA++K  K  SKE
Sbjct: 87  QQLQAVYKFRKLLSKE 102


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 29.4 bits (67), Expect = 0.35
 Identities = 8/42 (19%), Positives = 11/42 (26%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
            K EK +       K    KK   +       Q  +S     
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 28.2 bits (64), Expect = 0.81
 Identities = 5/29 (17%), Positives = 12/29 (41%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           +    KK   K  +K K++    ++  + 
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 27.1 bits (61), Expect = 2.0
 Identities = 8/29 (27%), Positives = 8/29 (27%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK  K         K    KK      RK
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRK 844



 Score = 27.1 bits (61), Expect = 2.4
 Identities = 8/32 (25%), Positives = 9/32 (28%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
                 K    KK   K  +K K      E D
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 26.7 bits (60), Expect = 2.8
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKT 27
           K   K  +K K++    +K  K T
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKAT 859



 Score = 26.7 bits (60), Expect = 3.3
 Identities = 6/24 (25%), Positives = 8/24 (33%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKT 27
           +       K    KK   K  +KT
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKT 845



 Score = 26.3 bits (59), Expect = 4.7
 Identities = 4/29 (13%), Positives = 9/29 (31%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
            ++ +K +       K    KK      +
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSR 843



 Score = 25.9 bits (58), Expect = 5.0
 Identities = 6/29 (20%), Positives = 11/29 (37%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
           ++ +       K    KK   K + K K+
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQ 847


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 29.1 bits (65), Expect = 0.36
 Identities = 13/41 (31%), Positives = 17/41 (41%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
            KE KK KK +  K  K  K  K K++  +S      P   
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGID 184



 Score = 28.3 bits (63), Expect = 0.79
 Identities = 14/46 (30%), Positives = 18/46 (39%), Gaps = 2/46 (4%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCK--RKEEPDISQDQSSTPKKP 47
           K+ KK +  K  K  K KK KKK     R  +      +S    K 
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKS 193


>gnl|CDD|132580 TIGR03541, reg_near_HchA, LuxR family transcriptional regulatory,
           chaperone HchA-associated.  Members of this protein
           family belong to the LuxR transcriptional regulator
           family, and contain both autoinducer binding (pfam03472)
           and transcriptional regulator (pfam00196) domains.
           Members, however, occur only in a few members of the
           Gammaproteobacteria that have the
           chaperone/aminopeptidase HchA, and are always encoded by
           the adjacent gene.
          Length = 232

 Score = 29.0 bits (65), Expect = 0.36
 Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 18/72 (25%)

Query: 29  KRKEEPDISQDQS--STPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPT 86
           +R  E  + ++    S  ++  L +T L RR                Q  IA  LG+   
Sbjct: 158 RRLLEAPLLREAGVLSEREREVLAWTALGRR----------------QADIAAILGISER 201

Query: 87  TVGNFFMNARRR 98
           TV N   +ARR+
Sbjct: 202 TVENHLRSARRK 213


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 29.2 bits (66), Expect = 0.37
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKK 26
             +KR+ KE+ KKKK K+ K  K  K
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 28.5 bits (64), Expect = 0.64
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKT 27
            EK+RK +++ KKKK K+ K  K   
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALK 273


>gnl|CDD|223522 COG0445, GidA, Flavin-dependent tRNA uridine
           5-carboxymethylaminomethyl modification enzyme GidA  
           [Cell cycle control, cell division, chromosome
           partitioning].
          Length = 621

 Score = 29.1 bits (66), Expect = 0.38
 Identities = 13/55 (23%), Positives = 23/55 (41%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRR 57
           ++R ++  KKK+  +++ ++ K       E           P K R    DL RR
Sbjct: 457 DERYERFLKKKENIEEEIERLKSTWVTPSEVAKELLALGGQPLKRRSSLYDLLRR 511


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 29.2 bits (66), Expect = 0.39
 Identities = 17/41 (41%), Positives = 21/41 (51%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
            KK  KK   K K KKKKKK       + + SQ+  S+ KK
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 28.9 bits (65), Expect = 0.42
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDIS 37
             + K E+ KK++  + ++KKK+K  K +     +
Sbjct: 361 PTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 25.1 bits (55), Expect = 9.2
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
             +R KKE+  + ++KKK+KK K +    + 
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|217667 pfam03666, NPR3, Nitrogen Permease regulator of amino acid
           transport activity 3.  This family, also known in yeasts
           as Rmd11, complexes with NPR2, pfam06218. This complex
           heterodimer is responsible for inactivating TORC1. an
           evolutionarily conserved protein complex that controls
           cell size via nutritional input signals, specifically,
           in response to amino acid starvation.
          Length = 424

 Score = 28.9 bits (65), Expect = 0.45
 Identities = 16/62 (25%), Positives = 23/62 (37%)

Query: 12  KKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKRPSK 71
           +KKKKKK+KK  +      +E    S+D SS         +            E  + S 
Sbjct: 45  RKKKKKKQKKSDRADPNDDREPSVDSEDSSSKSSSKSESGSLANSDPASDPSSEDSKSSM 104

Query: 72  EM 73
            M
Sbjct: 105 TM 106


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 28.5 bits (65), Expect = 0.63
 Identities = 7/19 (36%), Positives = 10/19 (52%)

Query: 3   EKRRKKEKKKKKKKKKKKK 21
            + R +  KK K+KKK   
Sbjct: 380 AQARLRAAKKVKRKKKTSG 398



 Score = 25.1 bits (56), Expect = 9.2
 Identities = 7/27 (25%), Positives = 12/27 (44%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           +  + + +  KK K+KKK        K
Sbjct: 378 KAAQARLRAAKKVKRKKKTSGPALPGK 404



 Score = 25.1 bits (56), Expect = 9.8
 Identities = 7/24 (29%), Positives = 10/24 (41%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKK 24
              K  +   +  KK K+KKK   
Sbjct: 375 KAIKAAQARLRAAKKVKRKKKTSG 398


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 28.4 bits (64), Expect = 0.64
 Identities = 9/53 (16%), Positives = 24/53 (45%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQ 55
           E++ KK +   +K+K  ++++K K    +++      +      P    ++L 
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168



 Score = 28.0 bits (63), Expect = 0.84
 Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
            ++ K  ++++K K K  +KK+K   ++ E         S    
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTV 169



 Score = 28.0 bits (63), Expect = 0.95
 Identities = 10/55 (18%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 2   VEKRRKKEKKKKKK--KKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDL 54
           V+ +R +++ KK++   +K+K  ++++  K K      + ++  P+      +D+
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI 164


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 27.9 bits (62), Expect = 0.75
 Identities = 14/23 (60%), Positives = 16/23 (69%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKK 25
           EK +KK+KK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 27.5 bits (61), Expect = 0.85
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 8   KEKKKKKKKKKKKKKKKKK 26
           K+KKKK KK KK KK  KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119



 Score = 27.5 bits (61), Expect = 1.0
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 7   KKEKKKKKKKKKKKKKKKKK 26
           KK+KKK KK KK KK  KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 27.2 bits (60), Expect = 1.5
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKK 26
            +  +K KKKKKK KK KK KK  
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSS 117



 Score = 26.8 bits (59), Expect = 1.6
 Identities = 14/24 (58%), Positives = 15/24 (62%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKK 26
           E   K +KKKKK KK KK KK  K
Sbjct: 95  EPTEKPKKKKKKSKKTKKPKKSSK 118



 Score = 26.8 bits (59), Expect = 1.7
 Identities = 15/23 (65%), Positives = 17/23 (73%)

Query: 9   EKKKKKKKKKKKKKKKKKTCKRK 31
           EK KKKKKK KK KK KK+ K+ 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 26.8 bits (59), Expect = 1.9
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 11  KKKKKKKKKKKKKKKKTCKRKE 32
           K KKKKKK KK KK K   +K+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 24.8 bits (54), Expect = 8.9
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 14  KKKKKKKKKKKKKTCKRKEEPD 35
           K KKKKKK KK K  K+  + D
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 28.0 bits (63), Expect = 0.86
 Identities = 10/36 (27%), Positives = 14/36 (38%)

Query: 3  EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQ 38
          EK  K+E K        K + +K      E  D +Q
Sbjct: 25 EKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQ 60



 Score = 27.3 bits (61), Expect = 1.7
 Identities = 8/42 (19%), Positives = 18/42 (42%)

Query: 3  EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTP 44
           K ++ E++K  K++ K          R  +   + D++  P
Sbjct: 17 LKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDP 58



 Score = 25.7 bits (57), Expect = 5.7
 Identities = 9/40 (22%), Positives = 15/40 (37%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSST 43
          K+ ++EK  K++ K        K   RK      +    T
Sbjct: 20 KQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPT 59


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 28.3 bits (64), Expect = 0.86
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 9   EKKKKKKKKKKKKK 22
           ++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355



 Score = 27.1 bits (61), Expect = 1.9
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 11  KKKKKKKKKKKKK 23
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 27.1 bits (61), Expect = 1.9
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 12  KKKKKKKKKKKKK 24
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 27.1 bits (61), Expect = 1.9
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 13  KKKKKKKKKKKKK 25
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 27.1 bits (61), Expect = 1.9
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 14  KKKKKKKKKKKKK 26
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355


>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases.  This family
           consists of known phosphorylases, and homologs believed
           to share the function of using inorganic phosphate to
           cleave an alpha 1,4 linkage between the terminal glucose
           residue and the rest of the polymer (maltodextrin,
           glycogen, etc.). The name of the glucose storage polymer
           substrate, and therefore the name of this enzyme,
           depends on the chain lengths and branching patterns. A
           number of the members of this family have been shown to
           operate on small maltodextrins, as may be obtained by
           utilization of exogenous sources. This family represents
           a distinct clade from the related family modeled by
           TIGR02093/pfam00343.
          Length = 601

 Score = 28.1 bits (63), Expect = 0.92
 Identities = 11/28 (39%), Positives = 16/28 (57%), Gaps = 2/28 (7%)

Query: 42  STPKKPRLVFTDLQRRTLQAIFKETKRP 69
           +T K+  L+F DL+R  L  I    +RP
Sbjct: 399 ATYKRADLIFRDLER--LARILNNPERP 424


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 27.9 bits (62), Expect = 1.0
 Identities = 16/29 (55%), Positives = 22/29 (75%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK+ KK+K +  K++KKKKK+KKK  KR 
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRH 169



 Score = 27.1 bits (60), Expect = 1.7
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           K++K E  K++KKKKK+KKKKKK    +   
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175



 Score = 26.3 bits (58), Expect = 3.0
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
           ++ K+KK +  K++KKKKK+KK  K++  P+
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172



 Score = 25.9 bits (57), Expect = 4.3
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
            K  +K+ KKKK +  K++KKKKK  K+K+
Sbjct: 137 LKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 137 to
          196 amino acids in length.
          Length = 149

 Score = 27.7 bits (62), Expect = 1.1
 Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 3/60 (5%)

Query: 1  MVEKRRKKE---KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRR 57
          M   R++     K+ +KK  K  +K      K K++   S+  +S    P++V   + RR
Sbjct: 1  MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGIPKVVSNRMARR 60


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 27.7 bits (62), Expect = 1.2
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 6   RKKEKKKKKKKKKKKKKK 23
           RK+E  KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374



 Score = 26.6 bits (59), Expect = 3.6
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 4   KRRKKEKKKKKKKKKKKK 21
           ++R+ +KK+KKKK KK K
Sbjct: 357 RKREGDKKQKKKKSKKLK 374



 Score = 25.8 bits (57), Expect = 5.8
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 3   EKRRKKEKKKKKKKKKKKKK 22
            KR+++  KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374



 Score = 25.4 bits (56), Expect = 7.5
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 8   KEKKKKKKKKKKKKKKKKKT 27
           K K++  KK+KKKK KK K 
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 27.4 bits (62), Expect = 1.4
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKK 24
            +EKR ++ +KK K   K+ K++ 
Sbjct: 135 TIEKRLERLEKKAKSGDKEAKEEL 158


>gnl|CDD|180803 PRK07037, PRK07037, extracytoplasmic-function sigma-70 factor;
           Validated.
          Length = 163

 Score = 27.3 bits (61), Expect = 1.5
 Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 3/40 (7%)

Query: 52  TDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGNF 91
           ++L  RT  A   E  R   E Q  IAR+LG+ PT V NF
Sbjct: 108 SELPARTRYAF--EMYRLHGETQKDIARELGVSPTLV-NF 144


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 881

 Score = 27.2 bits (60), Expect = 1.8
 Identities = 20/95 (21%), Positives = 40/95 (42%), Gaps = 9/95 (9%)

Query: 1  MVEKRRKKEK--KKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVF------- 51
          MV++  KK++   ++ +   +K+   KK  K++EE  +S++ +       L+        
Sbjct: 1  MVDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQDPD 60

Query: 52 TDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPT 86
           DLQ  +L  + +  K  +  M         L P 
Sbjct: 61 IDLQNNSLNMLKEVIKSSTSSMTAVPKPLKFLRPN 95


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
          receptor (72 kDa mitochondrial outermembrane protein)
          (mitochondrial import receptor for the ADP/ATP carrier)
          (translocase of outermembrane tom70).  [Transport and
          binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 27.3 bits (60), Expect = 1.9
 Identities = 11/43 (25%), Positives = 27/43 (62%)

Query: 5  RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
          R+ +++++ K K  KK+++K+K  +++ E    + +S+ PK  
Sbjct: 57 RKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNA 99


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 26.9 bits (60), Expect = 1.9
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
           VE  + + ++++ KKKKKKKKKK K          +   S+ P +   
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216



 Score = 25.0 bits (55), Expect = 9.8
 Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQA 61
           +  + +KKKKKKKKK KK     +            + S+          L+++   A
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIA 234


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 27.1 bits (60), Expect = 2.0
 Identities = 13/28 (46%), Positives = 13/28 (46%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
               K K  K  KKK K K KK T KR 
Sbjct: 757 SASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 26.7 bits (59), Expect = 2.8
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           EK+ K +KK    K K  K  KKK+  + ++ 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKT 779



 Score = 25.9 bits (57), Expect = 5.7
 Identities = 13/31 (41%), Positives = 16/31 (51%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           K    + K  K  KKK K K KKT K++  P
Sbjct: 756 KSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786


>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
          Length = 364

 Score = 27.0 bits (61), Expect = 2.0
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKK 25
           VEKR ++ +KK K   K+ K + +
Sbjct: 140 VEKRLERLEKKAKGGDKEAKAELE 163


>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist.
          LIN-8 is a nuclear protein, present at the sites of
          transcriptional repressor complexes, which interacts
          with LIN-35 Rb.Lin35 Rb is a product of the class B
          synMuv gene lin-35 which silences genes required for
          vulval specification through chromatin modification and
          remodelling. The biological role of the interaction has
          not yet been determined however predictions have been
          made. The interaction shows that class A synMuv genes
          control vulval induction through the transcriptional
          regulation of gene expression. LIN-8 normally functions
          as part of a protein complex however when the complex
          is absent, other family members can partially replace
          LIN-8 activity.
          Length = 316

 Score = 27.0 bits (60), Expect = 2.1
 Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 2  VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE-PDI-SQDQSSTPKKPRLVFTDLQRRT- 58
          +++  + EK K  K   K    KK      E+ P++  ++     KK  +V  ++  RT 
Sbjct: 3  LKEYLELEKTKFGKYATKDVTLKKVVLSLLEKRPEMWKRNTRGQAKKWAIVGVEVYERTG 62

Query: 59 -------LQAIFKETKR 68
                 L+ IFK  K 
Sbjct: 63 KLVSVKDLRKIFKTAKD 79


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 26.8 bits (60), Expect = 2.1
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKK 25
            VEKR  +  KK+KKKKKKK++K +
Sbjct: 158 FVEKRLDELTKKRKKKKKKKEEKVE 182


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 27.2 bits (60), Expect = 2.2
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKT 27
           + + + + E +  K  KK KK KKKK 
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKV 259


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 12/47 (25%), Positives = 21/47 (44%)

Query: 1  MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           + +   + ++ + K+  KKKKKKKK  K+K   +        P   
Sbjct: 39 FLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVW 85



 Score = 26.6 bits (59), Expect = 3.5
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 1  MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPD 35
          M E    + K++K++ K+K  KKK K  K+  + D
Sbjct: 2  MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDD 36


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 26.5 bits (59), Expect = 2.4
 Identities = 19/45 (42%), Positives = 27/45 (60%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
            K+R K +KKK+KKKKKKK KK    + KE    S++ S   ++ 
Sbjct: 84  AKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128



 Score = 25.4 bits (56), Expect = 5.9
 Identities = 19/31 (61%), Positives = 23/31 (74%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEE 33
           EK  KK  K++KKK+KKKKKKK K   +KEE
Sbjct: 81  EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEE 111


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 26.7 bits (60), Expect = 2.6
 Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD--QSSTPKKPRLVFTDLQRRTLQ 60
           +++  KE  +  KK + +KKK+ K    KE P               + +  + +R  ++
Sbjct: 183 DEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMKEINEEERRVVVE 242

Query: 61  A-IFKETKRPSK 71
             +FK   +  K
Sbjct: 243 GYVFKVEIKELK 254


>gnl|CDD|114472 pfam05748, Rubella_E1, Rubella membrane glycoprotein E1.  Rubella
           virus (RV), the sole member of the genus Rubivirus
           within the family Togaviridae, is a small enveloped,
           positive strand RNA virus. The nucleocapsid consists of
           40S genomic RNA and a single species of capsid protein
           which is enveloped within a host-derived lipid bilayer
           containing two viral glycoproteins, E1 (58 kDa) and E2
           (42-46 kDa). In virus infected cells, RV matures by
           budding either at the plasma membrane, or at the
           internal membranes depending on the cell type and enters
           adjacent uninfected cells by a membrane fusion process
           in the endosome, directed by E1-E2 heterodimers. The
           heterodimer formation is crucial for E1 transport out of
           the endoplasmic reticulum to the Golgi and plasma
           membrane. In RV E1, a cysteine at position 82 is crucial
           for the E1-E2 heterodimer formation and cell surface
           expression of the two proteins. The E1 has been shown to
           be a type 1 membrane protein, rich in cysteine residues
           with extensive intramolecular disulfide bonds.
          Length = 496

 Score = 27.0 bits (59), Expect = 2.6
 Identities = 11/26 (42%), Positives = 16/26 (61%), Gaps = 1/26 (3%)

Query: 28  CKRKEEPDISQDQSSTPKKPRLVFTD 53
           C+R   PD S+   +TP++PRL   D
Sbjct: 250 CQR-HSPDCSRLVGATPERPRLRLVD 274


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 26.7 bits (60), Expect = 2.6
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKT 27
           E R++  K  +KK+KKKKK+K+K+ 
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQE 635



 Score = 25.9 bits (58), Expect = 6.0
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKT 27
            + ++ KKKKKKK K  K K +T
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.9 bits (60), Expect = 2.7
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQD 39
            V       +KKK++KK+++K+ K+      ++ D   D
Sbjct: 173 GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDD 211



 Score = 26.5 bits (59), Expect = 2.9
 Identities = 10/40 (25%), Positives = 22/40 (55%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
           +V         +KKK++KK+++K+ K  +  ++ D  +D 
Sbjct: 171 LVGVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210



 Score = 26.5 bits (59), Expect = 3.1
 Identities = 9/42 (21%), Positives = 21/42 (50%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSS 42
           +       EKKK++KK+++K+ K+ +     +E +    +  
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGG 215



 Score = 25.7 bits (57), Expect = 6.1
 Identities = 11/47 (23%), Positives = 26/47 (55%)

Query: 10  KKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQR 56
               +KKK++KK+++K++ + +E+ D  +D+    +       + QR
Sbjct: 178 SSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQR 224


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 27.0 bits (59), Expect = 2.8
 Identities = 11/45 (24%), Positives = 32/45 (71%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRL 49
           +R+ E+K +++++++K+K+K++  +R+ E + +   SS+  + R+
Sbjct: 589 KREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRM 633


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 26.6 bits (59), Expect = 2.8
 Identities = 19/40 (47%), Positives = 25/40 (62%)

Query: 7   KKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
           KKE ++K+K K KKKK KKK  K K++ D  +D  S  K 
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 26.6 bits (59), Expect = 3.5
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRKE 32
           + +   +K KKKKKK+KKK+++ K++   R E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 25.8 bits (57), Expect = 5.7
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDI 36
           E +     ++K KKKKKK+KKK++  KR+E+  I
Sbjct: 734 EFKISDSVEEKTKKKKKKEKKKEEEYKREEKARI 767


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 26.5 bits (58), Expect = 3.5
 Identities = 16/57 (28%), Positives = 25/57 (43%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTL 59
           + R +  K KK KKKK  KKK+    +   E  + +  S+   +P      + R  L
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSRVIPIPRNKL 171


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 26.2 bits (58), Expect = 4.0
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
           E + K + K K K  KK +++ K+  K  E    S  +++ P +P
Sbjct: 96  EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARP 140


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 26.1 bits (58), Expect = 4.0
 Identities = 8/45 (17%), Positives = 14/45 (31%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
           + K++ K   K    K   KKK K      ++ +           
Sbjct: 74  IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDL 118



 Score = 25.7 bits (57), Expect = 5.8
 Identities = 9/54 (16%), Positives = 15/54 (27%)

Query: 2   VEKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQ 55
            +  +KK K   K    K   KKK   +        +  +           D+ 
Sbjct: 72  SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 26.1 bits (58), Expect = 4.2
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDI 36
           +R +KEK+KK KK+ K++++     K  EE +I
Sbjct: 392 EREEKEKEKKAKKEGKEERRIHFQNKSIEEAEI 424


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 26.0 bits (58), Expect = 4.3
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKT 27
             R +E  K+KKKKKKKKKKK++ 
Sbjct: 79  ADRAEELLKEKKKKKKKKKKKEEL 102



 Score = 25.3 bits (56), Expect = 7.0
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCK 29
           +   KEKKKKKKKKKKK++ ++  C+
Sbjct: 83  EELLKEKKKKKKKKKKKEELREWMCE 108


>gnl|CDD|225767 COG3226, COG3226, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 204

 Score = 26.0 bits (57), Expect = 4.5
 Identities = 10/40 (25%), Positives = 15/40 (37%)

Query: 51  FTDLQRRTLQAIFKETKRPSKEMQVTIARQLGLEPTTVGN 90
           FT++  R LQA F +           +A   G +  T   
Sbjct: 68  FTEIMARQLQAFFSDVSDDRGACAAVLADLYGSQVATPDR 107


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 26.1 bits (58), Expect = 4.6
 Identities = 8/26 (30%), Positives = 15/26 (57%)

Query: 8   KEKKKKKKKKKKKKKKKKKTCKRKEE 33
           +EKKK KK  +K+ +  +    +K +
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRK 109


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 26.0 bits (57), Expect = 4.7
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 7   KKEKKKKKKKKKKKKKKKK 25
            +E K+KKKKK KKKK  K
Sbjct: 86  NQENKQKKKKKDKKKKSPK 104


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 25.9 bits (57), Expect = 5.5
 Identities = 11/44 (25%), Positives = 24/44 (54%)

Query: 4  KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKP 47
            R  + + K +K K+ KK+KK+  K + + ++   +S  P++ 
Sbjct: 20 GPRPVKDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAPRRR 63


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 25.7 bits (57), Expect = 5.6
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTC 28
            R K +  ++ K KKK  KKK  C
Sbjct: 782 HRSKLRSSQRDKLKKKLLKKKDFC 805


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 25.4 bits (56), Expect = 5.8
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKK 24
           E  +K + K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 25.4 bits (56), Expect = 7.0
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 6   RKKEKKKKKKKKKKKKKKKK 25
            KK K K+ +K + KKKKKK
Sbjct: 86  SKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 25.5 bits (57), Expect = 5.8
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKK 25
           E+  KK  +K +K KKKKK++K 
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 25.8 bits (57), Expect = 5.9
 Identities = 11/28 (39%), Positives = 26/28 (92%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKR 30
           E+RR++EK+++KK+ K++K+++K+T ++
Sbjct: 182 ERRRREEKERRKKQDKERKQREKETAEK 209


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 25.8 bits (57), Expect = 6.0
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKK 26
           ++RK  KK  KK    KK+K+K+
Sbjct: 225 RKRKAPKKVAKKVAAAKKRKQKR 247


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 26.0 bits (57), Expect = 6.0
 Identities = 11/31 (35%), Positives = 18/31 (58%), Gaps = 3/31 (9%)

Query: 3   EKRRKKE---KKKKKKKKKKKKKKKKKTCKR 30
           E+RR +E   + K +K  + K + K+KT K 
Sbjct: 441 EERRLREEAIRAKAEKTARMKAEMKEKTLKM 471


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 25.5 bits (56), Expect = 6.6
 Identities = 10/29 (34%), Positives = 22/29 (75%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK +++E +KK+K++  + K+K++  +RK
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase.  Cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues, is the major component of wood and
           thus paper, and is synthesised by plants, most algae,
           some bacteria and fungi, and even some animals. The
           genes that synthesise cellulose in higher plants differ
           greatly from the well-characterized genes found in
           Acetobacter and Agrobacterium sp. More correctly
           designated as 'cellulose synthase catalytic subunits',
           plant cellulose synthase (CesA) proteins are integral
           membrane proteins, approximately 1,000 amino acids in
           length. There are a number of highly conserved residues,
           including several motifs shown to be necessary for
           processive glycosyltransferase activity.
          Length = 716

 Score = 25.9 bits (57), Expect = 6.7
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 13  KKKKKKKKKKKKKKTCKRKEE 33
           ++KK K   K KKK  KRKE 
Sbjct: 312 RRKKPKSASKAKKKDSKRKES 332


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 25.0 bits (55), Expect = 7.0
 Identities = 16/27 (59%), Positives = 20/27 (74%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCK 29
            K++K++KKKKKKKKKK  KK  K  K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 25.0 bits (55), Expect = 8.3
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEP 34
           K+  KEK K+KK+KKKKKKKKKK   +K   
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 24.6 bits (54), Expect = 9.9
 Identities = 18/29 (62%), Positives = 23/29 (79%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           EK ++K++KKKKKKKKKKK  KK   K+K
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|240399 PTZ00393, PTZ00393, protein tyrosine phosphatase; Provisional.
          Length = 241

 Score = 25.3 bits (55), Expect = 7.0
 Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 2   VEKRRKKEKKKKKKK--KKKKKKKKKKTCKRK 31
           +  RRK    K++ +  K  KKKKKKK C RK
Sbjct: 206 IRDRRKGAINKRQLQFLKAYKKKKKKKNCLRK 237


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 25.4 bits (56), Expect = 7.5
 Identities = 13/67 (19%), Positives = 22/67 (32%), Gaps = 4/67 (5%)

Query: 5   RRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFK 64
             KK+ + KKK  K  KK +           +S   S     P  +    + R     ++
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLVGEW----LSTAISPPDSSPVSLDAREKLRAYLPKWE 208

Query: 65  ETKRPSK 71
              +  K
Sbjct: 209 GKTKNKK 215


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 25.3 bits (56), Expect = 8.2
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 8   KEKKKKKKKKKKKKKKKKKTCKRKEE 33
           +E+K+KKK   K++KK  K  K K E
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLE 120


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 25.0 bits (55), Expect = 8.4
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 6   RKKEKKKKKKKKKKKKKKKKKTCKRK 31
            ++EKKKKKKK  KKKK KK      
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKVAAVGM 203



 Score = 25.0 bits (55), Expect = 9.1
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
               ++K    +++KKKKKKK  KKK  K+ 
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 25.1 bits (55), Expect = 8.5
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 3  EKRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQ 40
          +K+R    +K    KK KKK K+K  + +++P+   D+
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 25.5 bits (56), Expect = 8.6
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 7   KKEKKKKKKKKKKK 20
            +   KKKKKKK+K
Sbjct: 737 GRSASKKKKKKKRK 750


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 25.4 bits (56), Expect = 8.7
 Identities = 9/43 (20%), Positives = 15/43 (34%)

Query: 4   KRRKKEKKKKKKKKKKKKKKKKKTCKRKEEPDISQDQSSTPKK 46
              K+EK           K  K+  K  +  +  +D S  P +
Sbjct: 217 NINKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE 259


>gnl|CDD|239463 cd03347, eu_PheOH, Eukaryotic phenylalanine-4-hydroxylase
          (eu_PheOH); a member of the biopterin-dependent
          aromatic amino acid hydroxylase family of non-heme,
          iron(II)-dependent enzymes that also includes
          prokaryotic phenylalanine-4-hydroxylase (pro_PheOH),
          eukaryotic tyrosine hydroxylase (TyrOH) and eukaryotic
          tryptophan hydroxylase (TrpOH).  PheOH catalyzes the
          first and rate-limiting step in the metabolism of the
          amino acid L-phenylalanine (L-Phe), the hydroxylation
          of L-Phe to L-tyrosine (L-Tyr). It uses
          (6R)-L-erythro-5,6,7,8-tetrahydrobiopterin (BH4) as the
          physiological electron donor. The catalytic activity of
          the tetrameric enzyme is tightly regulated by the
          binding of L-Phe and BH4 as well as by phosphorylation.
          Mutations in the human enzyme are linked to a severe
          variant of phenylketonuria.
          Length = 306

 Score = 25.1 bits (55), Expect = 9.4
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 29 KRKEEPDISQDQSSTPKKPRLVFTDLQRRTLQAIFKETKR 68
          +RKE  DI+ +       PR+ +T+ +++T   +F+E K 
Sbjct: 39 RRKEFADIAYNYKHGQPIPRVEYTEEEKKTWGTVFRELKS 78


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 25.1 bits (55), Expect = 9.6
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 3   EKRRKKEKKKKKKKKKKKKKKKK 25
            K  K   KK KK KKKKKKK+ 
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 25.1 bits (55), Expect = 9.8
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 1   MVEKRRKKEKKKKKKKKKKKKKKKKKTCKRK 31
           ++ +RR K    KK  KK    K KKT  +K
Sbjct: 251 LLAERRAKGGPGKKPAKKATAAKAKKTTAKK 281


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.126    0.352 

Gapped
Lambda     K      H
   0.267   0.0685    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,616,691
Number of extensions: 483117
Number of successful extensions: 9725
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6008
Number of HSP's successfully gapped: 1523
Length of query: 111
Length of database: 10,937,602
Length adjustment: 75
Effective length of query: 36
Effective length of database: 7,611,052
Effective search space: 273997872
Effective search space used: 273997872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 53 (24.3 bits)