RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13676
         (200 letters)



>gnl|CDD|214857 smart00853, MutL_C, MutL C terminal dimerisation domain.  MutL and
           MutS are key components of the DNA repair machinery that
           corrects replication errors. MutS recognises mispaired
           or unpaired bases in a DNA duplex and in the presence of
           ATP, recruits MutL to form a DNA signaling complex for
           repair. The N terminal region of MutL contains the
           ATPase domain and the C terminal is involved in
           dimerisation.
          Length = 140

 Score = 76.2 bits (188), Expect = 5e-18
 Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 77  GQ-AEGFIIVKYDSDLFIIDQHATDEKYNFETLQKTTV-IKSQKLVVPQNLHLTKINQCI 134
           GQ A  +I+ + +  L+++DQHA  E+  +E L K    ++SQ L++P  L L+     +
Sbjct: 3   GQVAGTYILAEREDGLYLLDQHAAHERILYEQLLKQAGGLESQPLLIPVRLELSPQEAAL 62

Query: 135 LKDNLPVFYKNGFEFRI 151
           L+++L +  + GFE  I
Sbjct: 63  LEEHLELLRQLGFELEI 79


>gnl|CDD|219965 pfam08676, MutL_C, MutL C terminal dimerisation domain.  MutL and
           MutS are key components of the DNA repair machinery that
           corrects replication errors. MutS recognises mispaired
           or unpaired bases in a DNA duplex and in the presence of
           ATP, recruits MutL to form a DNA signaling complex for
           repair. The N terminal region of MutL contains the
           ATPase domain and the C terminal is involved in
           dimerisation.
          Length = 144

 Score = 66.1 bits (162), Expect = 3e-14
 Identities = 21/76 (27%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 77  GQ-AEGFIIVKYDSDLFIIDQHATDEKYNFETLQKTT--VIKSQKLVVPQNLHLTKINQC 133
           GQ    +I+ + +  L+++DQHA  E+  +E L+       +SQ L++P  L L+     
Sbjct: 7   GQVHGTYILAENEDGLYLVDQHAAHERILYEKLKSKLKEGGQSQPLLIPLTLELSPEEAA 66

Query: 134 ILKDNLPVFYKNGFEF 149
           +L+++  +  + GFE 
Sbjct: 67  LLEEHKELLARLGFEL 82


>gnl|CDD|239566 cd03484, MutL_Trans_hPMS_2_like, MutL_Trans_hPMS2_like:
          transducer domain, having a ribosomal S5 domain 2-like
          fold, found in proteins similar to human PSM2 (hPSM2).
          hPSM2 belongs to the DNA mismatch repair
          (MutL/MLH1/PMS2) family.  This transducer domain is
          homologous to the second domain of the DNA gyrase B
          subunit, which is known to be important in nucleotide
          hydrolysis and the transduction of structural signals
          from ATP-binding site to the DNA breakage/reunion
          regions of the enzymes. Included in this group are
          proteins similar to yeast PMS1. The yeast MLH1-PMS1 and
          the human MLH1-PMS2 heterodimers play a role in
          meiosis. hMLH1-hPMS2 also participates in the repair of
          all DNA mismatch repair (MMR) substrates. Cells lacking
          hPMS2 have a strong mutator phenotype and display
          microsatellite instability (MSI). Mutation in hPMS2
          causes predisposition to HPNCC and Turcot syndrome.
          Length = 142

 Score = 65.8 bits (161), Expect = 5e-14
 Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 8  ISQLINIIPITQSSPDASLLEEFRIPPD-ITAMFTLSGFISSAIHGHGRSTADRQFYFVN 66
             +  +IPI          EE     D   +   ++G+IS   HG GRS++DRQF+++N
Sbjct: 12 GKVIKGLIPINLELDVNPTKEELDSDEDLADSEVKITGYISKPSHGCGRSSSDRQFFYIN 71

Query: 67 SRPCEPSKVS 76
           RP +  KV+
Sbjct: 72 GRPVDLKKVA 81


>gnl|CDD|223400 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
           replication, recombination, and repair].
          Length = 638

 Score = 68.1 bits (167), Expect = 1e-13
 Identities = 22/79 (27%), Positives = 41/79 (51%), Gaps = 3/79 (3%)

Query: 74  KVSGQ-AEGFIIVKYDSDLFIIDQHATDEKYNFETLQKT--TVIKSQKLVVPQNLHLTKI 130
           +  GQ    +I+ +++  L ++DQHA  E+  +E L+     V + Q L++P  L L+  
Sbjct: 453 EAIGQVHGTYILAEHEDGLVLVDQHAAHERILYEKLKNELGNVGELQPLLIPIRLELSPE 512

Query: 131 NQCILKDNLPVFYKNGFEF 149
              +L+++     K GFE 
Sbjct: 513 EADVLEEHKEELEKLGFEI 531



 Score = 30.4 bits (69), Expect = 0.69
 Identities = 12/30 (40%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 40  FTLSGFISSAIHGHGRSTADRQFYFVNSRP 69
             LSG++S       R++ D Q+ FVN RP
Sbjct: 235 LRLSGYVSLPEFT--RASRDYQYLFVNGRP 262


>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
          Length = 617

 Score = 48.3 bits (116), Expect = 8e-07
 Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 77  GQAEG-FIIVKYDSDLFIIDQHATDEKYNFETLQ---KTTVIKSQKLVVPQNLHLTKINQ 132
           GQ  G +I+ + +  L+++DQHA  E+  +E L+       + SQ L++P  L L++   
Sbjct: 435 GQLHGTYILAENEDGLYLVDQHAAHERLLYEQLKDKLAEVGLASQPLLIPLVLELSEDEA 494

Query: 133 CILKDNLPVFYKNGFEF 149
             L+++  +  + G E 
Sbjct: 495 DRLEEHKELLARLGLEL 511



 Score = 32.9 bits (76), Expect = 0.11
 Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 4/31 (12%)

Query: 40  FTLSGFISS-AIHGHGRSTADRQFYFVNSRP 69
             LSG++    +    R+  D Q+ FVN R 
Sbjct: 230 LRLSGYVGLPTLS---RANRDYQYLFVNGRY 257


>gnl|CDD|238405 cd00782, MutL_Trans, MutL_Trans: transducer domain, having a
          ribosomal S5 domain 2-like fold, conserved in the
          C-terminal domain of DNA mismatch repair
          (MutL/MLH1/PMS2) family. This transducer domain is
          homologous to the second domain of the DNA gyrase B
          subunit, which is known to be important in nucleotide
          hydrolysis and the transduction of structural signals
          from ATP-binding site to the DNA breakage/reunion
          regions of the enzymes. Included in this group are
          proteins similar to human MLH1, hPMS2, hPMS1, hMLH3 and
          E. coli MutL,  MLH1 forms heterodimers with PMS2, PMS1
          and MLH3. These three complexes have distinct functions
          in meiosis. hMLH1-hPMS2 also participates in the repair
          of all DNA mismatch repair (MMR) substrates. Roles for
          hMLH1-hPMS1 or hMLH1-hMLH3 in MMR have not been
          established. Cells lacking either hMLH1 or hPMS2 have a
          strong mutator phenotype and display microsatellite
          instability (MSI). Mutation in hMLH1 causes
          predisposition to HNPCC, Muir-Torre syndrome and Turcot
          syndrome (HNPCC variant). Mutation in hPMS2 causes
          predisposition to HPNCC and Turcot syndrome. Mutation
          in hMLH1 accounts for a large fraction of HNPCC
          families. There is no convincing evidence to support
          hPMS1 having a role in HNPCC predisposition. It has
          been suggested that hMLH3 may be a low risk gene for
          colorectal cancer; however there is little evidence to
          support it having a role in classical HNPCC.  It has
          been suggested that during initiation of DNA mismatch
          repair in E. coli, the mismatch recognition protein
          MutS recruits MutL in the presence of ATP.  The
          MutS(ATP)-MutL ternary complex formed, then recruits
          the latent endonuclease MutH.
          Length = 122

 Score = 45.6 bits (109), Expect = 1e-06
 Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 40 FTLSGFISSAIHGHGRSTADRQFYFVNSRPCEPSKVS 76
          F +SG+IS      GRS+ DRQF FVN RP     +S
Sbjct: 27 FRISGYISKP--DFGRSSKDRQFLFVNGRPVRDKLLS 61


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 46.5 bits (111), Expect = 3e-06
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R +KK+K+K+K K KK+ +  K   KR
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 44.2 bits (105), Expect = 2e-05
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +  + K  KK   K+ +KK+K+K+K K KKR
Sbjct: 385 EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 44.2 bits (105), Expect = 2e-05
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +KK+K+K+K K KK+ +  K   K+R
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 43.0 bits (102), Expect = 5e-05
 Identities = 13/34 (38%), Positives = 22/34 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           S+K   K+ +KK+K+K+K K KK+ +    I K+
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 42.2 bits (100), Expect = 7e-05
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K+K+K+K K KK+ +  K   K++K
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 41.9 bits (99), Expect = 9e-05
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK+K+K+K K KK+ +  K   K+++
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           S KK  K  KK   K+ +KK+K+K+K +
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPK 411



 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           G   KK   K+ +KK+K+K+K K KK+ +
Sbjct: 389 GKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417



 Score = 40.7 bits (96), Expect = 3e-04
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T +  KK   K+ +KK+K+K+K K KK+ 
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 40.7 bits (96), Expect = 3e-04
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++K+K+K K KK+ +  K   K++K 
Sbjct: 403 KEKEKEKPKVKKRHRDTKNIGKRRKP 428



 Score = 39.9 bits (94), Expect = 4e-04
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            G  + +   K+ +KK+K+K+K K KK+ +  +
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 39.9 bits (94), Expect = 5e-04
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K+K+K K KK+ +  K   K++K  
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPS 429



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
            +KK  K  KK   K+ +KK+K+K++  + K+H
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++K+K K KK+ +  K   K++K    
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 35.7 bits (83), Expect = 0.011
 Identities = 7/26 (26%), Positives = 13/26 (50%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKK 190
            ++K K KK+ +  K   K++K    
Sbjct: 406 EKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 34.9 bits (81), Expect = 0.019
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            ++   +KK  K  KK   K+ +KK+K++
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEK 407


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 43.3 bits (103), Expect = 1e-05
 Identities = 16/34 (47%), Positives = 23/34 (67%), Gaps = 2/34 (5%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYI--FK 197
            K +KK +K K K +KKK KK++ K RL++  FK
Sbjct: 72  EKAEKKAEKAKAKAEKKKAKKEEPKPRLFVLDFK 105



 Score = 34.8 bits (81), Expect = 0.011
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           KK+ K  +K +KK +K K K +KK+ 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKA 90


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 44.2 bits (105), Expect = 1e-05
 Identities = 16/27 (59%), Positives = 25/27 (92%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K+++KK +K+KKKKK+KKK KK+KK+
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 43.0 bits (102), Expect = 4e-05
 Identities = 16/27 (59%), Positives = 25/27 (92%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KK+++KK +K+KKKKK+KKK KK+K+
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 42.6 bits (101), Expect = 5e-05
 Identities = 18/27 (66%), Positives = 24/27 (88%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R++KK +K+KKKKK+KKK KK+KKK R
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 42.6 bits (101), Expect = 5e-05
 Identities = 16/27 (59%), Positives = 24/27 (88%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++++KK +K+KKKKK+KKK KK+KKK 
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 17/26 (65%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            KK +K+KKKKK+KKK KK+KKK +K
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 16/26 (61%), Positives = 23/26 (88%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KK +K+KKKKK+KKK KK+KKK ++
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 41.5 bits (98), Expect = 1e-04
 Identities = 17/26 (65%), Positives = 23/26 (88%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
            KKK+++KK +K+KKKKK+KKK KKR
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 15/26 (57%), Positives = 22/26 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K  KKK+++KK +K+KKKKK+KKK +
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKGK 406



 Score = 35.7 bits (83), Expect = 0.010
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K+K  K  KKK+++KK +K+KKKKKR
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKR 401



 Score = 35.3 bits (82), Expect = 0.016
 Identities = 14/27 (51%), Positives = 21/27 (77%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K  K  KKK+++KK +K+KKKKK+K+
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKK 403



 Score = 34.9 bits (81), Expect = 0.017
 Identities = 13/27 (48%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++K  K  KKK+++KK +K+KKKKK++
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRK 402



 Score = 29.2 bits (66), Expect = 1.6
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 162 DGTSRKKKKKKKKKKKKKKKK 182
               +KK+KKK KK+KKK +K
Sbjct: 394 KRKKKKKRKKKGKKRKKKGRK 414



 Score = 27.2 bits (61), Expect = 6.4
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K+ ++ K+K  K  KKK+++KK +KR
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKR 395


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 41.4 bits (97), Expect = 7e-05
 Identities = 19/31 (61%), Positives = 24/31 (77%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            G  +K KKKK +  K++KKKKK+KKKKKKR
Sbjct: 138 KGHEKKHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 39.8 bits (93), Expect = 2e-04
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G     KK KKKK +  K++KKKKK+KKKKK+
Sbjct: 136 GLKGHEKKHKKKKHEDDKERKKKKKEKKKKKK 167



 Score = 33.6 bits (77), Expect = 0.028
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               +  +KK KKKK +  K++KKKKK+KK+
Sbjct: 134 SEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164



 Score = 31.7 bits (72), Expect = 0.13
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           D    K  +KK KKKK +  K++KKKKK+K+
Sbjct: 133 DSEGLKGHEKKHKKKKHEDDKERKKKKKEKK 163



 Score = 26.7 bits (59), Expect = 7.0
 Identities = 12/17 (70%), Positives = 16/17 (94%)

Query: 167 KKKKKKKKKKKKKKKKK 183
           K++KKKKK+KKKKKK+ 
Sbjct: 153 KERKKKKKEKKKKKKRH 169



 Score = 26.7 bits (59), Expect = 7.0
 Identities = 12/17 (70%), Positives = 16/17 (94%)

Query: 168 KKKKKKKKKKKKKKKKK 184
           K++KKKKK+KKKKKK+ 
Sbjct: 153 KERKKKKKEKKKKKKRH 169



 Score = 26.3 bits (58), Expect = 8.3
 Identities = 13/19 (68%), Positives = 15/19 (78%)

Query: 162 DGTSRKKKKKKKKKKKKKK 180
           D   RKKKKK+KKKKKK+ 
Sbjct: 151 DDKERKKKKKEKKKKKKRH 169


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 42.1 bits (100), Expect = 8e-05
 Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYI--FK 197
            K   K +KKK+K++ K  K K K RL++  FK
Sbjct: 66  LKAWHKAQKKKEKQEAKAAKAKSKPRLFVLDFK 98



 Score = 26.7 bits (60), Expect = 9.1
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query: 166 RKKKKKKKKKKKKKKKKK 183
           +KK+K++ K  K K K +
Sbjct: 74  KKKEKQEAKAAKAKSKPR 91


>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL.  All proteins in
           this family for which the functions are known are
           involved in the process of generalized mismatch repair.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 312

 Score = 38.0 bits (89), Expect = 0.001
 Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 1/37 (2%)

Query: 40  FTLSGFISSAIHGHGRSTADRQFYFVNSRPCEPSKVS 76
             L GFIS       R +   QF F+N RP E   + 
Sbjct: 235 LQLEGFISQPNVTRSRRSGW-QFLFINGRPVELKLLL 270


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 37.0 bits (86), Expect = 0.001
 Identities = 18/28 (64%), Positives = 22/28 (78%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            + K+KK+KKKKKKKKKKK  KK  KK+
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 37.0 bits (86), Expect = 0.002
 Identities = 19/28 (67%), Positives = 23/28 (82%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +++KK+KKKKKKKKKKK  KK  KKKK
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 35.8 bits (83), Expect = 0.003
 Identities = 18/25 (72%), Positives = 21/25 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K+K K+KK+KKKKKKKKKKK  KK 
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 35.4 bits (82), Expect = 0.005
 Identities = 18/25 (72%), Positives = 21/25 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
            K+K K+KK+KKKKKKKKKKK  KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKK 129



 Score = 35.0 bits (81), Expect = 0.007
 Identities = 17/26 (65%), Positives = 21/26 (80%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +  K+K K+KK+KKKKKKKKKKK  K
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSK 128



 Score = 34.3 bits (79), Expect = 0.015
 Identities = 17/25 (68%), Positives = 21/25 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K+  K+K K+KK+KKKKKKKKKKK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 33.9 bits (78), Expect = 0.017
 Identities = 16/27 (59%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++  K+K K+KK+KKKKKKKKKKK  +
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSK 128


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 36.4 bits (84), Expect = 0.002
 Identities = 16/32 (50%), Positives = 17/32 (53%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             D      +K KKKKKK KK KK KK  KK 
Sbjct: 89  PDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 31.8 bits (72), Expect = 0.074
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 172 KKKKKKKKKKKKKKKKKKKKR 192
           K KKKKKK KK KK KK  K+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 30.6 bits (69), Expect = 0.24
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 173 KKKKKKKKKKKKKKKKKKKRL 193
           K KKKKKK KK KK KK  + 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 27.9 bits (62), Expect = 1.7
 Identities = 10/24 (41%), Positives = 11/24 (45%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKK 182
                  +KK KK KK KK  KK 
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|201604 pfam01119, DNA_mis_repair, DNA mismatch repair protein,
          C-terminal domain.  This family represents the
          C-terminal domain of the mutL/hexB/PMS1 family. This
          domain has a ribosomal S5 domain 2-like fold.
          Length = 119

 Score = 36.3 bits (85), Expect = 0.002
 Identities = 15/30 (50%), Positives = 16/30 (53%), Gaps = 2/30 (6%)

Query: 40 FTLSGFISSAIHGHGRSTADRQFYFVNSRP 69
            LSGFIS       RS  D Q+ FVN RP
Sbjct: 23 LRLSGFISKP--ELSRSNRDYQYLFVNGRP 50


>gnl|CDD|238202 cd00329, TopoII_MutL_Trans, MutL_Trans: transducer domain, having
          a ribosomal S5 domain 2-like fold, conserved in the
          C-terminal domain of type II DNA topoisomerases (Topo
          II) and DNA mismatch repair (MutL/MLH1/PMS2) proteins.
          This transducer domain is homologous to the second
          domain of the DNA gyrase B subunit, which is known to
          be important in nucleotide hydrolysis and the
          transduction of structural signals from ATP-binding
          site to the DNA breakage/reunion regions of the
          enzymes. The GyrB dimerizes in response to ATP binding,
          and is homologous to the N-terminal half of eukaryotic
          Topo II and the ATPase fragment of MutL. Type II DNA
          topoisomerases catalyze the ATP-dependent transport of
          one DNA duplex through another, in the process
          generating transient double strand breaks via covalent
          attachments to both DNA strands at the 5' positions.
          Included in this group are proteins similar to human
          MLH1 and PMS2.  MLH1 forms a heterodimer with PMS2
          which functions in meiosis and in DNA mismatch repair
          (MMR). Cells lacking either hMLH1 or hPMS2 have a
          strong mutator phenotype and display microsatellite
          instability (MSI). Mutation in hMLH1 accounts for a
          large fraction of Lynch syndrome (HNPCC) families.
          Length = 107

 Score = 35.7 bits (83), Expect = 0.003
 Identities = 16/35 (45%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 40 FTLSGFISSAIHGHGRSTADRQFYFVNSRPCEPSK 74
          F + G IS      GRS+ DRQF FVN RP     
Sbjct: 27 FRVEGAISYP--DSGRSSKDRQFSFVNGRPVREGG 59


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 36.2 bits (84), Expect = 0.004
 Identities = 17/27 (62%), Positives = 22/27 (81%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + KK+ K++KKK+KKKKKKK KK  KK
Sbjct: 83  TAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 35.0 bits (81), Expect = 0.007
 Identities = 17/27 (62%), Positives = 23/27 (85%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           T++K+ K++KKK+KKKKKKK KK  KK
Sbjct: 83  TAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 33.9 bits (78), Expect = 0.018
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K  KK+ K++KKK+KKKKKKK KK
Sbjct: 81  EKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 33.9 bits (78), Expect = 0.020
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K  KK+ K++KKK+KKKKKKK KK 
Sbjct: 82  KTAKKRAKRQKKKQKKKKKKKAKKG 106



 Score = 33.9 bits (78), Expect = 0.022
 Identities = 15/24 (62%), Positives = 20/24 (83%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKR 192
           KK+ K++KKK+KKKKKKK KK  +
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNK 108



 Score = 33.1 bits (76), Expect = 0.033
 Identities = 17/26 (65%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R K++KKK+KKKKKKK KK  KK++K
Sbjct: 87  RAKRQKKKQKKKKKKKAKKGNKKEEK 112



 Score = 31.2 bits (71), Expect = 0.15
 Identities = 16/27 (59%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R+KKK+KKKKKKK KK  KK++K+  +
Sbjct: 90  RQKKKQKKKKKKKAKKGNKKEEKEGSK 116



 Score = 30.8 bits (70), Expect = 0.24
 Identities = 14/27 (51%), Positives = 23/27 (85%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K+K ++K  KK+ K++KKK+KKKKK+
Sbjct: 75  KKRKDEEKTAKKRAKRQKKKQKKKKKK 101



 Score = 30.4 bits (69), Expect = 0.30
 Identities = 16/27 (59%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RK ++K  KK+ K++KKK+KKKKKKK 
Sbjct: 77  RKDEEKTAKKRAKRQKKKQKKKKKKKA 103



 Score = 30.4 bits (69), Expect = 0.32
 Identities = 16/26 (61%), Positives = 20/26 (76%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KKK+KKKKKKK KK  KK++K+  K 
Sbjct: 92  KKKQKKKKKKKAKKGNKKEEKEGSKS 117



 Score = 28.9 bits (65), Expect = 0.95
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +KK+K ++K  KK+ K++KKK+KKK+    KK
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +KK+KKKKKKK KK  KK++K+  K 
Sbjct: 92  KKKQKKKKKKKAKKGNKKEEKEGSKS 117



 Score = 28.1 bits (63), Expect = 2.1
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K+KKKKKKK KK  KK++K+  K  
Sbjct: 93  KKQKKKKKKKAKKGNKKEEKEGSKSS 118



 Score = 26.2 bits (58), Expect = 9.6
 Identities = 11/26 (42%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++K+++KK+K ++K  KK+ K++KKK
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKK 94


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 36.2 bits (84), Expect = 0.005
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               ++KK KK+K  + K  KKKKKKKKKK
Sbjct: 164 KKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 34.3 bits (79), Expect = 0.023
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +  +    KK+KK KK+K  + K  KKKKKKKK+
Sbjct: 159 KKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 33.9 bits (78), Expect = 0.032
 Identities = 17/34 (50%), Positives = 24/34 (70%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +K+KKKKK+ KK+KK+KK KK+K  +     KK
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKK 186



 Score = 33.5 bits (77), Expect = 0.033
 Identities = 16/29 (55%), Positives = 21/29 (72%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             +K+KK KK+K  + K  KKKKKKKKK+
Sbjct: 165 KEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 33.5 bits (77), Expect = 0.035
 Identities = 16/29 (55%), Positives = 23/29 (79%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              ++KK+KKKKK+ KK+KK+KK KK+K 
Sbjct: 149 VEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 15/28 (53%), Positives = 23/28 (82%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +++KK+KKKKK+ KK+KK+KK KK+
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 30.5 bits (69), Expect = 0.41
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               + ++++KK+KKKKK+ KK+KK+KK +
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173



 Score = 28.5 bits (64), Expect = 1.5
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           TS K+   K +K+ + ++++KK+KKKKK
Sbjct: 134 TSEKETTAKVEKEAEVEEEEKKEKKKKK 161



 Score = 28.1 bits (63), Expect = 2.3
 Identities = 12/29 (41%), Positives = 25/29 (86%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T++ +K+ + ++++KK+KKKKK+ KK+K+
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEKK 168



 Score = 28.1 bits (63), Expect = 2.3
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              +K+ + ++++KK+KKKKK+ KK+KK 
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
                    K +K+ + ++++KK+KKKKK+ K+
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKK 165



 Score = 27.8 bits (62), Expect = 2.9
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +     +  +  K +K+ + ++++KK+KKKKK+ KK  
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEK 167


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 17/39 (43%), Positives = 26/39 (66%), Gaps = 4/39 (10%)

Query: 161 GDGTSRKKKKKKKKKKKKKK----KKKKKKKKKKKRLYI 195
           GDG  ++KKK+K+K++   K    KK+K+K KK K + I
Sbjct: 623 GDGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVII 661


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 36.4 bits (85), Expect = 0.006
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               N     RKK +KK++K +KK +K++ +K   K
Sbjct: 401 GENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAK 436



 Score = 34.1 bits (79), Expect = 0.033
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           RKK++K +KK +K++ +K   KKK +
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAE 440



 Score = 32.6 bits (75), Expect = 0.13
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              NG+ +  ++KK +KK++K +KK +K++ +K  
Sbjct: 400 EGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAA 434



 Score = 32.2 bits (74), Expect = 0.16
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
             ++KK +KK++K +KK +K++ +K 
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKA 433



 Score = 32.2 bits (74), Expect = 0.16
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
              S  ++KK +KK++K +KK +K++ +K   
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAA 435



 Score = 32.2 bits (74), Expect = 0.16
 Identities = 9/28 (32%), Positives = 20/28 (71%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + +KK +KK++K +KK +K++ +K   +
Sbjct: 409 AERKKLRKKQRKAEKKAEKEEAEKAAAK 436



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           RK +KK +K++ +K   KKK +   K
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 28.7 bits (65), Expect = 2.3
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + +K++ +K   KKK +   KK K  
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGP 449



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K +KK +K++ +K   KKK +   +
Sbjct: 419 RKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 28.0 bits (63), Expect = 4.2
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KK +K++ +K   KKK +   KK +
Sbjct: 422 EKKAEKEEAEKAAAKKKAEAAAKKAK 447



 Score = 27.2 bits (61), Expect = 6.7
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K++ +K   KKK +   KK K   
Sbjct: 426 EKEEAEKAAAKKKAEAAAKKAKGPD 450



 Score = 26.8 bits (60), Expect = 9.2
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            K   KKK +   KK K    + KK 
Sbjct: 431 EKAAAKKKAEAAAKKAKGPDGETKKV 456


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 34.2 bits (79), Expect = 0.006
 Identities = 14/38 (36%), Positives = 23/38 (60%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            D      K ++  ++ ++KKKKKKK KKK + +I +K
Sbjct: 23  DDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRK 60



 Score = 30.7 bits (70), Expect = 0.13
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
                 S++ ++KKKKKKK KKK K+   +KK++
Sbjct: 30  SKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQ 63



 Score = 30.0 bits (68), Expect = 0.20
 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 4/44 (9%)

Query: 153 MMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKK----KKKKKR 192
                  +     R  ++ ++KKKKKKK KKK K    +KK++ 
Sbjct: 21  EEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64



 Score = 28.4 bits (64), Expect = 0.65
 Identities = 14/35 (40%), Positives = 20/35 (57%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
                   ++KKKKKKK KKK K+   +KK++ R 
Sbjct: 32  VRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRR 66



 Score = 26.1 bits (58), Expect = 4.7
 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 6/40 (15%)

Query: 159 RNGDGTSRKKKKKKKKKKK------KKKKKKKKKKKKKKR 192
           R      RKKKKKKK KKK      +KK++ +++ K+ K 
Sbjct: 34  RISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKEVKP 73


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 36.4 bits (85), Expect = 0.007
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G   + KK KK  +K +KK++++KK+KKKK  
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAF 435



 Score = 35.3 bits (82), Expect = 0.015
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           T  KK  KK KK  +K +KK++++KK+K+   F  
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAG 437



 Score = 33.0 bits (76), Expect = 0.089
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K +KK++++KK+KKKK    KKK++ 
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 33.0 bits (76), Expect = 0.091
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +K +KK++++KK+KKKK    KKK
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKK 440



 Score = 32.2 bits (74), Expect = 0.15
 Identities = 11/27 (40%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R+++KK+KKKK    KKK+++++++K 
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 32.2 bits (74), Expect = 0.16
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +K +KK++++KK+KKKK    KKK+ 
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 31.8 bits (73), Expect = 0.19
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            KK++++KK+KKKK    KKK+++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 31.8 bits (73), Expect = 0.19
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K  +K +KK++++KK+KKKK    K+
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKK 439



 Score = 31.8 bits (73), Expect = 0.20
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +  +K +KK++++KK+KKKK    KK+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKK 440



 Score = 31.0 bits (71), Expect = 0.34
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +KK++++KK+KKKK    KKK++++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 31.0 bits (71), Expect = 0.37
 Identities = 10/26 (38%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K++++KK+KKKK    KKK++++++
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEE 446



 Score = 30.7 bits (70), Expect = 0.48
 Identities = 11/29 (37%), Positives = 22/29 (75%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             +K+KKKK    KKK+++++++K+KK+ 
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453



 Score = 30.3 bits (69), Expect = 0.58
 Identities = 9/26 (34%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +++++KK+KKKK    KKK+++++++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 29.5 bits (67), Expect = 1.0
 Identities = 10/27 (37%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++KKKK    KKK+++++++K+KK++ 
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 29.5 bits (67), Expect = 1.3
 Identities = 10/34 (29%), Positives = 24/34 (70%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +  +    KKKK    KKK+++++++K+KK++++
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 9/28 (32%), Positives = 22/28 (78%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             +KK    KKK+++++++K+KK+++K+
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKE 456


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 33.7 bits (78), Expect = 0.007
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           ++K+ +K + K+ KK+  KK  +KKKR  IFK+
Sbjct: 5   KRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37



 Score = 26.8 bits (60), Expect = 1.9
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKK 190
           R KK+  KK  +KKK+K   K+ +K
Sbjct: 16  RAKKRAAKKAARKKKRKLIFKRAEK 40



 Score = 26.4 bits (59), Expect = 3.4
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++ KK+  KK  +KKK+K   K+ +K
Sbjct: 15  KRAKKRAAKKAARKKKRKLIFKRAEK 40


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 36.3 bits (85), Expect = 0.009
 Identities = 7/32 (21%), Positives = 12/32 (37%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           +       K    KK   K  +K K+ Y+  +
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853



 Score = 35.9 bits (84), Expect = 0.009
 Identities = 7/31 (22%), Positives = 13/31 (41%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            D  ++    KK   K  +K K++    +K 
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 34.4 bits (80), Expect = 0.034
 Identities = 7/25 (28%), Positives = 11/25 (44%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
             KK   K  +K K++    +K  K
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 34.4 bits (80), Expect = 0.035
 Identities = 7/25 (28%), Positives = 11/25 (44%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
            KK   K  +K K++    +K  K 
Sbjct: 834 GKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 34.0 bits (79), Expect = 0.049
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           D   +K   K  +K K++    +K  K 
Sbjct: 831 DPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 33.6 bits (78), Expect = 0.060
 Identities = 6/37 (16%), Positives = 10/37 (27%)

Query: 155 FANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               +            K    KK   K  +K K++ 
Sbjct: 813 LLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849



 Score = 32.8 bits (76), Expect = 0.11
 Identities = 6/26 (23%), Positives = 11/26 (42%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
              KK   K  +K K++    +K  +
Sbjct: 832 PAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 32.5 bits (75), Expect = 0.13
 Identities = 6/27 (22%), Positives = 9/27 (33%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + +K +       K    KK   K  R
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSR 843


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 35.9 bits (83), Expect = 0.011
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           D   R +KK K K  KK  +K KK   K +++
Sbjct: 766 DSAFRGEKKAKPKAAKKDARKAKKPSAKTQKI 797



 Score = 30.9 bits (70), Expect = 0.45
 Identities = 12/36 (33%), Positives = 14/36 (38%), Gaps = 2/36 (5%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKK--KKKKKR 192
           R       K  KK  +K KK   K +K     K KR
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 13/43 (30%), Positives = 18/43 (41%), Gaps = 5/43 (11%)

Query: 155 FANLRNGDGTSRKKKKKKKKKKKKKKKK-----KKKKKKKKKR 192
           F   +     + KK  +K KK   K +K     K K+  KKK 
Sbjct: 769 FRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 34.6 bits (80), Expect = 0.019
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           K K KK  K K ++K+K K++K++K L
Sbjct: 55  KAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 33.1 bits (76), Expect = 0.057
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           K K KK  K K ++K+K K++K+++  
Sbjct: 55  KAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 32.3 bits (74), Expect = 0.11
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K KK  K K ++K+K K++K++K  R
Sbjct: 57  KAKKALKAKIEEKEKAKREKEEKGLR 82



 Score = 31.9 bits (73), Expect = 0.13
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           K K   K K KK  K K ++K+K KR 
Sbjct: 49  KAKVAAKAKAKKALKAKIEEKEKAKRE 75



 Score = 31.9 bits (73), Expect = 0.17
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + KK  K K ++K+K K++K++K  + L
Sbjct: 57  KAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +    K   K K KK  K K ++K+K K+++
Sbjct: 46  EEEKAKVAAKAKAKKALKAKIEEKEKAKREK 76



 Score = 30.8 bits (70), Expect = 0.40
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           KK  K K ++K+K K++K++K  ++    
Sbjct: 59  KKALKAKIEEKEKAKREKEEKGLRELEED 87



 Score = 30.4 bits (69), Expect = 0.45
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +++K K   K K KK  K K ++K++ 
Sbjct: 46  EEEKAKVAAKAKAKKALKAKIEEKEKA 72



 Score = 30.4 bits (69), Expect = 0.51
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K+++K K   K K KK  K K +++
Sbjct: 44  EKEEEKAKVAAKAKAKKALKAKIEEK 69



 Score = 30.0 bits (68), Expect = 0.59
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
              K K KK  K K ++K+K K++K 
Sbjct: 52  VAAKAKAKKALKAKIEEKEKAKREKE 77



 Score = 29.6 bits (67), Expect = 0.76
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             K+++K K   K K KK  K K ++K
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIEEK 69



 Score = 29.6 bits (67), Expect = 0.77
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +K K   K K KK  K K ++K+K 
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKA 72



 Score = 29.6 bits (67), Expect = 0.93
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKK 189
             +K K +K  K  KKKK K K K
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 29.6 bits (67), Expect = 0.98
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRL 193
            +K K +K  K  KKKK K K +L
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKL 215



 Score = 29.2 bits (66), Expect = 1.0
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKK 190
            +K K +K  K  KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 29.2 bits (66), Expect = 1.0
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKK 191
            +K K +K  K  KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 29.2 bits (66), Expect = 1.0
 Identities = 9/32 (28%), Positives = 11/32 (34%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
           K  K  KKKK K K K            +  +
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDDDDDYDGY 230



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           + ++K+++K K   K K KK  K K
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAK 65



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 9/25 (36%), Positives = 9/25 (36%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K  K  KKKK K K K         
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 9/29 (31%), Positives = 10/29 (34%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           +  K  KKKK K K K            Y
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDDDDDY 227



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 10/24 (41%), Positives = 11/24 (45%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKK 191
           K +K  K  KKKK K K K     
Sbjct: 196 KAEKAAKGGKKKKGKAKAKLNVGG 219



 Score = 27.3 bits (61), Expect = 5.8
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + ++K+++K K   K K KK  K ++
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKI 66


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 32.7 bits (75), Expect = 0.021
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           T     KKKKKKKKKK +      K KK
Sbjct: 18  TEADLVKKKKKKKKKKAEDTAATAKAKK 45



 Score = 31.6 bits (72), Expect = 0.064
 Identities = 13/25 (52%), Positives = 14/25 (56%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
            KKKKKKKKKK +      K KK  
Sbjct: 23  VKKKKKKKKKKAEDTAATAKAKKAT 47



 Score = 31.2 bits (71), Expect = 0.068
 Identities = 13/25 (52%), Positives = 14/25 (56%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
             KKKKKKKKKK +      K KK 
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKA 46



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K +    KKKKKKKKKK +      
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATA 41



 Score = 26.9 bits (60), Expect = 2.3
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKK 188
           D   +KKKKKKKK +      K KK  
Sbjct: 21  DLVKKKKKKKKKKAEDTAATAKAKKAT 47


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 35.0 bits (80), Expect = 0.021
 Identities = 19/38 (50%), Positives = 27/38 (71%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
            GD  + +KK KK KKK+KK+K+K++ K KKK +  FK
Sbjct: 189 KGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 33.5 bits (76), Expect = 0.065
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
           S+K KKK+KK+K+K++ K KKK+ +  +  + 
Sbjct: 199 SKKPKKKEKKEKEKERDKDKKKEVEGFKSLLL 230



 Score = 32.0 bits (72), Expect = 0.19
 Identities = 8/27 (29%), Positives = 21/27 (77%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R +++ +K K++KKK++K+K+ +++  
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 32.0 bits (72), Expect = 0.22
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + T +  K KKKK++K+K++KKKKKK    R
Sbjct: 272 EETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 31.2 bits (70), Expect = 0.33
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++ +    KK  K KKKK++K+K++KKKKKK   
Sbjct: 267 KDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHH 300



 Score = 30.8 bits (69), Expect = 0.52
 Identities = 8/27 (29%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R+ +++ +K K++KKK++K+K+ +++ 
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 30.4 bits (68), Expect = 0.64
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              D  + + KK  K KKKK++K+K++KKKKKK 
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKH 298



 Score = 30.4 bits (68), Expect = 0.72
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
                 S K KKKK++K+K++KKKKKK    +
Sbjct: 271 AEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 29.6 bits (66), Expect = 1.3
 Identities = 8/27 (29%), Positives = 21/27 (77%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +++ +K K++KKK++K+K+ +++   L
Sbjct: 91  RQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 29.3 bits (65), Expect = 1.5
 Identities = 8/26 (30%), Positives = 22/26 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++ +++ +K K++KKK++K+K+ +R
Sbjct: 87  ERRHRQRLEKDKREKKKREKEKRGRR 112



 Score = 28.1 bits (62), Expect = 4.0
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             KKKK++K+K++KKKKKK    +    
Sbjct: 279 KHKKKKQRKEKEEKKKKKKHHHHRCHHS 306



 Score = 27.7 bits (61), Expect = 4.1
 Identities = 8/26 (30%), Positives = 22/26 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++++ +++ +K K++KKK++K+K+ R
Sbjct: 86  EERRHRQRLEKDKREKKKREKEKRGR 111



 Score = 27.7 bits (61), Expect = 4.3
 Identities = 8/25 (32%), Positives = 22/25 (88%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++++ +++ +K K++KKK++K+KR
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKR 109



 Score = 27.7 bits (61), Expect = 5.0
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +      +KK++K+K++KKKKKK    +      
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDG 308



 Score = 27.3 bits (60), Expect = 7.1
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           +KK+KK+K+K++ K KKK+ +  K   L +
Sbjct: 203 KKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232



 Score = 27.3 bits (60), Expect = 7.1
 Identities = 7/25 (28%), Positives = 22/25 (88%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           +++++ +++ +K K++KKK++K+K+
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKR 109



 Score = 27.0 bits (59), Expect = 7.8
 Identities = 8/34 (23%), Positives = 25/34 (73%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           K +++++ +++ +K K++KKK++K++    + H+
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 34.4 bits (79), Expect = 0.038
 Identities = 15/33 (45%), Positives = 16/33 (48%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
             S K K  K  KKK K K KK  KK+   LY 
Sbjct: 757 SASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYT 789



 Score = 31.3 bits (71), Expect = 0.30
 Identities = 13/25 (52%), Positives = 13/25 (52%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           KK K KKK    K K  K  KKK K
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSK 773



 Score = 31.3 bits (71), Expect = 0.32
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T  +KK K KKK    K K  K  KKK +
Sbjct: 745 TPYEKKTKAKKKSASTKGKAAKTVKKKSK 773



 Score = 31.3 bits (71), Expect = 0.34
 Identities = 14/28 (50%), Positives = 14/28 (50%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           T  KKK    K K  K  KKK K K KK
Sbjct: 751 TKAKKKSASTKGKAAKTVKKKSKAKSKK 778



 Score = 30.9 bits (70), Expect = 0.47
 Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           +K K KKK    K K  K  KKK K +     K
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781



 Score = 30.6 bits (69), Expect = 0.66
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
             ++KK    K K  K  KKK K K KK
Sbjct: 751 TKAKKKSASTKGKAAKTVKKKSKAKSKK 778



 Score = 29.4 bits (66), Expect = 1.5
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           GT +   +KK K KKK    K K  K  K+
Sbjct: 741 GTPKTPYEKKTKAKKKSASTKGKAAKTVKK 770



 Score = 28.2 bits (63), Expect = 3.3
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           +KK K KKK    K K  K  KK+     K T
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKT 779


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 32.6 bits (75), Expect = 0.047
 Identities = 9/27 (33%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R K++ +K+K+ +K ++KK K+++K++
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 31.9 bits (73), Expect = 0.077
 Identities = 9/27 (33%), Positives = 21/27 (77%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R ++++ K++ +K+K+ +K ++KK KR
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKR 123



 Score = 31.5 bits (72), Expect = 0.14
 Identities = 8/26 (30%), Positives = 21/26 (80%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R+++ K++ +K+K+ +K ++KK K++
Sbjct: 99  RRERTKERAEKEKRTRKNREKKFKRR 124



 Score = 31.1 bits (71), Expect = 0.14
 Identities = 10/36 (27%), Positives = 26/36 (72%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            LR     ++++ +K+K+ +K ++KK K+++K+K++
Sbjct: 95  ALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 29.9 bits (68), Expect = 0.35
 Identities = 8/31 (25%), Positives = 20/31 (64%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           KK++   + ++++ K++ +K+K+ R    KK
Sbjct: 90  KKRRIALRLRRERTKERAEKEKRTRKNREKK 120



 Score = 28.0 bits (63), Expect = 2.1
 Identities = 5/29 (17%), Positives = 20/29 (68%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           R   + ++++ K++ +K+K+ +K +++ +
Sbjct: 93  RIALRLRRERTKERAEKEKRTRKNREKKF 121


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 34.1 bits (78), Expect = 0.051
 Identities = 16/24 (66%), Positives = 20/24 (83%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKK 188
           SRKKKKKKK K+KKKKK ++  K+
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQ 706



 Score = 32.9 bits (75), Expect = 0.098
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRLY 194
           +KKKKKKK K+KKKKK ++  K+++
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 31.4 bits (71), Expect = 0.34
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKK 190
             +KKKKKKK K+KKKKK ++  K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706



 Score = 31.0 bits (70), Expect = 0.48
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKK 191
           +KKKKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 30.2 bits (68), Expect = 0.77
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKK 187
           G+ +KKKKKK K+KKKKK ++  K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 32.7 bits (75), Expect = 0.063
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            ++K K KKKK KKKK K K KK  KK
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKK 106



 Score = 32.4 bits (74), Expect = 0.096
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           S+KKK K K KK  KK  K +KK +K+ 
Sbjct: 91  SKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 32.4 bits (74), Expect = 0.097
 Identities = 16/26 (61%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            K KKKK KKKK K K KK  KK  K
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 31.6 bits (72), Expect = 0.16
 Identities = 16/26 (61%), Positives = 17/26 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K K KKKK KKKK K K KK  KK 
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKD 107



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             KKKK K K KK  KK  K +KK +K
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 30.8 bits (70), Expect = 0.29
 Identities = 15/26 (57%), Positives = 17/26 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +KKK KKKK K K KK  KK  K +K
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEK 112



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K KKKK K K KK  KK  K +KK  
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 15/26 (57%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K+K K KKKK KKKK K K KK  +
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDK 105



 Score = 30.4 bits (69), Expect = 0.35
 Identities = 15/26 (57%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + KKKK KKKK K K KK  KK  K 
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKS 110



 Score = 30.4 bits (69), Expect = 0.39
 Identities = 15/26 (57%), Positives = 16/26 (61%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK KKKK K K KK  KK  K +KK 
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKD 114



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           K KK  KK  K +KK +K+ + K + L
Sbjct: 99  KDKKDDKKDDKSEKKDEKEAEDKLEDL 125



 Score = 28.5 bits (64), Expect = 1.8
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
            K K KK  KK  K +KK +K+ + +L 
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             +       K K KK  KK  K +KK +K+ + K
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           KK+ ++K+K K KKKK KKKK K K
Sbjct: 75  KKEYEEKQKWKWKKKKSKKKKDKDK 99



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +K+ ++K+K K KKKK KKKK K K 
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKD 100



 Score = 26.6 bits (59), Expect = 7.1
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            + KK  KK  K +KK +K+ + K + 
Sbjct: 98  DKDKKDDKKDDKSEKKDEKEAEDKLED 124


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 32.9 bits (75), Expect = 0.067
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           T+ +KKKK K KK K K KK + KKK +  +
Sbjct: 87  TAVEKKKKGKSKKNKLKGKKDEDKKKAREGW 117



 Score = 32.1 bits (73), Expect = 0.12
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
           KKKK K KK K K KK + KKK ++    F
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREGWNNF 120


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 32.8 bits (75), Expect = 0.067
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            RKKKK +    K  +K+ K+K +KKK
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKK 27



 Score = 30.9 bits (70), Expect = 0.26
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +KKKK +    K  +K+ K+K +KK++
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 30.1 bits (68), Expect = 0.46
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +KKK +    K  +K+ K+K +KKK 
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 29.3 bits (66), Expect = 0.81
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +KK +    K  +K+ K+K +KKK   +
Sbjct: 4   KKKNRSSNYKVNRKRLKRKDRKKKINII 31



 Score = 26.2 bits (58), Expect = 9.4
 Identities = 8/27 (29%), Positives = 13/27 (48%)

Query: 174 KKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           +KKKK +    K  +K+ +    KK  
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKI 28


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 33.0 bits (76), Expect = 0.068
 Identities = 7/27 (25%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R +++ KK++   +K+K  ++++K K 
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKP 140



 Score = 31.8 bits (73), Expect = 0.14
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + KK++   +K+K  ++++K K K  R
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPR 144



 Score = 31.8 bits (73), Expect = 0.16
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK++   +K+K  ++++K K K  ++
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRK 145



 Score = 31.4 bits (72), Expect = 0.21
 Identities = 5/27 (18%), Positives = 19/27 (70%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++ +++ KK++   +K+K  ++++K +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPK 139



 Score = 31.4 bits (72), Expect = 0.22
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           A  R   G   K  ++++K K K  +KK+K + +K 
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 31.0 bits (71), Expect = 0.24
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K+K  ++++K K K  +KK+K + ++
Sbjct: 128 KEKAPRRERKPKPKAPRKKRKPRAQK 153



 Score = 31.0 bits (71), Expect = 0.28
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R+   +K+K  ++++K K K  +KK++
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRK 148



 Score = 31.0 bits (71), Expect = 0.29
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K++   +K+K  ++++K K K  +K+
Sbjct: 121 KREAAGEKEKAPRRERKPKPKAPRKK 146



 Score = 30.7 bits (70), Expect = 0.32
 Identities = 8/31 (25%), Positives = 18/31 (58%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               ++K  ++++K K K  +KK+K + +K 
Sbjct: 124 AAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 29.5 bits (67), Expect = 0.99
 Identities = 6/26 (23%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++ KK++   +K+K  ++++K K +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPK 141



 Score = 29.1 bits (66), Expect = 1.2
 Identities = 5/26 (19%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + ++ +++ KK++   +K+K  +++R
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRER 136



 Score = 28.4 bits (64), Expect = 2.1
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +  ++++K K K  +KK+K + +K +
Sbjct: 130 KAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 28.0 bits (63), Expect = 2.5
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            ++++K K K  +KK+K + +K + + 
Sbjct: 132 PRRERKPKPKAPRKKRKPRAQKPEPQH 158



 Score = 28.0 bits (63), Expect = 3.2
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             ++++K K K  +KK+K + +K + +
Sbjct: 131 APRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 27.6 bits (62), Expect = 4.0
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               R++K K K  +KK+K + +K + +   
Sbjct: 130 KAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160



 Score = 27.6 bits (62), Expect = 4.3
 Identities = 4/27 (14%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + + ++ +++ KK++   +K+K  ++ 
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRE 135



 Score = 27.6 bits (62), Expect = 4.3
 Identities = 6/28 (21%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++ K + ++ +++ KK++   +K+K  R
Sbjct: 106 AKAKVQAQRAEQQAKKREAAGEKEKAPR 133


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 33.1 bits (76), Expect = 0.075
 Identities = 13/35 (37%), Positives = 15/35 (42%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           A  +       KKKK K+ K  K  KK   K  KK
Sbjct: 248 AEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 31.9 bits (73), Expect = 0.19
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            + KK+K K++ KKKK K+ K  K  KK
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 31.9 bits (73), Expect = 0.19
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +  + KKKK K+ K  K  KK   K  K++
Sbjct: 254 SKEEIKKKKPKESKGVKALKKVVAKGMKKI 283



 Score = 31.5 bits (72), Expect = 0.20
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K+K K++ KKKK K+ K  K  KK 
Sbjct: 251 KRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 31.1 bits (71), Expect = 0.26
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           SR +KK+K K++ KKKK K+ K  K  
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKAL 272



 Score = 30.8 bits (70), Expect = 0.41
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + K++ KKKK K+ K  K  KK   K
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAK 278



 Score = 28.8 bits (65), Expect = 1.6
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
            + +KK+K K++ KKKK K+ K  K L
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKAL 272



 Score = 27.7 bits (62), Expect = 4.0
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + + + +KK+K K++ KKKK K+ K  +
Sbjct: 243 AAESRAEKKRKSKEEIKKKKPKESKGVK 270


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 31.4 bits (72), Expect = 0.076
 Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%)

Query: 165 SRKKKKKKKKKKKKKKKKKKK--KKKKKKRL 193
           SRKKK+KK+ K + K+   KK   +++K R+
Sbjct: 60  SRKKKEKKEVKAESKRYNAKKLTLEQRKARV 90



 Score = 29.1 bits (66), Expect = 0.43
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKK 190
           R    +KKK+KK+ K + K+   KK
Sbjct: 56  RADPSRKKKEKKEVKAESKRYNAKK 80


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 33.2 bits (76), Expect = 0.079
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
            +  K KK KKKKKKKKK++K      + 
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELE 298



 Score = 32.8 bits (75), Expect = 0.11
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           D     +  K KK KKKKKKKKK++K  
Sbjct: 265 DYYDVSEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 32.4 bits (74), Expect = 0.13
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKK 186
            D +   K KK KKKKKKKKK++K  
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 27.0 bits (60), Expect = 7.8
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK KK++KKK+ ++KK     +K+K 
Sbjct: 96  KKSKKRQKKKEAERKKALLLDEKEKE 121


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 32.8 bits (75), Expect = 0.087
 Identities = 9/27 (33%), Positives = 19/27 (70%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R KK++  + ++KKK+KK K +++  +
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGLQ 394



 Score = 32.8 bits (75), Expect = 0.11
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           L+        K K ++ KK++  + ++KKK+KK +
Sbjct: 353 LKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAK 387



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T  K ++ KK++  + ++KKK+KK K +R
Sbjct: 362 TKAKPERDKKERPGRYRRKKKEKKAKSER 390



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + K ++ KK++  + ++KKK+KK K   
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKAKSER 390



 Score = 30.4 bits (69), Expect = 0.58
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + KK++  + ++KKK+KK K +++ L
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGL 393



 Score = 30.1 bits (68), Expect = 0.73
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK++  + ++KKK+KK K +++  + 
Sbjct: 370 KKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 30.1 bits (68), Expect = 0.76
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             K K ++ KK++  + ++KKK+KK  
Sbjct: 361 PTKAKPERDKKERPGRYRRKKKEKKAK 387



 Score = 30.1 bits (68), Expect = 0.87
 Identities = 9/27 (33%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + ++ KK++  + ++KKK+KK K ++R
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKSERR 391



 Score = 28.5 bits (64), Expect = 2.5
 Identities = 7/27 (25%), Positives = 18/27 (66%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +K++  + ++KKK+KK K +++  + 
Sbjct: 369 DKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           K   K K ++ KK++  + ++KKK      +
Sbjct: 359 KPPTKAKPERDKKERPGRYRRKKKEKKAKSE 389


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 32.7 bits (75), Expect = 0.090
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R+++ KKKK+ K K  K+  KKKKKK
Sbjct: 80  REERLKKKKRVKTKAYKEPTKKKKKK 105



 Score = 32.4 bits (74), Expect = 0.11
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R KKKK+ K K  K+  KKKKKK    
Sbjct: 83  RLKKKKRVKTKAYKEPTKKKKKKDPTA 109



 Score = 32.0 bits (73), Expect = 0.15
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++++ KKKK+ K K  K+  KKKKK+
Sbjct: 79  QREERLKKKKRVKTKAYKEPTKKKKKK 105



 Score = 32.0 bits (73), Expect = 0.15
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++ KKKK+ K K  K+  KKKKKK 
Sbjct: 82  ERLKKKKRVKTKAYKEPTKKKKKKD 106



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           L+  +   +KK+ K K  K+  KKKKKK     K  
Sbjct: 78  LQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSP 113



 Score = 30.4 bits (69), Expect = 0.49
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KKKKK     K  K    + KKK +R
Sbjct: 100 KKKKKKDPTAAKSPKAAAPRPKKKSER 126



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
            R K+++ ++KK + K +K+K+KKK+K L
Sbjct: 155 ERLKEREIRRKKIQAKARKRKEKKKEKEL 183



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 13/33 (39%), Positives = 14/33 (42%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           KKKKKK     K  K    + KKK   I    T
Sbjct: 100 KKKKKKDPTAAKSPKAAAPRPKKKSERISWAPT 132



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +G    +++++ KKKK+ K K  K+  KKK+
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKKK 103



 Score = 27.7 bits (62), Expect = 3.4
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
           T +KKKK     K  K    + KKK ++  +  
Sbjct: 99  TKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 171 KKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +K+ +++++ KKKK+ K K      KK
Sbjct: 74  GEKELQREERLKKKKRVKTKAYKEPTKK 101



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              KKKKKK     K  K    + KK+
Sbjct: 97  EPTKKKKKKDPTAAKSPKAAAPRPKKK 123


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.9 bits (75), Expect = 0.10
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           RKKKK++K K+K+ KK K  +K+ K +L
Sbjct: 20  RKKKKEEKAKEKELKKLKAAQKEAKAKL 47



 Score = 29.1 bits (65), Expect = 1.7
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           ++KKKK++K K+K+ KK K  +K+   
Sbjct: 19  ERKKKKEEKAKEKELKKLKAAQKEAKA 45



 Score = 28.7 bits (64), Expect = 2.6
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           T  + ++KKKK++K K+K+ KK K  +K
Sbjct: 14  TEEELERKKKKEEKAKEKELKKLKAAQK 41



 Score = 27.6 bits (61), Expect = 5.7
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           ++ ++KKKK++K K+K+ KK K  ++    K
Sbjct: 16  EELERKKKKEEKAKEKELKKLKAAQKEAKAK 46


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 32.7 bits (75), Expect = 0.12
 Identities = 11/29 (37%), Positives = 22/29 (75%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             K+  K++K+ +K KK+++KKKK+ ++L
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKL 586



 Score = 31.2 bits (71), Expect = 0.40
 Identities = 11/28 (39%), Positives = 23/28 (82%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           R+K++K+  K++K+ +K KK+++KKK+ 
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKE 582



 Score = 30.8 bits (70), Expect = 0.56
 Identities = 10/27 (37%), Positives = 22/27 (81%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++K+  K++K+ +K KK+++KKKK+ 
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKEL 583



 Score = 29.6 bits (67), Expect = 1.1
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
              K++K+ +K KK+++KKKK+ +K    
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 28.8 bits (65), Expect = 1.8
 Identities = 11/30 (36%), Positives = 22/30 (73%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           + K++K+ +K KK+++KKKK+ +K +K   
Sbjct: 562 ALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591



 Score = 28.8 bits (65), Expect = 2.4
 Identities = 11/29 (37%), Positives = 22/29 (75%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
            K++K+ +K KK+++KKKK+ +K ++  I
Sbjct: 563 LKEQKRLRKLKKQEEKKKKELEKLEKAKI 591



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 9/27 (33%), Positives = 23/27 (85%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++K++K+  K++K+ +K KK+++KK+
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKK 580



 Score = 27.7 bits (62), Expect = 4.9
 Identities = 7/26 (26%), Positives = 21/26 (80%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +++K++K+  K++K+ +K KK++++ 
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKK 579


>gnl|CDD|239564 cd03482, MutL_Trans_MutL, MutL_Trans_MutL: transducer domain,
          having a ribosomal S5 domain 2-like fold, found in
          proteins similar to Escherichia coli MutL.  EcMutL
          belongs to the DNA mismatch repair (MutL/MLH1/PMS2)
          family.  This transducer domain is homologous to the
          second domain of the DNA gyrase B subunit, which is
          known to be important in nucleotide hydrolysis and the
          transduction of structural signals from the ATP-binding
          site to the DNA breakage/reunion regions of the
          enzymes.  It has been suggested that during initiation
          of DNA mismatch repair in E. coli, the mismatch
          recognition protein MutS recruits MutL in the presence
          of ATP.  The MutS(ATP)-MutL ternary complex formed,
          then recruits the latent endonuclease MutH. Prokaryotic
          MutS and MutL are homodimers.
          Length = 123

 Score = 31.4 bits (72), Expect = 0.12
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 40 FTLSGFISSAIHGHGRSTADRQFYFVNSR 68
            LSG+I  A+    RS AD Q+++VN R
Sbjct: 27 LRLSGWI--ALPTFARSQADIQYFYVNGR 53


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 30.9 bits (70), Expect = 0.13
 Identities = 14/32 (43%), Positives = 22/32 (68%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G     KKKKKKKKKK K K++   +K+++++
Sbjct: 14  GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEK 45



 Score = 30.5 bits (69), Expect = 0.14
 Identities = 15/25 (60%), Positives = 17/25 (68%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKR 192
           K KK   KKKKKKKKKK K K++  
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVV 37



 Score = 30.5 bits (69), Expect = 0.14
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKK 190
           K KK   KKKKKKKKKK K K++ 
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEV 36



 Score = 30.5 bits (69), Expect = 0.17
 Identities = 16/25 (64%), Positives = 18/25 (72%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K K KK   KKKKKKKKKK K K++
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEE 35



 Score = 28.5 bits (64), Expect = 0.74
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           L+      +KKKKKKKKK K K++   +K++++K 
Sbjct: 12  LKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46



 Score = 28.5 bits (64), Expect = 0.78
 Identities = 16/26 (61%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
             K K K KK   KKKKKKKKKK K 
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKS 32



 Score = 25.8 bits (57), Expect = 6.9
 Identities = 14/23 (60%), Positives = 15/23 (65%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKR 192
             K K K KK   KKKKKKKKK+
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKK 29


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 30.7 bits (70), Expect = 0.14
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K+ K+K  +     KKKKK+K KKKR
Sbjct: 34  EKELKQKADEGNNSGKKKKKRKAKKKR 60



 Score = 29.1 bits (66), Expect = 0.55
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            ++++K+ K+K  +     KKKKK+K 
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKA 56



 Score = 29.1 bits (66), Expect = 0.56
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             K+K  +     KKKKK+K KKK+  
Sbjct: 36  ELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 28.8 bits (65), Expect = 0.61
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            ++ K+K  +     KKKKK+K KKK+
Sbjct: 34  EKELKQKADEGNNSGKKKKKRKAKKKR 60



 Score = 28.4 bits (64), Expect = 0.82
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++K+ K+K  +     KKKKK+K K+
Sbjct: 33  EEKELKQKADEGNNSGKKKKKRKAKK 58



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++K+ K+K  +     KKKKK+K +
Sbjct: 32  QEEKELKQKADEGNNSGKKKKKRKAK 57



 Score = 26.4 bits (59), Expect = 3.7
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++K  +     KKKKK+K KKK+ + 
Sbjct: 38  KQKADEGNNSGKKKKKRKAKKKRDEA 63



 Score = 26.4 bits (59), Expect = 3.8
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +K  +     KKKKK+K KKK+ +  
Sbjct: 39  QKADEGNNSGKKKKKRKAKKKRDEAG 64


>gnl|CDD|206749 cd01856, YlqF, Circularly permuted YlqF GTPase.  Proteins of the
           YlqF family contain all sequence motifs typical of the
           vast class of P-loop-containing GTPases, but show a
           circular permutation, with a G4-G1-G3 pattern of motifs
           as opposed to the regular G1-G3-G4 pattern seen in most
           GTPases. The YlqF subfamily is represented in all
           eukaryotes as well as a phylogenetically diverse array
           of bacteria (including gram-positive bacteria,
           proteobacteria, Synechocystis, Borrelia, and
           Thermotoga).
          Length = 171

 Score = 31.3 bits (72), Expect = 0.16
 Identities = 14/42 (33%), Positives = 21/42 (50%)

Query: 154 MFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           +F N +NG G  +  KK KK  K+ +K K K    +  R  +
Sbjct: 79  LFVNAKNGKGVKKLLKKAKKLLKENEKLKAKGLLPRPLRAMV 120


>gnl|CDD|131231 TIGR02176, pyruv_ox_red, pyruvate:ferredoxin (flavodoxin)
          oxidoreductase, homodimeric.  This model represents a
          single chain form of pyruvate:ferredoxin (or
          flavodoxin) oxidoreductase. This enzyme may transfer
          electrons to nitrogenase in nitrogen-fixing species.
          Portions of this protein are homologous to gamma
          subunit of the four subunit pyruvate:ferredoxin
          (flavodoxin) oxidoreductase.
          Length = 1165

 Score = 32.4 bits (74), Expect = 0.17
 Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 10/60 (16%)

Query: 2  GHVSLTISQLINIIPITQSSPDASLLEE---------FRIPPDITAMFTLSGFISSAIHG 52
           HV+   S++  I PIT SS     +++         F     +  M + +G  + A+HG
Sbjct: 11 AHVAYAFSEVAAIYPITPSSTMGEYVDDWAAQGRKNIFGQTVKVVEMQSEAG-AAGAVHG 69


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 31.4 bits (72), Expect = 0.20
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKK 188
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKK 189
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKK 190
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKK 191
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 30.3 bits (69), Expect = 0.42
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKK 187
             +KK KK KK+K+K++K+ +K  
Sbjct: 12  VDKKKAKKAKKEKRKQRKQARKGA 35



 Score = 29.9 bits (68), Expect = 0.67
 Identities = 10/20 (50%), Positives = 17/20 (85%)

Query: 173 KKKKKKKKKKKKKKKKKKKR 192
           KKK KK KK+K+K++K+ ++
Sbjct: 14  KKKAKKAKKEKRKQRKQARK 33



 Score = 27.6 bits (62), Expect = 3.2
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKK 186
           L+ G    +K KK KK+K+K++K+ +K  
Sbjct: 7   LKAGLVDKKKAKKAKKEKRKQRKQARKGA 35


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 31.8 bits (72), Expect = 0.22
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             +++ KK KK +  K  K  K KK KKK 
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKG 171



 Score = 31.0 bits (70), Expect = 0.37
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
            +    KK KK +  K  K  K KK KKK
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170



 Score = 29.1 bits (65), Expect = 1.4
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               KK KK +  K  K  K KK KKK  
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGS 172



 Score = 28.7 bits (64), Expect = 1.9
 Identities = 9/27 (33%), Positives = 10/27 (37%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              K  K  K KK KKK     +  K 
Sbjct: 154 QSPKVPKAPKPKKPKKKGSVSNRSVKM 180



 Score = 28.3 bits (63), Expect = 2.6
 Identities = 8/31 (25%), Positives = 11/31 (35%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
                + +  K  K KK KKK     +  K 
Sbjct: 150 PKKGQSPKVPKAPKPKKPKKKGSVSNRSVKM 180



 Score = 28.3 bits (63), Expect = 2.6
 Identities = 11/28 (39%), Positives = 12/28 (42%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           G S K  K  K KK KKK     +  K 
Sbjct: 153 GQSPKVPKAPKPKKPKKKGSVSNRSVKM 180



 Score = 28.3 bits (63), Expect = 2.8
 Identities = 11/29 (37%), Positives = 13/29 (44%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +K KK +  K  K  K KK KKK   
Sbjct: 145 KEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.5 bits (72), Expect = 0.22
 Identities = 10/30 (33%), Positives = 20/30 (66%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           +++K+KKK   K++KK  K++K+K    + 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPYG 124



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
            ++K+KKK   K++KK  K++K+K +  Y +
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 31.6 bits (72), Expect = 0.25
 Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 3/70 (4%)

Query: 131 NQCILKDNLPVFYKNGFEFRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
              + K+ L  F +   +   T      +  D  +  KKK +KKKKKKK+KK+ K + + 
Sbjct: 31  VLVLSKEILSTFSEEENKVA-TTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGET 89

Query: 191 KRLYIFKKHT 200
           K    FK   
Sbjct: 90  KL--GFKTPK 97



 Score = 28.9 bits (65), Expect = 2.0
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + +KKKKKK+KK+ K + + K   K  K+
Sbjct: 70  SEKKKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 28.5 bits (64), Expect = 2.6
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +KK+KK+ K + + K   K  KK KK
Sbjct: 73  KKKKKKEKKEPKSEGETKLGFKTPKKSKK 101



 Score = 28.1 bits (63), Expect = 3.5
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               +KKKKK+KK+ K + + K   K  KK
Sbjct: 69  KSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           S KKKKKKK+KK+ K + + K   K  +     K
Sbjct: 70  SEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTK 103



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 19/52 (36%), Positives = 24/52 (46%), Gaps = 9/52 (17%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKK---------KKKKKKKKKKKRLYIFKK 198
            +  N      +KKKKKKK+KK+ K         K  KK KK KK+    K 
Sbjct: 60  EDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.4 bits (71), Expect = 0.27
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T   K+ K +  K+++K+K++ K++KKK+
Sbjct: 92  TKPAKEPKNESGKEEEKEKEQVKEEKKKK 120



 Score = 30.6 bits (69), Expect = 0.49
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +  +   K+++K+K++ K++KKKKK+K K+ 
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127



 Score = 30.6 bits (69), Expect = 0.55
 Identities = 13/33 (39%), Positives = 24/33 (72%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           N  G   +K+K++ K++KKKKK+K K++ K ++
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 30.2 bits (68), Expect = 0.67
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
            +G    ++K++ K++KKKKK+K K++ K +K
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 29.5 bits (66), Expect = 1.4
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
                   K+ K +  K+++K+K++ K++KKKK+
Sbjct: 88  PAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK 121



 Score = 29.5 bits (66), Expect = 1.4
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             K++ K++KKKKK+K K++ K +K K
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPK 134



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
               S K+++K+K++ K++KKKKK+K K++ +    K+  
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137



 Score = 28.7 bits (64), Expect = 2.2
 Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RK K++ K+K+  K+K+K+K+KK ++ 
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEP 157



 Score = 27.9 bits (62), Expect = 3.8
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               KKKKK+K K++ K +K K++ K+K+ 
Sbjct: 113 VKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 27.9 bits (62), Expect = 4.2
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             + +K K++ K+K+  K+K+K+K+KK 
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKV 154



 Score = 27.5 bits (61), Expect = 4.6
 Identities = 9/28 (32%), Positives = 20/28 (71%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             K++ K+K+  K+K+K+K+KK ++ + 
Sbjct: 132 KPKEEAKEKRPPKEKEKEKEKKVEEPRD 159



 Score = 27.5 bits (61), Expect = 5.4
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R    +  KK  KKK   KKK+  +++K+++  R
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAR 201



 Score = 27.5 bits (61), Expect = 5.4
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              K +K K++ K+K+  K+K+K+K+K+
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 27.2 bits (60), Expect = 7.2
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +    +KKKKK+K K++ K +K K++ K+KR
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141



 Score = 27.2 bits (60), Expect = 8.1
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K +K K++ K+K+  K+K+K+K+KK 
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKV 154


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 31.7 bits (72), Expect = 0.27
 Identities = 15/28 (53%), Positives = 16/28 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           RK K K +K KKK    K   KKKKK L
Sbjct: 116 RKPKPKTEKLKKKITVNKSTNKKKKKVL 143



 Score = 30.6 bits (69), Expect = 0.52
 Identities = 14/37 (37%), Positives = 16/37 (43%)

Query: 155 FANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           FA         + K K +K KKK    K   KKKKK 
Sbjct: 106 FAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142



 Score = 28.6 bits (64), Expect = 2.4
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
                +  +K K K +K KKK    K   KKKK++
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142


>gnl|CDD|223789 COG0717, Dcd, Deoxycytidine deaminase [Nucleotide transport and
           metabolism].
          Length = 183

 Score = 30.7 bits (70), Expect = 0.28
 Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 12/49 (24%)

Query: 24  ASLLEEFRIPPDITAMFTLSGFISSAIHGHGRSTADRQFYFVNSRPCEP 72
           A  LE   IP D+ A  T            GRS+  R    V+    +P
Sbjct: 85  AVTLEYVEIPEDVAAFCT------------GRSSLARLGLIVHVGVIDP 121


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 29.5 bits (67), Expect = 0.29
 Identities = 9/33 (27%), Positives = 18/33 (54%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
               R KK+KK +K  K+ K +++  +++   Y
Sbjct: 27  RRNPRVKKRKKYEKAVKRLKSQRRVVREEDGGY 59



 Score = 25.7 bits (57), Expect = 6.6
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G +RK+KK ++  + KK+KK +K  K+ K 
Sbjct: 18  GLTRKRKKDRRNPRVKKRKKYEKAVKRLKS 47


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 30.4 bits (69), Expect = 0.30
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
              + KKK KK  K  KK  K KK+K +  +  F+  T
Sbjct: 3   PAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRPKT 40



 Score = 29.3 bits (66), Expect = 0.82
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              KK KKK KK  K  KK  K KK+K R
Sbjct: 2   APAKKAKKKAKKAAKAAKKGVKVKKRKVR 30


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.5 bits (71), Expect = 0.31
 Identities = 16/29 (55%), Positives = 17/29 (58%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           DG   K KKKKKK  K KKK    KK K+
Sbjct: 235 DGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 30.7 bits (69), Expect = 0.48
 Identities = 16/33 (48%), Positives = 18/33 (54%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            G     ++K KKKKKK  K KKK    KK KR
Sbjct: 231 KGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 29.5 bits (66), Expect = 1.1
 Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + G+    +K KKKKKK  K KKK    KK K+
Sbjct: 231 KGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 29.9 bits (68), Expect = 0.31
 Identities = 9/35 (25%), Positives = 24/35 (68%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           RK+++K +++++ K+ K  K+++ +++L   KK  
Sbjct: 3   RKEEEKAQREEELKRLKNLKREEIEEKLEKIKKVA 37



 Score = 28.4 bits (64), Expect = 0.96
 Identities = 6/25 (24%), Positives = 20/25 (80%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K++K+++K +++++ K+ K  K+++
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREE 25



 Score = 27.6 bits (62), Expect = 2.0
 Identities = 5/25 (20%), Positives = 20/25 (80%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKK 190
           +++K+++K +++++ K+ K  K+++
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREE 25



 Score = 27.2 bits (61), Expect = 2.6
 Identities = 9/29 (31%), Positives = 21/29 (72%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           K++K+++K +++++ K+ K  KR  I +K
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEK 29



 Score = 26.5 bits (59), Expect = 3.9
 Identities = 6/24 (25%), Positives = 18/24 (75%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKR 192
           K++K+++K +++++ K+ K  K+ 
Sbjct: 1   KERKEEEKAQREEELKRLKNLKRE 24


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
           consists of several eukaryotic survival motor neuron
           (SMN) proteins. The Survival of Motor Neurons (SMN)
           protein, the product of the spinal muscular
           atrophy-determining gene, is part of a large
           macromolecular complex (SMN complex) that functions in
           the assembly of spliceosomal small nuclear
           ribonucleoproteins (snRNPs). The SMN complex functions
           as a specificity factor essential for the efficient
           assembly of Sm proteins on U snRNAs and likely protects
           cells from illicit, and potentially deleterious,
           non-specific binding of Sm proteins to RNAs.
          Length = 264

 Score = 31.1 bits (70), Expect = 0.32
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           L+N D  + ++  ++   KK+K  KK + +KK
Sbjct: 29  LKNEDCATPQENDEQNPGKKRKNNKKNRSRKK 60


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 29.1 bits (66), Expect = 0.43
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R K++++K KKK  KK + ++ KKK  
Sbjct: 67  RDKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.8 bits (60), Expect = 3.6
 Identities = 9/25 (36%), Positives = 19/25 (76%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRLY 194
           K++++K KKK  KK + ++ KK+++
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.0 bits (58), Expect = 5.7
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKK 190
            +++++K KKK  KK + ++ KKK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 30.8 bits (70), Expect = 0.46
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
            K  +K K ++ +KK KK +  K  K LY
Sbjct: 348 AKGAQKVKNRRARKKAKKARLAKVAKALY 376



 Score = 30.0 bits (68), Expect = 0.68
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
             ++K   K  +K K ++ +KK KK RL    K
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           A L     T++  +K K ++ +KK KK +  K  K
Sbjct: 339 AVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.8 bits (70), Expect = 0.49
 Identities = 14/28 (50%), Positives = 24/28 (85%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           ++K KKKKKK+KKK+++ K+++K R+ I
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKARIEI 769



 Score = 28.5 bits (64), Expect = 3.1
 Identities = 9/27 (33%), Positives = 13/27 (48%)

Query: 173  KKKKKKKKKKKKKKKKKKKRLYIFKKH 199
            +   KK+K KK+ K   +  L  F K 
Sbjct: 1563 ESDIKKRKNKKQYKSNTEAELDFFLKR 1589


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.7 bits (70), Expect = 0.54
 Identities = 10/30 (33%), Positives = 13/30 (43%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             T+ K    K   KKK K +    KK +K
Sbjct: 79  TKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108



 Score = 30.3 bits (69), Expect = 0.63
 Identities = 9/28 (32%), Positives = 13/28 (46%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            ++    K   KKK K +    KK +KK
Sbjct: 82  AAKAAAAKAPAKKKLKDELDSSKKAEKK 109



 Score = 30.0 bits (68), Expect = 0.76
 Identities = 9/29 (31%), Positives = 10/29 (34%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T    K    K   KKK K +    KK  
Sbjct: 79  TKTAAKAAAAKAPAKKKLKDELDSSKKAE 107



 Score = 30.0 bits (68), Expect = 0.88
 Identities = 8/28 (28%), Positives = 9/28 (32%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              K   K    K   KKK K +    K
Sbjct: 77  KKTKTAAKAAAAKAPAKKKLKDELDSSK 104



 Score = 29.2 bits (66), Expect = 1.5
 Identities = 14/35 (40%), Positives = 14/35 (40%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            S   KKK K   K    K   KKK K  L   KK
Sbjct: 71  ESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105



 Score = 29.2 bits (66), Expect = 1.6
 Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           D T     KKK K   K    K   KKK K
Sbjct: 68  DATESDIPKKKTKTAAKAAAAKAPAKKKLK 97



 Score = 28.8 bits (65), Expect = 1.8
 Identities = 9/30 (30%), Positives = 11/30 (36%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           D   +K K   K    K   KKK K +   
Sbjct: 73  DIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 28.8 bits (65), Expect = 2.1
 Identities = 10/30 (33%), Positives = 11/30 (36%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           D  +     KKK K   K    K   KKK 
Sbjct: 67  DDATESDIPKKKTKTAAKAAAAKAPAKKKL 96



 Score = 28.8 bits (65), Expect = 2.1
 Identities = 9/28 (32%), Positives = 11/28 (39%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
                 K   KKK K +    KK +KK 
Sbjct: 83  AKAAAAKAPAKKKLKDELDSSKKAEKKN 110



 Score = 28.8 bits (65), Expect = 2.2
 Identities = 9/30 (30%), Positives = 12/30 (40%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              + K   KKK K +    KK +KK    
Sbjct: 84  KAAAAKAPAKKKLKDELDSSKKAEKKNALD 113



 Score = 27.7 bits (62), Expect = 5.0
 Identities = 9/37 (24%), Positives = 11/37 (29%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
              + +    KKK K   K    K   KKK       
Sbjct: 66  TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 27.7 bits (62), Expect = 5.1
 Identities = 10/33 (30%), Positives = 13/33 (39%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
              KKK K +    KK +KK    K     Y+ 
Sbjct: 90  APAKKKLKDELDSSKKAEKKNALDKDDDLNYVK 122



 Score = 26.9 bits (60), Expect = 8.4
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             ++   KKK K +    KK +KK    K
Sbjct: 86  AAAKAPAKKKLKDELDSSKKAEKKNALDK 114



 Score = 26.9 bits (60), Expect = 9.6
 Identities = 8/31 (25%), Positives = 10/31 (32%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             T    +    KKK K   K    K   K+
Sbjct: 64  KDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 30.0 bits (68), Expect = 0.55
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           ++KKKKKKK  KKKK KK      K +
Sbjct: 180 EEKKKKKKKSAKKKKLKKVAAVGMKAI 206



 Score = 29.2 bits (66), Expect = 1.0
 Identities = 14/27 (51%), Positives = 14/27 (51%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
              KKKKKKK  KKKK KK      K 
Sbjct: 179 EEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 29.2 bits (66), Expect = 1.1
 Identities = 14/27 (51%), Positives = 15/27 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             +KKKKKKK  KKKK KK      K 
Sbjct: 179 EEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            S   +++KKKKKKK  KKKK KK    
Sbjct: 174 KSNNAEEEKKKKKKKSAKKKKLKKVAAV 201



 Score = 28.8 bits (65), Expect = 1.7
 Identities = 14/31 (45%), Positives = 15/31 (48%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
            N     +KKKKKK  KKKK KK      K 
Sbjct: 175 SNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 27.7 bits (62), Expect = 3.7
 Identities = 14/32 (43%), Positives = 17/32 (53%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             D       +++KKKKKKK  KKKK KK   
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAA 200



 Score = 26.5 bits (59), Expect = 9.4
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + + KKK    +++KKKKKKK  KKKK
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKK 194


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 29.5 bits (67), Expect = 0.55
 Identities = 12/31 (38%), Positives = 25/31 (80%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
           ++ K++KKK+ K+KK+++K++K++L   KK 
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKK 126



 Score = 29.1 bits (66), Expect = 0.86
 Identities = 12/34 (35%), Positives = 25/34 (73%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
            + K++KKK+ K+KK+++K++K++ K+     KH
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129



 Score = 28.4 bits (64), Expect = 1.2
 Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 6/33 (18%)

Query: 166 RKKKKKKKKKKKKKKKKKKK------KKKKKKR 192
            K++KKK+ K+KK+++K++K      KKK K R
Sbjct: 98  NKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 28.4 bits (64), Expect = 1.3
 Identities = 11/27 (40%), Positives = 22/27 (81%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            R K+KK+++K++K++ K++KKK K +
Sbjct: 104 KRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 27.2 bits (61), Expect = 4.0
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           K + ++ K++KKK+ K+KK+++K+R    K+
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.4 bits (69), Expect = 0.55
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           L++      +K   + K+KKK+KKK++KKKK+ +RL
Sbjct: 313 LKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERL 348



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 152 TMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            M+       D   + K K ++  +K   + K+KKK+KKK
Sbjct: 298 KMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK 337


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 30.3 bits (69), Expect = 0.55
 Identities = 16/44 (36%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK-RLYIFKK 198
             L +  G   ++  + ++  K+KKKKKKKKKKK++ R ++ +K
Sbjct: 66  EKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.6 bits (69), Expect = 0.59
 Identities = 15/32 (46%), Positives = 17/32 (53%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G  T  KK K KKK KK KKK+ K+   K   
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDE 172



 Score = 27.5 bits (61), Expect = 5.4
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K K KKK KK KKK+ K+   K  + 
Sbjct: 147 KKGKAKKKTKKSKKKEAKESSDKDDEE 173


>gnl|CDD|221054 pfam11274, DUF3074, Protein of unknown function (DUF3074).  This
           eukaryotic family of proteins has no known function but
           appears to be part of the START superfamily.
          Length = 183

 Score = 30.0 bits (68), Expect = 0.59
 Identities = 11/42 (26%), Positives = 18/42 (42%), Gaps = 1/42 (2%)

Query: 48  SAIHGHGRSTADRQFYFVNSRPCEPSKVSGQAEGFIIVKYDS 89
           +A       +  R+F  V S P + S  +   +GF+   Y S
Sbjct: 88  TASLPEEAESNPREFMVV-SLPIDHSPDAPPRKGFVRGAYAS 128


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 30.1 bits (68), Expect = 0.59
 Identities = 8/24 (33%), Positives = 9/24 (37%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKR 192
                 +K  K K   KK  KK R
Sbjct: 190 TASSGDEKSPKSKAAPKKAGKKMR 213



 Score = 29.7 bits (67), Expect = 0.96
 Identities = 9/31 (29%), Positives = 12/31 (38%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKKKKKKKKKKK 188
                 T+    +K  K K   KK  KK +K
Sbjct: 184 AAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 7/38 (18%), Positives = 11/38 (28%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
                    + K         +K  K K   KK  K++
Sbjct: 175 EGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKM 212


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.1 bits (68), Expect = 0.61
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K+K K +KK KK K K K K K K +
Sbjct: 84  PKEKPKPEKKPKKPKPKPKPKPKPKPK 110



 Score = 29.4 bits (66), Expect = 0.99
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K K +KK KK K K K K K K K 
Sbjct: 86  EKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 28.6 bits (64), Expect = 2.0
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK KK K K K K K K K K + K 
Sbjct: 92  KKPKKPKPKPKPKPKPKPKVKPQPKP 117



 Score = 28.6 bits (64), Expect = 2.0
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            KK K K K K K K K K + K KK 
Sbjct: 94  PKKPKPKPKPKPKPKPKVKPQPKPKKP 120



 Score = 28.6 bits (64), Expect = 2.2
 Identities = 13/26 (50%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K +KK KK K K K K K K K K +
Sbjct: 89  KPEKKPKKPKPKPKPKPKPKPKVKPQ 114



 Score = 27.8 bits (62), Expect = 3.1
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           T  K K+K K +KK KK K K K K K 
Sbjct: 80  TPPKPKEKPKPEKKPKKPKPKPKPKPKP 107



 Score = 27.4 bits (61), Expect = 4.6
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KK KK K K K K K K K K + +
Sbjct: 91  EKKPKKPKPKPKPKPKPKPKVKPQPK 116



 Score = 27.4 bits (61), Expect = 5.0
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +KK KK K K K K K K K K + 
Sbjct: 90  PEKKPKKPKPKPKPKPKPKPKVKPQP 115



 Score = 27.1 bits (60), Expect = 7.0
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + K K K K K K K K + K KK  
Sbjct: 96  KPKPKPKPKPKPKPKVKPQPKPKKPP 121


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 29.5 bits (67), Expect = 0.63
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKK 188
           +   G SR K  ++++K K+KKKKKKK+ +
Sbjct: 61  KRKAGASRNKAAEERRKLKEKKKKKKKELE 90



 Score = 28.7 bits (65), Expect = 1.2
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           R     + + K  ++++K K+KKKKKKK+ +
Sbjct: 60  RKRKAGASRNKAAEERRKLKEKKKKKKKELE 90



 Score = 28.7 bits (65), Expect = 1.2
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
           + K  ++++K K+KKKKKKK+ +  Y F
Sbjct: 68  RNKAAEERRKLKEKKKKKKKELENFYRF 95



 Score = 27.2 bits (61), Expect = 3.7
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             G  RK    + K  ++++K K+KKKKKKK L
Sbjct: 57  RGGRKRKAGASRNKAAEERRKLKEKKKKKKKEL 89


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.4 bits (69), Expect = 0.64
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 164  TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
              ++ K K K K  K +K K KKK+KKK+
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKK 1182



 Score = 27.7 bits (62), Expect = 4.9
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 165  SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             + K K K  K +K K KKK+KKKKK 
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKS 1184



 Score = 27.7 bits (62), Expect = 4.9
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 164  TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             S+ K K  K +K K KKK+KKKKK   
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSA 1186



 Score = 27.7 bits (62), Expect = 5.2
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 164  TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               K+++ K K K K  K +K K KKK++
Sbjct: 1151 EIAKEQRLKSKTKGKASKLRKPKLKKKEK 1179



 Score = 27.7 bits (62), Expect = 5.3
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 165  SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             RK K KKK+KKKKK    K KK     
Sbjct: 1169 LRKPKLKKKEKKKKKSSADKSKKASVVG 1196



 Score = 27.3 bits (61), Expect = 7.5
 Identities = 13/35 (37%), Positives = 17/35 (48%)

Query: 164  TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
             ++ K  K +K K KKK+KKKKK    K       
Sbjct: 1161 KTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV 1195



 Score = 27.3 bits (61), Expect = 7.7
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 163  GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            G + K +K K KKK+KKKKK    K KK 
Sbjct: 1164 GKASKLRKPKLKKKEKKKKKSSADKSKKA 1192



 Score = 27.3 bits (61), Expect = 7.8
 Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 162  DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
                  K +K K KKK+KKKKK    K KK
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191



 Score = 26.9 bits (60), Expect = 9.5
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 164  TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
            T  K  K +K K KKK+KKKKK    K +      ++
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNS 1198


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 29.1 bits (66), Expect = 0.65
 Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 6/38 (15%)

Query: 165 SRKKKKKK------KKKKKKKKKKKKKKKKKKKRLYIF 196
           +R  +K K       + K K+K+KKKKKKK+ +  Y F
Sbjct: 57  TRGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRF 94



 Score = 28.8 bits (65), Expect = 0.89
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
              RK K  +++  + K K+K+KKKKKKK L
Sbjct: 58  RGGRKGKTAREEAVEAKAKEKEKKKKKKKEL 88



 Score = 28.4 bits (64), Expect = 1.1
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 9/44 (20%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKK---------KKKKRL 193
           R G     +  + K K+K+KKKKKKK+ +         KKK+ L
Sbjct: 61  RKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEEL 104



 Score = 26.4 bits (59), Expect = 5.2
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T   +K K  +++  + K K+K+KKKKK+
Sbjct: 57  TRGGRKGKTAREEAVEAKAKEKEKKKKKK 85



 Score = 26.1 bits (58), Expect = 8.3
 Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 6/41 (14%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKK------KKKKKKKK 190
                 +    K K+K+KKKKKKK+ +       ++KKK++
Sbjct: 63  GKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 29.6 bits (67), Expect = 0.78
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           R +   +KKK + KKK  K  KK +  L
Sbjct: 148 RTEILAEKKKPRSKKKSSKNSKKLRIDL 175



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 8/36 (22%), Positives = 13/36 (36%)

Query: 149 FRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKK 184
            R+   +          +K + KKK  K  KK +  
Sbjct: 139 KRMANYWCERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           + T    +KKK + KKK  K  KK +  
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 9/29 (31%), Positives = 12/29 (41%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKK 187
                   +KKK + KKK  K  KK +  
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRL 193
            +KKK + KKK  K  KK +   +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDLV 176


>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 166

 Score = 29.6 bits (67), Expect = 0.79
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 151 ITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKK 183
           +   FA+L +  G    + K K+++K +K+ +K
Sbjct: 134 VLGPFASLPSSQGAETNETKSKRQEKLEKRGEK 166


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 29.6 bits (67), Expect = 0.80
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           N      +  K + ++++ KKKKKKKKKK KK
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTKK 194



 Score = 29.6 bits (67), Expect = 1.0
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             + KKKKKKKKKK KK        + 
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEA 204



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           K + ++++ KKKKKKKKKK KK   
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNA 197



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + + ++++ KKKKKKKKKK KK    
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNAT 198



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++++ KKKKKKKKKK KK       
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSS 201



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R KKKKKKKKKK KK        +  
Sbjct: 180 RAKKKKKKKKKKTKKNNATGSSAEAT 205



 Score = 27.7 bits (62), Expect = 3.5
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           R ++++ KKKKKKKKKK KK        
Sbjct: 175 RLEEERAKKKKKKKKKKTKKNNATGSSA 202



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++ KKKKKKKKKK KK         
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAE 203



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++KKKKKKKKK KK        +    
Sbjct: 181 AKKKKKKKKKKTKKNNATGSSAEATVS 207



 Score = 26.9 bits (60), Expect = 7.6
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KKKKKKKK KK        +      
Sbjct: 183 KKKKKKKKKTKKNNATGSSAEATVSSA 209


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 29.3 bits (66), Expect = 0.80
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           S+K  K KKK K +K +KK K++ + ++    +K
Sbjct: 27  SKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60



 Score = 28.1 bits (63), Expect = 1.5
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           SR KK  K KKK K +K +KK K++ + 
Sbjct: 25  SRSKKLLKAKKKLKSEKLEKKAKRQLRA 52



 Score = 28.1 bits (63), Expect = 1.6
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
              K KKK K +K +KK K++ + +K++ L
Sbjct: 29  KLLKAKKKLKSEKLEKKAKRQLRAEKRQAL 58



 Score = 26.2 bits (58), Expect = 7.6
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + KK  K KKK K +K +KK K++  
Sbjct: 26  RSKKLLKAKKKLKSEKLEKKAKRQLR 51


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 30.1 bits (68), Expect = 0.87
 Identities = 17/41 (41%), Positives = 22/41 (53%)

Query: 154 MFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
             A L + +  +  K+  KKKKKKKKKKKKK   +     Y
Sbjct: 39  FLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79



 Score = 27.8 bits (62), Expect = 3.9
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               + + + K++K++ K+K  KKK KK KK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKK 31



 Score = 26.6 bits (59), Expect = 9.9
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           N +G +  K++K++ K+K  KKK KK KK 
Sbjct: 3   NENGENEVKQQKQQNKQKGTKKKNKKSKKD 32


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 28.6 bits (64), Expect = 0.90
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +     +KKKK+  K  + K  KK +KK KKK  
Sbjct: 19  SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52



 Score = 27.4 bits (61), Expect = 2.1
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +K+KKKKK+  K  + K  KK +KK +      
Sbjct: 21  IKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFP 54



 Score = 25.5 bits (56), Expect = 9.2
 Identities = 12/32 (37%), Positives = 18/32 (56%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           T+R K  KK +KK KKK +  +  ++ KR   
Sbjct: 34  TARPKATKKGQKKDKKKDEFPEFPEESKRRRT 65


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 30.0 bits (68), Expect = 0.95
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 175 KKKKKKKKKKKKKKKKKRLYIFKK 198
           ++KKK KK  +K+ +  RLY+ KK
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKK 107



 Score = 28.4 bits (64), Expect = 3.5
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           R+KKK KK  +K+ +  +    KK+K L 
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQ 112



 Score = 26.9 bits (60), Expect = 9.9
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           ++KKK KK  +K+ +  +    KKR  +   
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQAL 114


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 29.2 bits (65), Expect = 0.96
 Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           A  R   G ++ +K  KKKK+K  +KK K KKK++K
Sbjct: 11  AKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 29.4 bits (66), Expect = 0.99
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           S+K  KKKK   +KKK  KK KKKKK++  + + 
Sbjct: 47  SKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80



 Score = 28.2 bits (63), Expect = 2.0
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + +  KKKK   +KKK  KK KKKKK+K
Sbjct: 47  SKKTSKKKKTTPRKKKTTKKTKKKKKEK 74



 Score = 28.2 bits (63), Expect = 2.5
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           TS+KKK   +KKK  KK KKKKK+K++
Sbjct: 50  TSKKKKTTPRKKKTTKKTKKKKKEKEE 76


>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           palm domain and Zn-ribbon; MTH326-like has inactivated
           polymerase catalytic domain; alr1562 and slr7011 -
           predicted only on the basis of size, presence of HD
           domain, and location with RAMPs in one operon; signature
           gene for type III; also known as Crm2 family.
          Length = 475

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 13/68 (19%), Positives = 28/68 (41%), Gaps = 5/68 (7%)

Query: 133 CILKDNLPVFYKNGFEFRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKK-----KKK 187
           C++K  LP   K        ++  + RN +    K +  +++ ++  +K+ +      K 
Sbjct: 260 CLIKRLLPERLKEKLVLIDEVIKRSKRNREEPRLKYENFREELRELLEKEPEIVYLFIKA 319

Query: 188 KKKKRLYI 195
           KK    Y 
Sbjct: 320 KKNPGPYY 327


>gnl|CDD|222928 PHA02778, PHA02778, major capsid L1 protein; Provisional.
          Length = 503

 Score = 29.4 bits (67), Expect = 1.2
 Identities = 6/29 (20%), Positives = 17/29 (58%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           G  R+ +++K+K       +  K++++K+
Sbjct: 475 GLRRRTRRRKRKSTSSSSSRSTKRRRRKR 503



 Score = 27.5 bits (62), Expect = 5.7
 Identities = 6/30 (20%), Positives = 14/30 (46%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKK 185
           A LR      ++K       +  K++++K+
Sbjct: 474 AGLRRRTRRRKRKSTSSSSSRSTKRRRRKR 503



 Score = 27.1 bits (61), Expect = 6.7
 Identities = 5/29 (17%), Positives = 15/29 (51%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           G     +++K+K       +  K++++K+
Sbjct: 475 GLRRRTRRRKRKSTSSSSSRSTKRRRRKR 503



 Score = 26.7 bits (60), Expect = 8.6
 Identities = 6/27 (22%), Positives = 17/27 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R++ +++K+K       +  K++++KR
Sbjct: 477 RRRTRRRKRKSTSSSSSRSTKRRRRKR 503


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K + K K K K K K  KK +++ KR
Sbjct: 94  KPEPKPKPKPKPKPKPVKKVEEQPKR 119



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K K K K  KK +++ K++ K  + R
Sbjct: 102 KPKPKPKPVKKVEEQPKREVKPVEPR 127



 Score = 28.5 bits (64), Expect = 2.1
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K K K K K K  KK +++ K++ K 
Sbjct: 98  KPKPKPKPKPKPVKKVEEQPKREVKP 123



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
            K K K K K K  KK +++ K++ +
Sbjct: 97  PKPKPKPKPKPKPVKKVEEQPKREVK 122


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 27.6 bits (62), Expect = 1.3
 Identities = 15/28 (53%), Positives = 18/28 (64%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
            KKKK KK KK     +KKKK KK++ I
Sbjct: 29  AKKKKLKKLKKIDDDLEKKKKNKKKIII 56



 Score = 27.6 bits (62), Expect = 1.6
 Identities = 15/28 (53%), Positives = 17/28 (60%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
            KKKK KK KK     +KKKK KKK + 
Sbjct: 29  AKKKKLKKLKKIDDDLEKKKKNKKKIII 56



 Score = 26.1 bits (58), Expect = 4.7
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           N +  ++KKK KK KK     +KKKK KKK
Sbjct: 24  NNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 28.8 bits (65), Expect = 1.3
 Identities = 17/29 (58%), Positives = 22/29 (75%), Gaps = 2/29 (6%)

Query: 166 RKKKKKK--KKKKKKKKKKKKKKKKKKKR 192
           R+KK+++  KK+K  KK KKKKK KKK R
Sbjct: 176 RQKKREENLKKRKDDKKNKKKKKAKKKGR 204



 Score = 28.4 bits (64), Expect = 1.9
 Identities = 11/26 (42%), Positives = 22/26 (84%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R+K++++K +KK+K+K+ KKK+  +K
Sbjct: 12  RRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 28.4 bits (64), Expect = 2.1
 Identities = 10/27 (37%), Positives = 23/27 (85%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R++K++++K +KK+K+K+ KKK+  ++
Sbjct: 11  RRRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKK 188
           KK+K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKK 189
           KK+K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKK 190
           KK+K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKK 187
           +K+K  KK KKKKK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 27.3 bits (61), Expect = 4.3
 Identities = 9/26 (34%), Positives = 22/26 (84%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++++K++++K +KK+K+K+ KKK+  
Sbjct: 10  QRRRKREQRKARKKQKRKEAKKKEDA 35



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 9/26 (34%), Positives = 23/26 (88%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +++++K++++K +KK+K+K+ KKK+ 
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKED 34



 Score = 26.9 bits (60), Expect = 6.5
 Identities = 9/27 (33%), Positives = 23/27 (85%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++++K++++K +KK+K+K+ KKK+  +
Sbjct: 10  QRRRKREQRKARKKQKRKEAKKKEDAQ 36



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 15/27 (55%), Positives = 23/27 (85%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RK+K+KKK KK+ K++K+K +KKK +R
Sbjct: 150 RKEKQKKKSKKEWKERKEKVEKKKAER 176


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKK 189
           SRK+K  KK  KK    KK+K+K+ 
Sbjct: 224 SRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
           S  +K+K  KK  KK    KK+K+K+     F
Sbjct: 222 SPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPF 253



 Score = 28.5 bits (64), Expect = 2.0
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           S    +K+K  KK  KK    KK+K+KR
Sbjct: 220 SSSPSRKRKAPKKVAKKVAAAKKRKQKR 247



 Score = 27.3 bits (61), Expect = 4.9
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +S   +K+K  KK  KK    KK+K+K+
Sbjct: 220 SSSPSRKRKAPKKVAKKVAAAKKRKQKR 247



 Score = 27.3 bits (61), Expect = 5.4
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
            +  +K+K  KK  KK    KK+K+K+ 
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           + ++ KKK+    +K    KK KKK K++L
Sbjct: 47  SPKQPKKKRPTTPRKPATTKKSKKKDKEKL 76



 Score = 26.6 bits (59), Expect = 8.0
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKK 189
           T+ +K    KK KKK K+K  +++KK
Sbjct: 57  TTPRKPATTKKSKKKDKEKLTEEEKK 82


>gnl|CDD|177798 PLN00208, PLN00208, translation initiation factor (eIF);
           Provisional.
          Length = 145

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
            K K K  K +K+ K +   +KR  IFK
Sbjct: 2   PKNKGKGGKNRKRGKNEADDEKRELIFK 29


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 11/38 (28%), Positives = 25/38 (65%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            + + +++K+  K+ ++K+  K KK+KK+KK   + +K
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 26.6 bits (59), Expect = 6.5
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
                K+ ++K+  K KK+KK+KK+KK  ++L
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130



 Score = 26.3 bits (58), Expect = 8.6
 Identities = 16/37 (43%), Positives = 23/37 (62%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           T  K+  K KK+KK+KK+KK  +K  KK+     K+T
Sbjct: 106 TEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNT 142


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.3 bits (65), Expect = 1.4
 Identities = 4/26 (15%), Positives = 13/26 (50%)

Query: 167  KKKKKKKKKKKKKKKKKKKKKKKKKR 192
             K + +    + +  ++K +  +KK+
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 29.3 bits (65), Expect = 1.5
 Identities = 5/26 (19%), Positives = 13/26 (50%)

Query: 166  RKKKKKKKKKKKKKKKKKKKKKKKKK 191
             K + +    + +  ++K +  +KKK
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 29.3 bits (65), Expect = 1.5
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 165  SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +     + +  ++K +  +KKK++ KK+    KK
Sbjct: 1352 AEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 29.0 bits (64), Expect = 2.0
 Identities = 4/27 (14%), Positives = 11/27 (40%)

Query: 166  RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             KK  +  K + +    + +  ++K  
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 29.0 bits (64), Expect = 2.1
 Identities = 4/33 (12%), Positives = 16/33 (48%)

Query: 159  RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +     ++ + +    + +  ++K +  +KKK+
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 28.6 bits (63), Expect = 2.6
 Identities = 3/27 (11%), Positives = 13/27 (48%)

Query: 165  SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            ++K  +  K + +    + +  ++K +
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 28.2 bits (62), Expect = 4.1
 Identities = 5/33 (15%), Positives = 14/33 (42%)

Query: 159  RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            +  D   +K ++ KK  +  K + +    + + 
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 28.2 bits (62), Expect = 4.3
 Identities = 6/30 (20%), Positives = 13/30 (43%)

Query: 169  KKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            KKK ++ KK  +  K + +         ++
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 27.8 bits (61), Expect = 5.0
 Identities = 4/36 (11%), Positives = 15/36 (41%)

Query: 165  SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
            + + KK  +  K + +    + +  +++    +K  
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 27.8 bits (61), Expect = 5.8
 Identities = 5/27 (18%), Positives = 13/27 (48%)

Query: 166  RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +KK ++ KK  +  K + +    + + 
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEA 1361


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 7/26 (26%), Positives = 19/26 (73%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +  +K+  K K++K+++ ++K+K+ L
Sbjct: 122 ELLEKELAKLKREKRRENERKQKEIL 147



 Score = 28.0 bits (63), Expect = 2.2
 Identities = 7/26 (26%), Positives = 20/26 (76%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           K+  K K++K+++ ++K+K+  K+++
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQM 151



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 7/26 (26%), Positives = 19/26 (73%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K+  K K++K+++ ++K+K+  K++ 
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQM 151



 Score = 27.6 bits (62), Expect = 2.9
 Identities = 6/28 (21%), Positives = 21/28 (75%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           ++  K K++K+++ ++K+K+  K++ ++
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMKM 153



 Score = 27.6 bits (62), Expect = 3.2
 Identities = 6/24 (25%), Positives = 18/24 (75%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKK 190
           +  +K+  K K++K+++ ++K+K+
Sbjct: 122 ELLEKELAKLKREKRRENERKQKE 145



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 8/27 (29%), Positives = 19/27 (70%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             K K++K+++ ++K+K+  K++ K L
Sbjct: 128 LAKLKREKRRENERKQKEILKEQMKML 154


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
            +++  K K +K+K K + KK K  K +LYI
Sbjct: 125 KKREVPKPKTEKEKPKTEPKKPKPSKPKLYI 155



 Score = 28.6 bits (64), Expect = 1.5
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           + D T ++   + K + K+KKK++  K K +K
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEK 136



 Score = 27.4 bits (61), Expect = 3.8
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           + K + K+KKK++  K K +K+K K 
Sbjct: 116 ETKTESKEKKKREVPKPKTEKEKPKT 141



 Score = 27.0 bits (60), Expect = 4.7
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             + K + K+KKK++  K K +K+K  
Sbjct: 114 PPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 26.3 bits (58), Expect = 8.6
 Identities = 7/26 (26%), Positives = 16/26 (61%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++   + K + K+KKK++  K K ++
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEK 136


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
              R K  +K+    K  +K+KKKKKKK+  
Sbjct: 58  KGRRPKTARKESVAAKAAEKEKKKKKKKELK 88



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T + ++ K  +K+    K  +K+KKKKK+
Sbjct: 56  TKKGRRPKTARKESVAAKAAEKEKKKKKK 84


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 29.0 bits (65), Expect = 1.5
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
              +RK+K  +K    K KK  KK + +
Sbjct: 354 AEAARKRKGDRKGVSHKAKKGGKKNQAE 381



 Score = 28.7 bits (64), Expect = 2.2
 Identities = 13/49 (26%), Positives = 23/49 (46%)

Query: 144 KNGFEFRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K+G +  +T+      + D   +K +  +K+K  +K    K KK  KK 
Sbjct: 330 KDGTQTELTLKEDGAFDVDAYPKKAEAARKRKGDRKGVSHKAKKGGKKN 378



 Score = 26.7 bits (59), Expect = 8.8
 Identities = 8/27 (29%), Positives = 11/27 (40%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKK 187
                RK  +K    K KK  KK + +
Sbjct: 355 EAARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +KKKKKKKK   KK K KK   K++K 
Sbjct: 256 KKKKKKKKKVVHKKYKTKKLTGKQRKA 282



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 17/28 (60%), Positives = 19/28 (67%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           S+ KKKKKKKKK   KK K KK   K+R
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQR 280



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            N     +KKKKKKK   KK K KK   K++K R
Sbjct: 250 ANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKAR 283



 Score = 28.5 bits (64), Expect = 2.6
 Identities = 16/27 (59%), Positives = 16/27 (59%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R    K KKKKKKKKK   KK K KK 
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKL 275



 Score = 27.0 bits (60), Expect = 7.9
 Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 172 KKKKKKKKKKKKKKKKKKKKRL 193
           K KKKKKKKKK   KK K K+L
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKL 275


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           ++K +++   +  K+ K KKK  KKKR   FK+
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
               +  K+ K KKK  KKK+K   KR 
Sbjct: 12  ELAVQVAKQAKAKKKANKKKRKIYFKRA 39


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RKK K+K ++ K K KK++ +K+K+K 
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKE 91



 Score = 27.7 bits (62), Expect = 2.5
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
             LR       KK++ +K+K+K+++  K   + +K+R  + KK
Sbjct: 70  EKLRRDKL-KAKKEEAEKEKEKEERFMKALAEAEKERAELEKK 111



 Score = 26.6 bits (59), Expect = 6.4
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           +G  + K+K ++ K K KK++ +K+K+K++R 
Sbjct: 63  EGRKKAKEKLRRDKLKAKKEEAEKEKEKEERF 94


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 27.9 bits (62), Expect = 1.8
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRL 193
            +++ KK KKK++KKKKKKK  K L
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNL 123



 Score = 27.5 bits (61), Expect = 2.6
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKK 189
            R+ KK KKK++KKKKKKK  K   
Sbjct: 100 QRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 27.1 bits (60), Expect = 3.2
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
            +++ KK KKK++KKKKKKK  K   
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 26.7 bits (59), Expect = 5.2
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKK 190
           ++++ KK KKK++KKKKKKK  K   
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 28.9 bits (65), Expect = 1.8
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           R++  ++   K KKKK+KKK+  +K R ++  
Sbjct: 416 REELIEEGLLKSKKKKRKKKEWFEKFRWFVSS 447


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             RK +K+ K ++KK + K+K K   KK+
Sbjct: 104 AGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 27.3 bits (61), Expect = 3.3
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R G    +K  +K +K+ K ++KK + K+K K  
Sbjct: 95  RLGLIKKKKAGRKARKELKNRRKKVRGKEKTKVS 128



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           G  +KKK  +K +K+ K ++KK + K+K ++   KK
Sbjct: 97  GLIKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 8/33 (24%), Positives = 19/33 (57%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
             K++K ++KK+K + + ++++ K R    K  
Sbjct: 33  ALKREKAQEKKRKAEAQAERRELKARKEKLKTR 65



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 7/30 (23%), Positives = 22/30 (73%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             +++K ++KK+K + + ++++ K +K++L
Sbjct: 33  ALKREKAQEKKRKAEAQAERRELKARKEKL 62



 Score = 27.3 bits (61), Expect = 4.4
 Identities = 7/29 (24%), Positives = 20/29 (68%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
             + K++K ++KK+K + + ++++ K +K
Sbjct: 31  ALALKREKAQEKKRKAEAQAERRELKARK 59



 Score = 26.1 bits (58), Expect = 9.4
 Identities = 8/33 (24%), Positives = 21/33 (63%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           + ++KK+K + + ++++ K +K+K K    + K
Sbjct: 38  KAQEKKRKAEAQAERRELKARKEKLKTRSDWLK 70


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 28.4 bits (64), Expect = 2.0
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKK 185
              KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 173 KKKKKKKKKKKKKKKKKKKRLYIF 196
             KK K K+ +K + KKKK+    
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDN 108



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 167 KKKKKKKKKKKKKKKKKKKKK 187
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 168 KKKKKKKKKKKKKKKKKKKKK 188
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 169 KKKKKKKKKKKKKKKKKKKKK 189
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 170 KKKKKKKKKKKKKKKKKKKKK 190
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 171 KKKKKKKKKKKKKKKKKKKKK 191
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.3 bits (61), Expect = 4.7
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 172 KKKKKKKKKKKKKKKKKKKKRLYI 195
             KK K K+ +K + KKKKK+   
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDN 108


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 28.7 bits (64), Expect = 2.0
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 173 KKKKKKKKKKKKKKKKKKKRLY 194
            ++ K+KKKKK KKKK  K L 
Sbjct: 86  NQENKQKKKKKDKKKKSPKSLL 107


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.5 bits (64), Expect = 2.1
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKK 185
             S K+K++  KK+KKKK KK K
Sbjct: 352 ALSPKRKREGDKKQKKKKSKKLK 374



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               R      K+K++  KK+KKKK KK K
Sbjct: 345 AAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 27.7 bits (62), Expect = 4.1
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 174 KKKKKKKKKKKKKKKKKKRL 193
           K+K++  KK+KKKK KK +L
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 9/36 (25%), Positives = 24/36 (66%), Gaps = 9/36 (25%)

Query: 166 RKKKKKKKKKKKK---------KKKKKKKKKKKKKR 192
           +K++KKK++++ K         +K ++K++KK+ ++
Sbjct: 287 KKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 10/37 (27%), Positives = 24/37 (64%), Gaps = 9/37 (24%)

Query: 166 RKKKKKKKKKKKKKKK---------KKKKKKKKKKRL 193
            ++KK++KKK++++ K         +K ++K++KK+ 
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 28.3 bits (63), Expect = 2.2
 Identities = 10/32 (31%), Positives = 22/32 (68%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               S K+K+K++ +  +K K+ ++KK+++KR
Sbjct: 159 HRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190



 Score = 27.9 bits (62), Expect = 3.0
 Identities = 10/36 (27%), Positives = 22/36 (61%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            + +  D   RK+  K+K+K++ +  +K K+ ++KK
Sbjct: 150 LSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKK 185



 Score = 26.8 bits (59), Expect = 7.6
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R       +K+  K+K+K++ +  +K K+ ++K+
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKK 185


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 28.3 bits (64), Expect = 2.3
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           + +K+KKK   K++KK  K++K K    + 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEPYG 125


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 28.2 bits (64), Expect = 2.4
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           KK  +K +K KKKKK++K   K  R+ +
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSKDVRIAV 219


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
           identified as the binding site of the SRP72 protein to
           SRP RNA.
          Length = 57

 Score = 26.3 bits (58), Expect = 2.4
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              KK  KKK   ++KKKK  KKK+K K
Sbjct: 3   KKPKKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 25.9 bits (57), Expect = 3.4
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
           KK  KKK   ++KKKK  KKK+K +L
Sbjct: 6   KKIAKKKPGTEQKKKKSAKKKRKPKL 31


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 28.6 bits (64), Expect = 2.4
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           R K KK+K  KK+K+  K++    K  
Sbjct: 351 RVKAKKEKLAKKRKQVSKEEAAAIKAA 377



 Score = 27.1 bits (60), Expect = 7.1
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            + K KK+K  KK+K+  K++    K 
Sbjct: 350 ARVKAKKEKLAKKRKQVSKEEAAAIKA 376


>gnl|CDD|240239 PTZ00041, PTZ00041, 60S ribosomal protein L35a; Provisional.
          Length = 120

 Score = 27.3 bits (61), Expect = 2.5
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 174 KKKKKKKKKKKKKKKKKKRLY 194
             +  KKKKK  +KKK  RLY
Sbjct: 1   MHQLTKKKKKTNRKKKPVRLY 21


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 27.5 bits (61), Expect = 2.5
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
                KKK++KKK K+  +K++K++KK 
Sbjct: 43  NEGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 26.3 bits (58), Expect = 6.4
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRL 193
             KKK++KKK K+  +K++K++KK L
Sbjct: 46  VSKKKQRKKKPKRLTRKQRKRQKKGL 71


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.2 bits (63), Expect = 2.6
 Identities = 7/35 (20%), Positives = 29/35 (82%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
             ++++K++++++K++KK+++++++K+K    ++K
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226



 Score = 28.2 bits (63), Expect = 2.7
 Identities = 7/27 (25%), Positives = 24/27 (88%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            KK K++++K++++++K++KK+++++ 
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEE 215



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 7/26 (26%), Positives = 23/26 (88%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
             KK K++++K++++++K++KK++++
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEE 213


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 7/33 (21%), Positives = 16/33 (48%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
             +       K  + K+K K ++ K+K++K+  
Sbjct: 261 GFESEYEPINKPVRPKRKTKAQRNKEKRRKELE 293



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 8/28 (28%), Positives = 21/28 (75%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           RK K ++ K+K++K+ +++ K++K+ + 
Sbjct: 277 RKTKAQRNKEKRRKELEREAKEEKQLKK 304



 Score = 27.4 bits (61), Expect = 5.0
 Identities = 9/27 (33%), Positives = 21/27 (77%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           T  ++ K+K++K+ +++ K++K+ KKK
Sbjct: 279 TKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 27.4 bits (61), Expect = 5.4
 Identities = 7/34 (20%), Positives = 18/34 (52%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
             +     K  + K+K K ++ K+K++K+ ++  
Sbjct: 263 ESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA 296



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 8/33 (24%), Positives = 24/33 (72%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           ++ ++ K+K++K+ +++ K++K+ KKK   + +
Sbjct: 279 TKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311



 Score = 27.4 bits (61), Expect = 6.1
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
             + K+K K ++ K+K++K+ +++ K     KK
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 27.8 bits (62), Expect = 2.6
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           K++K+ K   KK K  K KK  KK  L
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAAL 133


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.6 bits (64), Expect = 2.7
 Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           A  +       + +KK+K++  + K+K++K ++KK
Sbjct: 400 AKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 27.1 bits (60), Expect = 6.8
 Identities = 8/25 (32%), Positives = 20/25 (80%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
           ++ +KK+K++  + K+K++K ++KK
Sbjct: 410 EENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 28.1 bits (63), Expect = 2.7
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
            +++  KK K KKKK  K K  K +Y +KK
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235



 Score = 27.3 bits (61), Expect = 4.9
 Identities = 14/27 (51%), Positives = 15/27 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            KK K KKKK  K K  K   K KK+R
Sbjct: 211 MKKGKGKKKKIVKDKDGKVVYKWKKER 237


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKK 188
           R+ +GT +  +K   K++K+ +K + +  K
Sbjct: 256 RDPNGTFQVARKAAMKRRKRNRKLRARNAK 285


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.1 bits (63), Expect = 2.8
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           K +K++KK   K++KK  K +K+K    + 
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 26.5 bits (59), Expect = 2.8
 Identities = 10/27 (37%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RK++KKK+ KK K +++ +++ K  K+
Sbjct: 11  RKEQKKKELKKNKAERQARREAKLAKK 37


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 28.2 bits (63), Expect = 2.9
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 171 KKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           KKKKKKKKK  K+     +  +  IF 
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKTIFM 35



 Score = 27.4 bits (61), Expect = 5.9
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLY 194
           KKKKKKKKK  K+     +  KK ++
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKTIF 34


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 27.4 bits (61), Expect = 3.0
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            + KKK+KK +    +  KK   KK+++ 
Sbjct: 76  QTHKKKRKKSRHVSSRSAKKISAKKRRRS 104



 Score = 26.6 bits (59), Expect = 4.6
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +S+  KKK+KK +    +  KK   KK+R
Sbjct: 74  SSQTHKKKRKKSRHVSSRSAKKISAKKRR 102



 Score = 26.6 bits (59), Expect = 5.4
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           T +KK+KK +    +  KK   KK+++  
Sbjct: 77  THKKKRKKSRHVSSRSAKKISAKKRRRSS 105


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 28.3 bits (64), Expect = 3.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 167 KKKKKKKKKKKKKKKKKKKKK 187
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 168 KKKKKKKKKKKKKKKKKKKKK 188
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 169 KKKKKKKKKKKKKKKKKKKKK 189
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 170 KKKKKKKKKKKKKKKKKKKKK 190
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 171 KKKKKKKKKKKKKKKKKKKKK 191
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.1 bits (61), Expect = 8.1
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 173 KKKKKKKKKKKKKKKKKKKR 192
           +K++ KKKKK+KK KK +  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVE 752



 Score = 26.7 bits (60), Expect = 8.9
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 172 KKKKKKKKKKKKKKKKKKKKR 192
           +K++ KKKKK+KK KK + ++
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|216256 pfam01032, FecCD, FecCD transport family.  This is a sub-family
          of bacterial binding protein-dependent transport
          systems family. This Pfam entry contains the inner
          components of this multicomponent transport system.
          Length = 311

 Score = 28.2 bits (64), Expect = 3.1
 Identities = 11/50 (22%), Positives = 24/50 (48%), Gaps = 5/50 (10%)

Query: 2  GHVSLTISQLINIIPITQSSPDASLLEEFRIPPDITAMFT-----LSGFI 46
          G VS++ S + + +     S  A ++ + R+P  + A+       +SG +
Sbjct: 13 GSVSISPSDVFSALFGPDDSLAAFIVWDLRLPRVLAALLVGAALAVSGAL 62


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 3.1
 Identities = 11/43 (25%), Positives = 23/43 (53%), Gaps = 8/43 (18%)

Query: 158 LRNGDGTSRKKKKKKKKKKKKK--------KKKKKKKKKKKKR 192
           + N D    K+ KK  ++ ++K        +++K +KK +K+R
Sbjct: 99  VVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRR 141


>gnl|CDD|204301 pfam09735, Nckap1, Membrane-associated apoptosis protein.
           Expression of this protein was found to be markedly
           reduced in patients with Alzheimer's disease. It is
           involved in the regulation of actin polymerisation in
           the brain as part of a WAVE2 signalling complex.
          Length = 1118

 Score = 28.2 bits (63), Expect = 3.3
 Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 9/74 (12%)

Query: 3   HVSLTISQLINIIPITQSSPDASLLEEFRIPPDITAMFT---LSGFISSAIHGHGRSTAD 59
           +VS+ I+++I  + + Q+ P  S  E       ITA++T   L   +     G     + 
Sbjct: 752 YVSMDITRVIRNVLLQQTQPLDSHGE-----GTITALYTNWYLDNLLRQVSGGG-IVFSP 805

Query: 60  RQFYFVNSRPCEPS 73
               FV+S P E  
Sbjct: 806 AMKAFVSSPPEELG 819


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 27.3 bits (61), Expect = 3.3
 Identities = 8/31 (25%), Positives = 25/31 (80%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           D + R    ++K+++K+++K++K++K++K++
Sbjct: 6   DKSGRIIDIEEKREEKEREKEEKERKEEKEK 36



 Score = 26.9 bits (60), Expect = 5.3
 Identities = 9/33 (27%), Positives = 26/33 (78%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R+  G     ++K+++K+++K++K++K++K+K+
Sbjct: 5   RDKSGRIIDIEEKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 27.3 bits (61), Expect = 3.4
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +RK+  K  ++++ +K +K+ KK KK  
Sbjct: 98  ARKEALKAYQQEELRKIQKRSKKSKKAE 125


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 27.2 bits (61), Expect = 3.5
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 171 KKKKKKKKKKKKKKKKKKKKKRLYIF 196
           K  +++KK+K+KKKKKKK+ +  Y F
Sbjct: 72  KAAEEEKKEKEKKKKKKKELEDFYRF 97



 Score = 26.5 bits (59), Expect = 5.8
 Identities = 14/43 (32%), Positives = 25/43 (58%), Gaps = 6/43 (13%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKK------KKKKKKKRL 193
                 G + +++KK+K+KKKKKKK+ +       ++KKK+  
Sbjct: 65  AAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQ 107



 Score = 26.5 bits (59), Expect = 6.1
 Identities = 13/32 (40%), Positives = 21/32 (65%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           R G     +  K  +++KK+K+KKKKKKK+ +
Sbjct: 61  RKGPAAGEEAGKAAEEEKKEKEKKKKKKKELE 92


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
           short proteins that are rich in aspartate, glutamate,
           lysine and arginine. Although the function of these
           proteins is unknown, they are found to be ubiquitously
           expressed.
          Length = 38

 Score = 25.2 bits (56), Expect = 3.6
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +R++  KK  KKKKK  K + K   K 
Sbjct: 10  ARERNAKKAAKKKKKGAKSQLKAAAKA 36


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 28.1 bits (63), Expect = 3.7
 Identities = 11/45 (24%), Positives = 19/45 (42%), Gaps = 5/45 (11%)

Query: 153 MMFANLRNGDGTSR-----KKKKKKKKKKKKKKKKKKKKKKKKKR 192
              A    G G                +KKK++KK+++K+ K+ R
Sbjct: 155 EKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELR 199


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 27.2 bits (60), Expect = 3.7
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKK 184
              S ++ +KK  +KK+KK    
Sbjct: 55  KKISAEEAEKKLLQKKEKKALTN 77



 Score = 26.8 bits (59), Expect = 4.3
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
            KK KK   ++ +KK  +KK+KK    +
Sbjct: 51  NKKPKKISAEEAEKKLLQKKEKKALTNV 78



 Score = 26.4 bits (58), Expect = 6.1
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           G ++K KK   ++ +KK  +KK+KK     L
Sbjct: 49  GLNKKPKKISAEEAEKKLLQKKEKKALTNVL 79


>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
           subunit.
          Length = 1044

 Score = 28.0 bits (62), Expect = 3.8
 Identities = 16/44 (36%), Positives = 22/44 (50%), Gaps = 6/44 (13%)

Query: 163 GTSRKKKKKKKKKKKKKKK------KKKKKKKKKKRLYIFKKHT 200
           G  R+ K KK KK KK ++      KK+KKK       + KK+ 
Sbjct: 604 GGGRRGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYG 647


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
           family of proteins has no known function.
          Length = 44

 Score = 25.3 bits (56), Expect = 3.8
 Identities = 15/33 (45%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           +RKKK  +K KKK KK   K     K + YI K
Sbjct: 2   NRKKKINQKLKKKAKKANAKLHPSNKPK-YISK 33


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 27.3 bits (61), Expect = 3.8
 Identities = 8/28 (28%), Positives = 25/28 (89%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
           RK++++KK +++ K+K+++++ +++KR+
Sbjct: 30  RKQQRRKKAQEEAKEKEREERIEERKRI 57


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 26.0 bits (58), Expect = 3.9
 Identities = 9/29 (31%), Positives = 21/29 (72%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
           R+++  +K  +K+K+KK   +K+++K+L 
Sbjct: 33  RRREFYEKPSEKRKRKKAAARKRRRKKLA 61



 Score = 25.2 bits (56), Expect = 7.6
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKRLY 194
           K  +K+K+KK   +K+++KK  R  
Sbjct: 40  KPSEKRKRKKAAARKRRRKKLAREE 64


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 27.9 bits (62), Expect = 4.0
 Identities = 12/27 (44%), Positives = 12/27 (44%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
              KK  KK  KK  K  KK  KK  K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 27.2 bits (60), Expect = 5.8
 Identities = 12/25 (48%), Positives = 12/25 (48%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
            KK  KK  KK  K  KK  KK  K
Sbjct: 252 AKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 27.2 bits (60), Expect = 6.9
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++K KK  KK  KK  K  KK  KK  +
Sbjct: 249 AKKAKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 27.2 bits (60), Expect = 7.1
 Identities = 11/32 (34%), Positives = 13/32 (40%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
               S+KK  K     KK  K   K  KK K+
Sbjct: 223 AKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKK 254



 Score = 26.8 bits (59), Expect = 8.0
 Identities = 10/27 (37%), Positives = 10/27 (37%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKK 191
               K K   KK  KKK  K     KK
Sbjct: 214 KASAKAKSAAKKVSKKKAAKTAVSAKK 240



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 11/27 (40%), Positives = 13/27 (48%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKK 189
             ++K  KK  KK  K  KK  KK  K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 26.8 bits (59), Expect = 9.0
 Identities = 12/36 (33%), Positives = 14/36 (38%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           ++     KK  K   K  KK KK  KK L    K  
Sbjct: 232 AKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267



 Score = 26.8 bits (59), Expect = 9.1
 Identities = 12/28 (42%), Positives = 13/28 (46%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
             +  K  KK KK  KK  KK  K  KK
Sbjct: 242 AKTAAKAAKKAKKTAKKALKKAAKAVKK 269



 Score = 26.8 bits (59), Expect = 9.2
 Identities = 10/26 (38%), Positives = 10/26 (38%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           KK  K  KK  KK  K   K  K   
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAA 286



 Score = 26.8 bits (59), Expect = 9.6
 Identities = 10/28 (35%), Positives = 10/28 (35%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
                   KK  K   K  KK KK  KK
Sbjct: 231 AAKTAVSAKKAAKTAAKAAKKAKKTAKK 258



 Score = 26.8 bits (59), Expect = 9.8
 Identities = 12/24 (50%), Positives = 12/24 (50%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKK 191
            K  KK KK  KK  KK  K  KK
Sbjct: 246 AKAAKKAKKTAKKALKKAAKAVKK 269



 Score = 26.8 bits (59), Expect = 9.9
 Identities = 10/28 (35%), Positives = 11/28 (39%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
            +  KK   K K   KK  KKK  K   
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKAAKTAV 236



 Score = 26.8 bits (59), Expect = 9.9
 Identities = 10/30 (33%), Positives = 12/30 (40%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G  +   K K   KK  KKK  K     K+
Sbjct: 211 GAKKASAKAKSAAKKVSKKKAAKTAVSAKK 240


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 27.7 bits (62), Expect = 4.0
 Identities = 11/32 (34%), Positives = 12/32 (37%)

Query: 161 GDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           G    +  KK    K KK   KK   KK  K 
Sbjct: 259 GGPGKKPAKKATAAKAKKTTAKKAAAKKAAKT 290



 Score = 27.4 bits (61), Expect = 5.6
 Identities = 13/33 (39%), Positives = 13/33 (39%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R   G   KK  KK    K KK   KK   KK 
Sbjct: 255 RRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKA 287



 Score = 27.0 bits (60), Expect = 7.8
 Identities = 11/28 (39%), Positives = 13/28 (46%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
            ++K    K KK   KK   KK  K KK
Sbjct: 265 PAKKATAAKAKKTTAKKAAAKKAAKTKK 292



 Score = 26.6 bits (59), Expect = 8.3
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
           T+ K KK   KK   KK  K KK  KK
Sbjct: 270 TAAKAKKTTAKKAAAKKAAKTKKAAKK 296


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 27.2 bits (61), Expect = 4.0
 Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 9/37 (24%)

Query: 165 SRKKKKKKKKKKKKKKKKK---------KKKKKKKKR 192
            +KK   K+ K   K KKK         +++K+KKK 
Sbjct: 1   KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKH 37



 Score = 27.2 bits (61), Expect = 4.2
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 15/50 (30%)

Query: 159 RNGDGTSRKKKKKKKKKK------KKK---------KKKKKKKKKKKKRL 193
           R+ +G ++ K K + +KK      KK          K KKKK K KK +L
Sbjct: 45  RHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPKL 94



 Score = 26.4 bits (59), Expect = 9.2
 Identities = 11/37 (29%), Positives = 18/37 (48%), Gaps = 9/37 (24%)

Query: 165 SRKKKKKKKKKKKKKKKK---------KKKKKKKKKR 192
            +K   K+ K   K KKK         +++K+KKK +
Sbjct: 2   KKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHK 38


>gnl|CDD|215081 PLN00158, PLN00158, histone H2B; Provisional.
          Length = 116

 Score = 27.0 bits (59), Expect = 4.0
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           KK  KK  K  KKK  K K +  ++YI+K
Sbjct: 7   KKPAKKAAKGAKKKGSKSKTETYKIYIYK 35


>gnl|CDD|241543 cd13392, PH_AKAP13, A-kinase anchoring protein 13 Pleckstrin
           homology (PH) domain.  The Rho-specific GEF activity of
           AKAP13 (also called Brx-1, AKAP-Lbc, and proto-Lbc)
           mediates signaling downstream of G-protein coupled
           receptors and Toll-like receptor 2. It plays a role in
           cell growth, cell development and actin fiber formation.
           Protein kinase A (PKA) binds and phosphorylates AKAP13,
           regulating its Rho-GEF activity. Alternative splicing of
           this gene in humans has at least 3 transcript variants
           encoding different isoforms (i.e. proto-/onco-Lymphoid
           blast crisis, Lbc and breast cancer nuclear
           receptor-binding auxiliary protein, Brx) containing a
           dbl oncogene homology (DH) domain and PH domain which
           are required for full transforming activity. The DH
           domain is associated with guanine nucleotide exchange
           activation while the PH domain has multiple functions
           including determine protein sub-cellular localisation
           via phosphoinositide interactions, while others bind
           protein partners. Other ligands include protein kinase C
           which is bound by the PH domain of AKAP13, serving to
           activate protein kinase D and mobilize a cardiac
           hypertrophy signaling pathway. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 157

 Score = 27.2 bits (60), Expect = 4.1
 Identities = 18/42 (42%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 89  SDLFIIDQHATDEKYNFETL-QKTTVIKSQKLVVPQNLHLTK 129
           SD+ +  Q   D+KY F +L QK+TVI  +KL+V +  H  K
Sbjct: 79  SDILVFLQE-KDQKYVFASLDQKSTVISLKKLIVREVAHEEK 119


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 27.8 bits (62), Expect = 4.1
 Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 160 NGDGTSRKKKKKKKKKKKKKK---KKKKKKKKKKKR 192
            G G ++K K KKK+KKKK+      + K+KKK+K 
Sbjct: 5   QGQGNNQKDKNKKKEKKKKESPPPPHEIKRKKKRKG 40


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 27.3 bits (61), Expect = 4.2
 Identities = 6/27 (22%), Positives = 11/27 (40%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRL 193
            K+       K + KKK +    K++ 
Sbjct: 106 LKRVAANDAIKAEAKKKGELPSTKRQP 132


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 28.0 bits (62), Expect = 4.3
 Identities = 12/22 (54%), Positives = 12/22 (54%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKK 184
           G SRKK  K  KK   KKK  K
Sbjct: 660 GGSRKKSSKSSKKGSDKKKSGK 681


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 27.9 bits (62), Expect = 4.3
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 163  GTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
              +R +    KK  ++  KK   KK  KK
Sbjct: 1185 AAARGESGAAKKVSRQAPKKPAPKKTTKK 1213



 Score = 27.9 bits (62), Expect = 4.9
 Identities = 6/26 (23%), Positives = 13/26 (50%)

Query: 166  RKKKKKKKKKKKKKKKKKKKKKKKKK 191
            R+K ++   + +    KK  ++  KK
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKK 1204



 Score = 27.9 bits (62), Expect = 5.1
 Identities = 9/35 (25%), Positives = 12/35 (34%)

Query: 156  ANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
              L+           KK  ++  KK   KK  KK 
Sbjct: 1180 EKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 27.1 bits (60), Expect = 7.2
 Identities = 8/33 (24%), Positives = 11/33 (33%)

Query: 159  RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
                    +    KK  ++  KK   KK  KK 
Sbjct: 1182 LQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKK 181
            GD +S KK KK KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 26.7 bits (59), Expect = 7.5
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKK 185
               K   KK KK KKKKKKK+ 
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 27.2 bits (61), Expect = 4.4
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           +       KK+++KK++  K +K++ K KK
Sbjct: 135 DPKEDPFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 27.1 bits (60), Expect = 4.5
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           R++ K++ K+KK +KK K+ ++  +   Y  KK  
Sbjct: 103 REESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEG 137


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 27.5 bits (61), Expect = 4.6
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           +K K++ KK+ +++ K K   + KKK     KK
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174



 Score = 27.5 bits (61), Expect = 5.6
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           ++KK  + KKK + + K K + K K K 
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKA 191



 Score = 26.7 bits (59), Expect = 8.3
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           +K+ +++ K K   + KKK  + KKK 
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKA 175


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
           eukaryotic family of proteins has no known function.
          Length = 155

 Score = 27.2 bits (61), Expect = 4.6
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKH 199
            +   +  K +KKK ++ +K+  +  RL+ F++ 
Sbjct: 24  NRATLRDDKLEKKKAERAEKRSLELDRLFFFQEQ 57



 Score = 26.4 bits (59), Expect = 7.0
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           SRK K+  +   +  K +KKK ++ +KR
Sbjct: 17  SRKAKQLNRATLRDDKLEKKKAERAEKR 44


>gnl|CDD|220440 pfam09848, DUF2075, Uncharacterized conserved protein (DUF2075).
           This domain, found in various prokaryotic proteins
           (including putative ATP/GTP binding proteins), has no
           known function.
          Length = 348

 Score = 27.6 bits (62), Expect = 4.6
 Identities = 8/37 (21%), Positives = 13/37 (35%)

Query: 159 RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
           R G         K + K   +K+K  K+  +     I
Sbjct: 291 RPGTDRIVFDGTKWEDKSAFQKRKDLKEDPEDADRLI 327


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 26.8 bits (60), Expect = 4.6
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           + K +KK+++KK + +K  K   ++K  Y  +K
Sbjct: 21  KAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEK 53


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 27.1 bits (61), Expect = 4.6
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKKKRL 193
           K+KKKKKKKKK+ +  ++KKKK  +
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVV 106



 Score = 26.7 bits (60), Expect = 5.6
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKK 190
           K+KKKKKKKKK+ +  ++KKKK  
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYV 105



 Score = 26.7 bits (60), Expect = 5.6
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKK 191
           K+KKKKKKKKK+ +  ++KKKK  
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYV 105


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           SRKK  K  +K    K K K+KK K KR
Sbjct: 15  SRKKGVKALRKAAVAKSKDKQKKPKSKR 42



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
            R +KK  K  +K    K K K+KK K 
Sbjct: 13  KRSRKKGVKALRKAAVAKSKDKQKKPKS 40


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 27.4 bits (61), Expect = 4.9
 Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 160 NGDGTSRKKKKKKKKKKKKKKKKKKKKK----KKKKRLYIFKKH 199
               +S + ++ KKK++ + KKKK+ ++    +KK   YI +K 
Sbjct: 79  ELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 27.5 bits (61), Expect = 5.3
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
            +  K + K +K K+ KK+KK+++K  L
Sbjct: 21  PRPVKDEAKPRKIKRVKKRKKREEKDEL 48



 Score = 27.1 bits (60), Expect = 5.9
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKKKK 188
           G    K + K +K K+ KK+KK+++K
Sbjct: 20  GPRPVKDEAKPRKIKRVKKRKKREEK 45


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 27.6 bits (61), Expect = 5.3
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRL 193
              +G+ + ++  KKKK KK K +K+ K+K KK K L
Sbjct: 60  QANSGNVSGKRVPKKKKIKKPKLRKRTKRKNKKIKSL 96


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.5 bits (62), Expect = 5.4
 Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 3/43 (6%)

Query: 158 LRNGDGTSRKKKK---KKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           L+ G   S K  +    +K+  K  +KK+KKKKK+K++    K
Sbjct: 596 LQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 27.7 bits (62), Expect = 5.4
 Identities = 14/53 (26%), Positives = 18/53 (33%), Gaps = 5/53 (9%)

Query: 144 KNGFEF-----RITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           K GFEF     +              +  K +K   KK   KK   KK   +K
Sbjct: 808 KIGFEFEPREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARK 860


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 27.1 bits (60), Expect = 5.5
 Identities = 10/23 (43%), Positives = 21/23 (91%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKKR 192
           ++K+K++KKK K+ K+++K++KR
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKR 207


>gnl|CDD|178587 PLN03013, PLN03013, cysteine synthase.
          Length = 429

 Score = 27.4 bits (60), Expect = 5.6
 Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 4  VSLTISQLINIIPITQSSPDASLLEEFRIPPDITAM-FTLSGFISSAIHGHGRSTAD 59
          ++ T S  + + P+T SS  +S L  FR  P+I+++ F+ +   S A+    RS AD
Sbjct: 34 MAATSSSALLLNPLTSSS-SSSTLRRFRCSPEISSLSFSSASDFSLAMKRQSRSFAD 89


>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
          Length = 150

 Score = 26.7 bits (60), Expect = 5.8
 Identities = 12/42 (28%), Positives = 22/42 (52%), Gaps = 7/42 (16%)

Query: 163 GTSRKKKKKKKKKKKK-------KKKKKKKKKKKKKRLYIFK 197
           G SR + +++ +K+KK        +K KK  +  KK  +I +
Sbjct: 57  GISRGRARERHEKRKKGRRRGPGSRKGKKGARTPKKERWINR 98


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 27.5 bits (62), Expect = 6.1
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 154 MFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKK 190
               L+        ++K+KKK+KKK KKKK K  + K
Sbjct: 384 ELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420



 Score = 26.8 bits (60), Expect = 8.7
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLY 194
                ++K+KKK+KKK KKKK K  + ++Y
Sbjct: 393 VGVAPEQKEKKKEKKKNKKKKYKVPRGKIY 422


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 27.2 bits (60), Expect = 6.1
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 163 GTSRKKKKKKKKKKKKKKKKKKKK 186
               K   KKK K KK KKK+ K+
Sbjct: 206 HEKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 6.9
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 171 KKKKKKKKKKKKKKKKKKKKKR 192
           K K   KKK K KK KKK+ KR
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKK 188
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKK 189
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 169 KKKKKKKKKKKKKKKKKKKKKK 190
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 170 KKKKKKKKKKKKKKKKKKKKKK 191
           K K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|201249 pfam00472, RF-1, RF-1 domain.  This domain is found in peptide
           chain release factors such as RF-1 and RF-2, and a
           number of smaller proteins of unknown function. This
           domain contains the peptidyl-tRNA hydrolase activity.
           The domain contains a highly conserved motif GGQ, where
           the glutamine is thought to coordinate the water that
           mediates the hydrolysis.
          Length = 114

 Score = 26.4 bits (59), Expect = 6.1
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
             +KK++K K  +  + ++  +K+K R Y F
Sbjct: 75  ELQKKREKTKPTRASQVRRGDRKEKIRTYNF 105


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 26.5 bits (59), Expect = 6.2
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKRLYI 195
            KKKKKK   K K     KKKKKK  L +
Sbjct: 63  VKKKKKKSSLKSKSSSSSKKKKKKGPLQL 91


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 27.3 bits (60), Expect = 6.4
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKK 191
           R+KKK+K++  KK   K KK++K  K
Sbjct: 563 RRKKKRKRRAAKKAVTKAKKERKIGK 588



 Score = 27.3 bits (60), Expect = 7.2
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K ++++KKK+K++  KK   K KK+R
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKER 584


>gnl|CDD|203441 pfam06421, LepA_C, GTP-binding protein LepA C-terminus.  This
           family consists of the C-terminal region of several pro-
           and eukaryotic GTP-binding LepA proteins.
          Length = 108

 Score = 26.0 bits (58), Expect = 6.4
 Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 156 ANLRNGDGTSRKKKKKKKKKKKKKKKKK 183
           A    GD  SRKKK  +K+K+ KK+ K+
Sbjct: 65  AKCYGGD-ISRKKKLLEKQKEGKKRMKQ 91


>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein.  The
           low-temperature viability protein LTV1 is involved in
           ribosome biogenesis 40S subunit production.
          Length = 426

 Score = 27.3 bits (60), Expect = 6.8
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 153 MMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           M+   L +     + +KK KK++ K+K+K   ++  K++R
Sbjct: 380 MLDEFLGSYLPKVQGRKKAKKREDKRKRKSALEELDKERR 419


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 27.0 bits (60), Expect = 6.9
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKK 189
           T R+ K   K   KK+KKK+ K+KKK
Sbjct: 187 TLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 26.9 bits (60), Expect = 7.1
 Identities = 9/33 (27%), Positives = 13/33 (39%)

Query: 157 NLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKK 189
           +    D      KKK +K    + K   K+K K
Sbjct: 187 DFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAK 219


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 26.6 bits (59), Expect = 7.2
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 164 TSRKKKKKKKKKKKKKKKKKKKKKKKK 190
             RK++     KKK K +K  K+ +KK
Sbjct: 157 EKRKERAAAYYKKKVKLRKAWKEARKK 183


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 26.9 bits (60), Expect = 7.6
 Identities = 16/55 (29%), Positives = 29/55 (52%)

Query: 144 KNGFEFRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKK 198
           K  ++  I  +   ++  D +S    KKKK+KK+ K   KK +++ KK +   +K
Sbjct: 26  KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEK 80


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 27.1 bits (60), Expect = 8.0
 Identities = 5/26 (19%), Positives = 9/26 (34%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKKR 192
           K    K   + K   +K    K  ++
Sbjct: 224 KAAAAKAAAEAKAAAEKAAAAKAAEK 249



 Score = 26.7 bits (59), Expect = 8.3
 Identities = 5/25 (20%), Positives = 7/25 (28%)

Query: 167 KKKKKKKKKKKKKKKKKKKKKKKKK 191
            K    K   + K   +K    K  
Sbjct: 223 AKAAAAKAAAEAKAAAEKAAAAKAA 247


>gnl|CDD|239568 cd03486, MutL_Trans_MLH3, MutL_Trans_MLH3: transducer domain,
          having a ribosomal S5 domain 2-like fold, found in
          proteins similar to yeast and human MLH3 (MutL
          homologue 3). MLH3 belongs to the DNA mismatch repair
          (MutL/MLH1/PMS2) family. This transducer domain is
          homologous to the second domain of the DNA gyrase B
          subunit, which is known to be important in nucleotide
          hydrolysis and the transduction of structural signals
          from ATP-binding site to the DNA breakage/reunion
          regions of the enzymes. MLH1 forms heterodimers with
          MLH3. The MLH1-MLH3 complex plays a role in meiosis. A
          role for hMLH1-hMLH3 in DNA mismatch repair (MMR) has
          not been established. It has been suggested that hMLH3
          may be a low risk gene for colorectal cancer; however
          there is little evidence to support it having a role in
          classical HNPCC.
          Length = 141

 Score = 26.1 bits (58), Expect = 8.1
 Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 4/38 (10%)

Query: 40 FTLSGFISSAIHGHGRSTADRQFYFVNSRPCEPSKVSG 77
          + +SG+ISS  H     +   QF +VN R    ++   
Sbjct: 28 YEVSGYISSEGHY----SKSFQFIYVNGRLYLKTRFHK 61


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 26.1 bits (58), Expect = 8.2
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 166 RKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           RKK KK K+++  KKK++K K K KK+
Sbjct: 111 RKKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|240407 PTZ00412, PTZ00412, leucyl aminopeptidase; Provisional.
          Length = 569

 Score = 26.9 bits (59), Expect = 9.0
 Identities = 11/44 (25%), Positives = 22/44 (50%)

Query: 149 FRITMMFANLRNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           F + ++    R+     RK  KK +  KK  K  +KK+  ++++
Sbjct: 525 FGVQLLVDYFRHNKPPKRKGIKKAQNAKKDLKVVQKKRHGRRRK 568


>gnl|CDD|143227 cd05750, Ig_Pro_neuregulin, Immunoglobulin (Ig)-like domain in
           neuregulins (NRGs).  Ig_Pro_neuregulin: immunoglobulin
           (Ig)-like domain in neuregulins (NRGs). NRGs are
           signaling molecules, which participate in cell-cell
           interactions in the nervous system, breast, heart, and
           other organ systems, and are implicated in the pathology
           of diseases including schizophrenia, multiple sclerosis,
           and breast cancer. There are four members of the
           neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1
           protein, binds to and activates the tyrosine kinases
           receptors ErbB3 and ErbB4, initiating signaling
           cascades. The other NRGs proteins bind one or the other
           or both of these ErbBs. NRG-1 has multiple functions;
           for example, in the brain it regulates various processes
           such as radial glia formation and neuronal migration,
           dendritic development, and expression of
           neurotransmitters receptors; in the peripheral nervous
           system NRG-1 regulates processes such as target cell
           differentiation, and Schwann cell survival. There are
           many NRG-1 isoforms, which arise from the alternative
           splicing of mRNA. Less is known of the functions of the
           other NRGs. NRG-2 and -3 are expressed predominantly in
           the nervous system. NRG-2 is expressed by motor neurons
           and terminal Schwann cells, and is concentrated near
           synaptic sites and may be a signal that regulates
           synaptic differentiation. NRG-4 has been shown to direct
           pancreatic islet cell development towards the delta-cell
           lineage.
          Length = 75

 Score = 25.2 bits (55), Expect = 9.2
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 168 KKKKKKKKKKKKKKKKKKKKKKKKRLYIFK 197
           K  K+  +K K +  K + KKK   L I K
Sbjct: 20  KDGKELNRKNKPRNIKIRNKKKNSELQINK 49


>gnl|CDD|223747 COG0675, COG0675, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 364

 Score = 26.5 bits (58), Expect = 9.3
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIFKKHT 200
           SRKK + K++KK K K  + +++ + +R    K   
Sbjct: 217 SRKKSRSKRRKKAKLKLARLRERIRNRRKDFHKLAK 252


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
           is a family of proteins that seem to be involved,
           directly or indirectly, in the salt sensitivity of some
           cellular functions in yeast. These proteins also
           interact with the splicing factor Msl1p.
          Length = 189

 Score = 26.4 bits (58), Expect = 9.3
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 161 GDGTSRKKKKKKKKKKKKKK 180
            D +S K KKKKKK+ K   
Sbjct: 26  NDSSSSKSKKKKKKRSKATS 45


>gnl|CDD|217512 pfam03359, GKAP, Guanylate-kinase-associated protein (GKAP)
           protein. 
          Length = 342

 Score = 26.7 bits (59), Expect = 9.3
 Identities = 8/45 (17%), Positives = 11/45 (24%), Gaps = 4/45 (8%)

Query: 151 ITMMFANL----RNGDGTSRKKKKKKKKKKKKKKKKKKKKKKKKK 191
           +   F  L     N           KK  K    K    ++K   
Sbjct: 253 VDKKFDELQQLKANDWQKKVPPPVPKKPAKGPIVKPAISREKSLD 297


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 26.7 bits (60), Expect = 9.3
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 176 KKKKKKKKKKKKKKKKRLYIF 196
           +K KK KK KK K  K L IF
Sbjct: 343 RKSKKYKKNKKNKWTKALSIF 363


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 26.7 bits (59), Expect = 9.4
 Identities = 9/26 (34%), Positives = 18/26 (69%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKK 190
             +  ++ K++ +KK+K KKK +K+K
Sbjct: 343 EDEDDEESKEEVEKKQKVKKKPRKRK 368



 Score = 26.7 bits (59), Expect = 9.6
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 162 DGTSRKKKKKKKKKKKKKKKKKKKKKKKKKR 192
           D     +  ++ K++ +KK+K KKK +K+K 
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKV 369


>gnl|CDD|188624 cd09904, H3TH_XPG, H3TH domain of Xeroderma pigmentosum
           complementation group G (XPG) nuclease, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease.  The Xeroderma pigmentosum complementation
           group G (XPG) nuclease plays a central role in
           nucleotide excision repair (NER) in cleaving DNA bubble
           structures or loops. XPG is a member of the
           structure-specific, 5' nuclease family that catalyzes
           hydrolysis of DNA duplex-containing nucleic acid
           structures during DNA replication, repair, and
           recombination.  Members of this subgroup include the
           H3TH (helix-3-turn-helix) domains of XPG and other
           similar eukaryotic 5' nucleases. These nucleases contain
           a PIN (PilT N terminus) domain with a helical arch/clamp
           region/I domain (not included here) and inserted within
           the PIN domain is an atypical helix-hairpin-helix-2
           (HhH2)-like region. This atypical HhH2 region, the H3TH
           domain, has an extended loop with at least three turns
           between the first two helices, and only three of the
           four helices appear to be conserved. Both the H3TH
           domain and the helical arch/clamp region are involved in
           DNA binding.  Studies suggest that a glycine-rich loop
           in the H3TH domain contacts the phosphate backbone of
           the template strand in the downstream DNA duplex. These
           nucleases have a carboxylate rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+ or Mn2+) required for nuclease activity.
           The first metal binding site is composed entirely of
           Asp/Glu residues from the PIN domain, whereas, the
           second metal binding site is composed generally of two
           Asp residues from the PIN domain and one Asp residue
           from the H3TH domain. Together with the helical arch and
           network of amino acids interacting with metal binding
           ions, the H3TH region defines a positively charged
           active-site DNA-binding groove in structure-specific 5'
           nucleases.
          Length = 97

 Score = 25.7 bits (57), Expect = 9.5
 Identities = 7/32 (21%), Positives = 14/32 (43%)

Query: 165 SRKKKKKKKKKKKKKKKKKKKKKKKKKRLYIF 196
              + +K +     K++ K+K K   K L + 
Sbjct: 47  ENAQPEKSEDSDNDKQEFKRKHKNYLKNLILP 78


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.378 

Gapped
Lambda     K      H
   0.267   0.0707    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,073,158
Number of extensions: 957624
Number of successful extensions: 15423
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10375
Number of HSP's successfully gapped: 1651
Length of query: 200
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 108
Effective length of database: 6,857,034
Effective search space: 740559672
Effective search space used: 740559672
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.4 bits)