RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7145
         (191 letters)



>gnl|CDD|215741 pfam00135, COesterase, Carboxylesterase family. 
          Length = 510

 Score = 99.3 bits (248), Expect = 1e-24
 Identities = 37/110 (33%), Positives = 46/110 (41%), Gaps = 20/110 (18%)

Query: 23  VARTKYGPLRGILIQ--NPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSAV 80
           V  T YG +RG+ +     PV A+LG+PYA PP+G LR+  P  P  W     A  Y   
Sbjct: 2   VVETSYGKVRGLRVTVDGGPVYAFLGIPYAKPPVGELRFKKPQPPEPWTGVLDATKYGPA 61

Query: 81  CPQRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRP 130
           CPQ                                 SEDCLYLN+Y P+ 
Sbjct: 62  CPQNNDLGSEMWNKNT------------------GMSEDCLYLNVYTPKL 93


>gnl|CDD|238191 cd00312, Esterase_lipase, Esterases and lipases (includes fungal
           lipases, cholinesterases, etc.)  These enzymes act on
           carboxylic esters (EC: 3.1.1.-). The catalytic apparatus
           involves three residues (catalytic triad): a serine, a
           glutamate or aspartate and a histidine.These catalytic
           residues are responsible for the nucleophilic attack on
           the carbonyl carbon atom of the ester bond. In contrast
           with other alpha/beta hydrolase fold family members,
           p-nitrobenzyl esterase and acetylcholine esterase have a
           Glu instead of Asp at the active site carboxylate.
          Length = 493

 Score = 88.5 bits (220), Expect = 8e-21
 Identities = 34/109 (31%), Positives = 46/109 (42%), Gaps = 20/109 (18%)

Query: 23  VARTKYGPLRGILIQNPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSAVCP 82
           +  T  G +RG+      V ++LG+PYA PP+G LR+  P     W     A +Y   C 
Sbjct: 1   LVVTPNGKVRGVD--EGGVYSFLGIPYAEPPVGDLRFKEPQPYEPWSDVLDATSYPPSCM 58

Query: 83  QRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPV 131
           Q     G        L   +L             SEDCLYLN+Y P+  
Sbjct: 59  QWDQLGG-------GLWNAKLPG-----------SEDCLYLNVYTPKNT 89


>gnl|CDD|225181 COG2272, PnbA, Carboxylesterase type B [Lipid metabolism].
          Length = 491

 Score = 86.7 bits (215), Expect = 3e-20
 Identities = 36/109 (33%), Positives = 48/109 (44%), Gaps = 22/109 (20%)

Query: 20  STRVARTKYGPLRGILIQNPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSA 79
              VA T  G + GI +    V ++LG+PYA PP+G LR+  PV P  W   R A  +  
Sbjct: 1   RAPVAETTTGKVEGITVNG--VHSWLGIPYAAPPVGELRFRRPVPPEPWSGVRDATQFGP 58

Query: 80  VCPQRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVP 128
            CPQ             ++  G               SEDCLYLN++ P
Sbjct: 59  ACPQ----------PFNRMGSGEDF----------TGSEDCLYLNIWAP 87


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 54.6 bits (132), Expect = 4e-09
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           RP      +K+  K   K   K+ +KK+K+K+K K KK+ + TK       PSG +
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 48.4 bits (116), Expect = 5e-07
 Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK-KKKKKKKKKTKKFLPPSDPS 181
           P+  A   +K  K   K   K+ +KK+K+K+K K KK+ +  K   K   PS  S
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 45.3 bits (108), Expect = 6e-06
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           L  + K   + K     KK   K+ +KK+K+K+K K K + 
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 44.9 bits (107), Expect = 7e-06
 Identities = 18/52 (34%), Positives = 27/52 (51%)

Query: 122 YLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           Y+   +   V  E   K K  ++ K  K  KK   K+ +KK+K+K+K K KK
Sbjct: 363 YIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 45.0 bits (107), Expect = 7e-06
 Identities = 18/35 (51%), Positives = 26/35 (74%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K+K     KKK+++KK +K+KKKKK+KKK K +K 
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 43.8 bits (104), Expect = 2e-05
 Identities = 21/38 (55%), Positives = 28/38 (73%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E KEK     K K+++KK +K+KKKKK+KKK KK+ KK
Sbjct: 374 EIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 43.0 bits (102), Expect = 3e-05
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KKK + K  +K+KKKKK+KKK KK+KKK +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 41.9 bits (99), Expect = 7e-05
 Identities = 17/31 (54%), Positives = 26/31 (83%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           KK++++ K + +KKKKK+KKK KK+KKK +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 41.9 bits (99), Expect = 8e-05
 Identities = 21/38 (55%), Positives = 28/38 (73%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           EK  K   K + +KK +K+KKKKK+KKK KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 40.3 bits (95), Expect = 3e-04
 Identities = 19/38 (50%), Positives = 28/38 (73%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++ E+ K K     KKK+++KK +K+KKKKK+KKK KK
Sbjct: 370 KRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407



 Score = 39.6 bits (93), Expect = 5e-04
 Identities = 18/36 (50%), Positives = 27/36 (75%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +E K+   K  KKK+++KK +K+KKKKK+KKK  K+
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 34.6 bits (80), Expect = 0.022
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
             EKK +K+ K K +KKK KK+KKK +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 34.2 bits (79), Expect = 0.035
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
           P+P   + E+K+ +K K K K+KKK KK+KKK +K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.9 bits (73), Expect = 0.19
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           G++ +++ NK   + K+K  K  KKK+++KK +K+K KK         K
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           K+K  K  KKK+++KK + +K        GK   K  
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 43.6 bits (103), Expect = 2e-05
 Identities = 21/37 (56%), Positives = 29/37 (78%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK+KK++KN   KKK +KKKKKKK+KK+ K + +TK 
Sbjct: 55  KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91



 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 21/47 (44%), Positives = 27/47 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           E +KK +KK K K +KK+ K + + K   K  KK KKTKK  P   P
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111



 Score = 36.2 bits (84), Expect = 0.006
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             +K K ++KN + KKK +KKKKKKK+KK+ K + +
Sbjct: 53  STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88



 Score = 35.8 bits (83), Expect = 0.009
 Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 4/54 (7%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKK----KKKKKKKKKKKKKKKTKK 173
            L + + +     ++E K      KK KK+ K    KKK +KKKKKKK+KK  K
Sbjct: 31  VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84



 Score = 35.1 bits (81), Expect = 0.015
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +KKEKK+ K++ + K   K  KK KK KKK  K K  
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 34.7 bits (80), Expect = 0.022
 Identities = 14/38 (36%), Positives = 18/38 (47%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +KK++KK      + K   K  KK KK KKK  K    
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 33.9 bits (78), Expect = 0.040
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           + ++K++ K++ + K   K  KK KK KKK  K K      
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVD 116



 Score = 32.8 bits (75), Expect = 0.093
 Identities = 15/38 (39%), Positives = 16/38 (42%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
             + EKKE K         K  KK KK KKK  K K  
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 32.0 bits (73), Expect = 0.15
 Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 12/54 (22%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK------------KKKKKKTKKFL 175
           + E K + + K   K  KK KK KKK  K K             +  +K+  ++
Sbjct: 80  KKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYI 133


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 43.1 bits (102), Expect = 2e-05
 Identities = 20/62 (32%), Positives = 30/62 (48%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           P + + E+ EK+  + +  KKKK+ K K  K+  KKKKKK       P   + +   KS 
Sbjct: 66  PESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125

Query: 190 EQ 191
             
Sbjct: 126 RI 127



 Score = 34.3 bits (79), Expect = 0.025
 Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 6/58 (10%)

Query: 130 PVALEGEKKEKKKNKNKNK--KKKKKKKKKKK----KKKKKKKKKKKTKKFLPPSDPS 181
              L+ E++ KKK + K K  K+  KKKKKK     K  K    + K K       P+
Sbjct: 75  EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPT 132



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           + KE    + K ++ ++KK + K +K+K+KKK+K
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 41.9 bits (99), Expect = 6e-05
 Identities = 19/72 (26%), Positives = 26/72 (36%)

Query: 102 RLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
            L  L        ++                   E + +KK K+K + KKKK K+ K  K
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270

Query: 162 KKKKKKKKKTKK 173
             KK   K  KK
Sbjct: 271 ALKKVVAKGMKK 282



 Score = 37.3 bits (87), Expect = 0.002
 Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 109 LLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNK-------NKNKKKKKKKKKKKKKK 161
           L P L  +    L +  + P    L+  KK+K++ +       ++ +KK+K K++ KKKK
Sbjct: 203 LPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKK 262

Query: 162 KKKKKKKKKTKKFLP 176
            K+ K  K  KK + 
Sbjct: 263 PKESKGVKALKKVVA 277



 Score = 26.9 bits (60), Expect = 6.4
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
           E K   K   ++KKKK++ ++  +  + + +KK+K+K+ +    P    GVK+
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKA 271


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 41.6 bits (98), Expect = 7e-05
 Identities = 18/56 (32%), Positives = 31/56 (55%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           +++EK K   K K KK  K K ++K+K K++K++K  + L    P  +   K R +
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100



 Score = 38.9 bits (91), Expect = 7e-04
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E +EK++ K K   K K KK  K K ++K+K K++ ++
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78



 Score = 31.2 bits (71), Expect = 0.23
 Identities = 11/33 (33%), Positives = 12/33 (36%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            EK K +   K  KKKK K K K          
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 31.2 bits (71), Expect = 0.27
 Identities = 11/33 (33%), Positives = 12/33 (36%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             K K +K  K  KKKK K K K          
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 30.8 bits (70), Expect = 0.35
 Identities = 10/37 (27%), Positives = 12/37 (32%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K K +   K  KKKK K K K               +
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGY 230



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 9/40 (22%), Positives = 19/40 (47%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
                A    K + KK      ++K+K K++K++K  ++ 
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 26.9 bits (60), Expect = 7.1
 Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK +   N    +K K +K  K  KKKK K   K
Sbjct: 181 KKVSSTLNVLINEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 26.5 bits (59), Expect = 7.9
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E E  + K + ++ + ++K+++K K   K K KK  K K
Sbjct: 27  EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAK 65


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 41.2 bits (97), Expect = 8e-05
 Identities = 24/40 (60%), Positives = 28/40 (70%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E EKKEKKK K   K+KK+KK KK+K  + K  KKKK KK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190



 Score = 39.3 bits (92), Expect = 4e-04
 Identities = 20/47 (42%), Positives = 28/47 (59%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                  E E++EKK+ K K + KK+KK+KK KK+K  + K  K KK
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187



 Score = 38.5 bits (90), Expect = 8e-04
 Identities = 20/64 (31%), Positives = 35/64 (54%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
                  E E   K + + + ++++KK+KKKKK+ KK+KK+KK  K+ +     S K   
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188

Query: 187 KSRE 190
           K ++
Sbjct: 189 KKKK 192



 Score = 32.0 bits (73), Expect = 0.13
 Identities = 16/44 (36%), Positives = 27/44 (61%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           P  L  E +  +K      +K+ + ++++KK+KKKKK+ KK KK
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKK 168



 Score = 29.7 bits (67), Expect = 0.77
 Identities = 17/46 (36%), Positives = 29/46 (63%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           P  +  E E  EK+      K+ + ++++KK+KKKKK+ KK+K +K
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 27.4 bits (61), Expect = 3.9
 Identities = 13/44 (29%), Positives = 23/44 (52%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           P     E   + +   K    K +K+ + ++++KK+KKKKK  K
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 42.1 bits (100), Expect = 9e-05
 Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 16/78 (20%)

Query: 97  QLPRG---RLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKK 153
            LP+G   R   +E+ L LL+              +P  L        K+    K   K 
Sbjct: 795 TLPKGRETRAPTVEEALELLAE-------------KPEKLRYLADAPAKDPAGKKAAVKF 841

Query: 154 KKKKKKKKKKKKKKKKKT 171
            +K K++    +K  K T
Sbjct: 842 SRKTKQQYVASEKDGKAT 859



 Score = 41.7 bits (99), Expect = 1e-04
 Identities = 11/43 (25%), Positives = 17/43 (39%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + L  EK EK +       K    KK   K  +K K++    +
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853



 Score = 34.4 bits (80), Expect = 0.028
 Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 3/42 (7%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKK---KTKKFLPPSDPSGKA 184
           K +K +       K    KK   K   KTK+    S+  GKA
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 20/49 (40%), Positives = 30/49 (61%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            K E+K  K + K++KKK+KKKKKKK KK  KK++ +      + S + 
Sbjct: 77  RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125



 Score = 37.7 bits (88), Expect = 0.001
 Identities = 19/37 (51%), Positives = 27/37 (72%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E+K++K  +   KK+ K++KKK+KKKKKKK KK   K
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 37.3 bits (87), Expect = 0.001
 Identities = 21/50 (42%), Positives = 32/50 (64%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           +K+E+KK K++ K  KK+ K++KKK+KKKKKKK K        + S  + 
Sbjct: 70  QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119



 Score = 35.4 bits (82), Expect = 0.005
 Identities = 18/37 (48%), Positives = 31/37 (83%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++++++ K K+++K  KK+ K++KKK+KKKKKKK KK
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 34.6 bits (80), Expect = 0.011
 Identities = 17/38 (44%), Positives = 27/38 (71%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++K ++K +   +K  KK+ K++KKK+KKKKKKK  K 
Sbjct: 69  QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKG 106



 Score = 33.1 bits (76), Expect = 0.034
 Identities = 17/40 (42%), Positives = 29/40 (72%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E +E ++ + + K+K ++K  KK+ K++KKK+KKK KK
Sbjct: 62  ETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101



 Score = 31.2 bits (71), Expect = 0.15
 Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            K++KKK K   KKKKKK KK  KK++K+  K  +            K        
Sbjct: 88  AKRQKKKQK---KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIM 140


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 39.4 bits (93), Expect = 2e-04
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KKE K  +   KK +K K K +KKK KK++ K +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 38.3 bits (90), Expect = 7e-04
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +LE    +KK+ K   K +KK +K K K +KKK KK++   +
Sbjct: 57  SLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 36.0 bits (84), Expect = 0.003
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           AL  +K+ K   K    +KK +K K K +KKK KK++ K + F
Sbjct: 61  ALLDKKELKAWEK---AEKKAEKAKAKAEKKKAKKEEPKPRLF 100



 Score = 35.6 bits (83), Expect = 0.006
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           KK+ K  +K +KK +K K K +K K       P
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 26.3 bits (59), Expect = 6.9
 Identities = 9/33 (27%), Positives = 14/33 (42%)

Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           KK+ K  +K +KK +K K   +         K 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 39.7 bits (93), Expect = 2e-04
 Identities = 21/40 (52%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+K+K K K K  KKKK K K KK  KK  K +KK +K
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 38.9 bits (91), Expect = 5e-04
 Identities = 19/39 (48%), Positives = 24/39 (61%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            EK++ K  K K+KKKK K K KK  KK  K +KK  K+
Sbjct: 79  EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117



 Score = 37.4 bits (87), Expect = 0.002
 Identities = 21/52 (40%), Positives = 26/52 (50%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           ++ E +EK+K K K KK KKKK K K KK  KK  K + K      D     
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125



 Score = 36.6 bits (85), Expect = 0.003
 Identities = 28/73 (38%), Positives = 32/73 (43%), Gaps = 10/73 (13%)

Query: 125 LYVPRPVALEGEKKEKKKNKNKNKK------KKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           +Y       + +KKE  +   K KK      K K KKKK KKKK K K KK  KK     
Sbjct: 52  IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK----D 107

Query: 179 DPSGKAGVKSREQ 191
           D S K   K  E 
Sbjct: 108 DKSEKKDEKEAED 120



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 16/52 (30%), Positives = 24/52 (46%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
            KK+K K K    K KK  KK  K +KK +K+ +   + L  S     + + 
Sbjct: 86  WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLS 137



 Score = 35.4 bits (82), Expect = 0.007
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +  KK+K K+K+K   KK  K +KK +K+ + K +  TK
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127



 Score = 35.4 bits (82), Expect = 0.007
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + +K +KKK+K+K+KK  KK  K +KK +K+ + K +   
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126



 Score = 32.4 bits (74), Expect = 0.077
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             +KK+K K+K  +KK  K +KK +K+ + K +   K+  
Sbjct: 91  SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130



 Score = 31.2 bits (71), Expect = 0.18
 Identities = 11/39 (28%), Positives = 22/39 (56%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           + + K+KK +K  +K +KK +K+ + K +   K   +T 
Sbjct: 95  KDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 39.3 bits (92), Expect = 2e-04
 Identities = 24/41 (58%), Positives = 29/41 (70%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +E +K  K+  K K K+KK+KKKKKKKKKKK  KK  K KK
Sbjct: 95  VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 33.1 bits (76), Expect = 0.034
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
           + V  E  K++K+K K K KKKKK  KK  KKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 31.6 bits (72), Expect = 0.10
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
            L+ + K+KK+ K K KKKKKK  KK  KKKK
Sbjct: 104 VLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 39.5 bits (93), Expect = 5e-04
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
             + ++K +KK + +  +K   KKK +   KK K    +TKK  P  DP G+   ++ +
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP--DPLGEKLARTED 470



 Score = 39.1 bits (92), Expect = 7e-04
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             E+KK + K +K +KK +K++ +K   KKK +   K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 34.1 bits (79), Expect = 0.031
 Identities = 17/54 (31%), Positives = 35/54 (64%), Gaps = 2/54 (3%)

Query: 121 LYLNLYVPRPVALEGEKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +YL L+  +P+  EGE++E +    +  ++KK +KK++K +KK +K++ +K   
Sbjct: 383 IYLKLH-DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAA 435


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 39.4 bits (92), Expect = 6e-04
 Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 136 EKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
           EKK K K K+ + K K  K  KKK K K KK  KK+    + P    +   G 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGA 800



 Score = 34.8 bits (80), Expect = 0.023
 Identities = 19/51 (37%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK----KTKKFLPPSDPS 181
           G  K   + K K KKK    K K  K  KKK K K      K+  P   PS
Sbjct: 741 GTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPS 791



 Score = 26.7 bits (59), Expect = 9.0
 Identities = 15/47 (31%), Positives = 16/47 (34%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
            K     K   +KK K KKK    K K  K     S    K   K R
Sbjct: 737 TKYAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 39.5 bits (93), Expect = 6e-04
 Identities = 19/58 (32%), Positives = 30/58 (51%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           L G KK  KK K   +K +KK++++KK+KKKK    KK ++         +   +  E
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 39.1 bits (92), Expect = 7e-04
 Identities = 17/60 (28%), Positives = 35/60 (58%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           +    EK EKK+ + K +KKKK    KKK+++++++K+KK ++     + + +   +  E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471



 Score = 39.1 bits (92), Expect = 8e-04
 Identities = 15/61 (24%), Positives = 37/61 (60%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           V    +K+E++K + K K    KKK+++++++K+KK+++K ++     +   +   K ++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475

Query: 191 Q 191
           Q
Sbjct: 476 Q 476



 Score = 38.7 bits (91), Expect = 0.001
 Identities = 14/57 (24%), Positives = 31/57 (54%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           +  KK  +K + K +++KK+KKKK    KKK++++++ K+         +   +  +
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466



 Score = 38.0 bits (89), Expect = 0.002
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
             EK + K + +KK+KKKK    KKK+++++++K KK     +   +A  +  E+
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 38.0 bits (89), Expect = 0.002
 Identities = 16/56 (28%), Positives = 31/56 (55%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            KK KK  +   KK++++KK+KKKK    KKK+++ ++     +   +   +  E+
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464



 Score = 36.1 bits (84), Expect = 0.009
 Identities = 14/56 (25%), Positives = 33/56 (58%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           +K +K   K + K++++KK+KKKK    KKK++++ ++     +   +   ++ E+
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 35.7 bits (83), Expect = 0.009
 Identities = 14/60 (23%), Positives = 35/60 (58%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
             +E  +K++++ K + KKK    KKK+++++++K+KK++ K+         K   + ++
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473



 Score = 35.3 bits (82), Expect = 0.015
 Identities = 12/53 (22%), Positives = 30/53 (56%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
             ++   ++ +K + + KK+KKKK    KKK+++++++K+ K+     +    
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462



 Score = 34.5 bits (80), Expect = 0.025
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           E E +    +K   KK KK  +K +KK++++KK+KKK        +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441



 Score = 32.2 bits (74), Expect = 0.13
 Identities = 14/53 (26%), Positives = 28/53 (52%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           +E+ +    +KK  KK KK  +K +KK++++KK KK    +    +   +  +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448



 Score = 31.4 bits (72), Expect = 0.28
 Identities = 10/38 (26%), Positives = 32/38 (84%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           + E++E++K K + +K++++++ +++K+++++KKKK+ 
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 30.3 bits (69), Expect = 0.70
 Identities = 10/39 (25%), Positives = 29/39 (74%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
                E E++EK+K + + ++++++ +++K+++++KKKK
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 38.0 bits (89), Expect = 0.001
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           E++ KK+     K+K  ++++K K K  +KK+K + +K  P   P
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160



 Score = 38.0 bits (89), Expect = 0.001
 Identities = 10/41 (24%), Positives = 23/41 (56%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           + E++  K +   +K+K  ++++K K K  +KK+  +   P
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 37.2 bits (87), Expect = 0.002
 Identities = 15/46 (32%), Positives = 26/46 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           +K+E    K K  ++++K K K  +KK+K + +K   +  P SD S
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDIS 165



 Score = 36.4 bits (85), Expect = 0.004
 Identities = 11/49 (22%), Positives = 24/49 (48%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           E K K + +   ++ KK++   +K+K  ++++K K K       P  + 
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153



 Score = 34.9 bits (81), Expect = 0.011
 Identities = 10/45 (22%), Positives = 24/45 (53%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
            ++ +++ K +    +K+K  ++++K K K  +KK K      +P
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 34.5 bits (80), Expect = 0.020
 Identities = 10/44 (22%), Positives = 23/44 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
            +++ K +    +K+K  ++++K K K  +KK+K +   P    
Sbjct: 115 AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158



 Score = 33.4 bits (77), Expect = 0.043
 Identities = 10/45 (22%), Positives = 23/45 (51%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           E + K+++    K K  ++++K K K  +KK+K + +  +     
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160



 Score = 31.8 bits (73), Expect = 0.13
 Identities = 9/39 (23%), Positives = 20/39 (51%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E   +++K  + + K K K  +KK+K + +K + +    
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161



 Score = 31.0 bits (71), Expect = 0.29
 Identities = 13/56 (23%), Positives = 27/56 (48%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           A + E   +K+   + ++K K K  +KK+K + +K + +       S+ +    VK
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174



 Score = 30.7 bits (70), Expect = 0.38
 Identities = 9/50 (18%), Positives = 23/50 (46%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K K + ++ +++ KK++   +K+K  +++ K          K   +  E 
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156



 Score = 30.3 bits (69), Expect = 0.40
 Identities = 12/45 (26%), Positives = 22/45 (48%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           R  A E EK  +++ K K K  +KK+K + +K + +        +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 38.3 bits (90), Expect = 0.001
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            L+    +KK+ K  +K +KKK+K++ K  K K K +
Sbjct: 55  ELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 36.7 bits (86), Expect = 0.004
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K + K   K +KKK+K++ K  K K+K 
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 36.0 bits (84), Expect = 0.007
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K    +K + K   K +KKK+K++ K  K K K + F
Sbjct: 57  KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93



 Score = 34.8 bits (81), Expect = 0.017
 Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           KKE K         K +KKK+K++ K  K K K  
Sbjct: 63  KKELKA------WHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 34.8 bits (81), Expect = 0.020
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KE K+        KK+ K   K +KKK+K++ K  K
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAK 85



 Score = 34.0 bits (79), Expect = 0.027
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E KE+ K    +KK+ K   K +KKK+K++ K  K K 
Sbjct: 51  EMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88



 Score = 33.7 bits (78), Expect = 0.042
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           KK+ K   K +KKK+K++ K    K  P
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKP 90



 Score = 29.4 bits (67), Expect = 1.2
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++ K+ K + K     KK+ K   K +KKK+K+  K
Sbjct: 47  EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAK 82



 Score = 28.3 bits (64), Expect = 2.2
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           KK+ K   K +KKK+K++ K     S P
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSKP 90


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 10/74 (13%)

Query: 110 LPLLSNQSEDC----LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
           LP L  + ED         L     +  E E+KE  K      ++K+ +K KK+++KKKK
Sbjct: 528 LPNLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALK------EQKRLRKLKKQEEKKKK 581

Query: 166 KKKKKTKKFLPPSD 179
           + +K  K  +PP++
Sbjct: 582 ELEKLEKAKIPPAE 595


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 38.1 bits (89), Expect = 0.002
 Identities = 17/72 (23%), Positives = 22/72 (30%)

Query: 114 SNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
               +  ++L+  V            KKK K   K    K   KKK K +    KK  KK
Sbjct: 50  EQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKK 109

Query: 174 FLPPSDPSGKAG 185
                D      
Sbjct: 110 NALDKDDDLNYV 121



 Score = 34.6 bits (80), Expect = 0.022
 Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 1/61 (1%)

Query: 131 VALEGEKKEKKKNKNKNKKK-KKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           V +      K  +        KKK K   K    K   KKK K  L  S  + K     +
Sbjct: 55  VLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114

Query: 190 E 190
           +
Sbjct: 115 D 115



 Score = 33.8 bits (78), Expect = 0.038
 Identities = 11/52 (21%), Positives = 15/52 (28%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
                 + +   K        KKK K +    KK +KK    K        D
Sbjct: 72  SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123



 Score = 32.7 bits (75), Expect = 0.099
 Identities = 14/64 (21%), Positives = 19/64 (29%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
                +   +        K+ +   +    KKK K   K    K    KK     D S K
Sbjct: 46  KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105

Query: 184 AGVK 187
           A  K
Sbjct: 106 AEKK 109



 Score = 31.5 bits (72), Expect = 0.21
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
              K +   + + KKK KK   K K K    K++ K+ L   
Sbjct: 4   ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESK 45



 Score = 31.5 bits (72), Expect = 0.23
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
               A E  KK+ KK   K+K K    K++ K+  + KKK  +   
Sbjct: 8   AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQID 53



 Score = 30.3 bits (69), Expect = 0.52
 Identities = 9/45 (20%), Positives = 13/45 (28%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
            K   K K K++    KK +KK    K           +      
Sbjct: 88  AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132



 Score = 30.3 bits (69), Expect = 0.52
 Identities = 16/53 (30%), Positives = 24/53 (45%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
           E++ KKK K    K K K    K++ K+  + KKKT + +          VK 
Sbjct: 13  EEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65



 Score = 30.0 bits (68), Expect = 0.70
 Identities = 10/37 (27%), Positives = 18/37 (48%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
               + K +   +++ KKK KK   K K+K F+   +
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEE 37



 Score = 30.0 bits (68), Expect = 0.88
 Identities = 10/52 (19%), Positives = 15/52 (28%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
              K   K K K +    KK +KK    K       K +   + +       
Sbjct: 86  AAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDD 137



 Score = 29.6 bits (67), Expect = 0.92
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                K +   +++ KKK KK   K K K   TK+
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKE 36



 Score = 29.6 bits (67), Expect = 0.95
 Identities = 13/55 (23%), Positives = 22/55 (40%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           +KK KK       K    K++ K+  + KKK  ++  + L       K    + E
Sbjct: 17  KKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATE 71



 Score = 28.0 bits (63), Expect = 3.3
 Identities = 13/55 (23%), Positives = 22/55 (40%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           E   +++ K K KK   K K K    K++ K+  ++KK  P         +    
Sbjct: 9   ELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63



 Score = 28.0 bits (63), Expect = 3.7
 Identities = 9/44 (20%), Positives = 12/44 (27%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
              +KK     +  KK +KK    K       K          D
Sbjct: 90  APAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDD 133


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 19/40 (47%), Positives = 22/40 (55%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E    KK  N  +++KKKKKKK  KKKK KK      K
Sbjct: 165 EAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204



 Score = 36.1 bits (84), Expect = 0.005
 Identities = 18/38 (47%), Positives = 23/38 (60%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             ++KKK+ N  ++KKKKKKK  KKKK KK      K 
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 35.4 bits (82), Expect = 0.008
 Identities = 18/39 (46%), Positives = 20/39 (51%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           A     K+K  N  + KKKKKKK  KKKK KK      K
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204



 Score = 31.9 bits (73), Expect = 0.14
 Identities = 17/35 (48%), Positives = 23/35 (65%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            K+    ++KKK    +++KKKKKKK  KKKK KK
Sbjct: 163 IKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197



 Score = 31.9 bits (73), Expect = 0.15
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            L+ ++ E  ++K K+   +++KKKKKKK  KKKK KK    
Sbjct: 160 ELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201



 Score = 28.8 bits (65), Expect = 1.5
 Identities = 12/37 (32%), Positives = 15/37 (40%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
                +    E++K K K K  KKKK KK      K 
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 27.7 bits (62), Expect = 3.1
 Identities = 17/47 (36%), Positives = 23/47 (48%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           KK++   K     + KKK    +++KKKKKKK   KK L      G 
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGM 203


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 37.9 bits (88), Expect = 0.002
 Identities = 15/48 (31%), Positives = 17/48 (35%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           L      K K K +  KKK    K   KKKKK    K        + P
Sbjct: 110 LAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKP 157



 Score = 36.3 bits (84), Expect = 0.006
 Identities = 14/45 (31%), Positives = 18/45 (40%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
            ++ K    K KKK    K   KKKKK    K +  K+      S
Sbjct: 115 MRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKS 159



 Score = 34.4 bits (79), Expect = 0.028
 Identities = 17/56 (30%), Positives = 23/56 (41%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           + K+ KK K K KKK KKK             K  +K+F  P   S     K + +
Sbjct: 68  DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTE 123



 Score = 33.7 bits (77), Expect = 0.045
 Identities = 16/37 (43%), Positives = 19/37 (51%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
             L  +  +K K  NK    K  KK K K+KKK KKK
Sbjct: 50  KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86



 Score = 33.7 bits (77), Expect = 0.048
 Identities = 18/34 (52%), Positives = 20/34 (58%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            +K  KKNK  NK    K  KK K K+KKK KKK
Sbjct: 53  HEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86



 Score = 31.0 bits (70), Expect = 0.39
 Identities = 18/38 (47%), Positives = 20/38 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           KK  +K   KNKK  K    K  KK K K+KKK  KK 
Sbjct: 50  KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKL 87



 Score = 27.9 bits (62), Expect = 3.8
 Identities = 14/42 (33%), Positives = 17/42 (40%), Gaps = 2/42 (4%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKK--KKTKKFLPPSDPS 181
                +  +K K K +K KKK    K   KK KK L   D  
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDEL 149



 Score = 27.1 bits (60), Expect = 6.8
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E  KK+   NK+ NKKKKK    K +  K    K K
Sbjct: 123 EKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPK 158


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 36.1 bits (84), Expect = 0.003
 Identities = 13/34 (38%), Positives = 27/34 (79%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
             K++KK ++K KK+++K++K++ K++KKK K +
Sbjct: 97  RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 33.4 bits (77), Expect = 0.023
 Identities = 16/41 (39%), Positives = 30/41 (73%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           AL+ E +  K+ K K  K+KK+++K++K++ K++KKK K +
Sbjct: 90  ALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 32.6 bits (75), Expect = 0.040
 Identities = 15/37 (40%), Positives = 29/37 (78%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K E ++NK + KK+ K+KK+++K++K++ K++KK  K
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAK 128



 Score = 31.8 bits (73), Expect = 0.072
 Identities = 12/35 (34%), Positives = 26/35 (74%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E+ K + K + K+KK+++K++K++ K++KKK   +
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 27.6 bits (62), Expect = 2.4
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 6/45 (13%)

Query: 137 KKEKKKNKNKN------KKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            KE KK           K + ++ K++KKK+ K+KK+++K +K  
Sbjct: 75  AKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQ 119


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 36.3 bits (84), Expect = 0.004
 Identities = 20/42 (47%), Positives = 26/42 (61%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           + +  K   KK KKKK +  K++KKKKK+KKK KK   P  P
Sbjct: 133 DSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 36.0 bits (83), Expect = 0.005
 Identities = 18/35 (51%), Positives = 23/35 (65%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           K KKKK +  K++KKKKK+KKKK K+  P     G
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVG 177



 Score = 35.6 bits (82), Expect = 0.006
 Identities = 18/29 (62%), Positives = 23/29 (79%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           EKK KKK    +K++KKKKK+KKKKKK+ 
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRH 169



 Score = 35.6 bits (82), Expect = 0.006
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           +P     + E  +  + K+K KK +  K++KKKKK+KKKKKK+ + +   
Sbjct: 126 LPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175



 Score = 34.4 bits (79), Expect = 0.019
 Identities = 18/44 (40%), Positives = 24/44 (54%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           +  K   K   KKK +  K++KKKKK+KKKKKK+     P    
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVGF 178



 Score = 32.9 bits (75), Expect = 0.058
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
             P         ++K K K     K++KKKKK+KKKKKK+ 
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 37.0 bits (86), Expect = 0.004
 Identities = 22/72 (30%), Positives = 32/72 (44%), Gaps = 14/72 (19%)

Query: 134 EGEKKEKKKNKNK--------------NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           +G KK+ KK+K                N++ + K+  KKKKKKKKKKKKK   +    + 
Sbjct: 20  KGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79

Query: 180 PSGKAGVKSREQ 191
                   +  Q
Sbjct: 80  DLPVVWSSAAFQ 91



 Score = 35.8 bits (83), Expect = 0.008
 Identities = 17/43 (39%), Positives = 21/43 (48%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
            L     E ++ +NK   KKKKKKKKKKKKK   +        
Sbjct: 39  FLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDL 81



 Score = 34.7 bits (80), Expect = 0.022
 Identities = 26/83 (31%), Positives = 31/83 (37%), Gaps = 27/83 (32%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKK---------------------------KKKKKK 164
             E  + E K+ K +NK+K  KKK KK                            KKKKK
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61

Query: 165 KKKKKKTKKFLPPSDPSGKAGVK 187
           KKKKKK K      D +    V 
Sbjct: 62  KKKKKKKKNLGEAYDLAYDLPVV 84



 Score = 33.1 bits (76), Expect = 0.073
 Identities = 18/53 (33%), Positives = 26/53 (49%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
            +   +  + +NK+  KKKKKKKKKKKKK   +     +  P   S  A   +
Sbjct: 41  AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDN 93



 Score = 31.2 bits (71), Expect = 0.31
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
              E  E +  + K + K+K  KKK KK KK 
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32



 Score = 30.8 bits (70), Expect = 0.45
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             N+N + + K++K++ K+K  KKK KK+KK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKK 31



 Score = 30.5 bits (69), Expect = 0.58
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
                 +N+ K++K++ K+K  KKK KK KK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKK 31



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 15/58 (25%), Positives = 21/58 (36%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           E E K+  K K K KKKKKKK   +            +      +    K G    ++
Sbjct: 49  EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQE 106



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             N+N   + K++K++ K+K  KKK KK K  
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32



 Score = 28.9 bits (65), Expect = 1.8
 Identities = 19/68 (27%), Positives = 25/68 (36%), Gaps = 2/68 (2%)

Query: 114 SNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           S +  D     L     ++   E + K+ NK K KKKKKKKKK   +             
Sbjct: 29  SKKDVDDDDAFL--AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWS 86

Query: 174 FLPPSDPS 181
                D S
Sbjct: 87  SAAFQDNS 94



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                 + + K++K++ K+K  KKK KK  K 
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32



 Score = 27.4 bits (61), Expect = 6.2
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
              +       ++K++ K+K  KKK KK KK 
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 18/48 (37%), Positives = 20/48 (41%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
            + K K K     K K KKK+KKKKK    K KK          D   K
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206



 Score = 36.6 bits (85), Expect = 0.006
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
            ALE +++ ++K   K ++ K K K K  K +K K KKK+ KK    +D S KA V   
Sbjct: 1140 ALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197



 Score = 36.2 bits (84), Expect = 0.008
 Identities = 20/58 (34%), Positives = 23/58 (39%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
            L+ + K K     K K KKK+KKKKK    K KK           SD   K   K   
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215



 Score = 35.8 bits (83), Expect = 0.011
 Identities = 18/55 (32%), Positives = 22/55 (40%)

Query: 137  KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
             K K K K    +K K KKK+KKKKK    K KK                K  ++
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK 1212



 Score = 35.4 bits (82), Expect = 0.016
 Identities = 15/58 (25%), Positives = 25/58 (43%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
             + ++ + K     +K +K K KKK+KKKKK    K K    +  S        +  +
Sbjct: 1153 AKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLD 1210



 Score = 35.0 bits (81), Expect = 0.019
 Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 1/55 (1%)

Query: 136  EKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
            E +EK+  K  + K K K K  K +K K KKK+KKK K     S  +   G   R
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200



 Score = 33.5 bits (77), Expect = 0.059
 Identities = 18/45 (40%), Positives = 23/45 (51%)

Query: 129  RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                +  E++ K K K K  K +K K KKK+KKKKK    K  K 
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA 1192



 Score = 33.5 bits (77), Expect = 0.067
 Identities = 22/58 (37%), Positives = 30/58 (51%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            E E  ++++ K+K K K  K +K K KKK+KKKKK    K    S       V S E+
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206



 Score = 32.7 bits (75), Expect = 0.10
 Identities = 12/45 (26%), Positives = 22/45 (48%)

Query: 130  PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
              AL+ +KK +KK   K K K + K+    +  +  ++ +K K  
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362



 Score = 30.8 bits (70), Expect = 0.54
 Identities = 14/61 (22%), Positives = 21/61 (34%), Gaps = 4/61 (6%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKK----KKKKKKKTKKFLPPSDPSGKAGVKSR 189
                   KK   K  +      KKKKK +K    KKK K + K+             + +
Sbjct: 1300 SKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKK 1359

Query: 190  E 190
            +
Sbjct: 1360 K 1360



 Score = 29.6 bits (67), Expect = 1.2
 Identities = 13/47 (27%), Positives = 22/47 (46%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
            L   KK+KK  K   +KKK K + K+    +  +  ++ +K    S 
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364



 Score = 28.5 bits (64), Expect = 2.9
 Identities = 10/47 (21%), Positives = 20/47 (42%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
                 K+KKK++ K  +KKK K + K+    +  +  +  +      
Sbjct: 1317 SLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363



 Score = 27.7 bits (62), Expect = 4.7
 Identities = 13/49 (26%), Positives = 18/49 (36%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
               E       +   KKK KK+ +      KKKKK++K       S   
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340



 Score = 27.3 bits (61), Expect = 5.8
 Identities = 15/46 (32%), Positives = 23/46 (50%)

Query: 128  PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            P     +  KK  + +    KKKKK +KK  +KKK K + K+ +  
Sbjct: 1302 PSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347



 Score = 27.3 bits (61), Expect = 7.2
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E  E+++   + +  K+++ K K K K  K +K K KK
Sbjct: 1139 EALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKK 1176



 Score = 26.9 bits (60), Expect = 9.1
 Identities = 14/40 (35%), Positives = 19/40 (47%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
              +GE     K  +  KKK KK+ +      KKKKK +K 
Sbjct: 1291 RPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 36.0 bits (83), Expect = 0.005
 Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +K      +KKKK K KK K K KK + KKK +
Sbjct: 82  RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 34.4 bits (79), Expect = 0.017
 Identities = 16/32 (50%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKK 159
           P   A+E +KK K K KNK K KK + KKK +
Sbjct: 84  PATTAVEKKKKGKSK-KNKLKGKKDEDKKKAR 114



 Score = 31.0 bits (70), Expect = 0.29
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++       +KKKK K KK K K KK + K K 
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113



 Score = 30.6 bits (69), Expect = 0.35
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 150 KKKKKKKKKKKKKKKKKKKKKTKK 173
           KKKK K KK K K KK + KK  +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 34.9 bits (80), Expect = 0.006
 Identities = 15/23 (65%), Positives = 15/23 (65%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
            K KKKKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 34.9 bits (80), Expect = 0.007
 Identities = 14/27 (51%), Positives = 15/27 (55%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKK 165
            +   K K KKKK KK KK KK  KK 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 33.7 bits (77), Expect = 0.017
 Identities = 15/23 (65%), Positives = 15/23 (65%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
            K KKKKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.9 bits (75), Expect = 0.035
 Identities = 15/22 (68%), Positives = 15/22 (68%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKK 170
           K KKKKKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 31.8 bits (72), Expect = 0.071
 Identities = 15/21 (71%), Positives = 16/21 (76%)

Query: 153 KKKKKKKKKKKKKKKKKKTKK 173
           K KKKKKK KK KK KK +KK
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 31.0 bits (70), Expect = 0.13
 Identities = 14/22 (63%), Positives = 14/22 (63%)

Query: 152 KKKKKKKKKKKKKKKKKKKTKK 173
           K KKKKKK KK KK KK   K 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.2 bits (68), Expect = 0.25
 Identities = 15/32 (46%), Positives = 15/32 (46%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
           P     E  EK K K K  KK KK KK  KK 
Sbjct: 89  PDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 34.2 bits (79), Expect = 0.009
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           ++ + K  K+K  +     KKKKK+K KKK  +  P S  
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69



 Score = 31.5 bits (72), Expect = 0.078
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           E+KE K+  ++     KKKKK+K KKK+ +  
Sbjct: 33  EEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 31.1 bits (71), Expect = 0.089
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           + + K+ K+K  +     KKKKK+K K K+       +
Sbjct: 31  EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAST 68



 Score = 29.5 bits (67), Expect = 0.31
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           E++E+K+ K K  +     KKKKK+K KKK+ +   
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGP 65



 Score = 28.0 bits (63), Expect = 1.2
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           LE +++++ K K        KKKKK+K KKK+ +  
Sbjct: 29  LEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 26.4 bits (59), Expect = 4.1
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           NK+  +++++K+ K+K  +     KKKK +K
Sbjct: 25  NKDYLEEQEEKELKQKADEGNNSGKKKKKRK 55


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 36.3 bits (83), Expect = 0.009
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E  K E +   ++ +  ++K +  +KKK++ KKK    KK
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 35.9 bits (82), Expect = 0.011
 Identities = 7/38 (18%), Positives = 18/38 (47%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +K E+ K   +  K + +    + +  ++K +  + KK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 34.7 bits (79), Expect = 0.028
 Identities = 13/55 (23%), Positives = 26/55 (47%)

Query: 137  KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            KK+ ++ K K    KKK ++ KK  +  K + +        ++   +A  K +E+
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375



 Score = 34.3 bits (78), Expect = 0.038
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E  KK  +  K + +    + +  ++K +  +KKK++ KK
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378



 Score = 34.0 bits (77), Expect = 0.049
 Identities = 9/42 (21%), Positives = 19/42 (45%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +K  +           + +  ++K +  +KKK++  KK
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 33.6 bits (76), Expect = 0.054
 Identities = 8/42 (19%), Positives = 18/42 (42%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A   +KK ++  K     K + +    + +  ++K +   KK
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372



 Score = 33.6 bits (76), Expect = 0.065
 Identities = 8/40 (20%), Positives = 18/40 (45%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E  KK+    K K ++ KK  +  K + +    + +  ++
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 33.2 bits (75), Expect = 0.071
 Identities = 8/42 (19%), Positives = 20/42 (47%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A + + +E KK     K + +    + +  ++K +  +K K+
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 33.2 bits (75), Expect = 0.087
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +K +  K K +  KK  +  K + +    + +  + K 
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366



 Score = 32.8 bits (74), Expect = 0.098
 Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            + E+ +K     K + +    + +  ++K +  +KKK +
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375



 Score = 32.4 bits (73), Expect = 0.16
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E E    +    + K +  +KKK++ KKK    KKK  +K
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390



 Score = 32.0 bits (72), Expect = 0.18
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E    E +  + K +  +KKK++ KKK    KKK ++ KK
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 32.0 bits (72), Expect = 0.18
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            + + +         ++K +  +KKK++ KKK     KK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 32.0 bits (72), Expect = 0.20
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 131  VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             A    +    + +   +K +  +KKK++ KKK    KKK ++
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389



 Score = 31.3 bits (70), Expect = 0.35
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +KK ++  K +  KKK ++ KK  + KKK ++ KK  +
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487



 Score = 31.3 bits (70), Expect = 0.37
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +KK ++K K    KKK ++ KK  + KKK ++ KK ++    ++ + KA
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472



 Score = 30.9 bits (69), Expect = 0.41
 Identities = 18/53 (33%), Positives = 28/53 (52%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +K    K K    KKK ++KKK  + KKK ++ KK  +    ++ + KA
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459



 Score = 30.9 bits (69), Expect = 0.47
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E E  E+K    + KK++ KKK    KKK ++KKK    K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397



 Score = 30.9 bits (69), Expect = 0.53
 Identities = 8/39 (20%), Positives = 18/39 (46%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            + E+ +KK +  K K ++ KK  +  K + +    +   
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361



 Score = 30.5 bits (68), Expect = 0.54
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +K+E KK  +  KKK ++KKK  + KKK ++ KKK  +    +    KA
Sbjct: 1368 AAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420



 Score = 30.5 bits (68), Expect = 0.58
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +K E+ K K +  KK  + KKK ++ KK  + KKK ++    +D + KA
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 30.5 bits (68), Expect = 0.59
 Identities = 19/53 (35%), Positives = 31/53 (58%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +K E+KK  ++ KKK ++ KK  + KKK ++ KKK       ++ + KA
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 30.1 bits (67), Expect = 0.82
 Identities = 17/53 (32%), Positives = 30/53 (56%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +KK ++  K    KKK ++ KK ++ KKK ++ KK  +    ++ + KA
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA 1485



 Score = 29.7 bits (66), Expect = 0.98
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             E  +++ +  + K ++ KKK    KKK ++KKK  +  KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 29.7 bits (66), Expect = 1.1
 Identities = 17/53 (32%), Positives = 27/53 (50%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +KK ++  K    KKK ++ KKK  + KK  + KK       ++ + KA
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524



 Score = 29.7 bits (66), Expect = 1.1
 Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKK----KKKKKKKKKKKKTKK 173
            E +KK ++  K    KKK ++ KKK    KKK ++ KK  +  K
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349



 Score = 29.7 bits (66), Expect = 1.2
 Identities = 16/49 (32%), Positives = 28/49 (57%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            +K E+ K  ++ KKK ++ KK  + KKK ++ KKK  +    ++   KA
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512



 Score = 29.7 bits (66), Expect = 1.2
 Identities = 19/53 (35%), Positives = 29/53 (54%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E + +  +K K + KKK    KKK ++KKK  + KKK ++    +D   KA
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 29.3 bits (65), Expect = 1.6
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            +K E+KK  ++ KKK ++ KKK  + KK    KKK  +    ++   KA
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA 1433



 Score = 29.3 bits (65), Expect = 1.6
 Identities = 17/49 (34%), Positives = 26/49 (53%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            +K E+ K K    KK    KKK  + KKK ++KKK  +    ++ + KA
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKA 1446



 Score = 29.0 bits (64), Expect = 1.8
 Identities = 18/53 (33%), Positives = 29/53 (54%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            A E +K ++ K K +  KK  + KKK ++ KKK  + KK  +    +D + KA
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA 1518



 Score = 29.0 bits (64), Expect = 2.0
 Identities = 15/40 (37%), Positives = 24/40 (60%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            + E+ +KK ++ K     KKK  + KKK ++KKK  + KK
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438



 Score = 28.6 bits (63), Expect = 2.9
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKK----KKKKKKKKKKKKTKK 173
            A   +KK ++K K    KKK ++ KKK    KK    KKK  + KK
Sbjct: 1380 ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKK 1425



 Score = 28.2 bits (62), Expect = 3.4
 Identities = 15/63 (23%), Positives = 34/63 (53%)

Query: 129  RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
              +  E  KKE +++K K ++ KK +++KKK    KK+++KK ++     +   +  +  
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDE 1787

Query: 189  REQ 191
             ++
Sbjct: 1788 EDE 1790



 Score = 28.2 bits (62), Expect = 3.4
 Identities = 12/41 (29%), Positives = 27/41 (65%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             +  ++++KK     KK+ ++ KK ++ KKK+ ++KKK ++
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720



 Score = 28.2 bits (62), Expect = 3.9
 Identities = 17/59 (28%), Positives = 30/59 (50%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
            A E +K ++ K K +  KKK  + KK    KKK  + KK  +    +D + K   ++++
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 27.8 bits (61), Expect = 4.1
 Identities = 16/42 (38%), Positives = 26/42 (61%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +K E+ K  ++ KK ++ KK  + KK ++KKK  + KK
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553



 Score = 27.8 bits (61), Expect = 4.5
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 131  VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +    E K+ +++K K ++ KK ++ +KK  +  KK+ ++ KK
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703



 Score = 27.8 bits (61), Expect = 5.3
 Identities = 17/60 (28%), Positives = 29/60 (48%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            A E +K ++ K K +  KK  + KKK ++ KKK    KK  +    +  + KA  ++   
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 27.4 bits (60), Expect = 5.3
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              E K+K +   K ++ +KK  +  KK+ ++ KK ++ KK
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709



 Score = 27.0 bits (59), Expect = 7.2
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E  KK +++NK K  ++ KK ++ KKK ++ KK ++  KK
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689



 Score = 27.0 bits (59), Expect = 7.4
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E ++K+  +   K  ++ KK ++ KKK+ ++KKK ++ KK
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723



 Score = 27.0 bits (59), Expect = 8.0
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 138  KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +EKKK     K ++KKK  + KKK ++ KK  + KK
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322



 Score = 27.0 bits (59), Expect = 8.2
 Identities = 14/42 (33%), Positives = 24/42 (57%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +KK ++  K  ++ KK  + KKK  + KK ++ KK  +
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526



 Score = 27.0 bits (59), Expect = 8.7
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +K E+KK  ++ KK ++KKK  + KKK ++ KK    K
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAK 1321



 Score = 27.0 bits (59), Expect = 9.1
 Identities = 17/50 (34%), Positives = 28/50 (56%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
            E +KK  +  K + KKK  + KKK ++ KK  + KKK ++    +D + K
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336



 Score = 27.0 bits (59), Expect = 9.1
 Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +  E KK  ++ KK ++ KK  + KK ++ KK  + KK
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541



 Score = 27.0 bits (59), Expect = 9.2
 Identities = 14/41 (34%), Positives = 25/41 (60%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            A E +K E+ + K     KK+ ++ KK ++ KKK+ ++K K
Sbjct: 1677 AEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 35.5 bits (82), Expect = 0.011
 Identities = 13/43 (30%), Positives = 17/43 (39%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A EG  +  KK    +       +K  K K   KK  KK +K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 28.2 bits (63), Expect = 2.9
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           E K+ KK  +   ++K K+   K+K++KK ++   + +  P S P+ 
Sbjct: 116 ESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNS 162


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 35.6 bits (82), Expect = 0.011
 Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
               KE++K K + K++KKKKK+K K++ K +K K++ K+  PP     K   K  E
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP-KEKEKEKEKKVE 155



 Score = 35.2 bits (81), Expect = 0.016
 Identities = 15/57 (26%), Positives = 32/57 (56%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           E + +  K+ + + ++ K++KKKKK+K K++ K +K  ++      P  K   K ++
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 34.9 bits (80), Expect = 0.018
 Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 3/63 (4%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
             P A     KE K   N++ K+++K+K++ K++KKKKK+K K +          K    
Sbjct: 86  KGPAAKTKPAKEPK---NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142

Query: 188 SRE 190
            +E
Sbjct: 143 PKE 145



 Score = 34.5 bits (79), Expect = 0.023
 Identities = 18/46 (39%), Positives = 30/46 (65%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           + EKK+KK+   +  K +K K++ K+K+  K+K+K+K KK   P D
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159



 Score = 33.7 bits (77), Expect = 0.054
 Identities = 13/56 (23%), Positives = 30/56 (53%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           EK+  K+ + + +KK ++ + ++++KK+++ + K   K  P   P  K      E+
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE 194



 Score = 33.3 bits (76), Expect = 0.061
 Identities = 15/58 (25%), Positives = 29/58 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           +G   + K  K    +  K+++K+K++ K++KKKKK+  K  P      +   + R  
Sbjct: 86  KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 32.9 bits (75), Expect = 0.086
 Identities = 17/64 (26%), Positives = 33/64 (51%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
            RP   + ++KEKK  + ++++++KK+++ + K + KK  KKK            K    
Sbjct: 140 KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQA 199

Query: 188 SREQ 191
           +RE 
Sbjct: 200 AREA 203



 Score = 32.9 bits (75), Expect = 0.088
 Identities = 14/57 (24%), Positives = 29/57 (50%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           G  K         K+ K +  K+++K+K++ K++KK KK  P  +P  +   +  ++
Sbjct: 83  GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139



 Score = 31.8 bits (72), Expect = 0.20
 Identities = 14/46 (30%), Positives = 28/46 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           E  K+E K  K K + K+K+  K+K+K+K+KK ++ + ++     +
Sbjct: 122 EKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167



 Score = 31.0 bits (70), Expect = 0.32
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + V   G K    K K   + K +  K+++K+K++ K++KKK K+
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122



 Score = 29.1 bits (65), Expect = 1.6
 Identities = 14/54 (25%), Positives = 26/54 (48%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           E    +    K K  K+ K +  K+++K+K++ K++K KK   P +       K
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK 134



 Score = 28.7 bits (64), Expect = 1.9
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           K+E+ + K++ KK  KKK   KKK+  +++K+++  +      P 
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPE 209



 Score = 28.7 bits (64), Expect = 2.1
 Identities = 11/55 (20%), Positives = 25/55 (45%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           PR    E +++  +      K  KKK   KKK+  +++K+++  ++ +       
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEP 211



 Score = 28.3 bits (63), Expect = 2.7
 Identities = 14/56 (25%), Positives = 27/56 (48%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           E   +E++K + + + K + KK  KKK   KKK+  + +K    +  + K   +  
Sbjct: 156 EPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEP 211



 Score = 27.5 bits (61), Expect = 5.1
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           V  P   E EKK ++       KK  KKK   KKK+  +++K++   
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200



 Score = 27.2 bits (60), Expect = 6.9
 Identities = 11/56 (19%), Positives = 23/56 (41%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
            K   KK   K    KKK+  +++K+++  ++  K K   P  +   +      + 
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKD 226



 Score = 26.8 bits (59), Expect = 8.2
 Identities = 13/54 (24%), Positives = 24/54 (44%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
            R    + E+   K    K  KKK   KKK+  +++K+++  +      P +P 
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPD 212


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 35.5 bits (82), Expect = 0.012
 Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 13/68 (19%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKK----------KKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           A E  KK     K   +++  KK           +KK K K  KK  +K KK   PS  +
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK---PSAKT 794

Query: 182 GKAGVKSR 189
            K    ++
Sbjct: 795 QKIAAATK 802



 Score = 35.1 bits (81), Expect = 0.015
 Identities = 17/42 (40%), Positives = 21/42 (50%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A  GEKK K K   K+ +K KK   K +K     K K+  KK
Sbjct: 768 AFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809



 Score = 33.2 bits (76), Expect = 0.084
 Identities = 13/40 (32%), Positives = 17/40 (42%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E + K K   K+  K KK   K +K     K K+  K K 
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811



 Score = 32.4 bits (74), Expect = 0.13
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
            +K  +K  K   K +K     K K+  KKK  
Sbjct: 780 AKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 32.0 bits (73), Expect = 0.21
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           + V  E +   + + K K K  KK  +K KK   K +K    TK
Sbjct: 759 KKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATK 802



 Score = 30.5 bits (69), Expect = 0.62
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           +  KK+ +K K  + K +K     K K+  KKK  
Sbjct: 778 KAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812



 Score = 30.1 bits (68), Expect = 0.87
 Identities = 11/36 (30%), Positives = 17/36 (47%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
           P A + + ++ KK   K +K     K K+  KKK  
Sbjct: 777 PKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 35.0 bits (81), Expect = 0.015
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E     K+K K+K ++  +K   + K+KKK+KKK++K KK
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343



 Score = 33.1 bits (76), Expect = 0.069
 Identities = 16/40 (40%), Positives = 29/40 (72%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + ++K K K +  N+K   + K+KKK+KKK++KKKK+ ++
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347



 Score = 29.6 bits (67), Expect = 0.90
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           L+ E KEKKK K K +KKKK+ ++ +++ +K + +  
Sbjct: 324 LDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360



 Score = 29.6 bits (67), Expect = 1.0
 Identities = 13/37 (35%), Positives = 27/37 (72%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K + +++  K   + K+KKK+KKK++KKKK+ ++ ++
Sbjct: 314 KSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350



 Score = 29.6 bits (67), Expect = 1.1
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 4/64 (6%)

Query: 132 ALEGEKKEKKKN----KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           AL+ + K  KK     +  +  K+K K K ++  +K   + K+ KK     +   K   +
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347

Query: 188 SREQ 191
             E+
Sbjct: 348 LEER 351



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 13/37 (35%), Positives = 29/37 (78%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +++ +K   + K+KKK+KKK++KKKK+ ++ +++ +K
Sbjct: 318 ERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 35.1 bits (81), Expect = 0.018
 Identities = 17/44 (38%), Positives = 26/44 (59%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           ++  K KK KKKKKKKKK++K     +  P ++  G +   SR+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRK 313



 Score = 34.3 bits (79), Expect = 0.034
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           ++  K KK KKKKKKKKK++K   +   +       S  +G
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSG 310



 Score = 34.0 bits (78), Expect = 0.038
 Identities = 18/59 (30%), Positives = 28/59 (47%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           V+L+ +  E   +   +  +  K KK KKKKKKKKK++K   +     +  G     S 
Sbjct: 252 VSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSG 310



 Score = 32.4 bits (74), Expect = 0.14
 Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +  +    +   +      +  K KK KKKKKKKKK++   
Sbjct: 251 NVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 31.7 bits (72), Expect = 0.25
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           K KK KKKKKKKKK++K   + + + +     S  SG   
Sbjct: 274 KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRK 313



 Score = 30.1 bits (68), Expect = 0.75
 Identities = 9/42 (21%), Positives = 25/42 (59%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A   E  +K++ + + K+K+++ ++K  K ++K+++  K   
Sbjct: 36  AAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGG 77



 Score = 27.4 bits (61), Expect = 5.2
 Identities = 12/45 (26%), Positives = 26/45 (57%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            A E  KK +++ + K K+++ ++K  K ++K+++  K    K L
Sbjct: 37  AAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTL 81



 Score = 27.0 bits (60), Expect = 6.8
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKT 171
            K+KK++KKK+ ++KK     +K+K+ 
Sbjct: 96  KKSKKRQKKKEAERKKALLLDEKEKER 122



 Score = 27.0 bits (60), Expect = 7.5
 Identities = 8/41 (19%), Positives = 23/41 (56%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +    E  K + +  + K+K+++ ++K  K ++K+++  K 
Sbjct: 35  DAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKL 75



 Score = 26.6 bits (59), Expect = 8.5
 Identities = 11/40 (27%), Positives = 17/40 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            KK+K  N            + +KK+++ KKK K     L
Sbjct: 215 GKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSL 254



 Score = 26.6 bits (59), Expect = 9.7
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           K  KK++KKK+ ++KK     +K+K +
Sbjct: 96  KKSKKRQKKKEAERKKALLLDEKEKER 122


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 35.0 bits (81), Expect = 0.018
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            + +EK +      +KK+++ +K K K   K  KK  KK
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKK 143



 Score = 33.8 bits (78), Expect = 0.049
 Identities = 11/42 (26%), Positives = 25/42 (59%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E E + ++  K   K ++K ++ + + +KK+++ +K   KFL
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFL 133



 Score = 32.3 bits (74), Expect = 0.13
 Identities = 11/42 (26%), Positives = 19/42 (45%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           LE + ++ +    K +++ +K K K   K  KK  KK     
Sbjct: 107 LEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148



 Score = 32.3 bits (74), Expect = 0.15
 Identities = 9/37 (24%), Positives = 21/37 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E K++ K   +  ++ + + +KK+++ +K K K   K
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135



 Score = 31.9 bits (73), Expect = 0.21
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E+ +K+  K + K ++ + + +KK+++ +K K K   K
Sbjct: 98  EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135



 Score = 31.1 bits (71), Expect = 0.31
 Identities = 11/46 (23%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 129 RPVALEGEK-KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + V   GE+  E +    + KK+ KK ++K ++ + + +KK++  +
Sbjct: 81  KGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELE 126



 Score = 30.7 bits (70), Expect = 0.45
 Identities = 10/42 (23%), Positives = 22/42 (52%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           L+ E K+ ++   + + + +KK+++ +K K K   K   K  
Sbjct: 100 LKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLA 141



 Score = 29.6 bits (67), Expect = 1.2
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           + + + + K +  +K K K   K  KK  KK      + L
Sbjct: 113 QLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEAL 152



 Score = 28.8 bits (65), Expect = 1.9
 Identities = 9/39 (23%), Positives = 22/39 (56%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           ++ KK+ K   +K ++ + + +KK+++ +K K K     
Sbjct: 98  EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKA 136


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 34.6 bits (80), Expect = 0.018
 Identities = 19/52 (36%), Positives = 26/52 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           E E  + +  + + KKKKKKKKKK KK        + T     P++ S  AG
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAG 219



 Score = 34.2 bits (79), Expect = 0.026
 Identities = 16/54 (29%), Positives = 26/54 (48%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
            E+ E  K + + ++ KKKKKKKKKK KK        +  +  + P+  +    
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAG 219



 Score = 32.3 bits (74), Expect = 0.098
 Identities = 13/56 (23%), Positives = 16/56 (28%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           E++ KKK K K KK KK        +                     K   K R  
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSI 233



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 2/57 (3%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP--PSDPSGKAGVKSREQ 191
            K + + +   KKKKKKKKK KK        +      +P   S  +G+ G   + +
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLK 228



 Score = 27.3 bits (61), Expect = 4.8
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           N  + + +  K + ++++ KKKKKKKKK  K
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTK 193


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 34.0 bits (78), Expect = 0.020
 Identities = 16/40 (40%), Positives = 27/40 (67%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E  +KE+K+   + ++K+  K KK+KK+KK+KK  +K  K
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132



 Score = 32.0 bits (73), Expect = 0.095
 Identities = 22/60 (36%), Positives = 30/60 (50%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
            + V+ E E+KE  K K + K+KK+KK  +K  KKK  K  K T K       + K   K
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158



 Score = 32.0 bits (73), Expect = 0.10
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            ++ +K+ K  +K+ ++K+  K KK+KK+KK+KK  +K
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 31.7 bits (72), Expect = 0.12
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E EK++K+ +K   +K+  K KK+KK+KK+KK  +K  KK
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKK 133



 Score = 31.3 bits (71), Expect = 0.15
 Identities = 21/62 (33%), Positives = 32/62 (51%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           P   E E+KE  K   + +  K KK+KK+KK+KK  +K  K K      + + KA  K+ 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 190 EQ 191
            +
Sbjct: 152 TK 153


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.7 bits (80), Expect = 0.021
 Identities = 15/40 (37%), Positives = 29/40 (72%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+KE+KK + +    K+++  ++++K++KKKK KK K+
Sbjct: 40  EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79



 Score = 33.2 bits (76), Expect = 0.066
 Identities = 14/38 (36%), Positives = 30/38 (78%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E++E+K+ K + ++K   K+++  ++++K++KKKKTKK
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76



 Score = 32.4 bits (74), Expect = 0.12
 Identities = 15/40 (37%), Positives = 28/40 (70%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E EK+EKK+ + K   K+++  ++++K++KKKK KK  + 
Sbjct: 41  EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80



 Score = 32.4 bits (74), Expect = 0.13
 Identities = 12/40 (30%), Positives = 28/40 (70%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+++++K + + K   K+++  ++++K++KKKK K  K
Sbjct: 39  EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78



 Score = 32.4 bits (74), Expect = 0.14
 Identities = 14/40 (35%), Positives = 30/40 (75%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+K++++ K  +K+++  ++++K++KKKK KK K+T  
Sbjct: 43  EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82



 Score = 30.5 bits (69), Expect = 0.47
 Identities = 13/47 (27%), Positives = 29/47 (61%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           VP     E ++++K++ +    K+++  ++++K++KKKK KK K   
Sbjct: 35  VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81



 Score = 27.0 bits (60), Expect = 6.7
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             KE++ ++ + K++KKKK KK K+   + +   KTK
Sbjct: 55  TDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 34.8 bits (81), Expect = 0.024
 Identities = 21/77 (27%), Positives = 37/77 (48%)

Query: 105 FLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
             +K LP L  Q E   +  L +   +    E+ EK + + + + +K  K+  +  KK +
Sbjct: 139 LKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLE 198

Query: 165 KKKKKKTKKFLPPSDPS 181
            +KKK++K F P   P 
Sbjct: 199 AEKKKQSKNFDPKEGPV 215



 Score = 29.4 bits (67), Expect = 1.4
 Identities = 11/47 (23%), Positives = 21/47 (44%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           E E ++  K   +  KK + +KKK+ K    K+   +  K +   + 
Sbjct: 180 EEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 33.1 bits (76), Expect = 0.027
 Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           V   V L  E+K + + +     ++++ +K +K+ KK KK +
Sbjct: 84  VNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 28.1 bits (63), Expect = 1.7
 Identities = 8/43 (18%), Positives = 20/43 (46%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
              +   + K + +K+  K  ++++ +K +K+ K  K   P  
Sbjct: 86  HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 34.5 bits (79), Expect = 0.027
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            A E  ++   + ++   KK  ++  KK   KK  KK  +++
Sbjct: 1178 AREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESE 1218



 Score = 32.5 bits (74), Expect = 0.12
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             + E+  +K  +   + +    KK  ++  KK   KK TKK
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213



 Score = 31.4 bits (71), Expect = 0.29
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            A E  +K ++          KK  ++  KK   KK  KK
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213



 Score = 31.4 bits (71), Expect = 0.32
 Identities = 11/42 (26%), Positives = 16/42 (38%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A   E +EK +      +    KK  ++  KK   KK   K 
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 27.1 bits (60), Expect = 8.1
 Identities = 8/42 (19%), Positives = 19/42 (45%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             L+ E  + ++ + K ++   + +    KK  ++  KK   K
Sbjct: 1167 KLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPK 1208


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 34.4 bits (79), Expect = 0.028
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
                KK +   K+K  +K    K KK  KK + +T  F 
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAETAGFE 386


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 34.1 bits (78), Expect = 0.028
 Identities = 13/49 (26%), Positives = 16/49 (32%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           E K+ K   K +  K  K  K KK KKK     +    P          
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191



 Score = 32.9 bits (75), Expect = 0.078
 Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 5/64 (7%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
           +        E K+ KK ++    K  K KK KKK     +  K      P  DP  K   
Sbjct: 137 ISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKM-----PGIDPRSKPDW 191

Query: 187 KSRE 190
           KS++
Sbjct: 192 KSQD 195


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
           family consists of several repeats of around 30 residues
           in length which are found specifically in
           mature-parasite-infected erythrocyte surface antigen
           proteins from Plasmodium falciparum. This family often
           found in conjunction with pfam00226.
          Length = 215

 Score = 33.7 bits (76), Expect = 0.031
 Identities = 22/55 (40%), Positives = 33/55 (60%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           EK + KK+K    +K +K+ K+K +K+ K+K KKK KK +  +D  G   VK  E
Sbjct: 13  EKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVKGPE 67


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 34.1 bits (79), Expect = 0.032
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           R   LE E K+ K        ++K+KKK+KKK KKKK K  + K
Sbjct: 377 RVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420



 Score = 29.8 bits (68), Expect = 0.93
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E+ E++  + K +      ++K+KKK+KKK KKKK K 
Sbjct: 379 EQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKV 416



 Score = 29.5 bits (67), Expect = 1.2
 Identities = 15/37 (40%), Positives = 24/37 (64%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++E K+ K +      ++K+KKK+KKK KKKK K  +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPR 418



 Score = 29.5 bits (67), Expect = 1.3
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E++ K+          ++K+KKK+KKK KKKK K  + 
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRG 419


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 34.2 bits (79), Expect = 0.034
 Identities = 13/63 (20%), Positives = 20/63 (31%), Gaps = 2/63 (3%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
               P     EK++K+K      +  K    K  +K K    +  T      +DP   A 
Sbjct: 756 CCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDL--TAADPDAVAA 813

Query: 186 VKS 188
              
Sbjct: 814 KVD 816



 Score = 27.7 bits (62), Expect = 4.1
 Identities = 8/48 (16%), Positives = 16/48 (33%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           N  + +KK+K+ K + +  K    K  +  K       +         
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDA 810


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 34.3 bits (78), Expect = 0.036
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           EKK  K K K+KK+K+K++ K KKK+ +  K     L  S  S  +  ++ E
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADE 247



 Score = 33.1 bits (75), Expect = 0.067
 Identities = 16/37 (43%), Positives = 25/37 (67%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           EK +    + K+KK KKK+KK+K+K++ K KKK+   
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224



 Score = 33.1 bits (75), Expect = 0.078
 Identities = 18/48 (37%), Positives = 27/48 (56%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K+K  KKK+KK+K+K++ K KKK+ +  K  L   D S  +     E 
Sbjct: 198 KSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEA 245



 Score = 32.0 bits (72), Expect = 0.19
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           + E+ K   K+KKKK++K+K++KKKKKK    +
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 31.6 bits (71), Expect = 0.23
 Identities = 17/46 (36%), Positives = 24/46 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           EKK KK  K + K+K+K++ K KKK+ +  K         P S  S
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAAS 241



 Score = 31.6 bits (71), Expect = 0.26
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             E  K  +K +    +KK KK KKK+KK+K+K++ K K K+
Sbjct: 180 NAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221



 Score = 31.2 bits (70), Expect = 0.29
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           E ++ +  + +   K  K KKKK++K+K++KKKKKK        SD
Sbjct: 262 EPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSD 307



 Score = 31.2 bits (70), Expect = 0.29
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           EK       KK KK KKK+KK+K+K++ K KK + 
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222



 Score = 30.4 bits (68), Expect = 0.51
 Identities = 9/34 (26%), Positives = 24/34 (70%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           V LE E++ +++ +   ++KKK++K+K+ +++  
Sbjct: 82  VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 30.4 bits (68), Expect = 0.53
 Identities = 10/40 (25%), Positives = 27/40 (67%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           K + + +++++ +K K++KKK++K+K+ +++       SD
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESD 122



 Score = 30.4 bits (68), Expect = 0.53
 Identities = 8/31 (25%), Positives = 25/31 (80%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           +E+++++ + +K K++KKK++K+K+ +++  
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 30.0 bits (67), Expect = 0.77
 Identities = 15/37 (40%), Positives = 21/37 (56%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           P+    E  KK  K  K K +K+K++KKKKKK    +
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 30.0 bits (67), Expect = 0.84
 Identities = 19/60 (31%), Positives = 26/60 (43%)

Query: 118 EDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
            D   L   V          + K     + KK  K KKKK++K+K++KKKKKK       
Sbjct: 245 ADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCH 304



 Score = 28.9 bits (64), Expect = 1.7
 Identities = 18/71 (25%), Positives = 32/71 (45%)

Query: 103 LVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
           L+      P  +    +    +L           + ++ K+    + KK  K KKKK++K
Sbjct: 228 LLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRK 287

Query: 163 KKKKKKKKTKK 173
           +K++KKKK K 
Sbjct: 288 EKEEKKKKKKH 298



 Score = 28.5 bits (63), Expect = 2.3
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
           V    +K +K K K K +K+K++ K KKK+ +  K
Sbjct: 192 VPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.7 bits (61), Expect = 4.3
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
           P   +  KK KKK K + +K++ K KKK+ +  K
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.3 bits (60), Expect = 6.1
 Identities = 15/49 (30%), Positives = 23/49 (46%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
            P   E E+ +K     K K++K+K++KKKKKK    +          P
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 33.5 bits (77), Expect = 0.037
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E   KK+   +    +  K+ K KKK  KKK+K   K+
Sbjct: 1   ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38



 Score = 26.9 bits (60), Expect = 6.4
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K +K +++   +  K+ K KKK  K K+ 
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRK 33


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 32.0 bits (73), Expect = 0.038
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K  K   KKKKKKKKKK K K++   +K++ +K
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEK 45



 Score = 31.6 bits (72), Expect = 0.054
 Identities = 17/40 (42%), Positives = 23/40 (57%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
           K K KK   KKKKKKKKKK K  ++ +   +   K+  +S
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50



 Score = 31.6 bits (72), Expect = 0.058
 Identities = 19/34 (55%), Positives = 22/34 (64%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             K K K KK   KKKKKKKKKK K K++  T+K
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEK 40



 Score = 31.2 bits (71), Expect = 0.080
 Identities = 17/46 (36%), Positives = 21/46 (45%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K K KK   KKKKKKKKKK K K++        + S         +
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56



 Score = 30.9 bits (70), Expect = 0.10
 Identities = 14/37 (37%), Positives = 24/37 (64%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K +K   K K KKKKKK K K++   +K++++K + +
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAE 49



 Score = 29.3 bits (66), Expect = 0.35
 Identities = 15/59 (25%), Positives = 30/59 (50%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           L+G+K + KK K K KKK K K++   +K++++K   ++       D   +   +  + 
Sbjct: 12  LKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70



 Score = 29.3 bits (66), Expect = 0.37
 Identities = 15/32 (46%), Positives = 20/32 (62%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K K K    KKKKKKKKKK K K++   + ++
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42



 Score = 29.3 bits (66), Expect = 0.40
 Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K K K  +  KKKKKKKKK K K++   +K+++ K 
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 33.7 bits (77), Expect = 0.042
 Identities = 19/59 (32%), Positives = 22/59 (37%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           R +A  G    KK  K    K   KK   K K   KK  KKK  K    +  + K   K
Sbjct: 189 RVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAK 247



 Score = 32.2 bits (73), Expect = 0.14
 Identities = 15/54 (27%), Positives = 20/54 (37%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
               +K   K K+  KK  KKK  K     KK  K   K    +  + K  +K 
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKK 262



 Score = 31.0 bits (70), Expect = 0.36
 Identities = 16/59 (27%), Positives = 20/59 (33%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           +  A   +K +K   K   K  K  KK  KK  K   K  K   K       + K   K
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGK 301



 Score = 30.2 bits (68), Expect = 0.51
 Identities = 15/60 (25%), Positives = 19/60 (31%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
            +  A     K   K  +   K   KK  KKK  K     KK  K     +  + K   K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258



 Score = 30.2 bits (68), Expect = 0.60
 Identities = 21/63 (33%), Positives = 23/63 (36%), Gaps = 2/63 (3%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK--KKKTKKFLPPSDPSGKA 184
           V    A +   K  KK K   KK  KK  K  KK  KK  K   K  K     +    KA
Sbjct: 236 VSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKA 295

Query: 185 GVK 187
             K
Sbjct: 296 KKK 298



 Score = 29.9 bits (67), Expect = 0.74
 Identities = 17/42 (40%), Positives = 19/42 (45%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A+  +K  K   K   K KK  KK  KK  K  KK  KK  K
Sbjct: 235 AVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276



 Score = 29.9 bits (67), Expect = 0.75
 Identities = 14/45 (31%), Positives = 16/45 (35%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +    +  K      K      K  KK KK  KK  KK  K  KK
Sbjct: 225 KVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269



 Score = 29.5 bits (66), Expect = 0.94
 Identities = 14/47 (29%), Positives = 16/47 (34%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             +  A      +K   K   K     KK  K   K  KK KK  KK
Sbjct: 212 AKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258



 Score = 29.1 bits (65), Expect = 1.5
 Identities = 13/51 (25%), Positives = 17/51 (33%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
           K+  K   K  KK  K   K  K   K  K K   K       +  +  K+
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKA 311



 Score = 28.7 bits (64), Expect = 1.9
 Identities = 16/53 (30%), Positives = 19/53 (35%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           K  KK  K   K  KK  K   K  K   K  K   K    +     AG K++
Sbjct: 258 KALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310



 Score = 28.7 bits (64), Expect = 1.9
 Identities = 20/53 (37%), Positives = 21/53 (39%), Gaps = 2/53 (3%)

Query: 137 KKEKKKNKNKNKKKKK--KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           KK KK  K   KK  K  KK  KK  K   K  K   K     +    KAG K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKK 302



 Score = 28.3 bits (63), Expect = 2.3
 Identities = 17/56 (30%), Positives = 23/56 (41%), Gaps = 1/56 (1%)

Query: 137 KKEKKKNKNKNKKKKKK-KKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K   K  K   K  KK  KK  K  KK  KK  K   K    +  + K   K++++
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKK 298



 Score = 28.3 bits (63), Expect = 2.6
 Identities = 16/48 (33%), Positives = 18/48 (37%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           KK  KK      K  K   K  K K K KKK  K       +  + KA
Sbjct: 268 KKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315



 Score = 27.9 bits (62), Expect = 3.5
 Identities = 15/55 (27%), Positives = 19/55 (34%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
            +     K   K   K  KK KK  KK  KK  K   K     +  + KA   + 
Sbjct: 232 AKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAA 286



 Score = 27.2 bits (60), Expect = 6.2
 Identities = 21/61 (34%), Positives = 25/61 (40%), Gaps = 2/61 (3%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           P A  G KK   K K+  KK  KKK  K     KK  K   K  KK    +  + K   K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAK 265

Query: 188 S 188
           +
Sbjct: 266 A 266



 Score = 26.4 bits (58), Expect = 9.7
 Identities = 15/53 (28%), Positives = 18/53 (33%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
            K+  K K        KK  K   K  KK KK   K     +    KA  K+ 
Sbjct: 223 AKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAA 275


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 32.7 bits (75), Expect = 0.049
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           +K    ++  K+       K + KKK +   TK+      P+G
Sbjct: 97  RKSRCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQPRPAG 139


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 32.2 bits (74), Expect = 0.054
 Identities = 15/51 (29%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 127 VPRPVALEGEKKEKKKNKNKNK---KKKKKKKKKKKKKKKKKKKKKKTKKF 174
           V + +  E EKK+K+K   K +    + ++++ K++ +K+K+ +K + KKF
Sbjct: 71  VEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKF 121



 Score = 31.5 bits (72), Expect = 0.11
 Identities = 10/34 (29%), Positives = 26/34 (76%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           +  +++ K + +K+K+ +K ++KK K+++K+K+K
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 30.7 bits (70), Expect = 0.22
 Identities = 10/33 (30%), Positives = 25/33 (75%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            ++E+ K + + +K+ +K ++KK K+++K+K+K
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 29.2 bits (66), Expect = 0.59
 Identities = 8/33 (24%), Positives = 23/33 (69%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            + +++   +  +K+K+ +K ++KK K+++K+K
Sbjct: 96  LRLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128



 Score = 28.8 bits (65), Expect = 1.0
 Identities = 8/33 (24%), Positives = 23/33 (69%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
              ++ K++ + + + +K ++KK K+++K+K+K
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 28.0 bits (63), Expect = 1.6
 Identities = 10/38 (26%), Positives = 24/38 (63%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           KEKKK + +  KK++   + ++++ K++ +K+K  +  
Sbjct: 79  KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116



 Score = 27.2 bits (61), Expect = 3.2
 Identities = 11/36 (30%), Positives = 25/36 (69%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K +K+   + +KKKK+K+  KK++   + ++++TK+
Sbjct: 70  KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKE 105



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
           +  E  K+  +K K   K ++KK K+++K+K+K
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 33.1 bits (76), Expect = 0.059
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           +   +K+K  KK  KK    KK+K K+  PP 
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPP 252



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           +    K+K  KK  KK    KK+K+K+  
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           ++ + ++K+K  KK  KK    KK+K+K+
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKR 247



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           ++    +K+K  KK  KK    KK+ +K  PP 
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPP 251



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
            VP P +    +K K   K   K    KK+K+K+ 
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
                K K  KK  KK    KK+K+K+ 
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248



 Score = 27.0 bits (60), Expect = 6.2
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKK 166
               K K  KK  KK    KK+K+K+ 
Sbjct: 222 SPSRKRKAPKKVAKKVAAAKKRKQKRS 248


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 32.6 bits (75), Expect = 0.060
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
           +KKK KK KK+K+K++K+ +K  
Sbjct: 13  DKKKAKKAKKEKRKQRKQARKGA 35



 Score = 31.8 bits (73), Expect = 0.13
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
           KKK KK KK+K+K++K+ +K   
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGAD 36



 Score = 31.0 bits (71), Expect = 0.21
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 151 KKKKKKKKKKKKKKKKKKKKTK 172
           KKK KK KK+K+K++K+ +K  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGA 35



 Score = 30.3 bits (69), Expect = 0.36
 Identities = 13/39 (33%), Positives = 23/39 (58%)

Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           KKK KK KK+K+K++K+  K      D   +A  +++ +
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAE 52



 Score = 29.5 bits (67), Expect = 0.74
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
           +K K KK KK+K+K++K+ +K  
Sbjct: 13  DKKKAKKAKKEKRKQRKQARKGA 35



 Score = 29.1 bits (66), Expect = 0.90
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKK 165
           +K K KK KK+K+K++K+ +K  
Sbjct: 13  DKKKAKKAKKEKRKQRKQARKGA 35



 Score = 28.0 bits (63), Expect = 2.6
 Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 8/46 (17%)

Query: 136 EKKEKKKNKNKNKKKKKKKKK--------KKKKKKKKKKKKKKTKK 173
           +KK KK  K K K++K+ +K         K+  ++ K +K ++ ++
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRE 59


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 32.2 bits (74), Expect = 0.062
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+++   +K  +K   K+K K +KK+++KK + +K  K
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAK 40



 Score = 31.5 bits (72), Expect = 0.11
 Identities = 10/43 (23%), Positives = 23/43 (53%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           A+  EK + +K + + K + +K  K   ++K + + +K  K+ 
Sbjct: 16  AIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKEL 58



 Score = 31.5 bits (72), Expect = 0.11
 Identities = 7/42 (16%), Positives = 23/42 (54%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                +K++++ K++ +K  K   ++K + + +K +K+  + 
Sbjct: 20  EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61



 Score = 31.5 bits (72), Expect = 0.11
 Identities = 7/41 (17%), Positives = 23/41 (56%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            E +K E +K    + ++K + + +K +K+ ++ + +  ++
Sbjct: 28  QEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68



 Score = 29.1 bits (66), Expect = 0.69
 Identities = 12/40 (30%), Positives = 24/40 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E    +K+ +K   K+K K +KK+++KK + +K  K + +
Sbjct: 5   EKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44



 Score = 27.2 bits (61), Expect = 3.0
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
            K   K K K +KK+++KK + +K  K   ++K   + 
Sbjct: 14  DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYEL 51



 Score = 26.1 bits (58), Expect = 7.0
 Identities = 8/40 (20%), Positives = 20/40 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + E  +    +    +KK+++KK + +K  K   ++K + 
Sbjct: 10  DKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY 49


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 33.3 bits (76), Expect = 0.065
 Identities = 15/73 (20%), Positives = 24/73 (32%), Gaps = 6/73 (8%)

Query: 107 EKLLPLLSNQSEDCL--YLNLYVPRP----VALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
           E ++ L  N  +     +L      P       +    E K         KK    K  +
Sbjct: 638 ELIIKLNKNNYKVNFQNFLEKIFGGPKHIFAISKKLINEAKIYWKAINNTKKSVVIKPLE 697

Query: 161 KKKKKKKKKKTKK 173
           K KK +K   +K+
Sbjct: 698 KPKKYEKTSVSKE 710


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 33.3 bits (76), Expect = 0.067
 Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 18/75 (24%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKK---------------KKKKKKKKKKKTKKFLPPS 178
             E      N +K  KKKK K    K               KK KK +KK + K    PS
Sbjct: 406 SNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPS 465

Query: 179 DP---SGKAGVKSRE 190
           D      K  VKS+E
Sbjct: 466 DSKAGGKKESVKSQE 480



 Score = 28.2 bits (63), Expect = 2.8
 Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           E   K+ KKN+ K + K  K     K   KK+  K +      P +
Sbjct: 443 EDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIPPE 488


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 33.0 bits (76), Expect = 0.069
 Identities = 15/48 (31%), Positives = 31/48 (64%), Gaps = 6/48 (12%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKK------KKKKKKKKKKKTKK 173
           A E E++E+ + K + KKK++++ K  K      +K ++K++KK+ +K
Sbjct: 275 AAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 32.6 bits (75), Expect = 0.087
 Identities = 11/37 (29%), Positives = 28/37 (75%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E++E+K  K   ++++++ ++KK++KKK++++ K  K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 29.1 bits (66), Expect = 1.1
 Identities = 9/45 (20%), Positives = 31/45 (68%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           ++  ++E+++   K  +++++++ ++KK++KKK++++     L P
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP 305


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 33.2 bits (76), Expect = 0.072
 Identities = 13/36 (36%), Positives = 27/36 (75%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E + KE+K  + +N+KK+K++  + K+K++K ++KK
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 32.8 bits (75), Expect = 0.099
 Identities = 11/37 (29%), Positives = 25/37 (67%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
              K +++K K +  +KK+K++  + K+K++K ++KK
Sbjct: 398 AEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 32.1 bits (73), Expect = 0.17
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
            + EK ++++N+ K K++  + K+K++K ++KK
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 31.3 bits (71), Expect = 0.24
 Identities = 12/43 (27%), Positives = 27/43 (62%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             A E  + + K+ K K ++ +KK+K++  + K+K++K ++ K
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 31.3 bits (71), Expect = 0.30
 Identities = 9/38 (23%), Positives = 26/38 (68%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            ++ + K K +  K+++ +KK+K++  + K+K++K ++
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432



 Score = 30.5 bits (69), Expect = 0.51
 Identities = 11/40 (27%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             E + K K +   +++ +KK+K++  + K+K++K + KK
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 30.1 bits (68), Expect = 0.59
 Identities = 9/38 (23%), Positives = 25/38 (65%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E    ++ + K K++K K+++ +KK+K++  + K+ ++
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428



 Score = 30.1 bits (68), Expect = 0.64
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           V  E +  E+ + K K +K K+++ +KK+K++  + K+K+ K
Sbjct: 388 VKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429



 Score = 29.8 bits (67), Expect = 0.84
 Identities = 12/40 (30%), Positives = 26/40 (65%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             E+ E K  + K K+++ +KK+K++  + K+K++K  +K
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 27.5 bits (61), Expect = 4.7
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           L   K E   ++    K K++K K+++ +KK+K++  + K
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 31.5 bits (72), Expect = 0.073
 Identities = 13/49 (26%), Positives = 29/49 (59%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
             +  K E      +   +K++++K++KK+K +KK +++ K++ P  DP
Sbjct: 46  EFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKEWDPNEDP 94


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 30.7 bits (70), Expect = 0.079
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           KK K+  K + K+ KK+  KK  +KKK+K   K+ +K+
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKY 41



 Score = 26.4 bits (59), Expect = 2.6
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
             K + +N+K + K+ KK+  KK  +KKK+K
Sbjct: 2   LLKKRKRNEKLRAKRAKKRAAKKAARKKKRK 32


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 32.8 bits (75), Expect = 0.081
 Identities = 14/39 (35%), Positives = 27/39 (69%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           E   ++KN N N ++K+KKK++ K K++K++  +  K+F
Sbjct: 341 ENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRF 379



 Score = 27.4 bits (61), Expect = 5.3
 Identities = 8/28 (28%), Positives = 19/28 (67%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           ++K K K + K K++K++  + +K+ K+
Sbjct: 354 QQKEKKKEESKAKEEKQEDIEFEKRFKE 381



 Score = 26.7 bits (59), Expect = 8.9
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E       + K K K++ K K++K++  + +K+ K+
Sbjct: 346 EKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 32.7 bits (75), Expect = 0.091
 Identities = 9/37 (24%), Positives = 14/37 (37%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K   K     N  ++K   K  +K KK     K+ + 
Sbjct: 458 KPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFET 494



 Score = 30.4 bits (69), Expect = 0.53
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            K     N ++ K   K  +K KK     K+ +    K
Sbjct: 461 IKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALK 498



 Score = 30.4 bits (69), Expect = 0.61
 Identities = 11/36 (30%), Positives = 15/36 (41%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +K KNK   K        ++K   K  +K KK   F
Sbjct: 453 DKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488



 Score = 30.0 bits (68), Expect = 0.74
 Identities = 10/43 (23%), Positives = 15/43 (34%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
            K K   K        ++K   K  +K KK     K+F   + 
Sbjct: 455 DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDAL 497



 Score = 26.9 bits (60), Expect = 7.8
 Identities = 7/35 (20%), Positives = 14/35 (40%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
              ++K   K  +K KK     K+ +    K ++ 
Sbjct: 468 NNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI 502


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 32.2 bits (74), Expect = 0.096
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
           E+KEKKK  +K +KK  K++K+K ++ 
Sbjct: 96  EEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 28.8 bits (65), Expect = 1.4
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           ++EK+K K  +K++KK  K++K+K ++ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           + +K+KKK   K++KK  K++K+K  + +
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123



 Score = 27.6 bits (62), Expect = 3.5
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           E++K K K   K++KK  K++K+K ++ 
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 33.0 bits (75), Expect = 0.099
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           KN+ ++ K KK KKKK  KKK+  K +   ++ +     +      SR
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSR 162


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 31.6 bits (72), Expect = 0.11
 Identities = 15/43 (34%), Positives = 20/43 (46%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
                 E K + K K K +  K K +K+K K + KK K  K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 31.3 bits (71), Expect = 0.18
 Identities = 12/39 (30%), Positives = 24/39 (61%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E E  ++ + ++  + K + K+KKK++  K K +K+K K
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140



 Score = 30.9 bits (70), Expect = 0.19
 Identities = 14/40 (35%), Positives = 26/40 (65%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+++  + K ++K+KKK++  K K +K+K K + K  K
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147



 Score = 30.1 bits (68), Expect = 0.37
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E +E+  ++ + +   + K + K+KKK++  K K  K+
Sbjct: 98  EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137



 Score = 29.3 bits (66), Expect = 0.64
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E E++   + + ++  + K + K+KKK++  K K +K K
Sbjct: 100 ETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138



 Score = 29.3 bits (66), Expect = 0.82
 Identities = 7/40 (17%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + E++ ++++ ++ +++   + K + K+KKK++  K   +
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135



 Score = 28.2 bits (63), Expect = 1.7
 Identities = 16/42 (38%), Positives = 21/42 (50%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
              P   + E KEKKK +    K +K+K K + KK K  K K
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 26.3 bits (58), Expect = 9.3
 Identities = 7/40 (17%), Positives = 24/40 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             +++E+ + ++ ++ +++   + K + K+KKK++    K
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 32.3 bits (74), Expect = 0.11
 Identities = 12/36 (33%), Positives = 25/36 (69%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E EK+E+K+ ++  KK ++ +  KK +K+ +K++ K
Sbjct: 156 EAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 27.7 bits (62), Expect = 3.5
 Identities = 10/36 (27%), Positives = 23/36 (63%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           + EK++ K +   +KK ++ +  KK +K+ +K++ K
Sbjct: 156 EAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 26.9 bits (60), Expect = 5.8
 Identities = 11/40 (27%), Positives = 24/40 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E + KE +K + K ++  +KK ++ +  KK +K+ +K + 
Sbjct: 151 ERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQA 190


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 32.4 bits (74), Expect = 0.11
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            K   K    ++KK++  + ++KKK+KK K +++ 
Sbjct: 358 PKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392



 Score = 32.4 bits (74), Expect = 0.12
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           EK+ +K  +   K   K K ++ KK++  + ++KK +K    S+  G 
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKK-AKSERRGL 393



 Score = 31.6 bits (72), Expect = 0.24
 Identities = 12/40 (30%), Positives = 23/40 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E   K   K K +  KK++  + ++KKK+KK K +++  +
Sbjct: 355 EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394



 Score = 30.8 bits (70), Expect = 0.43
 Identities = 9/38 (23%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
            P   A     K+++  +   +KKK+KK K +++  + 
Sbjct: 359 KPPTKAKPERDKKERPGR-YRRKKKEKKAKSERRGLQN 395


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 31.4 bits (72), Expect = 0.11
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            +  +K+    ++ K  ++++K K+KKKKKKK+ +
Sbjct: 56  TRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90



 Score = 28.3 bits (64), Expect = 1.2
 Identities = 13/32 (40%), Positives = 24/32 (75%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           G K++   ++NK  ++++K K+KKKKKKK+ +
Sbjct: 59  GRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90



 Score = 27.6 bits (62), Expect = 2.4
 Identities = 10/40 (25%), Positives = 21/40 (52%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +G     +  + +     + K  ++++K K+KKKKKK + 
Sbjct: 50  DGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKEL 89



 Score = 27.6 bits (62), Expect = 2.7
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             + +     + K  ++++K K+KKKKKKK+ + F
Sbjct: 58  GGRKRKAGASRNKAAEERRKLKEKKKKKKKELENF 92


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 32.3 bits (74), Expect = 0.11
 Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           P  V  + +KK+K  +K    KK   K++K + K KK +++++ +K
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQK 297



 Score = 31.6 bits (72), Expect = 0.18
 Identities = 15/44 (34%), Positives = 27/44 (61%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            P  ++ +KK+KKK  +K  K KK   K++K + K KK +++ +
Sbjct: 251 NPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294



 Score = 31.2 bits (71), Expect = 0.28
 Identities = 18/33 (54%), Positives = 21/33 (63%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + N +K KKKKKKKKK   KK K KK   K +K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRK 281



 Score = 30.0 bits (68), Expect = 0.57
 Identities = 20/42 (47%), Positives = 22/42 (52%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           P +LE   K+       N  K KKKKKKKKK   KK K KK 
Sbjct: 234 PDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKL 275



 Score = 29.7 bits (67), Expect = 0.81
 Identities = 15/40 (37%), Positives = 20/40 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            K +KKK K K    KK K KK   K++K + K K  +  
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292



 Score = 28.1 bits (63), Expect = 2.7
 Identities = 18/37 (48%), Positives = 20/37 (54%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K       N +K KKKKKKKKK   KK K KK   K+
Sbjct: 243 KAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ 279



 Score = 27.3 bits (61), Expect = 4.9
 Identities = 16/37 (43%), Positives = 19/37 (51%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +    K K K KKKKK   KK K KK   K++K   K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVK 285



 Score = 26.6 bits (59), Expect = 9.1
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           KK K K     ++K + K KK +++++ +KK K
Sbjct: 268 KKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.6 bits (72), Expect = 0.11
 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 3/45 (6%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
              RP   EG KK K+K +    K KK++ +K+K+K+++  K   
Sbjct: 58  ESKRP---EGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALA 99



 Score = 26.6 bits (59), Expect = 6.0
 Identities = 11/29 (37%), Positives = 21/29 (72%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K  K+K ++ K K KK++ +K+K+K ++F
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERF 94


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.8 bits (75), Expect = 0.12
 Identities = 14/35 (40%), Positives = 25/35 (71%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            K  + K  +  ++K KKKKKK+KKK+++ K+++K
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 31.6 bits (72), Expect = 0.29
 Identities = 15/36 (41%), Positives = 26/36 (72%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           G+  E K + +  +K KKKKKK+KKK+++ K+++K 
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 30.1 bits (68), Expect = 0.88
 Identities = 14/35 (40%), Positives = 24/35 (68%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           + E K     + K KKKKKK+KKK+++ K+++K +
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 8/31 (25%), Positives = 15/31 (48%)

Query: 136  EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
              +EK   ++  +   KK+K KK+ K   + 
Sbjct: 1551 SNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 8/33 (24%), Positives = 18/33 (54%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
             L  ++K  +++  ++  KK+K KK+ K   + 
Sbjct: 1549 VLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             + K  +  ++K KKKKKK+KKK+++ K++ K 
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 28.1 bits (63), Expect = 3.4
 Identities = 12/36 (33%), Positives = 26/36 (72%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+ +   + + ++K KKKKKK+KKK+++ K+++  +
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.1 bits (63), Expect = 3.6
 Identities = 8/29 (27%), Positives = 16/29 (55%)

Query: 140  KKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            ++KN  ++  +   KK+K KK+ K   + 
Sbjct: 1553 QEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581



 Score = 28.1 bits (63), Expect = 3.9
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 10/51 (19%)

Query: 135  GEKKEKKKN-----KNKNKKKKK-----KKKKKKKKKKKKKKKKKKTKKFL 175
               KEKK N      N+ K  ++       KK+K KK+ K   + +   FL
Sbjct: 1537 ESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFL 1587



 Score = 27.0 bits (60), Expect = 8.0
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            K      +   ++K KKKKKK+KKK+++ K+ +K
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764



 Score = 27.0 bits (60), Expect = 9.3
 Identities = 16/68 (23%), Positives = 28/68 (41%), Gaps = 7/68 (10%)

Query: 121  LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
            L LNL          E   KK N+ K       +KK  + + + +++K+   +    SD 
Sbjct: 1488 LLLNLN-------GNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDK 1540

Query: 181  SGKAGVKS 188
              K  ++S
Sbjct: 1541 EKKGNLES 1548



 Score = 26.6 bits (59), Expect = 9.6
 Identities = 12/48 (25%), Positives = 21/48 (43%), Gaps = 1/48 (2%)

Query: 128  PRPVALEGEKK-EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
               V    EK  E+   ++  KK+K KK+ K   + +     K+  +F
Sbjct: 1546 LESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 30.6 bits (70), Expect = 0.12
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +  +KKK+KK+ K + K+   KK T +
Sbjct: 57  ADPSRKKKEKKEVKAESKRYNAKKLTLE 84



 Score = 28.7 bits (65), Expect = 0.71
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKK 167
            + ++KKK+KK+ K + K+   KK
Sbjct: 57  ADPSRKKKEKKEVKAESKRYNAKK 80



 Score = 27.9 bits (63), Expect = 1.0
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KKK+KK+ K + K+   KK   + +K
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRK 87



 Score = 27.1 bits (61), Expect = 2.1
 Identities = 11/32 (34%), Positives = 21/32 (65%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           +KKEKK+ K ++K+   KK   +++K +  +K
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 26.7 bits (60), Expect = 3.0
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            +K+K+K + K + K+   KK   +++K +  +K
Sbjct: 60  SRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 26.0 bits (58), Expect = 5.4
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            +KK + K  K + K+   KK   +++K +   K
Sbjct: 60  SRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 26.0 bits (58), Expect = 6.4
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKK 165
            + ++ KK+KK+ K + K+   KK
Sbjct: 57  ADPSRKKKEKKEVKAESKRYNAKK 80


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 32.3 bits (74), Expect = 0.13
 Identities = 18/56 (32%), Positives = 23/56 (41%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           + K+K  +K      K  K   K KKKKKK+K       L   + S K  V   E 
Sbjct: 511 DSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566



 Score = 31.6 bits (72), Expect = 0.27
 Identities = 15/40 (37%), Positives = 20/40 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             ++  KKK+ +K  K   K  K   K KKKKKK+K    
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDL 546



 Score = 31.6 bits (72), Expect = 0.28
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KK+ K + +K++   K +  KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657



 Score = 31.2 bits (71), Expect = 0.38
 Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 5/47 (10%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK-----KKKTKKFL 175
           +G KK KKK K K +   K +  KK+K+K KK K     +K+ KK  
Sbjct: 624 DGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVIINEKRNKKAA 670



 Score = 30.8 bits (70), Expect = 0.48
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           G+  +K+K K K K++   K +  KK+K+K KK K
Sbjct: 623 GDGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657



 Score = 28.9 bits (65), Expect = 2.1
 Identities = 15/38 (39%), Positives = 16/38 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             K  KK  +K K   K  K   K  K   K KKK KK
Sbjct: 503 VGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKK 540



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 12/49 (24%), Positives = 18/49 (36%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           K   K +K   K KKKKKK+K          ++ + K     +      
Sbjct: 522 KAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570



 Score = 27.3 bits (61), Expect = 6.7
 Identities = 14/43 (32%), Positives = 17/43 (39%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
                 K  KK++  K    K  K   K  K   K KKKK K+
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKE 541


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 32.2 bits (74), Expect = 0.13
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            ++V     L  E  E  +  ++ ++  K+KKKKKKKKKKK++ ++ 
Sbjct: 59  EIFVQEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105



 Score = 27.6 bits (62), Expect = 4.1
 Identities = 13/37 (35%), Positives = 25/37 (67%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +EK+K  +++ +  ++  + ++  K+KKKKKKK KK 
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKK 99



 Score = 27.6 bits (62), Expect = 4.4
 Identities = 14/37 (37%), Positives = 25/37 (67%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +KEK  +++    ++  + ++  K+KKKKKKKKK K+
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKE 100


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 0.15
 Identities = 15/47 (31%), Positives = 25/47 (53%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           E E+K+KK+ K K K+ KK K  +K+ K K + ++      +P    
Sbjct: 17  ELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63



 Score = 32.2 bits (73), Expect = 0.17
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           E EKK   + + + KKKK++K K+K+ KK K  +K+   K        G    K  E+
Sbjct: 7   EAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64



 Score = 31.8 bits (72), Expect = 0.25
 Identities = 16/64 (25%), Positives = 33/64 (51%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
                    ++E ++ K K +K K+K+ KK K  +K+ K K + ++    ++   K+  K
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 188 SREQ 191
           SR++
Sbjct: 66  SRKR 69



 Score = 31.0 bits (70), Expect = 0.44
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           E E ++K   + + ++KKKK++K K+K+ KK K  +K  K 
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA 45


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 31.5 bits (72), Expect = 0.15
 Identities = 18/39 (46%), Positives = 26/39 (66%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            +E +K E++K + +N KK+K  KK KKKKK KKK +  
Sbjct: 168 KVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 29.6 bits (67), Expect = 0.80
 Identities = 20/40 (50%), Positives = 27/40 (67%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           EK EKKK + + K+++  KK+K  KK KKKKK KK  + L
Sbjct: 167 EKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206



 Score = 29.6 bits (67), Expect = 0.87
 Identities = 12/53 (22%), Positives = 30/53 (56%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           +++++ + + K +KK+K+K+ KKK+  +K + +  K          A +++ E
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAE 61


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 30.7 bits (70), Expect = 0.18
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +G     KK +     +K+    K  +K+KKKKKKK+ K F
Sbjct: 50  DGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKNF 90



 Score = 26.5 bits (59), Expect = 5.6
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           K   + K  +K+    K  +K+KKKKK K+  
Sbjct: 57  KKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 31.9 bits (72), Expect = 0.19
 Identities = 14/45 (31%), Positives = 21/45 (46%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              AL+G+ + K K + + +        KKK K+  KK  K  KK
Sbjct: 258 ALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKK 302


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 31.7 bits (72), Expect = 0.20
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           R    E E K+K   +++N  + K++K++K+KK KK+     
Sbjct: 122 RFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163



 Score = 30.6 bits (69), Expect = 0.50
 Identities = 15/45 (33%), Positives = 28/45 (62%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           ALE ++  K + + K K  K+ +   + K++K++K+KK  K+ LP
Sbjct: 117 ALELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161



 Score = 27.9 bits (62), Expect = 3.9
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           EK+ KKK   +++   + K++K++K+KK KK+     
Sbjct: 127 EKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 31.7 bits (72), Expect = 0.20
 Identities = 10/40 (25%), Positives = 15/40 (37%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E +KK   + K K   + K    K   + K   +K    K
Sbjct: 206 EAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAK 245



 Score = 30.5 bits (69), Expect = 0.49
 Identities = 11/38 (28%), Positives = 15/38 (39%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E + KKK   + KKK   + K    K   + K    K 
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKA 241



 Score = 30.5 bits (69), Expect = 0.54
 Identities = 10/40 (25%), Positives = 15/40 (37%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E K+K   + K K   + K    K   + K   +K   
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243



 Score = 30.2 bits (68), Expect = 0.64
 Identities = 9/38 (23%), Positives = 14/38 (36%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             + KKK   + K    K   + K   +K    K  +K
Sbjct: 212 AAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEK 249



 Score = 29.4 bits (66), Expect = 1.1
 Identities = 11/43 (25%), Positives = 17/43 (39%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            A E +KK + + K K   + KKK   + K    K   +    
Sbjct: 195 AAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 10/42 (23%), Positives = 15/42 (35%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A E +KK   + K    K   + K   +K    K  +K    
Sbjct: 212 AAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253



 Score = 28.2 bits (63), Expect = 2.4
 Identities = 8/42 (19%), Positives = 13/42 (30%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A   +K   +      K   + K   +K    K  +K    K
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAK 254



 Score = 27.1 bits (60), Expect = 6.6
 Identities = 8/41 (19%), Positives = 12/41 (29%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           A    KK+          K   + K   +K    K  +K  
Sbjct: 211 AAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAA 251


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 31.3 bits (71), Expect = 0.22
 Identities = 18/62 (29%), Positives = 22/62 (35%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
                  + EKK KK       K K K K K + K KK   K   K    P+ P+     
Sbjct: 82  PKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSA 141

Query: 187 KS 188
            S
Sbjct: 142 AS 143



 Score = 30.9 bits (70), Expect = 0.27
 Identities = 23/62 (37%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP-PSDPSGKAGV 186
           P+P        E    K K K K +KK KK K K K K K K   K  P P  P  K   
Sbjct: 67  PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126

Query: 187 KS 188
           K+
Sbjct: 127 KA 128



 Score = 29.0 bits (65), Expect = 1.5
 Identities = 22/62 (35%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
            P         K K+K K + K KK K K K K K K K K +   K  PPS  + KA  
Sbjct: 72  EPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK-KPPSKTAAKAPA 130

Query: 187 KS 188
             
Sbjct: 131 AP 132



 Score = 26.3 bits (58), Expect = 9.4
 Identities = 16/62 (25%), Positives = 23/62 (37%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
             +P   + + K K K K K K + K KK   K   K      +  +    +  SG A  
Sbjct: 91  EKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAATG 150

Query: 187 KS 188
            S
Sbjct: 151 PS 152


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 31.7 bits (72), Expect = 0.24
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           KKKKKKKKK  K+     +  K   F+       K  
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRH 45



 Score = 28.6 bits (64), Expect = 2.3
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKT 171
           KKKKKKKKK  K+     +  KKT
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKT 32



 Score = 27.0 bits (60), Expect = 6.3
 Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 5/39 (12%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKK-----KKKKKKKTKK 173
           KKK K K K  K+     +  KK        ++KK+  +
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNE 47


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.3 bits (72), Expect = 0.26
 Identities = 13/43 (30%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 134 EGEKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKF 174
           + +K      K     + +K+  K  +KK+KKKKK+K+K ++ 
Sbjct: 595 QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL 637



 Score = 30.6 bits (70), Expect = 0.55
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
             E  E +K  NK  +KK+KKKKK+K+K+++ K
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 28.3 bits (64), Expect = 3.1
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
           +K +K KKKKKKK K  K K +T
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702



 Score = 27.9 bits (63), Expect = 3.7
 Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK---KTKKFLPPSDPSGKAGVKSRE 190
           E+KK K + ++ K  ++ +K+ ++  K+ KK   +  K L      G A VK+ E
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHE 608



 Score = 27.9 bits (63), Expect = 4.4
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
             +  +  K  NK  +KK+KKKKK+K+K+++ K
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 31.2 bits (71), Expect = 0.26
 Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 1/55 (1%)

Query: 127 VPRPVALE-GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
              PV +   E K K K K K K  KK +++ K++ K  + +     +   P+ P
Sbjct: 86  KEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARP 140



 Score = 27.0 bits (60), Expect = 5.8
 Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 1/54 (1%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           P P+  E  K+          K K K K K K  KK +++ K+  K + P   S
Sbjct: 78  PEPIP-EPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 31.5 bits (71), Expect = 0.27
 Identities = 14/50 (28%), Positives = 32/50 (64%)

Query: 126  YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
             V      E E+KE++ NK++ KK++ +++KK+  ++  ++  ++ K+FL
Sbjct: 975  NVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFL 1024



 Score = 27.2 bits (60), Expect = 7.3
 Identities = 8/39 (20%), Positives = 26/39 (66%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            L  E+++ +K K + +K++K K ++++ +  +++ K++
Sbjct: 776 ELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEE 814



 Score = 26.9 bits (59), Expect = 8.4
 Identities = 9/40 (22%), Positives = 26/40 (65%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            ++E+K  K K +++K++K K ++++ +  +++ K +  L
Sbjct: 778 AEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAEL 817


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.2 bits (71), Expect = 0.27
 Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 8/49 (16%)

Query: 135 GEKKEKKKNKNK---NKKKKKKK-----KKKKKKKKKKKKKKKKTKKFL 175
           G +K KK  + K   NK+K +KK       KK+ + K+K+  +  +K L
Sbjct: 258 GSRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVL 306



 Score = 29.6 bits (67), Expect = 0.84
 Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 10/43 (23%)

Query: 138 KEKKKNKN----------KNKKKKKKKKKKKKKKKKKKKKKKK 170
           KEK + KN          + K+K+  + K+K  +    ++KK 
Sbjct: 274 KEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316



 Score = 28.9 bits (65), Expect = 1.7
 Identities = 10/45 (22%), Positives = 20/45 (44%), Gaps = 10/45 (22%)

Query: 135 GEKKEKKKN----------KNKNKKKKKKKKKKKKKKKKKKKKKK 169
            ++K +KKN          + K K+  + K+K  +    ++KK  
Sbjct: 273 NKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317



 Score = 26.9 bits (60), Expect = 7.3
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KE ++++ K   +K KK K+ K    K+K +KK
Sbjct: 248 KEGREDREKFGSRKGKKDKEGKSTTNKEKARKK 280


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 29.6 bits (67), Expect = 0.27
 Identities = 14/34 (41%), Positives = 15/34 (44%)

Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
             KKKKKKKKKK +      K KK        G 
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGA 55



 Score = 29.6 bits (67), Expect = 0.28
 Identities = 13/25 (52%), Positives = 14/25 (56%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
            KKKKKKKKKK +      K KK  
Sbjct: 23  VKKKKKKKKKKAEDTAATAKAKKAT 47



 Score = 29.3 bits (66), Expect = 0.38
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
             KKKKKKKKKK +      K K   
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKAT 47



 Score = 29.3 bits (66), Expect = 0.40
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           KKKKKKKKKK +      K KK T + +  
Sbjct: 24  KKKKKKKKKKAEDTAATAKAKKATAEDVSE 53



 Score = 27.7 bits (62), Expect = 1.4
 Identities = 15/44 (34%), Positives = 18/44 (40%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K +    KKKKKKKKKK +    T K    +      G    E 
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60



 Score = 27.3 bits (61), Expect = 1.8
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            KK+KKK K K +      K KK   +   +     + FL
Sbjct: 23  VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFL 62



 Score = 26.2 bits (58), Expect = 3.7
 Identities = 15/36 (41%), Positives = 16/36 (44%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
            L G K E    K K KKKKKK +      K KK  
Sbjct: 12  ELLGPKTEADLVKKKKKKKKKKAEDTAATAKAKKAT 47


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 31.4 bits (72), Expect = 0.27
 Identities = 11/23 (47%), Positives = 17/23 (73%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
           +K++ KKKKK+KK KK + +K  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 30.6 bits (70), Expect = 0.46
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 153 KKKKKKKKKKKKKKKKKKTKK 173
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.6 bits (70), Expect = 0.50
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 148 KKKKKKKKKKKKKKKKKKKKK 168
           +K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 0.71
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 146 KNKKKKKKKKKKKKKKKKKKK 166
           + ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 30.2 bits (69), Expect = 0.74
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 152 KKKKKKKKKKKKKKKKKKKTK 172
           +K++ KKKKK+KK KK +  K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 1.5
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 125 LYVPRPVALEGEKKEKKKNKNKNKKKKKKKKK 156
           +++  P+ L+ EK+  KK K + K KK + +K
Sbjct: 722 IFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.7 bits (65), Expect = 2.0
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 144 KNKNKKKKKKKKKKKKKKKKK 164
           + +  KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 27.5 bits (62), Expect = 5.7
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 139 EKKKNKNKNKKKKKKKKKKKK 159
           EK++ K K K+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.1 bits (71), Expect = 0.29
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           ++E K+ + + +  + +KK  +KK KK  KK K  +  
Sbjct: 75  EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEAL 112



 Score = 29.9 bits (68), Expect = 0.79
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E KE +  K   + +KK  +KK KK  KK K ++   + L
Sbjct: 77  ELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 30.4 bits (69), Expect = 0.29
 Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 9/64 (14%)

Query: 136 EKKEKKKNKNKN---------KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
           EKK+ + N +               +++KK   +K K+K ++++ K  L   + +    V
Sbjct: 32  EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91

Query: 187 KSRE 190
           K+ +
Sbjct: 92  KATK 95


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 31.0 bits (71), Expect = 0.29
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
            +KEKKK  +K +KK  K++K K ++ 
Sbjct: 97  AEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 27.9 bits (63), Expect = 3.0
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           E +K K K   K++KK  K++K K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 27.9 bits (63), Expect = 3.1
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           + EK+K K  +K++KK  K++K K ++ 
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123



 Score = 26.8 bits (60), Expect = 5.9
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +K+KKK   K++KK  K++K K  + +
Sbjct: 98  EKEKKKAMSKEEKKAIKEEKDKLEEPY 124


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 30.5 bits (69), Expect = 0.30
 Identities = 16/31 (51%), Positives = 18/31 (58%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           K  KKKK   +KKK  KK KKKK +K   P 
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79



 Score = 30.2 bits (68), Expect = 0.51
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           K   KKKK   +KKK  KK KKKKK K+ +P
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVP 78



 Score = 29.0 bits (65), Expect = 1.1
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           KK  KK K   +KKK  KK KKKKK+K++  + 
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80



 Score = 29.0 bits (65), Expect = 1.2
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           KK  KKKK   +KKK  KK KKK K+     +
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 16/45 (35%), Positives = 22/45 (48%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           +   KK   K K   +KKK  KK KKKKK+K++  +        S
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDS 88



 Score = 28.6 bits (64), Expect = 1.7
 Identities = 18/47 (38%), Positives = 23/47 (48%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           E   KK +K K    +KKK  KK KKKKK+K++         SD   
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 30.2 bits (69), Expect = 0.31
 Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           KK + K++ K +KK ++ K   + FL  ++ + K GVK
Sbjct: 79  KKLQAKQQAKAEKKAEENKAAGEAFL--AENAKKEGVK 114


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 30.1 bits (68), Expect = 0.31
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            ++KE++K+   +KKK+KK +    +  KK   KK+ + 
Sbjct: 66  RKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104



 Score = 29.7 bits (67), Expect = 0.47
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            + K+K + K+ +  KKK+KK +    +  KK   KK
Sbjct: 64  PQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKK 100



 Score = 26.6 bits (59), Expect = 5.4
 Identities = 13/45 (28%), Positives = 22/45 (48%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           +K K++ K+    KKK+KK +    +  KK   K ++    S  S
Sbjct: 66  RKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSS 110



 Score = 25.8 bits (57), Expect = 8.6
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           L+  ++++K+ +  ++  KKK+KK +    +  KK    K+    S  S
Sbjct: 61  LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSS 109


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 31.3 bits (71), Expect = 0.32
 Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 1/56 (1%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           E+ EK K   K K+  +    +  KK  K  +         P+ P  K+   +R  
Sbjct: 42  EQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPR-KSAESTRSS 96


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 31.2 bits (71), Expect = 0.32
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF---LPPSDPSGK 183
           EK++   KNK  K+K K +       KK+++   K    L     SGK
Sbjct: 205 EKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPKGIILLATGSESGK 252


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 31.3 bits (71), Expect = 0.32
 Identities = 12/45 (26%), Positives = 17/45 (37%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           R    E  + +K   + K K      KKK+ KK     K K   +
Sbjct: 591 RAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 31.1 bits (71), Expect = 0.33
 Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)

Query: 105 FLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           ++E+L    + +++  L  + +      ++    + K  K K K       K +  ++ K
Sbjct: 326 YIERLKKAKTKKTQTRL-DSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384

Query: 165 KKKKKKTKK 173
              KKK KK
Sbjct: 385 SSGKKKVKK 393



 Score = 30.0 bits (68), Expect = 0.72
 Identities = 16/43 (37%), Positives = 21/43 (48%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
              KK  KK   K K KKKKKK     +P S+ S +A    ++
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389



 Score = 26.9 bits (60), Expect = 6.8
 Identities = 13/47 (27%), Positives = 16/47 (34%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
               K  KK   K K KKKKKK       K     +       K ++
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 30.9 bits (70), Expect = 0.34
 Identities = 13/46 (28%), Positives = 18/46 (39%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           E K K K K K    K+   K +K  +K  +     K+F  P    
Sbjct: 242 EAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKK 287



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 8/40 (20%), Positives = 14/40 (35%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
            +P     ++   K  K   K  +     K+     +KKK
Sbjct: 249 AKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288


>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain.  The
           sequences featured in this family are found repeated in
           a number of plant calmodulin-binding proteins, and are
           thought to constitute the calmodulin-binding domains.
           Binding of the proteins to calmodulin depends on the
           presence of calcium ions. These proteins are thought to
           be involved in various processes, such as plant defence
           responses and stolonisation or tuberization.
          Length = 115

 Score = 29.7 bits (67), Expect = 0.36
 Identities = 15/68 (22%), Positives = 24/68 (35%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
                R    + + +++K     N KK    K+  K  +K +K   +   FLP       
Sbjct: 4   KKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGPNFLPLEPDPEA 63

Query: 184 AGVKSREQ 191
             V  R Q
Sbjct: 64  EKVDLRHQ 71


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 29.4 bits (66), Expect = 0.38
 Identities = 15/43 (34%), Positives = 18/43 (41%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
              KK K K K+  K  + K  KK +KK KKK      P    
Sbjct: 19  SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61



 Score = 29.0 bits (65), Expect = 0.52
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
           KK+ K K +  K  + K  KK +KK KKK +  +F   S   
Sbjct: 22  KKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63



 Score = 28.6 bits (64), Expect = 0.85
 Identities = 13/39 (33%), Positives = 19/39 (48%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              K++KK K +  K  + K  KK +KK KKK +     
Sbjct: 19  SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFP 57



 Score = 28.2 bits (63), Expect = 1.1
 Identities = 12/40 (30%), Positives = 21/40 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           K++KKK +     + K  KK +KK KKK +  +  ++   
Sbjct: 23  KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKR 62



 Score = 27.8 bits (62), Expect = 1.4
 Identities = 15/42 (35%), Positives = 20/42 (47%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           KK KKK K   K  + K  KK +KK KKK +  +  +     
Sbjct: 22  KKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63



 Score = 27.8 bits (62), Expect = 1.6
 Identities = 11/41 (26%), Positives = 22/41 (53%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
              +  + K   K +KK KKK +  +  ++ K+++T+  LP
Sbjct: 30  RTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70



 Score = 27.0 bits (60), Expect = 2.5
 Identities = 17/42 (40%), Positives = 20/42 (47%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
            N  K KKKKK+  K  + K  KK +KK  KK   P  P   
Sbjct: 19  SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60



 Score = 25.9 bits (57), Expect = 7.5
 Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           KE+KKN+  +K+         KK+KKKKK+  KT +         K   K  E 
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATK-KGQKKDKKKDEF 53


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 30.3 bits (69), Expect = 0.39
 Identities = 10/42 (23%), Positives = 27/42 (64%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           V +E   +E++ ++   K+  K K++K+++ ++K+K+  K +
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 29.9 bits (68), Expect = 0.43
 Identities = 9/42 (21%), Positives = 29/42 (69%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E E+ ++++  ++  +K+  K K++K+++ ++K+K+  K+ +
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQM 151



 Score = 28.4 bits (64), Expect = 1.3
 Identities = 10/37 (27%), Positives = 24/37 (64%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E E+ ++   K   K K++K+++ ++K+K+  K++ K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 9/37 (24%), Positives = 24/37 (64%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E E +E  + +  ++  +K+  K K++K+++ ++K+K
Sbjct: 108 EVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 8/39 (20%), Positives = 22/39 (56%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
                 E  E+++     +K+  K K++K+++ ++K+K+
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 27.6 bits (62), Expect = 3.1
 Identities = 8/40 (20%), Positives = 25/40 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E + ++ ++ +   +  +K+  K K++K+++ ++K K+
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 26.9 bits (60), Expect = 5.2
 Identities = 8/40 (20%), Positives = 24/40 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E+ E ++   + +  +  +K+  K K++K+++ ++ +K
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144



 Score = 26.5 bits (59), Expect = 6.8
 Identities = 7/33 (21%), Positives = 20/33 (60%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           E  + +    K++K+++ ++K+K+  K++ K  
Sbjct: 122 ELLEKELAKLKREKRRENERKQKEILKEQMKML 154



 Score = 26.5 bits (59), Expect = 7.2
 Identities = 8/27 (29%), Positives = 19/27 (70%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           K   K K++K+++ ++K+K+  K++ K
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMK 152


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 30.4 bits (69), Expect = 0.40
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPP 177
             KK K K+ +K + KKK KK   P
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDNP 109



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 147 NKKKKKKKKKKKKKKKKKKKK 167
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.2
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 148 KKKKKKKKKKKKKKKKKKKKK 168
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.2
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 149 KKKKKKKKKKKKKKKKKKKKK 169
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.0 bits (63), Expect = 2.2
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 150 KKKKKKKKKKKKKKKKKKKKK 170
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.3 bits (61), Expect = 3.7
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 145 NKNKKKKKKKKKKKKKKKKKK 165
              K K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 26.9 bits (60), Expect = 5.4
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLP 176
             KK K K+ +K + KKKK K   P
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDNP 109


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 0.41
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              + E      + K+K K ++ K+K++K+ +++ K+ K+
Sbjct: 262 FESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301



 Score = 30.4 bits (69), Expect = 0.51
 Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E E   K   + K K K ++ K+K++K+ +++ K++K  K
Sbjct: 265 EYEPINKPV-RPKRKTKAQRNKEKRRKELEREAKEEKQLK 303



 Score = 29.3 bits (66), Expect = 1.2
 Identities = 11/36 (30%), Positives = 24/36 (66%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
               + K+K K +  K+K++K+ +++ K++K+ KKK
Sbjct: 270 NKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 31.0 bits (70), Expect = 0.42
 Identities = 14/22 (63%), Positives = 18/22 (81%)

Query: 152 KKKKKKKKKKKKKKKKKKKTKK 173
           +KKKKKKK K+KKKKK ++  K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705



 Score = 30.6 bits (69), Expect = 0.54
 Identities = 14/23 (60%), Positives = 20/23 (86%)

Query: 151 KKKKKKKKKKKKKKKKKKKKTKK 173
           +KKKKKKK K+KKKKK ++ +K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 29.8 bits (67), Expect = 0.87
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKK 170
           +KKKKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 28.7 bits (64), Expect = 2.0
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKK 167
           K KKKKK K+KKKKK ++  K+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQ 706



 Score = 27.9 bits (62), Expect = 4.5
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKK 166
           + K KKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 27.5 bits (61), Expect = 5.2
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKK 168
           + KKKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 27.1 bits (60), Expect = 6.4
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKK 162
           +KK K K  K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 26.8 bits (59), Expect = 9.6
 Identities = 13/25 (52%), Positives = 19/25 (76%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKK 159
           G +K+KKK K+K KKKKK ++  K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 29.3 bits (66), Expect = 0.43
 Identities = 12/47 (25%), Positives = 19/47 (40%), Gaps = 7/47 (14%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKK-------KKKKKKKKKKKKKKKTKK 173
           + ++ + K    + K KK           +   +K K KK KK  KK
Sbjct: 19  DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65



 Score = 27.4 bits (61), Expect = 2.0
 Identities = 11/37 (29%), Positives = 14/37 (37%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            K +K           +   +K K KK KK  KK  K
Sbjct: 32  GKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCK 68



 Score = 26.2 bits (58), Expect = 4.6
 Identities = 13/39 (33%), Positives = 17/39 (43%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             + K K + K K  K  KK+   K K K+ K KK    
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40



 Score = 26.2 bits (58), Expect = 5.2
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +K K K + K+K  K  KK+   K K K+ K  K 
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37



 Score = 26.2 bits (58), Expect = 5.5
 Identities = 16/52 (30%), Positives = 22/52 (42%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
            KK K K + K K  K  KK+   K K K+ K KK    +  +  +     K
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEK 53



 Score = 25.9 bits (57), Expect = 7.3
 Identities = 11/37 (29%), Positives = 13/37 (35%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
              KK           +   +K K KK KK  KK  K
Sbjct: 32  GKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCK 68


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 30.4 bits (69), Expect = 0.46
 Identities = 12/44 (27%), Positives = 19/44 (43%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           P A +  K      +    K  +K K ++ +KK KK +  K  K
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373



 Score = 30.0 bits (68), Expect = 0.70
 Identities = 15/51 (29%), Positives = 21/51 (41%)

Query: 123 LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           LN    +   L    +EK   K   K K ++ +KK KK +  K  K   K 
Sbjct: 328 LNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKS 378



 Score = 28.1 bits (63), Expect = 3.3
 Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             + K   K  +K K ++ +KK KK +  K  K  
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 27.7 bits (62), Expect = 4.3
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            L  EK   K  +    ++ +KK KK +  K  K   K  K 
Sbjct: 340 VLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381



 Score = 26.6 bits (59), Expect = 9.3
 Identities = 12/49 (24%), Positives = 21/49 (42%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
           KK  K     ++K   K  +K K ++ +KK  K  L     +    +K+
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 30.4 bits (69), Expect = 0.47
 Identities = 15/38 (39%), Positives = 15/38 (39%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
               KK  K     K KK   KK   KK  K KK  KK
Sbjct: 259 GGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 28.5 bits (64), Expect = 2.2
 Identities = 12/40 (30%), Positives = 14/40 (35%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
           K    KK  KK    K KK   KK   K    +  + K  
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKA 297



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 14/39 (35%), Positives = 16/39 (41%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +G   +K   K    K KK   KK   KK  K KK   K
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 27.7 bits (62), Expect = 3.4
 Identities = 15/41 (36%), Positives = 17/41 (41%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           L   + +    K   KK    K KK   KK   KK  KTKK
Sbjct: 252 LAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292



 Score = 27.4 bits (61), Expect = 4.5
 Identities = 14/40 (35%), Positives = 15/40 (37%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             +    KK   K    K KK   KK   KK  K KK  K
Sbjct: 256 RAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 29.5 bits (66), Expect = 0.47
 Identities = 16/44 (36%), Positives = 24/44 (54%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           NKK KK   ++ +KK  +KK+KK     L P   S +  V+ R+
Sbjct: 51  NKKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRRCVRLRD 94


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 30.7 bits (70), Expect = 0.47
 Identities = 12/45 (26%), Positives = 20/45 (44%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           KN  KK+ K K+ +++K  K++ K           S K+     E
Sbjct: 10  KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDE 54



 Score = 30.0 bits (68), Expect = 0.90
 Identities = 8/34 (23%), Positives = 15/34 (44%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KK  K  + + +K  K++ K        K + +K
Sbjct: 14  KKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47



 Score = 28.4 bits (64), Expect = 2.8
 Identities = 10/40 (25%), Positives = 19/40 (47%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           KK+ K K  +++K  K++ K        K +++K     D
Sbjct: 14  KKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADD 53



 Score = 26.9 bits (60), Expect = 8.8
 Identities = 9/36 (25%), Positives = 16/36 (44%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           AL+   K K+  + K  K++ K        K + +K
Sbjct: 12  ALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 29.5 bits (67), Expect = 0.49
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           +  +K K   ++  + K K+K+KKKKKKK+ +
Sbjct: 58  RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 29.5 bits (67), Expect = 0.51
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +  +     +++  + K K+K+KKKKKKK+ + F
Sbjct: 58  RGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDF 91



 Score = 29.1 bits (66), Expect = 0.68
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
              K K   +   + K K+K+KKKKKKK+ +
Sbjct: 59  GGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 28.4 bits (64), Expect = 1.2
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           +   K K  +++  + K K+K+KKKKKKK+  
Sbjct: 58  RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 26.4 bits (59), Expect = 4.7
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           +  +K    + +  + K K+K+KKKKKKK+ +
Sbjct: 58  RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 29.7 bits (67), Expect = 0.55
 Identities = 15/31 (48%), Positives = 17/31 (54%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           K    K KKK KK  K  KK  K KK+K +T
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 29.3 bits (66), Expect = 0.82
 Identities = 15/38 (39%), Positives = 19/38 (50%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           K   K   KK KK  K  KK  K KK+K + + +F  P
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRP 38



 Score = 27.0 bits (60), Expect = 3.8
 Identities = 14/31 (45%), Positives = 16/31 (51%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
           K   KK K K KK  K  KK  K KK+K + 
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 14/38 (36%), Positives = 18/38 (47%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           +    K K K KK  K  KK  K KK+K +   + F P
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRP 38


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 28.8 bits (65), Expect = 0.55
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           + +  + ++   +N++KKKKKKK KKK K+   +KK
Sbjct: 26  QIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 28.4 bits (64), Expect = 0.74
 Identities = 15/37 (40%), Positives = 22/37 (59%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E     K +  ++  ++KKKKKKK KKK K+   +KK
Sbjct: 25  EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 28.0 bits (63), Expect = 0.86
 Identities = 16/44 (36%), Positives = 28/44 (63%), Gaps = 3/44 (6%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKK---KKKKKKKKKKTKK 173
           ++  K  +   +N+ KKKKKKK KKK K    +KK++ +++ K+
Sbjct: 27  IDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70



 Score = 28.0 bits (63), Expect = 0.96
 Identities = 15/40 (37%), Positives = 25/40 (62%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             GE+ E+   +    K ++  ++ ++KKKKKKK KKK+K
Sbjct: 15  GLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 27.3 bits (61), Expect = 1.9
 Identities = 16/42 (38%), Positives = 29/42 (69%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           P  +  EGE+ +++ + +K ++  ++ ++KKKKKKK KKK K
Sbjct: 13  PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 26.1 bits (58), Expect = 4.4
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           P  L  E +E  +  + +K ++  ++ ++KKKKKKK KKK 
Sbjct: 13  PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53


>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
          Length = 620

 Score = 30.3 bits (69), Expect = 0.59
 Identities = 15/34 (44%), Positives = 18/34 (52%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           L   KK KKK+  KN          KK+KKK+KK
Sbjct: 587 LIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 29.1 bits (66), Expect = 1.4
 Identities = 13/31 (41%), Positives = 16/31 (51%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            KK K K+ KK           KK+KKK+KK
Sbjct: 590 PKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 29.1 bits (66), Expect = 1.6
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
             +K K K+  K           KK+KKK+KK
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620



 Score = 27.6 bits (62), Expect = 4.7
 Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
              KK  K  NKK           KK+KKK+KK
Sbjct: 588 IPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 30.3 bits (68), Expect = 0.59
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + E     KN+ + KKK+K++  KK   K KK++K   +K
Sbjct: 551 KTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEK 590



 Score = 28.4 bits (63), Expect = 2.7
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           KK++K+   K    K KK++K  K+K     KK +  F
Sbjct: 565 KKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAF 602



 Score = 27.3 bits (60), Expect = 5.9
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            K  +K+K +  +  K ++++KKK+K++  KK  TK
Sbjct: 544 MKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTK 579



 Score = 26.9 bits (59), Expect = 7.5
 Identities = 14/55 (25%), Positives = 27/55 (49%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
            +K K +    +K ++++KKK+K++  KK   K K ++ +      G A   S  
Sbjct: 547 IEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601



 Score = 26.9 bits (59), Expect = 7.8
 Identities = 12/37 (32%), Positives = 22/37 (59%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK K   K+K +  +  K ++++KKK+K++  KK   
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVT 578



 Score = 26.9 bits (59), Expect = 8.8
 Identities = 15/61 (24%), Positives = 28/61 (45%)

Query: 125 LYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
             V     L  E+  KK    +  K +  +  K ++++KKK+K++  KK +  +    K 
Sbjct: 527 TAVSDAALLAPEEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKI 586

Query: 185 G 185
           G
Sbjct: 587 G 587


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 29.7 bits (67), Expect = 0.61
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E+KE K N  K+K  K KK  KK   + +KK  +   +
Sbjct: 108 EEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAE 145



 Score = 28.2 bits (63), Expect = 2.2
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+EK+   N  K K  K KK  KK   + +KK    +
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEAR 143



 Score = 27.0 bits (60), Expect = 4.5
 Identities = 14/39 (35%), Positives = 17/39 (43%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           A    + E K    K +K+ K   KK K  K KK  KK 
Sbjct: 93  AFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKA 131


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 28.4 bits (64), Expect = 0.64
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            N    KKK KK KK     +KKKK KKK
Sbjct: 25  NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.4 bits (64), Expect = 0.67
 Identities = 14/29 (48%), Positives = 15/29 (51%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
                K KK KK KK     +KKKK KKK
Sbjct: 25  NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 27.2 bits (61), Expect = 1.6
 Identities = 16/29 (55%), Positives = 17/29 (58%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           N N   KKKK KK KK     +KKKK KK
Sbjct: 24  NNNVVAKKKKLKKLKKIDDDLEKKKKNKK 52



 Score = 27.2 bits (61), Expect = 1.9
 Identities = 15/29 (51%), Positives = 16/29 (55%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           N    KKKK KK KK     +KKKK  KK
Sbjct: 25  NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 26.8 bits (60), Expect = 2.8
 Identities = 16/30 (53%), Positives = 17/30 (56%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           N N   KKKK KK KK     +KKKK K K
Sbjct: 24  NNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 25.7 bits (57), Expect = 5.9
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKK 164
             KK K K  KK     +KKKK KKK
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 25.7 bits (57), Expect = 7.2
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKK 159
           +  +KK+ KK K  +   +KKKK KKK
Sbjct: 27  VVAKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.2 bits (68), Expect = 0.65
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             K KK K KKK KK KKK+ K+ + K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDK 169



 Score = 29.4 bits (66), Expect = 1.1
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           G + + KK K K K KK KKK+ K+   K  +++ ++
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177



 Score = 29.4 bits (66), Expect = 1.3
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            + K KK K KKK KK KKK+ K+   K 
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKD 170



 Score = 28.3 bits (63), Expect = 2.4
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
           E K  K K KKK KK KKK+ K+   K  +++++          
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186



 Score = 27.5 bits (61), Expect = 5.1
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 154 KKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           + K KK K KKK KK K K+    SD   +   +S ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180



 Score = 26.7 bits (59), Expect = 8.2
 Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 11/52 (21%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKK-----------KKKKKKKKKTKKF 174
             E    +  +  ++ K+ +KKK KK             KKK K+KK   KF
Sbjct: 365 AAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKF 416


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 30.4 bits (68), Expect = 0.68
 Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 87  DIGNRTEALLQLPRGRLVFLE--KLLPLLSNQSEDCLY-LNLYVP----RPVALEGEKKE 139
           D+   T   L L   ++ +LE  +L  L+   SE   Y + L V     + V  E E+  
Sbjct: 164 DMKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEEDT 223

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK +++  + K ++ K+  + KKKK KK K+  K
Sbjct: 224 KKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTK 257


>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
           Provisional.
          Length = 231

 Score = 30.0 bits (67), Expect = 0.68
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           K  K++ KKK K  K +K  K  +K  PP D   K   K
Sbjct: 84  KTPKRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNEEK 122


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 29.5 bits (67), Expect = 0.69
 Identities = 14/40 (35%), Positives = 20/40 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + E  EKK  K + + KK  KK  K       K+KK+ +K
Sbjct: 16  KQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEK 55



 Score = 26.4 bits (59), Expect = 7.2
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 136 EKKEKK-KNKNKNKKKKKKKKKKK---KKKKKKKKKKKKT 171
           EKK KK + + K   KK  K       K+KK+ +K+  + 
Sbjct: 21  EKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQL 60


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 30.4 bits (69), Expect = 0.70
 Identities = 14/46 (30%), Positives = 16/46 (34%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           PR       KK   K       K +K   KK   KK   KK   +K
Sbjct: 815 PREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARK 860



 Score = 29.2 bits (66), Expect = 1.3
 Identities = 11/44 (25%), Positives = 16/44 (36%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           P+  A +    +         +K   KK   KK   KK   +KT
Sbjct: 818 PKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861


>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain.  This domain is
           found in Plasmodium Duffy binding proteins. Plasmodium
           vivax and Plasmodium knowlesi merozoites invade human
           erythrocytes that express Duffy blood group surface
           determinants. The Duffy receptor family is localised in
           micronemes, an organelle found in all organisms of the
           phylum Apicomplexa.
          Length = 276

 Score = 30.0 bits (68), Expect = 0.74
 Identities = 16/37 (43%), Positives = 20/37 (54%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             KKK  +K KKK KK KK+  KK     K K  K++
Sbjct: 240 DNKKKEWDKQKKKYKKYKKRNNKKNYSDIKDKYAKEY 276


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
           family of fungal proteins is conserved towards the
           C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 27.6 bits (61), Expect = 0.83
 Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           E +KK ++ N  +NK    + + KKK KKK  K+  K K  
Sbjct: 3   ERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.9 bits (67), Expect = 0.85
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           + E+ E++KN+ +    KK KK KK K KK    K 
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 29.2 bits (65), Expect = 1.4
 Identities = 14/47 (29%), Positives = 24/47 (51%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +P    +E ++  ++  + KN+++    KK KK KK K KK    K 
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 28.4 bits (63), Expect = 2.4
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E +K  +  ++ K+KKKKKK  K KKK    KK K+
Sbjct: 228 ESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 27.2 bits (60), Expect = 6.7
 Identities = 15/41 (36%), Positives = 19/41 (46%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           +  + K+K KKKK  K KKK    KK K+          SD
Sbjct: 235 DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275



 Score = 26.8 bits (59), Expect = 8.1
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            +G++ +K       +K KKKKKK  K KKK    KK  +
Sbjct: 224 DDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 29.6 bits (67), Expect = 0.85
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            ++E  K     KKK  K K  K   K KK++K+
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++ E ++   K  K KKKK  K K  K   K KK+ K+
Sbjct: 202 QRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 26.6 bits (59), Expect = 9.3
 Identities = 15/38 (39%), Positives = 19/38 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +K E++    +   KK K KKKK  K K  K   K KK
Sbjct: 198 KKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 30.0 bits (67), Expect = 0.86
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            +     KNK KNK+K K  KK +   K+ K+  +K+K
Sbjct: 214 IKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSK 251


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.3 bits (66), Expect = 0.87
 Identities = 15/43 (34%), Positives = 23/43 (53%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           P  + +   K+ KKK     +K    KK KKK K+K  +++KK
Sbjct: 40  PEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKK 82



 Score = 28.2 bits (63), Expect = 1.9
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           K+ KKK+    +K    KK KKK K+ L   +
Sbjct: 49  KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEE 80



 Score = 26.6 bits (59), Expect = 7.3
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           K  KKK+    +K    KK KKK K K
Sbjct: 49  KQPKKKRPTTPRKPATTKKSKKKDKEK 75



 Score = 26.3 bits (58), Expect = 8.5
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
            K+ K K     +K    KK KKK K+K  + +K     D
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDD 87


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.8 bits (67), Expect = 0.90
 Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E + +KK K + KK+ +++ K K   + KKK  +  KK
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + +EK+K   + K K+  + K K + + +KK K++ KK
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150



 Score = 28.6 bits (64), Expect = 2.1
 Identities = 14/53 (26%), Positives = 28/53 (52%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           A E + K + + + K K++ KK+ +++ K K   + KKK  +    ++   KA
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKA 181



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E E K K   + K K  + KKK + + K K + K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 27.9 bits (62), Expect = 3.4
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E++ K K   + KKK  + KKK + + K K + K   K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190



 Score = 27.5 bits (61), Expect = 4.1
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +    E +K+ +++ K K   + KKK  + KKK + + K K   K
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186



 Score = 27.1 bits (60), Expect = 5.5
 Identities = 11/37 (29%), Positives = 23/37 (62%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E ++K+ ++ K K   + K K + + +KK K++ KK+
Sbjct: 115 EEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151



 Score = 27.1 bits (60), Expect = 6.2
 Identities = 12/41 (29%), Positives = 25/41 (60%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           A + E+K+K+  + K K+  + K K + + +KK K++ K +
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151



 Score = 27.1 bits (60), Expect = 6.7
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
             E + K   + K K  + KKK + + K K + K K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190



 Score = 27.1 bits (60), Expect = 6.9
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             + + +E+ K K   + KKK  + KKK + + K K +   K
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 26.7 bits (59), Expect = 7.9
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E K K+  + K K + + +KK K++ KK+ +++ K K
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.2 bits (66), Expect = 0.90
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
            +KE+KK  +K +KK  K +K+K +++  
Sbjct: 96  AQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124



 Score = 26.5 bits (59), Expect = 8.9
 Identities = 10/30 (33%), Positives = 20/30 (66%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           K +K++KK   K++KK  K +K+K  +++ 
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.4 bits (66), Expect = 0.93
 Identities = 12/39 (30%), Positives = 17/39 (43%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
             ALE EK +  K+K   +   K   + K  +K K   K
Sbjct: 13  VRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK 51



 Score = 27.8 bits (62), Expect = 2.7
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            +  +K K   K +  ++ KK   ++KK++KKK KKKK  
Sbjct: 40  DKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79



 Score = 27.5 bits (61), Expect = 4.6
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 7/48 (14%)

Query: 134 EGEKKEKKKNKNKNKKKKKKK-------KKKKKKKKKKKKKKKKTKKF 174
           E   K   ++K   K K   K       KK   ++KK++KKK K KK 
Sbjct: 31  EIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.3 bits (66), Expect = 0.94
 Identities = 10/35 (28%), Positives = 30/35 (85%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           ++ E KK K + +K++++++K++KK+++++++K+K
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219



 Score = 27.4 bits (61), Expect = 5.1
 Identities = 9/35 (25%), Positives = 29/35 (82%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +E +  K K +++K++++++K++KK+++++++K K
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 28.8 bits (65), Expect = 0.94
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKK 163
           KE KK K K+  K K     KKKKKK
Sbjct: 61  KEVKKKKKKSSLKSKSSSSSKKKKKK 86



 Score = 26.8 bits (60), Expect = 4.7
 Identities = 14/25 (56%), Positives = 14/25 (56%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
           KKKKKK   K K     KKKKKK  
Sbjct: 64  KKKKKKSSLKSKSSSSSKKKKKKGP 88


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
           includes the E. faecalis PcfJ protein, which is
           functionally uncharacterized. It is found in bacteria
           and viruses, and is typically between 159 and 170 amino
           acids in length. There is a conserved HCV sequence
           motif.
          Length = 169

 Score = 29.2 bits (66), Expect = 0.98
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
            K + KK K K +K +K+  + +K  ++ K K
Sbjct: 36  AKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67



 Score = 27.3 bits (61), Expect = 3.5
 Identities = 9/45 (20%), Positives = 21/45 (46%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
             P+ +    ++   K    K K+K +K +K+  + +K  ++ K 
Sbjct: 22  VCPKNLKAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKS 66


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.8 bits (67), Expect = 0.98
 Identities = 10/41 (24%), Positives = 20/41 (48%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
              G      ++      ++ K++ +KK+K KKK +K+K  
Sbjct: 330 EFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370



 Score = 27.8 bits (62), Expect = 3.8
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            +   G   E  ++++  + K++ +KK+K KKK +K+K  
Sbjct: 331 FLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 28.8 bits (64), Expect = 0.98
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           AL  ++KE  K +++   KKKK+K  +KK K KKK++K   K +P  +P
Sbjct: 9   ALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK-IPEHEP 56


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 29.5 bits (66), Expect = 1.0
 Identities = 16/48 (33%), Positives = 18/48 (37%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
            KE  K K   K    K  KK  K KK K  K  TKK     + +   
Sbjct: 26  SKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESV 73



 Score = 28.0 bits (62), Expect = 2.7
 Identities = 14/51 (27%), Positives = 18/51 (35%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
           KK  + K    KK    K+  K KK  K   TK     +       VK+  
Sbjct: 10  KKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60



 Score = 27.2 bits (60), Expect = 5.0
 Identities = 14/38 (36%), Positives = 15/38 (39%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +K    K   K KK  K    K  KK  K KK K  K 
Sbjct: 21  KKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKT 58



 Score = 26.8 bits (59), Expect = 7.0
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           V ++ EK E  K ++  KK  KK+    +  +   K  K T K LP  
Sbjct: 63  VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK-LPKK 109



 Score = 26.8 bits (59), Expect = 7.9
 Identities = 15/38 (39%), Positives = 17/38 (44%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           EKK   K    +K+  K KK  K    K  KK  K KK
Sbjct: 15  EKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKK 52


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++YVP+    E   KE ++ + + K+         KKKK+KK+ K   KK
Sbjct: 21  DIYVPK----ELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKK 66


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 28.9 bits (65), Expect = 1.1
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            +KK KN++   K  +K+ K+K +KKK   
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKKINI 30



 Score = 27.8 bits (62), Expect = 2.5
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           ++KKKN++ N K  +K+ K+K +KKK   
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30



 Score = 27.8 bits (62), Expect = 2.9
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K KK +    K  +K+ K+K +KKK    
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKINII 31



 Score = 27.4 bits (61), Expect = 3.2
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           +KKKK +    K  +K+ K+K + KK     +P
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34



 Score = 27.4 bits (61), Expect = 4.1
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           + K KN+    K  +K+ K+K +KKK   
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30



 Score = 26.6 bits (59), Expect = 5.8
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
            +K+KK   +  K  +K+ K+K +KKK   
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKKINI 30



 Score = 26.2 bits (58), Expect = 9.7
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           + K K +    K  +K+ K+K +KKK   
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30


>gnl|CDD|188624 cd09904, H3TH_XPG, H3TH domain of Xeroderma pigmentosum
           complementation group G (XPG) nuclease, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease.  The Xeroderma pigmentosum complementation
           group G (XPG) nuclease plays a central role in
           nucleotide excision repair (NER) in cleaving DNA bubble
           structures or loops. XPG is a member of the
           structure-specific, 5' nuclease family that catalyzes
           hydrolysis of DNA duplex-containing nucleic acid
           structures during DNA replication, repair, and
           recombination.  Members of this subgroup include the
           H3TH (helix-3-turn-helix) domains of XPG and other
           similar eukaryotic 5' nucleases. These nucleases contain
           a PIN (PilT N terminus) domain with a helical arch/clamp
           region/I domain (not included here) and inserted within
           the PIN domain is an atypical helix-hairpin-helix-2
           (HhH2)-like region. This atypical HhH2 region, the H3TH
           domain, has an extended loop with at least three turns
           between the first two helices, and only three of the
           four helices appear to be conserved. Both the H3TH
           domain and the helical arch/clamp region are involved in
           DNA binding.  Studies suggest that a glycine-rich loop
           in the H3TH domain contacts the phosphate backbone of
           the template strand in the downstream DNA duplex. These
           nucleases have a carboxylate rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+ or Mn2+) required for nuclease activity.
           The first metal binding site is composed entirely of
           Asp/Glu residues from the PIN domain, whereas, the
           second metal binding site is composed generally of two
           Asp residues from the PIN domain and one Asp residue
           from the H3TH domain. Together with the helical arch and
           network of amino acids interacting with metal binding
           ions, the H3TH region defines a positively charged
           active-site DNA-binding groove in structure-specific 5'
           nucleases.
          Length = 97

 Score = 28.0 bits (63), Expect = 1.1
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
           + +K +     K++ K+K K       LPP  PS   
Sbjct: 50  QPEKSEDSDNDKQEFKRKHKNYLKNLILPPGFPSPAV 86


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 28.9 bits (65), Expect = 1.1
 Identities = 10/44 (22%), Positives = 17/44 (38%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              +     E+  N+ K  ++    +  K+ K   K K KK  K
Sbjct: 113 EDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDK 156


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 29.6 bits (66), Expect = 1.2
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
            K   KKKK KK K +K+ K+K KK K    F
Sbjct: 68  GKRVPKKKKIKKPKLRKRTKRKNKKIKSLTAF 99



 Score = 28.4 bits (63), Expect = 2.8
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           N +  +  KKKK KK K +K+ K+K KK K
Sbjct: 65  NVSGKRVPKKKKIKKPKLRKRTKRKNKKIK 94



 Score = 26.9 bits (59), Expect = 9.7
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKK 167
           K   K KK KK K +K+ K+K KK K
Sbjct: 69  KRVPKKKKIKKPKLRKRTKRKNKKIK 94


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 13/55 (23%), Positives = 24/55 (43%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
            P   A+ GE    +K   +    +K+K  + K++K  K  +  T+  +  S  S
Sbjct: 86  SPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDVVDKSQAS 140



 Score = 27.8 bits (62), Expect = 2.6
 Identities = 9/48 (18%), Positives = 19/48 (39%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
             ++KE ++     K+K  + K++K  K  +   +    K     D  
Sbjct: 97  SADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDVVDKSQASLDYG 144


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 29.5 bits (66), Expect = 1.2
 Identities = 9/36 (25%), Positives = 19/36 (52%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+K K K  N K+ K +    + + ++   K++ +K
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEK 833


>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
          Length = 650

 Score = 29.4 bits (67), Expect = 1.2
 Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKK-KKKKKKKKTKKFLPP 177
             + KK + KN++  K+   + KKK  K  K KKK +K  PP
Sbjct: 210 GYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNPP 251



 Score = 27.5 bits (62), Expect = 4.6
 Identities = 17/43 (39%), Positives = 24/43 (55%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           G K +K + KN+   K+   + KKK  K  K KKK+ K+  PP
Sbjct: 210 GYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNPPP 252


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           P  + +E  ++E+  N  K   K++ KK + K K+      K+  K   P++ S K  +K
Sbjct: 515 PDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILK-EEPTENSSKNILK 573

Query: 188 SR 189
            R
Sbjct: 574 LR 575


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 11/36 (30%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KE+K+ + K +++++ K+ K  K+++ ++K +K KK
Sbjct: 1   KERKE-EEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            K+E+K  + +  K+ K  K+++ ++K +K KK
Sbjct: 3   RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 27.6 bits (62), Expect = 1.8
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E+KE++K + + + K+ K  K+++ ++K +K KK
Sbjct: 2   ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 27.2 bits (61), Expect = 2.4
 Identities = 9/31 (29%), Positives = 20/31 (64%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           E EK ++++   + K  K+++ ++K +K KK
Sbjct: 5   EEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 25.7 bits (57), Expect = 7.6
 Identities = 9/34 (26%), Positives = 23/34 (67%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           K+ K++ K + +++ K+ K  K+++ ++K +K K
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEKLEKIK 34


>gnl|CDD|218071 pfam04409, DUF530, Protein of unknown function (DUF530).  Family of
           hypothetical archaeal proteins.
          Length = 512

 Score = 29.5 bits (66), Expect = 1.2
 Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 5/80 (6%)

Query: 96  LQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKK 155
           L +   RL      L  +S+  E  +   L +        E  +K      ++    KK 
Sbjct: 433 LHIKINRLALGGVALYGISDWDEVEVAEILGID-----IEELLKKMAKIEPSEILLFKKV 487

Query: 156 KKKKKKKKKKKKKKKTKKFL 175
           +KKK +K  K K  K KKFL
Sbjct: 488 EKKKFEKFGKPKSDKAKKFL 507


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +  K   KKK +K    + K   K+K K  KF
Sbjct: 192 DAAKGGSKKKGRKGGAARGKPNAKRKAKDAKF 223



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 8/51 (15%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK--------KKKKKKKKTKK 173
             ++   K   K K +     + K   K+K K        KK+  K  T +
Sbjct: 188 FLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFGFGGKKRGSKSNTAE 238


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 28.8 bits (65), Expect = 1.3
 Identities = 15/38 (39%), Positives = 20/38 (52%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K E KK   K K  + KK K+  K  +KK+KK   K+ 
Sbjct: 130 KSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 29.4 bits (65), Expect = 1.3
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 132 ALEGEKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           A  GEKK K+K +  K + K ++    + K   KK+ + + KKF      + K+GV+
Sbjct: 378 AKSGEKKVKRKARLEKMRAKCRRAVGNRMKGSMKKRVRSRAKKF----GEAAKSGVR 430


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 29.0 bits (65), Expect = 1.3
 Identities = 16/38 (42%), Positives = 22/38 (57%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
            R    +  KKE +   + N++ K+KKKKK KKKK  K
Sbjct: 67  ERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104



 Score = 28.3 bits (63), Expect = 1.9
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 5/56 (8%)

Query: 132 ALEGEKKEKKKNK-----NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
             +G +  KK+ +     N+  K+KKKKK KKKK  K       T +   P D   
Sbjct: 68  RKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVH 123


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
           family shows weak but suggestive similarity to
           translation initiation factor SUI1 and its prokaryotic
           homologs.
          Length = 173

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 1/64 (1%)

Query: 121 LYLNLYVPRPVALEGEK-KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           LY  LY       E E     +  + K +K     +K ++ ++  KKKKK  +K     +
Sbjct: 37  LYAKLYGTDSPTAEAEAVTIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKRE 96

Query: 180 PSGK 183
           P  K
Sbjct: 97  PRTK 100


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 29.5 bits (66), Expect = 1.4
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 14/58 (24%)

Query: 140 KKKNKNKNKKKKKKKKK--------------KKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           K    +K K  K +KK+              K+ +KK  K K+ K K F P  + S K
Sbjct: 429 KVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKK 486


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 11/49 (22%), Positives = 25/49 (51%), Gaps = 5/49 (10%)

Query: 131 VALEGEKKEKKKNKN-----KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             L  ++ + K  +      K +KK ++ +++ KK  +K ++ KK+  F
Sbjct: 10  SKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGIF 58


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
           family is found in eukaryotes, and is typically between
           96 and 116 amino acids in length.
          Length = 106

 Score = 28.0 bits (63), Expect = 1.4
 Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKK-KKKKKKKKTKKFLPPS 178
           EKK     +   K KK     KKK KK   KK    F+ PS
Sbjct: 13  EKKIFDKIDYIDKLKKDPNIDKKKFKKIIFKKYGLIFILPS 53


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 28.9 bits (65), Expect = 1.6
 Identities = 11/37 (29%), Positives = 23/37 (62%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K E +K  + +  + ++ KKK++ + KKKK+ ++ K 
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109



 Score = 28.2 bits (63), Expect = 2.3
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           P   +       + +   KK++ + KKKK+ ++ K  +KK
Sbjct: 74  PEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113



 Score = 27.8 bits (62), Expect = 3.2
 Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 2/70 (2%)

Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK--KKKKKKTKKFLPPSDPS 181
                     E EK+    +    + KKK++ + KKKK+ +  K  +KK  +++     S
Sbjct: 65  VKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLS 124

Query: 182 GKAGVKSREQ 191
                K  E+
Sbjct: 125 SSLQTKLTEE 134


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 29.2 bits (65), Expect = 1.7
 Identities = 14/53 (26%), Positives = 33/53 (62%), Gaps = 1/53 (1%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
            +G+++ + K K  +KK+++K+K+ +K+ + K ++KK T     P +P+ +  
Sbjct: 57  RKGQQQRESKPK-ISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELP 108


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 28.9 bits (64), Expect = 1.7
 Identities = 9/42 (21%), Positives = 26/42 (61%)

Query: 132  ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              + E+KE+ ++  + K+++  KK+K+++++ +K      K+
Sbjct: 1026 IKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067



 Score = 27.8 bits (61), Expect = 4.2
 Identities = 10/41 (24%), Positives = 24/41 (58%)

Query: 133  LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            +E  ++ K++   K +K+++++ +K      K+  KK+ KK
Sbjct: 1035 MESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKK 1075



 Score = 27.8 bits (61), Expect = 4.3
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 134  EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
              E++  KK K + ++ +K      K+  KK+ KKK+
Sbjct: 1041 AKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 9/35 (25%), Positives = 23/35 (65%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
            ++  K+  K K K++ +  +K K+ ++KK+++K+
Sbjct: 156 DDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190



 Score = 27.5 bits (61), Expect = 4.1
 Identities = 11/44 (25%), Positives = 26/44 (59%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
             R ++ +    E +K   K K+K++ +  +K K+ ++KK+++K
Sbjct: 146 ECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEK 189



 Score = 27.5 bits (61), Expect = 4.2
 Identities = 10/35 (28%), Positives = 26/35 (74%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           G++KEK++ ++  K K+ ++KK+++K++  + K+ 
Sbjct: 164 GKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRP 198



 Score = 26.8 bits (59), Expect = 7.4
 Identities = 12/42 (28%), Positives = 27/42 (64%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           AL  +K + +  K   K+K+K++ +  +K K+ ++KK++ K+
Sbjct: 149 ALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 27.2 bits (61), Expect = 1.7
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            + K + K     +K +K+ KK    K KKKK
Sbjct: 2   LRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.4 bits (64), Expect = 1.8
 Identities = 8/38 (21%), Positives = 24/38 (63%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E++E++K   + +++  +  K + ++ + K+K K+ +K
Sbjct: 123 EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160



 Score = 27.3 bits (61), Expect = 4.4
 Identities = 9/36 (25%), Positives = 22/36 (61%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           LE E++EK+  + + +  +  K + ++ + K+K K+
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE 157


>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1.  This domain
           family is found in eukaryotes, and is typically between
           216 and 263 amino acids in length. Casc1 has many SNPs
           associated with cancer susceptibility.
          Length = 227

 Score = 28.5 bits (64), Expect = 1.8
 Identities = 9/40 (22%), Positives = 14/40 (35%), Gaps = 2/40 (5%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
            K   + +  +       K+ K K  K  +  K  P  DP
Sbjct: 69  SKIAEEAEGARGYVTDIPKEYKAKPVKYLEIPK--PIKDP 106



 Score = 27.7 bits (62), Expect = 3.5
 Identities = 9/39 (23%), Positives = 13/39 (33%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           E+ +          K+ K K  K  +  K  K    LP 
Sbjct: 73  EEAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPELPE 111


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 6/45 (13%)

Query: 136 EKKEKKKNKNKNKK------KKKKKKKKKKKKKKKKKKKKKTKKF 174
           E  +    K + KK      ++  ++   K KKKK+KKK+  +KF
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKF 441



 Score = 27.0 bits (60), Expect = 8.2
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
               A+E  ++E  +      KKKK+KKK+  +K
Sbjct: 407 EGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP).  NAP proteins
           are involved in moving histones into the nucleus,
           nucleosome assembly and chromatin fluidity. They affect
           the transcription of many genes.
          Length = 236

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           K+ KN   K  KKK++ KK  + +   K
Sbjct: 152 KEGKNLTVKTVKKKQRNKKTGQTRTITK 179


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 27.7 bits (62), Expect = 2.0
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++K   K KKK K +K +KK K++ +  K+
Sbjct: 26  RSKKLLKAKKKLKSEKLEKKAKRQLRAEKR 55



 Score = 27.3 bits (61), Expect = 3.0
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK     KK K +K +KK K++ + +K++  +K
Sbjct: 28  KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60



 Score = 26.9 bits (60), Expect = 3.9
 Identities = 15/51 (29%), Positives = 28/51 (54%)

Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           P+    +K  K K K K++K +KK K++ + +K++  +K + K  LP    
Sbjct: 22  PILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLE 72


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 16/62 (25%), Positives = 33/62 (53%), Gaps = 7/62 (11%)

Query: 111 PLLSNQSEDCLYLNLYVPRPVALE-------GEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
           PL  ++ E  L+ +  +   + ++        +KKE K      +K KK++KK+K K+++
Sbjct: 617 PLFLDRIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREE 676

Query: 164 KK 165
           K+
Sbjct: 677 KE 678


>gnl|CDD|176148 cd08459, PBP2_DntR_NahR_LinR_like, The C-terminal substrate
          binding domain of LysR-type transcriptional regulators
          that are involved in the catabolism of dinitrotoluene,
          naphthalene and gamma-hexachlorohexane; contains the
          type 2 periplasmic binding fold.  This CD includes
          LysR-like bacterial transcriptional regulators, DntR,
          NahR, and LinR, which are involved in the degradation
          of aromatic compounds. The transcription of the genes
          encoding enzymes involved in such degradation is
          regulated and expression of these enzymes is enhanced
          by inducers, which are either an intermediate in the
          metabolic pathway or compounds to be degraded.  DntR
          from Burkholderia species controls genes encoding
          enzymes for oxidative degradation of the nitro-aromatic
          compound 2,4-dinitrotoluene. The active form of DntR is
          homotetrameric, consisting of a dimer of dimers. NahR
          is a salicylate-dependent transcription activator of
          the nah and sal operons for naphthalene degradation.
          Salicylic acid is an intermediate of the oxidative
          degradation of the aromatic ring in soil bacteria.
          LinR positively regulates expression of the genes (linD
          and linE) encoding enzymes for
          gamma-hexachlorocyclohexane (a haloorganic insecticide)
          degradation. Expression of linD and linE are induced by
          their substrates, 2,5-dichlorohydroquinone (2,5-DCHQ)
          and chlorohydroquinone (CHQ). The structural topology
          of this substrate-binding domain is most similar to
          that of the type 2 periplasmic binding proteins (PBP2),
          which are responsible for the uptake of a variety of
          substrates such as phosphate, sulfate, polysaccharides,
          lysine/arginine/ornithine, and histidine. The PBP2 bind
          their ligand in the cleft between these domains in a
          manner resembling a Venus flytrap. After binding their
          specific ligand with high affinity, they can interact
          with a cognate membrane transport complex comprised of
          two integral membrane domains and two cytoplasmically
          located ATPase domains. This interaction triggers the
          ligand translocation across the cytoplasmic membrane
          energized by ATP hydrolysis.
          Length = 201

 Score = 28.3 bits (64), Expect = 2.3
 Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 2/43 (4%)

Query: 59 YMPPVTPSTWRAPRFADTYSAVCPQRLPDIGNR--TEALLQLP 99
          Y+P +    ++   F + Y  +  +  P IG+    E  L   
Sbjct: 56 YLPDLGAGFFQQRLFRERYVCLVRKDHPRIGSTLTLEQFLAAR 98


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 6/47 (12%)

Query: 143 NKNKNKKKKKKKKKKKK------KKKKKKKKKKKTKKFLPPSDPSGK 183
           N  K+K KKK+KKKK+        K+KKK+K       LP   P+ K
Sbjct: 9   NNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGPDAASKLPKVTPNTK 55


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 27.7 bits (62), Expect = 2.3
 Identities = 11/39 (28%), Positives = 16/39 (41%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
             P   +  +K+  K   K    K K K+KK K K+   
Sbjct: 7   RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45



 Score = 26.5 bits (59), Expect = 5.7
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           K+ +KK     +K    K K K+KK K K+   ++ 
Sbjct: 13  KRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48



 Score = 26.5 bits (59), Expect = 6.5
 Identities = 12/48 (25%), Positives = 16/48 (33%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
               +K      K+ +KK  K  +K    K K K KK       S   
Sbjct: 1   MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48



 Score = 26.1 bits (58), Expect = 7.6
 Identities = 12/35 (34%), Positives = 16/35 (45%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E K  +K   K  +K    K K K+KK K K+   
Sbjct: 11  EPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 28.3 bits (63), Expect = 2.3
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKK 160
           E+KEKKKN NK +K+ KKK++ K++
Sbjct: 95  EEKEKKKNLNKEEKEAKKKERAKRE 119



 Score = 26.4 bits (58), Expect = 9.0
 Identities = 12/25 (48%), Positives = 21/25 (84%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKK 162
           +EK+K KN NK++K+ KKK++ K++
Sbjct: 95  EEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
           beta).  This family consists of several telomere-binding
           protein beta subunits which appear to be specific to the
           family Oxytrichidae. Telomeres are specialised
           protein-DNA complexes that compose the ends of
           eukaryotic chromosomes. Telomeres protect chromosome
           termini from degradation and recombination and act
           together with telomerase to ensure complete genome
           replication. TEBP beta forms a complex with TEBP alpha
           and this complex is able to recognise and bind ssDNA to
           form a sequence-specific, telomeric nucleoprotein
           complex that caps the very 3' ends of chromosomes.
          Length = 375

 Score = 28.5 bits (63), Expect = 2.3
 Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 1/52 (1%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS-DPSGKAGVKSREQ 191
           K   NK   +K+  +K +  K   KK    +KK +P +  PSGK   K+ +Q
Sbjct: 283 KYTPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKTTDQ 334


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 26.8 bits (60), Expect = 2.3
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           +K KK  +N   KK+KK +K  K+ K +++  ++
Sbjct: 21  RKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVRE 54



 Score = 26.1 bits (58), Expect = 4.8
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+K   +N + KK+KK +K  K+ K +++   ++
Sbjct: 22  KRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREE 55


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 27.9 bits (63), Expect = 2.3
 Identities = 11/53 (20%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT------KKFLPPSD 179
           LE  ++E ++ K++  +   + +  +K+ ++++++ KK       K  LP  D
Sbjct: 20  LEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEKFAKDLLPVLD 72


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E+K K++     KKK K +K  K+ +KK   K  K   
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIV 193



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 3/45 (6%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKK---KKKKKKKKKTKKF 174
           LE ++KE+     K K K +K  K+ +KK   K  K      KKF
Sbjct: 155 LEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKF 199


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 7/35 (20%), Positives = 18/35 (51%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           + K  ++   +  +     K+++  + +KK+ KKK
Sbjct: 161 DAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
           E   +E +      +++  + +KK+ KKKK
Sbjct: 167 EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 2.4
 Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 7/43 (16%)

Query: 134 EGEKKEKKKNKNKNKKK-------KKKKKKKKKKKKKKKKKKK 169
             +K+ KK  +   +K        +++K +KK +K++K KK  
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMD 147



 Score = 27.7 bits (61), Expect = 5.1
 Identities = 9/58 (15%), Positives = 21/58 (36%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
           A++ E  E      K   +  K+  KK   +  +  ++ ++  +       K   K+ 
Sbjct: 57  AVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTP 114



 Score = 27.3 bits (60), Expect = 6.4
 Identities = 10/40 (25%), Positives = 22/40 (55%), Gaps = 9/40 (22%)

Query: 143 NKNKNKKKKKKKKKK---------KKKKKKKKKKKKKTKK 173
              ++KK  ++ ++K         ++K +KK +K++K KK
Sbjct: 106 LDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKK 145


>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus.  This family
           aligns residues towards the N-terminus of several
           proteins with multiple functions. The members of this
           family all appear to possess 5'-3' exonuclease activity
           EC:3.1.11.-. Thus, the aligned region may be necessary
           for 5' to 3' exonuclease function. The family also
           contains several Xrn1 and Xrn2 proteins. The 5'-3'
           exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
           degradation and processing of several classes of RNA in
           Saccharomyces cerevisiae. Xrn1p is the main enzyme
           catalyzing cytoplasmic mRNA degradation in multiple
           decay pathways, whereas Xrn2p/Rat1p functions in the
           processing of rRNAs and small nucleolar RNAs (snoRNAs)
           in the nucleus.
          Length = 237

 Score = 28.1 bits (63), Expect = 2.4
 Identities = 8/33 (24%), Positives = 20/33 (60%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           A + ++KE +  +N+ + + +  K  +K +K+K
Sbjct: 104 AKDAKEKEAEAEENREELETEGIKLPEKVEKEK 136


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
           short proteins that are rich in aspartate, glutamate,
           lysine and arginine. Although the function of these
           proteins is unknown, they are found to be ubiquitously
           expressed.
          Length = 38

 Score = 25.6 bits (57), Expect = 2.5
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           G ++E+ + +N  K  KKKKK  K + K   K  
Sbjct: 4   GNQREQARERNAKKAAKKKKKGAKSQLKAAAKAL 37



 Score = 24.1 bits (53), Expect = 8.6
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             N++++ +++  KK  KKKKK  K   K 
Sbjct: 3   RGNQREQARERNAKKAAKKKKKGAKSQLKA 32


>gnl|CDD|240132 cd04862, PaeLigD_Pol_like, PaeLigD_Pol_like: Polymerase (Pol)
           domain of bacterial LigD proteins similar to Pseudomonas
           aeruginosa (Pae) LigD. The LigD Pol domain belongs to
           the archaeal/eukaryal primase (AEP) superfamily. In
           prokaryotes, LigD along with Ku is required for
           non-homologous end joining (NHEJ)-mediated repair of DNA
           double-strand breaks (DSB). NHEJ-mediated DNA DSB repair
           is error-prone. PaeLigD is monomeric, containing an
           N-terminal phosphoesterase module, a central polymerase
           (Pol) domain, and a C-terminal ATP-dependent ligase
           domain. It has been suggested that LigD Pol contributes
           to NHEJ-mediated DNA DSB repair in vivo, by filling in
           short 5'-overhangs with ribonucleotides; the filled in
           termini would then be sealed by the associated LigD
           ligase domain, resulting in short stretches of RNA
           incorporated into the genomic DNA. The PaeLigD Pol
           domain in vitro, in a manganese-dependent fashion,
           catalyzes templated extensions of 5'-overhang duplex
           DNA, and nontemplated single-nucleotide additions to
           blunt-end duplex DNA; it preferentially adds single
           ribonucleotides at blunt DNA ends. PaeLigD Pol adds a
           correctly paired rNTP to the DNA primer termini more
           rapidly than it does a correctly paired dNTP; it has
           higher infidelity as an RNA polymerase than it does as a
           DNA polymerase, which is in keeping with the mutagenic
           property of NHEJ-mediated DNA DSB repair.
          Length = 227

 Score = 28.0 bits (63), Expect = 2.5
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 74  ADTYSAVCPQRLPDIGNRTEALLQLPRGR 102
           A  Y+AV P  LP +  R  +L++ P G 
Sbjct: 5   ARYYAAVAPWMLPHLAGRPLSLVRCPDGI 33


>gnl|CDD|133326 cd04126, Rab20, Rab GTPase family 20 (Rab20).  Rab20 is one of
           several Rab proteins that appear to be restricted in
           expression to the apical domain of murine polarized
           epithelial cells. It is expressed on the apical side of
           polarized kidney tubule and intestinal epithelial cells,
           and in non-polarized cells. It also localizes to
           vesico-tubular structures below the apical brush border
           of renal proximal tubule cells and in the apical region
           of duodenal epithelial cells. Rab20 has also been shown
           to colocalize with vacuolar H+-ATPases (V-ATPases) in
           mouse kidney cells, suggesting a role in the regulation
           of V-ATPase traffic in specific portions of the nephron.
           It was also shown to be one of several proteins whose
           expression is upregulated in human myelodysplastic
           syndrome (MDS) patients. GTPase activating proteins
           (GAPs) interact with GTP-bound Rab and accelerate the
           hydrolysis of GTP to GDP. Guanine nucleotide exchange
           factors (GEFs) interact with GDP-bound Rabs to promote
           the formation of the GTP-bound state. Rabs are further
           regulated by guanine nucleotide dissociation inhibitors
           (GDIs), which facilitate Rab recycling by masking
           C-terminal lipid binding and promoting cytosolic
           localization. Most Rab GTPases contain a lipid
           modification site at the C-terminus, with sequence
           motifs CC, CXC, or CCX. Lipid binding is essential for
           membrane attachment, a key feature of most Rab proteins.
          Length = 220

 Score = 27.9 bits (62), Expect = 2.6
 Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 11/89 (12%)

Query: 107 EKLLPLLSNQSEDCLY------LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
           ++ L L    +EDCL+      L+L     +A + EK    +   +++++   +  K   
Sbjct: 88  DRFLGLTDTANEDCLFAVVGNKLDLTEEGALAGQ-EKDAGDRVSPEDQRQVTLEDAKAFY 146

Query: 161 KKKKKKKKKKTKKFLPPSDP----SGKAG 185
           K+  K K                 S K G
Sbjct: 147 KRINKYKMLDEDLSPAAEKMCFETSAKTG 175


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 26.5 bits (59), Expect = 2.6
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           EGEK  KK+ K+ N  KK + K K+ +K+  + KKK  K 
Sbjct: 20  EGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKL 59


>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
           and repair].
          Length = 570

 Score = 28.3 bits (64), Expect = 2.6
 Identities = 14/45 (31%), Positives = 24/45 (53%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
            +EG+K+ + K+K++ ++   K K K  K    +KK KK     P
Sbjct: 215 EIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRSPPPP 259


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 27.3 bits (61), Expect = 2.6
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           KK K   K +K+ K ++KK + K+K K    KK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 26.9 bits (60), Expect = 3.8
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKK 163
           +K +K+ KN+ KK + K+K K   KKK
Sbjct: 106 RKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 26.5 bits (59), Expect = 6.1
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           KKK   K +K+ K ++KK + K+K K   KK
Sbjct: 101 KKKAGRKARKELKNRRKKVRGKEKTKVSGKK 131


>gnl|CDD|222658 pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy.
           This family includes O-antigen polysaccharide
           polymerases. These enzymes link O-units via a glycosidic
           linkage to form a long O-antigen. These enzymes vary in
           specificity and sequence.
          Length = 440

 Score = 28.4 bits (64), Expect = 2.7
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           + KK+KK+KKK + + ++  K  +K    L
Sbjct: 103 RKKKRKKEKKKNQLEYERLLKNLRKLSLIL 132



 Score = 27.3 bits (61), Expect = 6.5
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKK 160
           G KK++KK K KN+ + ++  K  +K
Sbjct: 102 GRKKKRKKEKKKNQLEYERLLKNLRK 127


>gnl|CDD|163426 TIGR03714, secA2, accessory Sec system translocase SecA2.  Members
           of this protein family are homologous to SecA and part
           of the accessory Sec system. This system, including both
           five core proteins for export and a variable number of
           proteins for glycosylation, operates in certain
           Gram-positive pathogens for the maturation and delivery
           of serine-rich glycoproteins such as the cell surface
           glycoprotein GspB in Streptococcus gordonii [Protein
           fate, Protein and peptide secretion and trafficking].
          Length = 762

 Score = 28.5 bits (64), Expect = 2.7
 Identities = 15/53 (28%), Positives = 19/53 (35%), Gaps = 2/53 (3%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKK--KKKKKTKKFLPPSDPSGKAGVKSREQ 191
           K+      KK  KK   K  K K     K++  K        S   G  +REQ
Sbjct: 542 KRWSPSWLKKYYKKYSVKDSKLKPSALFKRRFRKIVEKAQRASEDKGESAREQ 594


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 9/37 (24%), Positives = 12/37 (32%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
             KK N ++  K    K   K        +    KKF
Sbjct: 42  DSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKF 78


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 27.5 bits (62), Expect = 2.8
 Identities = 12/34 (35%), Positives = 22/34 (64%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           +K+KKK K K  +  ++KKKK    +++K+ +K 
Sbjct: 83  RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 27.1 bits (61), Expect = 3.5
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           +KK+KKK K     ++KKKK    +++K+ +K 
Sbjct: 84  KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 27.1 bits (61), Expect = 3.8
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           K++KKK K K + +  ++KKKK    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 27.1 bits (61), Expect = 4.0
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+KKKKKKKKK+ +  ++KKKK    
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVS 107



 Score = 26.0 bits (58), Expect = 8.8
 Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K+K K K KKK+ +  ++KKKK    +++K+ +K
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 28.0 bits (63), Expect = 2.8
 Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 134 EGEK-KEKKKNKNKNKKKKKKKKKK----KKKKKKKKKKKKKTKKFLPPSDP 180
           E EK K+ KK   K KKKKKK+KK     KK  K K  + K  K      DP
Sbjct: 146 EREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDP 197


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 27.1 bits (60), Expect = 2.9
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
             KN+  +KKK++KKK K+  +K++K++KK 
Sbjct: 40  ASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 25.9 bits (57), Expect = 6.9
 Identities = 12/32 (37%), Positives = 22/32 (68%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             +KN+   KKK++KKK K+  +K++K++K  
Sbjct: 39  AASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 27.6 bits (62), Expect = 2.9
 Identities = 15/60 (25%), Positives = 21/60 (35%), Gaps = 12/60 (20%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKK------------KKKKKKKKKKTKKFLPPSDPSGKAGV 186
                K  NKKK +K   K+ K            K  K  KK   +  +   + +G  GV
Sbjct: 2   LFLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKLSFRAIVVIGNENGWVGV 61


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 27.7 bits (62), Expect = 2.9
 Identities = 8/33 (24%), Positives = 13/33 (39%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
                K K K    ++KK      +  +K +KK
Sbjct: 97  AISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129


>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
          Length = 987

 Score = 28.4 bits (63), Expect = 3.0
 Identities = 14/43 (32%), Positives = 15/43 (34%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
           K  K   K   KK   K    K  +K  KKK       L PS 
Sbjct: 878 KAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAAGLKPSA 920


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 27.2 bits (61), Expect = 3.1
 Identities = 17/38 (44%), Positives = 25/38 (65%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           L+ E+ EK   + K KK K+++  KKK++K K K KKK
Sbjct: 100 LKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 26.1 bits (58), Expect = 3.1
 Identities = 10/41 (24%), Positives = 27/41 (65%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
            L  +   K K K K  K +++K++++++ +K++++K++ K
Sbjct: 13  MLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 27.3 bits (61), Expect = 3.2
 Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 5/44 (11%)

Query: 136 EKKEKKKNKNKNKKKKKK-----KKKKKKKKKKKKKKKKKTKKF 174
           EK+ +K+ K + ++K+K+      +K++++K+ ++ +K K K  
Sbjct: 20  EKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63



 Score = 27.3 bits (61), Expect = 3.7
 Identities = 9/36 (25%), Positives = 27/36 (75%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           ++K       ++K+++K+++K++K++K++K+K+  K
Sbjct: 5   RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGK 40


>gnl|CDD|219755 pfam08231, SYF2, SYF2 splicing factor.  Proteins in this family are
           involved in cell cycle progression and pre-mRNA
           splicing.
          Length = 157

 Score = 27.3 bits (61), Expect = 3.3
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           +G   E+K+  + + ++ +K  KKKKKKK K  
Sbjct: 3   QGIDFERKRALDWSAEESEKWDKKKKKKKNKDD 35


>gnl|CDD|218991 pfam06326, Vesiculo_matrix, Vesiculovirus matrix protein.  This
           family consists of several Vesiculovirus matrix
           proteins. The matrix (M) protein of vesicular stomatitis
           virus (VSV) expressed in the absence of other viral
           components causes many of the cytopathic effects of VSV,
           including an inhibition of host gene expression and the
           induction of cell rounding. It has been shown that M
           protein also induces apoptosis in the absence of other
           viral components. It is thought that the activation of
           apoptotic pathways causes the inhibition of host gene
           expression and cell rounding by M protein.
          Length = 229

 Score = 27.8 bits (62), Expect = 3.3
 Identities = 10/33 (30%), Positives = 12/33 (36%)

Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
            K  K K K+   KK K  KK    +    P  
Sbjct: 4   FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPS 36



 Score = 26.7 bits (59), Expect = 8.7
 Identities = 11/33 (33%), Positives = 13/33 (39%)

Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
            K  K K K+   KK K  KK        + PS
Sbjct: 4   FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPS 36


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 26.8 bits (60), Expect = 3.3
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
           K++++K KKK  KK + ++ KKK  
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.0 bits (62), Expect = 3.3
 Identities = 12/43 (27%), Positives = 17/43 (39%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              + EKK +   + K   +KKK   K K  K     K   +K
Sbjct: 213 AKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERK 255



 Score = 27.2 bits (60), Expect = 5.8
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           E + K K + K +   ++K   +KKK   K K  K   
Sbjct: 210 EAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247



 Score = 26.8 bits (59), Expect = 8.5
 Identities = 13/42 (30%), Positives = 18/42 (42%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           A E  K E +      KK +   ++K   +KKK   K K  K
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADK 244


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 27.7 bits (62), Expect = 3.4
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
             K +   +KK+K + + ++++ K +K+K KT+ 
Sbjct: 33  ALKREKAQEKKRKAEAQAERRELKARKEKLKTRS 66


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 26.8 bits (59), Expect = 3.5
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
              K   +K KK +K K    K KKK  KKKKK + 
Sbjct: 1   MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEA 36



 Score = 26.0 bits (57), Expect = 7.1
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
             KEKK  K K    K KKK  KKKKK++ +
Sbjct: 7   LSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            K E+K  KN N KKK K  + +++    +K+  + K+ +
Sbjct: 259 AKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELI 298


>gnl|CDD|226659 COG4196, COG4196, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 808

 Score = 28.0 bits (62), Expect = 3.7
 Identities = 28/96 (29%), Positives = 40/96 (41%), Gaps = 18/96 (18%)

Query: 53  PLGSLRYMPP-----VTPSTWRAPRFADTYSAVCPQRLPDIGNRTEALLQLPRGRL--VF 105
           PL S+ ++PP       PS  R P  A+ ++A      P    RT AL+  PRG    +F
Sbjct: 260 PLDSISWVPPRASFDADPSAPRGPLPAEPHTAAAVVENPGTAPRT-ALVAEPRGGRLCIF 318

Query: 106 ------LEKLLPLLSNQSEDCLYLNLYVPRPVALEG 135
                 LE  L L++        +      PV +EG
Sbjct: 319 LPPTEALEHFLELVAAVEAAATTIGC----PVVIEG 350


>gnl|CDD|184516 PRK14115, gpmA, phosphoglyceromutase; Provisional.
          Length = 247

 Score = 27.5 bits (62), Expect = 3.8
 Identities = 12/47 (25%), Positives = 17/47 (36%), Gaps = 11/47 (23%)

Query: 49  YATPPLGSLRYMPPVTPSTWRAPRFADTYSAVCPQRLPDIGNRTEAL 95
           Y  PP       P +     R P     Y+ +  + LP     TE+L
Sbjct: 116 YDVPP-------PALEKDDERYPGHDPRYAKLPEEELP----LTESL 151


>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus. 
          Length = 139

 Score = 27.2 bits (61), Expect = 3.8
 Identities = 13/31 (41%), Positives = 16/31 (51%)

Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
              +KKKK  K+K   KK +KK  K  KF  
Sbjct: 86  STLQKKKKFSKQKFSVKKLQKKHGKKPKFGS 116



 Score = 26.4 bits (59), Expect = 6.8
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
           K+KK +K K   KK +KK  KK K    K
Sbjct: 90  KKKKFSKQKFSVKKLQKKHGKKPKFGSYK 118


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 27.5 bits (62), Expect = 3.8
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           K+ ++  KK  +K +K KKKKK+ K F
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKGF 211



 Score = 27.1 bits (61), Expect = 5.7
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            EK  N  + ++  KK  +K +K KKKKK++K   K
Sbjct: 178 IEKGLNVKQTEELIKKLLEKPEKPKKKKKRRKGFSK 213



 Score = 26.3 bits (59), Expect = 8.9
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           KK  +K +K KKKKK++K   K  +  +
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSKDVRIAV 219


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 27.7 bits (61), Expect = 4.0
 Identities = 18/39 (46%), Positives = 26/39 (66%), Gaps = 1/39 (2%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK-KKKKKKKTKK 173
           +KK  KK K  N KK+++ +  KKKKK+  K+KK  +KK
Sbjct: 553 QKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591



 Score = 27.0 bits (59), Expect = 7.7
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           E  + +K  NK+KNKK+K  +++++KK K      K+ K
Sbjct: 517 ETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKK 555



 Score = 27.0 bits (59), Expect = 8.9
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           LE E +  K ++     K   K K KK+K  +++++KK K  +
Sbjct: 506 LELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIM 548


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 26.8 bits (60), Expect = 4.0
 Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 4/47 (8%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKK----KKKKKKKKKKKTKK 173
           V + G K  +K N    K++KKKKKKK +     +KK+KK+  + +K
Sbjct: 21  VDISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRK 67



 Score = 26.0 bits (58), Expect = 6.0
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           K+EKKK K K++ +   +KK+KK+  +++K  +K 
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 26.9 bits (58), Expect = 4.1
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           EKK  K +N+ +KK +K + + +KK +K + + +K    SD S K
Sbjct: 66  EKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDK 110


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 27.7 bits (62), Expect = 4.1
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 152 KKKKKKKKKKKKKKKKKKKT 171
           K+K++  KK+KKKK KK K 
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375



 Score = 27.7 bits (62), Expect = 4.1
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKK 154
             RP+AL  ++K +   K K KK KK K
Sbjct: 347 FERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 27.7 bits (62), Expect = 4.2
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 151 KKKKKKKKKKKKKKKKKKKKT 171
           K+K++  KK+KKKK KK K T
Sbjct: 356 KRKREGDKKQKKKKSKKLKLT 376



 Score = 27.3 bits (61), Expect = 4.7
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
                ++    + K+K++  KK+KKKK KK K
Sbjct: 343 PPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374



 Score = 27.3 bits (61), Expect = 5.2
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 150 KKKKKKKKKKKKKKKKKKK 168
           K+K++  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374



 Score = 26.6 bits (59), Expect = 8.8
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 157 KKKKKKKKKKKKKKTKK 173
           K+K++  KK+KKKK+KK
Sbjct: 356 KRKREGDKKQKKKKSKK 372


>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
          Length = 903

 Score = 27.9 bits (62), Expect = 4.1
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
             K + KN  K+ KK K    K  K+ K+K+  +  +
Sbjct: 208 SDKSEDKNDTKSSKKGKSSHVKGPKEAKEKESSQMYE 244


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 27.7 bits (61), Expect = 4.2
 Identities = 10/38 (26%), Positives = 20/38 (52%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            K+ KKK   + ++ K+ KKK  +++  ++      KK
Sbjct: 644 IKRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEKK 681



 Score = 27.7 bits (61), Expect = 4.4
 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 7/60 (11%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK----KKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
           +  +KK N  K      K  K+K          +K  KK KKFL    P      ++REQ
Sbjct: 771 KVIKKKSNTVKKVVGGIKYSKRKDFNLSHVIINEKVNKKDKKFLADEVP---FPFETREQ 827


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 27.9 bits (63), Expect = 4.2
 Identities = 15/59 (25%), Positives = 22/59 (37%), Gaps = 4/59 (6%)

Query: 136 EKKEKKKNKNK--NKKKKKKKKKKKKKKKKKKKKKKKTKKF-LPPS-DPSGKAGVKSRE 190
             +E     +K   K     K +   KK ++  +KKK     LP     +  AG K  E
Sbjct: 325 FLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSISLPGKLADASSAGPKKCE 383


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 27.6 bits (62), Expect = 4.2
 Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 135 GEKKEKKKNKNKNKKKK-KKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           G+    K  K K++ K+ +K + +  +++  ++ K    +F PP    GK
Sbjct: 270 GKAAASKAKKAKSRIKRLEKLEARLAEERPVEEGKPLAFRFPPPGKRLGK 319


>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score = 27.6 bits (62), Expect = 4.4
 Identities = 16/30 (53%), Positives = 17/30 (56%)

Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
             K KKKK KK +KK   KKKKK    L P
Sbjct: 51  SAKAKKKKSKKSEKKSSSKKKKKEISVLDP 80


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 27.3 bits (61), Expect = 4.4
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKK 165
            K + K KNKK  K+ K   K     K
Sbjct: 205 PKWEGKTKNKKSLKEYKDLIKLLDSGK 231


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 27.6 bits (61), Expect = 4.4
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 142 KNKNKNKKKKKKKKKKKKKKKK 163
           K+K   KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 6.4
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 143 NKNKNKKKKKKKKKKKKKKKKKK 165
            K+K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.8 bits (59), Expect = 7.1
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
            K+K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229



 Score = 26.4 bits (58), Expect = 8.4
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
            K K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|221856 pfam12922, Cnd1_N, non-SMC mitotic condensation complex subunit 1,
           N-term.  The three non-SMC (structural maintenance of
           chromosomes) subunits of the mitotic condensation
           complex are Cnd1-3. The whole complex is essential for
           viability and the condensing of chromosomes in mitosis.
           This is the conserved N-terminus of the subunit 1.
          Length = 169

 Score = 27.2 bits (61), Expect = 4.7
 Identities = 13/36 (36%), Positives = 16/36 (44%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
            ALE  +K       +  K KKK+KK KK      K
Sbjct: 56  EALEKAEKSAAAAPARAGKGKKKRKKAKKAVVASWK 91


>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
           This family of ABC substrate-binding proteins is
           observed primarily in close proximity with proteins
           localized to the cell wall and bearing the NEAT (NEAr
           Transporter, pfam05031) heme-binding domain. IsdE has
           been shown to bind heme and is involved in the process
           of scavenging heme for the purpose of obtaining iron.
          Length = 289

 Score = 27.2 bits (61), Expect = 4.8
 Identities = 14/35 (40%), Positives = 21/35 (60%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           GEK  +++   K  K+  +K+ + KKK K KKK K
Sbjct: 135 GEKYGREEQAEKLVKEINEKEAEVKKKVKGKKKPK 169


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 26.7 bits (59), Expect = 4.9
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 8/52 (15%)

Query: 133 LEGEKKEKKKNKNKNKKK--------KKKKKKKKKKKKKKKKKKKKTKKFLP 176
           +E + K   +NK   + +         +++ KK KKK++KKKKKKK  K L 
Sbjct: 73  MEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 27.0 bits (60), Expect = 4.9
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
             +E+K+      KK    +K + KKKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 26.2 bits (58), Expect = 9.4
 Identities = 9/27 (33%), Positives = 12/27 (44%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKK 160
             E+KE      K     +K + KKKK
Sbjct: 170 AEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein.  This protein family was
           first noted as a paralogous set in Porphyromonas
           gingivalis, but it is more widely distributed among the
           Bacteroidetes. The protein family is now renamed GLPGLI
           after its best-conserved motif.
          Length = 227

 Score = 27.3 bits (61), Expect = 5.0
 Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
            K +          K KK KK  ++K KK K+K 
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKY 226



 Score = 26.6 bits (59), Expect = 7.3
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           E          K KK KK  ++K KK K+K  
Sbjct: 196 ELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227



 Score = 26.6 bits (59), Expect = 8.4
 Identities = 12/35 (34%), Positives = 15/35 (42%)

Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
            K   N        K KK KK  ++K KK K K +
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 27.4 bits (62), Expect = 5.1
 Identities = 7/38 (18%), Positives = 28/38 (73%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           EK+E++  K + + ++K+++ +KK+++ ++  +++ ++
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 26.7 bits (60), Expect = 8.4
 Identities = 8/39 (20%), Positives = 27/39 (69%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           + E  EK++ + + K+K+ ++K+++ +KK+++ ++   +
Sbjct: 101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 27.6 bits (62), Expect = 5.1
 Identities = 10/39 (25%), Positives = 19/39 (48%), Gaps = 5/39 (12%)

Query: 142 KNKNKNKKKKKKKKKK-----KKKKKKKKKKKKKTKKFL 175
           + K K KK  +K+ +       KK+K+ +   +K +  L
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHVL 122


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.3 bits (61), Expect = 5.2
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           KK+K K + +NKK KKK +    K + +KK  K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155



 Score = 26.5 bits (59), Expect = 7.1
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
           G+KK+ KK +   K KKK +    K + +KK  K
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 27.0 bits (60), Expect = 5.3
 Identities = 6/40 (15%), Positives = 30/40 (75%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E+K+ ++ + + ++++++++KK ++K++ ++K+++  +  
Sbjct: 12  EEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEE 51


>gnl|CDD|236108 PRK07824, PRK07824, O-succinylbenzoic acid--CoA ligase;
           Provisional.
          Length = 358

 Score = 27.3 bits (61), Expect = 5.4
 Identities = 11/32 (34%), Positives = 13/32 (40%)

Query: 41  VEAYLGVPYATPPLGSLRYMPPVTPSTWRAPR 72
           V A +G     P L +LR     T     APR
Sbjct: 299 VAAVVGDGGPAPTLEALRAHVARTLDRTAAPR 330


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 27.4 bits (61), Expect = 5.4
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
           YV      E ++   K   N N    KK    KKK K+ K
Sbjct: 846 YVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGK 885


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 27.4 bits (61), Expect = 5.5
 Identities = 12/43 (27%), Positives = 19/43 (44%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           ++  K+++K   K +K   +   K  KK   K  K    L PS
Sbjct: 251 QELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPS 293


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 27.1 bits (61), Expect = 5.5
 Identities = 10/16 (62%), Positives = 11/16 (68%)

Query: 157 KKKKKKKKKKKKKKTK 172
           +K KK KK KK K TK
Sbjct: 343 RKSKKYKKNKKNKWTK 358


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 27.4 bits (60), Expect = 5.5
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           +E+++N+ + + KK K       K KK+   KK KK
Sbjct: 442 RERQENRKERQLKKTKDDSDVDLKDKKESINKKNKK 477



 Score = 27.1 bits (59), Expect = 7.3
 Identities = 14/48 (29%), Positives = 21/48 (43%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
             E++E +K +   K K       K KK+   KK KK K  +  +  S
Sbjct: 441 LRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAAS 488


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 27.5 bits (61), Expect = 5.8
 Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
             LEG +  +K       +  K  KK KK KKKK
Sbjct: 225 AVLEGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 27.5 bits (61), Expect = 6.0
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E+  +K  K+ NK+       + +++ ++K+K+ 
Sbjct: 703 EEAREKALKDANKRAPIVAATQAREEVREKRKQT 736



 Score = 27.1 bits (60), Expect = 6.4
 Identities = 8/38 (21%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 139 EKKKNKN-KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E+ + K  K+  K+       + +++ ++K+K+T  F+
Sbjct: 703 EEAREKALKDANKRAPIVAATQAREEVREKRKQTSGFM 740


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 26.2 bits (58), Expect = 6.0
 Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           KK + K  K        K KKKK  K K K K
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 27.2 bits (60), Expect = 6.1
 Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 10/43 (23%)

Query: 148 KKKKKK----------KKKKKKKKKKKKKKKKKTKKFLPPSDP 180
           K K KK           +KK  K  KK   KKK+ K + P+ P
Sbjct: 646 KPKHKKPGFLSSLCGGSRKKSSKSSKKGSDKKKSGKHVDPTVP 688


>gnl|CDD|184210 PRK13651, PRK13651, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 305

 Score = 27.0 bits (60), Expect = 6.1
 Identities = 14/31 (45%), Positives = 21/31 (67%), Gaps = 3/31 (9%)

Query: 146 KNKKKKKKKKKKKKKKKK---KKKKKKKTKK 173
           K++K KKK K+K+K  +K   +K + KK KK
Sbjct: 69  KDEKNKKKTKEKEKVLEKLVIQKTRFKKIKK 99



 Score = 26.6 bits (59), Expect = 9.7
 Identities = 15/37 (40%), Positives = 22/37 (59%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           K EK K K K K+K  +K   +K + KK KK K+ ++
Sbjct: 69  KDEKNKKKTKEKEKVLEKLVIQKTRFKKIKKIKEIRR 105


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 27.1 bits (60), Expect = 6.1
 Identities = 14/48 (29%), Positives = 22/48 (45%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
           +    K+N  K         KKK K+KK+KK+  ++       + SGK
Sbjct: 203 KAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGK 250



 Score = 26.7 bits (59), Expect = 7.7
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
            KK N   +  KKK K+KK+KK+  +   K+++++     D
Sbjct: 212 PKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRD 252


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 27.1 bits (60), Expect = 6.4
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
           E+  KK+ + +  K KK+K ++K KK  KK+   
Sbjct: 378 EEYIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYL 411



 Score = 27.1 bits (60), Expect = 6.9
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           KK     K K KK+K ++K KK  KK+   + +F+
Sbjct: 382 KKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFI 416


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 27.3 bits (61), Expect = 6.5
 Identities = 14/70 (20%), Positives = 30/70 (42%), Gaps = 1/70 (1%)

Query: 108 KLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
            +   +++ SE+  +          +E   +E  + + K K +  K +K K     +K +
Sbjct: 162 LIEFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGR 221

Query: 168 K-KKTKKFLP 176
           K K T++  P
Sbjct: 222 KIKSTEEIKP 231


>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain.  Sof1 is essential for cell
           growth and is a component of the nucleolar rRNA
           processing machinery.
          Length = 88

 Score = 25.7 bits (57), Expect = 6.7
 Identities = 11/45 (24%), Positives = 21/45 (46%)

Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
           VP+ +    + K + K   K K++ ++K  K      K ++KK  
Sbjct: 43  VPKAIKKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKHV 87


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 26.4 bits (59), Expect = 6.7
 Identities = 12/45 (26%), Positives = 28/45 (62%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           A +  ++EKK  K K ++ K+ K + ++ K + +K ++K +++ P
Sbjct: 65  AEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQP 109


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 26.8 bits (59), Expect = 6.8
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + ++ E +  + + K+ KK+     KK KK  KKK   + 
Sbjct: 201 DEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRF 240


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 26.6 bits (59), Expect = 7.0
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           K+E+  N  +  ++ K   K   KK+KKK+ K+K K
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
          Length = 607

 Score = 27.0 bits (60), Expect = 7.1
 Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 4/53 (7%)

Query: 107 EKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKK 159
           E   P+L  +      L L VPR VAL+    +K    N ++K   K +K +K
Sbjct: 496 EAARPVLRGEVS----LQLAVPRIVALKPRAMQKSFGGNYDRKLFAKLRKLRK 544


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 26.9 bits (59), Expect = 7.3
 Identities = 7/34 (20%), Positives = 14/34 (41%)

Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
             + KN +    K +       ++ ++KKK   T
Sbjct: 12  NNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVT 45


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 26.5 bits (59), Expect = 7.3
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
           E++E K  +   +  KKK+ +KKKK+K
Sbjct: 153 EEEEAKLAEEALEALKKKEAEKKKKEK 179


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 27.0 bits (60), Expect = 7.3
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
            +   +   K  K+K  ++K K   K K++ + K K F 
Sbjct: 919 PENGTELVEKLAKEKSLREKNKDDWKAKEEVEAKLKAFF 957


>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain found
           in neuregulin (NRG)-1.  Ig_Pro_neuregulin-1:
           immunoglobulin (Ig)-like domain found in neuregulin
           (NRG)-1. There are many NRG-1 isoforms which arise from
           the alternative splicing of mRNA. NRG-1 belongs to the
           neuregulin gene family, which is comprised of four
           genes. This group represents NRG-1. NRGs are signaling
           molecules, which participate in cell-cell interactions
           in the nervous system, breast, and heart, and other
           organ systems, and are implicated in the pathology of
           diseases including schizophrenia, multiple sclerosis,
           and breast cancer. The NRG-1 protein binds to and
           activates the tyrosine kinases receptors ErbB3 and
           ErbB4, initiating signaling cascades. NRG-1 has multiple
           functions; for example, in the brain it regulates
           various processes such as radial glia formation and
           neuronal migration, dendritic development, and
           expression of neurotransmitters receptors; in the
           peripheral nervous system NRG-1 regulates processes such
           as target cell differentiation, and Schwann cell
           survival.
          Length = 76

 Score = 25.3 bits (55), Expect = 7.4
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKK 157
            +  K+   KNK  NK K +KKKK 
Sbjct: 19  FKNGKEIGAKNKPDNKIKIRKKKKS 43


>gnl|CDD|130109 TIGR01037, pyrD_sub1_fam, dihydroorotate dehydrogenase (subfamily
          1) family protein.  This family includes subfamily 1
          dihydroorotate dehydrogenases while excluding the
          closely related subfamily 2 (TIGR01036). This family
          also includes a number of uncharacterized proteins and
          a domain of dihydropyrimidine dehydrogenase. The
          uncharacterized proteins might all be dihydroorotate
          dehydrogenase.
          Length = 300

 Score = 26.6 bits (59), Expect = 7.5
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 22 RVARTKYGPLRGILIQNPPVEAYL 45
           +  T  G L  I +QNP VEA+L
Sbjct: 56 TIVETPCGMLNAIGLQNPGVEAFL 79


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 25.2 bits (56), Expect = 7.5
 Identities = 12/42 (28%), Positives = 24/42 (57%)

Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           R V   G  +E ++ +   K  +K+K+KK   +K+++KK  +
Sbjct: 21  RKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRKKLAR 62


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 27.0 bits (60), Expect = 7.7
 Identities = 10/42 (23%), Positives = 26/42 (61%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
            LE E KE ++ K + ++ +K+ +  +  K+K ++K ++ ++
Sbjct: 225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 14/45 (31%), Positives = 28/45 (62%)

Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
            L  E +E +K + + KK  +K K++ ++++K KK+ +K +K L 
Sbjct: 677 GLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALE 721


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 26.8 bits (60), Expect = 7.8
 Identities = 9/27 (33%), Positives = 10/27 (37%)

Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLPPS 178
           K K      K + K KK    K  PP 
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKIKPPK 215


>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
           PB1.  Two GTP binding sites exist in this protein.
          Length = 740

 Score = 27.1 bits (60), Expect = 7.9
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
           +K E K  K+KN K++   K+  K  K+K+  K+K +
Sbjct: 176 DKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDR 212


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 26.8 bits (59), Expect = 8.0
 Identities = 16/42 (38%), Positives = 24/42 (57%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           E  + E KK + +  K +  KKKKK KK+K   + K  KK++
Sbjct: 48  EKREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKKWI 89


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 26.5 bits (58), Expect = 8.1
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
           E E  E +   NK K +K+K    KK +K  
Sbjct: 253 ESESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283


>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
          Length = 194

 Score = 26.3 bits (58), Expect = 8.1
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
           K+E    K+  +++   +  KK+ K+  K++ ++
Sbjct: 3   KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQE 36


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 26.6 bits (59), Expect = 8.4
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           +++ KKKN    +K K KK  KK   K   KK  K K  
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLL 182



 Score = 26.6 bits (59), Expect = 9.9
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
            KN   NKK K++ KKK  K  +K K KK  KK 
Sbjct: 134 NKNNVINKKIKEESKKKNSKNIEKLKIKKYFKKC 167


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 26.7 bits (59), Expect = 8.7
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 7/56 (12%)

Query: 121 LYLNLY--VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           L LN Y    R +AL  E    K      KK+K  KK+K+  K++    K   K +
Sbjct: 331 LKLNPYAKTARRMALLAEAARVKA-----KKEKLAKKRKQVSKEEAAAIKAAGKAW 381


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 26.9 bits (59), Expect = 8.8
 Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 7/64 (10%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
           P+P       KE K+     K++ +K + + KK  ++  K K  K F        K   K
Sbjct: 261 PKPADTSS-PKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAF------DLKQESK 313

Query: 188 SREQ 191
           + E+
Sbjct: 314 ASEK 317


>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
           methyltransferase subunit [Defense mechanisms].
          Length = 489

 Score = 26.6 bits (59), Expect = 8.8
 Identities = 9/46 (19%), Positives = 19/46 (41%)

Query: 123 LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
            NL   R VA E E+        +   +   +  + +K+ ++  K+
Sbjct: 438 YNLNPGRYVAEEEEEDIDLDELLQELSEIDAELAELEKELEEILKE 483


>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
          Length = 658

 Score = 26.7 bits (60), Expect = 8.9
 Identities = 11/59 (18%), Positives = 15/59 (25%), Gaps = 13/59 (22%)

Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKK-------------KKKKKKKKKKKKKKKKTKK 173
                 +     K  +  K   K K K             K+K   KK   +     KK
Sbjct: 600 SEKYKFDNLTGPKCPDCGKPMLKVKGKNGKMLVCQDRECGKRKNVSKKTNARCPNCKKK 658


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 26.6 bits (60), Expect = 8.9
 Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 6/41 (14%)

Query: 143 NKNKN------KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
           N+N        +K  K  + + +  KK K+KKK +   LP 
Sbjct: 363 NQNPELAEKLAEKAIKAAQARLRAAKKVKRKKKTSGPALPG 403


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 26.7 bits (58), Expect = 8.9
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKK 169
           +   KKKKKKKKK     K K++ K
Sbjct: 104 DTPSKKKKKKKKKGWFWAKSKQESK 128


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 26.7 bits (59), Expect = 9.0
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
           + +    KK  N+N K++  KK++  +K+  KK +K  K 
Sbjct: 413 KTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKN 452


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 26.6 bits (59), Expect = 9.0
 Identities = 24/97 (24%), Positives = 37/97 (38%), Gaps = 22/97 (22%)

Query: 93  EALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKK 152
           E LL+  +  L  L+KL+ L      D L  +  +      +        N NKN K+++
Sbjct: 189 EGLLER-KNELPHLKKLIIL------DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQ 241

Query: 153 KKKK---------------KKKKKKKKKKKKKKTKKF 174
           K                   K+K +K K  K+K KK 
Sbjct: 242 KGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKL 278


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 26.5 bits (59), Expect = 9.1
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKT 171
                  +KKK++KK+++K+ K+  
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELR 199


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 26.2 bits (58), Expect = 9.6
 Identities = 11/55 (20%), Positives = 24/55 (43%)

Query: 121 LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
           ++      + V+ + E + KK+ K   K +K+ K K+ K +   K +    +   
Sbjct: 35  IFQESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKA 89


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 26.5 bits (59), Expect = 9.7
 Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 5/50 (10%)

Query: 130 PVALEGEKKE-----KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
           PV  E  K E     K   K K        +   K KK K   + K +K 
Sbjct: 564 PVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRKL 613


>gnl|CDD|214368 CHL00117, rpoC2, RNA polymerase beta'' subunit; Reviewed.
          Length = 1364

 Score = 26.8 bits (60), Expect = 9.8
 Identities = 8/36 (22%), Positives = 15/36 (41%), Gaps = 6/36 (16%)

Query: 149 KKKKKKKKKKKK------KKKKKKKKKKTKKFLPPS 178
           +KKKK+ + K          +  K  + +   +PP 
Sbjct: 707 EKKKKRIELKIFSGDIHFPGETDKISRHSGILIPPG 742


>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
           (FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
           module that binds and bends membranes.  F-BAR domains
           are dimerization modules that bind and bend membranes
           and are found in proteins involved in membrane dynamics
           and actin reorganization. F-BAR domain containing
           proteins, also known as Pombe Cdc15 homology (PCH)
           family proteins, include Fes and Fer tyrosine kinases,
           PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
           srGAPs. Many members also contain an SH3 domain and play
           roles in endocytosis. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           These tubules have diameters larger than those observed
           with N-BARs. The F-BAR domains of some members such as
           NOSTRIN and Rgd1 are important for the subcellular
           localization of the protein.
          Length = 191

 Score = 26.1 bits (58), Expect = 9.8
 Identities = 11/43 (25%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 136 EKKEKKKNKNKNKKKKKKKKK--KKKKKKKKKKKKKKTKKFLP 176
           +++ +KK   + +K KKK ++   K  KK  ++ +++ +K  P
Sbjct: 95  KEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLNP 137


>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
          Length = 399

 Score = 26.5 bits (59), Expect = 9.9
 Identities = 9/40 (22%), Positives = 20/40 (50%), Gaps = 6/40 (15%)

Query: 146 KNKKKKKKKK------KKKKKKKKKKKKKKKTKKFLPPSD 179
              K++KK         +++ ++KKK +K + ++ L P  
Sbjct: 46  PYSKREKKPYPRPMKLLRREAREKKKLRKLQPERPLDPPK 85


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 26.3 bits (58), Expect = 9.9
 Identities = 10/44 (22%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query: 134 EGEKKEKKKNKN-KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
           + E      +K  +N   + K+  KKKK     K     +K   
Sbjct: 84  KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVKL 127


>gnl|CDD|216850 pfam02020, W2, eIF4-gamma/eIF5/eIF2-epsilon.  This domain of
           unknown function is found at the C-terminus of several
           translation initiation factors.
          Length = 75

 Score = 24.9 bits (55), Expect = 10.0
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 153 KKKKKKKKKKKKKKKKKKTKKFL 175
            +K KK   +  KK +K+ K F+
Sbjct: 47  YEKSKKVDGEGSKKVRKQAKPFV 69


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.134    0.399 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,965,771
Number of extensions: 958378
Number of successful extensions: 19548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11567
Number of HSP's successfully gapped: 2205
Length of query: 191
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 100
Effective length of database: 6,901,388
Effective search space: 690138800
Effective search space used: 690138800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.5 bits)