RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy680
         (122 letters)



>gnl|CDD|214500 smart00065, GAF, Domain present in phytochromes and cGMP-specific
          phosphodiesterases.  Mutations within these domains in
          PDE6B result in autosomal recessive inheritance of
          retinitis pigmentosa.
          Length = 149

 Score = 43.5 bits (103), Expect = 2e-06
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 12 DLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEV 71
          DL+ L +  ++ +  LL AD   ++ V+    G   LV    D  +   L       +  
Sbjct: 1  DLEELLQTILEELRQLLGADRVLIYLVDENDRG--ELVLVAADGLTLPTL------GIRF 52

Query: 72 PWGVGILGHVAVTGETVNL 90
          P   G+ G VA TG  +N+
Sbjct: 53 PLDEGLAGRVAETGRPLNI 71


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 40.3 bits (95), Expect = 8e-05
 Identities = 13/32 (40%), Positives = 22/32 (68%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           L K+ EKK+K+K+K K KK+ +  K   +R++
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 37.6 bits (88), Expect = 6e-04
 Identities = 13/43 (30%), Positives = 21/43 (48%)

Query: 80  HVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
             A + +      KK   K+ +KK+K+K+K K KK   + K  
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422



 Score = 37.6 bits (88), Expect = 7e-04
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
              K  +KK+K+K+K K KK+ +  +   KR
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 34.5 bits (80), Expect = 0.008
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +K+K K KK+ +  K   K++K    
Sbjct: 406 EKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 32.2 bits (74), Expect = 0.047
 Identities = 7/27 (25%), Positives = 13/27 (48%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           +K + KK+ +  K   K++K      E
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 31.8 bits (73), Expect = 0.061
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +K K KK+ +  K   K++K    +E
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTSE 433



 Score = 30.3 bits (69), Expect = 0.19
 Identities = 6/26 (23%), Positives = 12/26 (46%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           + K +K+ +  K   K++K      E
Sbjct: 409 KPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 92  KKKEKKKKKKKKKKKK---KKKKKKKAEIERKR 121
           K K   +KK  K  KK   K+ +KK+ E E+ +
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411



 Score = 25.7 bits (57), Expect = 7.7
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           L+ K +   +KK  K  KK   K+  + E+++
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEK 407


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 38.4 bits (90), Expect = 3e-04
 Identities = 16/30 (53%), Positives = 25/30 (83%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK+E+KK +K+KKKKK+KKK KK + + ++
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 38.0 bits (89), Expect = 5e-04
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KKK ++KK +K+KKKKK+KKK K   ++ R
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 35.7 bits (83), Expect = 0.003
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KK++++KK +K+KKKKK+KKK  + ++K 
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 35.7 bits (83), Expect = 0.003
 Identities = 16/24 (66%), Positives = 22/24 (91%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           ++KK +K+KKKKK+KKK KK+KKK
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 34.9 bits (81), Expect = 0.006
 Identities = 15/24 (62%), Positives = 20/24 (83%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           + +K KKKKK+KKK KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 34.9 bits (81), Expect = 0.006
 Identities = 16/24 (66%), Positives = 21/24 (87%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           +KK +K+KKKKK+KKK KK+KKK 
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 34.2 bits (79), Expect = 0.010
 Identities = 14/24 (58%), Positives = 21/24 (87%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           +K +++KKKKK+KKK KK+KKK +
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 33.0 bits (76), Expect = 0.027
 Identities = 14/24 (58%), Positives = 22/24 (91%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           +++K+ +K+KKKKK+KKK KK+KK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 32.6 bits (75), Expect = 0.034
 Identities = 14/31 (45%), Positives = 25/31 (80%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +  K+K+++KK +K+KKKKK+KKK +  +K+
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 28.8 bits (65), Expect = 0.72
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKK 114
           K+K  K  KKK+++KK +K+KKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKK 398



 Score = 28.8 bits (65), Expect = 0.76
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +K  K  KKK+++KK +K+KKKK   ++ +
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGK 406



 Score = 28.4 bits (64), Expect = 0.94
 Identities = 13/30 (43%), Positives = 22/30 (73%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K  +  KKK+++KK +K+KKKKK + + K+
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 37.9 bits (89), Expect = 3e-04
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K  EK +KK +K K K +KKK K E  + R
Sbjct: 68  LKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 33.3 bits (77), Expect = 0.011
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + K  +K +KK +K K K +KKK  + E K
Sbjct: 67  ELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 28.3 bits (64), Expect = 0.69
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERK 120
           K++ K  +K +KK +K K K  + + K
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 27.5 bits (62), Expect = 1.5
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E++      +K+ K  +K +KK +K K KAE ++ 
Sbjct: 56  ESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKA 90



 Score = 27.1 bits (61), Expect = 1.8
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKK 114
            +K ++K +K K K +KKK KK++ 
Sbjct: 71  WEKAEKKAEKAKAKAEKKKAKKEEP 95


>gnl|CDD|164595 MTH00022, CYTB, cytochrome b; Validated.
          Length = 379

 Score = 36.2 bits (84), Expect = 0.002
 Identities = 23/76 (30%), Positives = 32/76 (42%), Gaps = 26/76 (34%)

Query: 3   IVRDIARDLDLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLV 62
           I+RD+     L+ L             A+GASLFF+        CL      +H G GL 
Sbjct: 68  IMRDVNYGFLLRYLH------------ANGASLFFL--------CLY-----IHIGRGLY 102

Query: 63  YSTADEVEVPWGVGIL 78
           Y    +  V W VG++
Sbjct: 103 YGGYLKFHV-WNVGVV 117


>gnl|CDD|177162 MTH00100, CYTB, cytochrome b; Provisional.
          Length = 379

 Score = 35.0 bits (81), Expect = 0.005
 Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS+FF+        CL      LH G GL Y +   +E  W +GI+
Sbjct: 82  LHANGASMFFI--------CLF-----LHVGRGLYYGSYLFLET-WNIGII 118


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 34.8 bits (81), Expect = 0.006
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           L  KKE K   K +KKK+K++ K      + R
Sbjct: 60  LLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 33.3 bits (77), Expect = 0.021
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           L KK+ K   K +KKK+K++ K  KA+ + + +
Sbjct: 61  LDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93



 Score = 29.8 bits (68), Expect = 0.27
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           E +      +K+ K   K +KKK+K++ KA   + +
Sbjct: 54  EELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89


>gnl|CDD|177109 MTH00034, CYTB, cytochrome b; Validated.
          Length = 379

 Score = 34.7 bits (80), Expect = 0.006
 Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 14/49 (28%)

Query: 30  ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           A+GASLFF+        CL       H G GL Y +   +E  W +G++
Sbjct: 84  ANGASLFFI--------CLY-----FHIGRGLYYGSYVNIET-WNIGVI 118


>gnl|CDD|177239 MTH00191, CYTB, cytochrome b; Provisional.
          Length = 365

 Score = 34.1 bits (79), Expect = 0.009
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 14/49 (28%)

Query: 30  ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           A+GAS FF+        C+      LH G GL Y +    E  W VG++
Sbjct: 81  ANGASFFFI--------CIY-----LHIGRGLYYGSYLNKET-WNVGVI 115


>gnl|CDD|177102 MTH00016, CYTB, cytochrome b; Validated.
          Length = 378

 Score = 33.9 bits (78), Expect = 0.013
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)

Query: 27  LLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGI 77
            L A+GAS FF+        CL      LH G G+ Y +   +E  W +G+
Sbjct: 82  NLHANGASFFFI--------CLY-----LHIGRGIYYGSYFLMET-WNIGV 118


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 32.9 bits (75), Expect = 0.014
 Identities = 15/21 (71%), Positives = 15/21 (71%)

Query: 94  KEKKKKKKKKKKKKKKKKKKK 114
           K KKKKKK KK KK KK  KK
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 32.5 bits (74), Expect = 0.019
 Identities = 14/23 (60%), Positives = 16/23 (69%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKK 113
           +K K+KKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 31.4 bits (71), Expect = 0.046
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 95  EKKKKKKKKKKKKKKKKKKKAE 116
           EK KKKKKK KK KK KK   +
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKK 119



 Score = 31.4 bits (71), Expect = 0.047
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKK 114
           K +KKKKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.6 bits (69), Expect = 0.081
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKK 111
             E     KKK+KK KK KK KK  KK 
Sbjct: 93  PPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.2 bits (68), Expect = 0.11
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 97  KKKKKKKKKKKKKKKKKKAE 116
           KKKKKK KK KK KK  K +
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 0.26
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 97  KKKKKKKKKKKKKKKKKKAEIE 118
           K KKKKKK KK KK KK ++ +
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 27.9 bits (62), Expect = 0.87
 Identities = 13/37 (35%), Positives = 17/37 (45%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
              +       E  +K KKKKKK KK KK K   ++ 
Sbjct: 84  GEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 25.6 bits (56), Expect = 6.1
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 101 KKKKKKKKKKKKKKAEIERKR 121
           K KKKKKK KK KK +   K+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 33.5 bits (77), Expect = 0.014
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K KE+ KKKK K+ K  K  KK      K+
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 30.4 bits (69), Expect = 0.16
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKK 112
             E     K++ KKKK K+ K  K  KK 
Sbjct: 247 RAEKKRKSKEEIKKKKPKESKGVKALKKV 275



 Score = 29.6 bits (67), Expect = 0.31
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
                ++K +++ KKKK K+ K  K  KK
Sbjct: 246 SRAEKKRKSKEEIKKKKPKESKGVKALKK 274



 Score = 29.2 bits (66), Expect = 0.43
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKK 114
            ++KKK K+ K  K  KK   K  KK
Sbjct: 257 EIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 28.8 bits (65), Expect = 0.57
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
            KKK+ K+ K  K  KK   K  K 
Sbjct: 258 IKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           KK++ K++ KKKK K+ K  K   
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALK 273



 Score = 27.3 bits (61), Expect = 2.2
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEI 117
           +K++K K++ KKKK K+ K  KA  
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALK 273



 Score = 25.4 bits (56), Expect = 9.2
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
            + + +KK+K K++ KKKK K+ K  
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGV 269


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.9 bits (75), Expect = 0.020
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
           +KK E  K++KKKKK+KKKKKK+ +
Sbjct: 146 KKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 32.9 bits (75), Expect = 0.023
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
           KK +KKK +  K++KKKKK+KKK +
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 32.5 bits (74), Expect = 0.028
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           +K ++ K++KKKKK+KKKKKK+   E  
Sbjct: 147 KKHEDDKERKKKKKEKKKKKKRHSPEHP 174



 Score = 32.5 bits (74), Expect = 0.031
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           E +   +KK KKKK +  K++KKKKK+KK + +R  
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 32.1 bits (73), Expect = 0.033
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
             +    + KK+K +  K++KKKKK+KKKKK
Sbjct: 136 GLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166



 Score = 31.3 bits (71), Expect = 0.060
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +KKK +  K++KKKKK+KKKKKK+   E  
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174


>gnl|CDD|164624 MTH00053, CYTB, cytochrome b; Provisional.
          Length = 381

 Score = 32.8 bits (75), Expect = 0.026
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS+FF+        C    V+  H G G+ Y +  ++ V W VG+L
Sbjct: 83  LHANGASMFFL--------C----VY-FHIGRGIYYGSYTKIIV-WNVGVL 119


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.6 bits (75), Expect = 0.032
 Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 2/39 (5%)

Query: 85  GETVNLQKKKEKKKKKKKKKKKKKKKK--KKKAEIERKR 121
           GE  NL   + KK +KK++K +KK +K   +KA  ++K 
Sbjct: 401 GENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA 439



 Score = 32.6 bits (75), Expect = 0.039
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             Q+K EKK +K++ +K   KKK + A  + K 
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448



 Score = 31.4 bits (72), Expect = 0.072
 Identities = 10/31 (32%), Positives = 22/31 (70%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           ++KK +KK++K +KK +K++ +K A  ++  
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440



 Score = 29.5 bits (67), Expect = 0.42
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           L+KK+ K +KK +K++ +K   KKKAE
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAE 440



 Score = 29.1 bits (66), Expect = 0.47
 Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +K+++ +KK +K++ +K   KKK     +K 
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 27.6 bits (62), Expect = 1.8
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +K+E +K   KKK +   KK K  + E K
Sbjct: 426 EKEEAEKAAAKKKAEAAAKKAKGPDGETK 454



 Score = 27.2 bits (61), Expect = 2.2
 Identities = 9/25 (36%), Positives = 13/25 (52%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
            +KE+ +K   KKK +   KK K  
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGP 449



 Score = 26.4 bits (59), Expect = 4.3
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
              GE    +       ++KK +KK++K +KK  + E ++
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEK 432



 Score = 26.4 bits (59), Expect = 4.4
 Identities = 9/31 (29%), Positives = 15/31 (48%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            +K++ +K   KKK +   KK K      +K
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 32.7 bits (75), Expect = 0.036
 Identities = 12/33 (36%), Positives = 22/33 (66%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             +K+  K++K+ +K KK+++KKKK+ E   K 
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 32.3 bits (74), Expect = 0.045
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
            ++K+EK+  K++K+ +K KK+++K + E ++ 
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586



 Score = 31.5 bits (72), Expect = 0.073
 Identities = 10/32 (31%), Positives = 23/32 (71%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           ++K+  K++K+ +K KK+++KKKK   + ++ 
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 31.5 bits (72), Expect = 0.076
 Identities = 10/30 (33%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + ++EK++K+  K++K+ +K KK+ E ++K
Sbjct: 552 ELQREKEEKEALKEQKRLRKLKKQEEKKKK 581



 Score = 29.6 bits (67), Expect = 0.40
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
             ++K+ +K KK+++KKKK+ +K +KA+I    +
Sbjct: 563 LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEF 596



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 11/37 (29%), Positives = 24/37 (64%), Gaps = 2/37 (5%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKK--KKKKAEIERKR 121
              L  K+E +++K++K+  K++K  +K K + E+K+
Sbjct: 544 VWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 32.0 bits (72), Expect = 0.051
 Identities = 16/35 (45%), Positives = 26/35 (74%)

Query: 85  GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           G+   ++KK +K KKK+KK+K+K++ K KK E+E 
Sbjct: 190 GDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224



 Score = 29.3 bits (65), Expect = 0.52
 Identities = 10/34 (29%), Positives = 23/34 (67%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           V L++++  +++ +K K++KKK++K+K    R  
Sbjct: 82  VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 28.9 bits (64), Expect = 0.63
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           KK +KK+KK+K+K++ K KKK+    +
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 28.5 bits (63), Expect = 0.96
 Identities = 12/25 (48%), Positives = 21/25 (84%)

Query: 97  KKKKKKKKKKKKKKKKKKAEIERKR 121
           +KK KK KKK+KK+K+K+ + ++K+
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKK 220



 Score = 28.5 bits (63), Expect = 1.0
 Identities = 12/30 (40%), Positives = 22/30 (73%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           +K+ KK KKK+KK+K+K++ K  + E + +
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGF 225



 Score = 27.0 bits (59), Expect = 2.8
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           + KK++KK+K+K++ K KKK+ +  +
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFK 226



 Score = 27.0 bits (59), Expect = 3.3
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           KK  K KKKK++K+K++KKKKKK    R
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 26.6 bits (58), Expect = 4.0
 Identities = 9/29 (31%), Positives = 19/29 (65%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           L+K K +KKK++K+K+ +++      E +
Sbjct: 94  LEKDKREKKKREKEKRGRRRHHSLGTESD 122



 Score = 26.6 bits (58), Expect = 4.1
 Identities = 9/30 (30%), Positives = 24/30 (80%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           ++E++ +++ +K K++KKK++K +  R+R+
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 26.6 bits (58), Expect = 4.4
 Identities = 9/36 (25%), Positives = 25/36 (69%)

Query: 80  HVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           +V +  E  + Q+ ++ K++KKK++K+K+ +++  +
Sbjct: 81  YVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHS 116



 Score = 26.6 bits (58), Expect = 4.5
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           ET    K K+KK++K+K++KKKKKK    
Sbjct: 273 ETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301



 Score = 26.6 bits (58), Expect = 4.5
 Identities = 11/38 (28%), Positives = 26/38 (68%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           V G  ++ Q  K +++++ +++ +K K++KKK E E++
Sbjct: 72  VPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKR 109



 Score = 25.8 bits (56), Expect = 6.6
 Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 4/76 (5%)

Query: 50  SKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKK----KKKKKK 105
           S +  L        S A+  E      + G    +        + E+ KK    KKKK++
Sbjct: 227 SLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQR 286

Query: 106 KKKKKKKKKAEIERKR 121
           K+K++KKKK +    R
Sbjct: 287 KEKEEKKKKKKHHHHR 302



 Score = 25.4 bits (55), Expect = 9.6
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           +KKK++K+K++KKKKKK    +       
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGG 309


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.2 bits (74), Expect = 0.052
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           T  ++K  EK +KK++++KK+KKKK    + + + 
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 31.8 bits (73), Expect = 0.054
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 5/56 (8%)

Query: 65  TADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           T +E+E      + G    T +   + +K EKK++++KK+KKKK    KK E E +
Sbjct: 395 TEEEIEF-----LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE 445



 Score = 30.3 bits (69), Expect = 0.22
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +KKEKKKK    KKK+++++++K + E ++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 29.9 bits (68), Expect = 0.28
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +KK +++KK+KKKK    KKK++ E E K 
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 29.9 bits (68), Expect = 0.31
 Identities = 13/30 (43%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +K++E+KK+KKKK    KKK++++ E + K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 29.9 bits (68), Expect = 0.33
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK  KK KK  +K +KK++++KK + ++  
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAF 435



 Score = 29.5 bits (67), Expect = 0.36
 Identities = 12/31 (38%), Positives = 21/31 (67%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
            KK  +K +KK++++KK+KKKK    ++K  
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 29.5 bits (67), Expect = 0.44
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           L   K+  KK KK  +K +KK++++ + ++K+
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKK 433



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 11/29 (37%), Positives = 23/29 (79%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           K+K+KK    KKK+++++++K+K E E++
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 27.6 bits (62), Expect = 2.0
 Identities = 8/29 (27%), Positives = 22/29 (75%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
            +++++K+KKKK    KKK++++++ + +
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 27.2 bits (61), Expect = 2.7
 Identities = 9/32 (28%), Positives = 23/32 (71%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +++K++KKKK    KKK+++++++ E + + 
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 26.8 bits (60), Expect = 3.7
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +    +KK++KKK    KKK+++++++K+ + E K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 26.4 bits (59), Expect = 5.0
 Identities = 11/35 (31%), Positives = 24/35 (68%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E    +KK    KKK+++++++K+KK+++ E E +
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460



 Score = 26.0 bits (58), Expect = 6.3
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
             KK  KK KK  +K +KK++ E + K+ 
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKK 432


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 31.7 bits (72), Expect = 0.057
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           T  ++KKK+ K KK K K KK + KKK  E
Sbjct: 86  TTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115



 Score = 29.8 bits (67), Expect = 0.26
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
           KKK K KK K K KK + KKK +  
Sbjct: 92  KKKGKSKKNKLKGKKDEDKKKAREG 116


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 32.1 bits (74), Expect = 0.058
 Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            K    KK   K  +K K++   +E + K
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 30.5 bits (70), Expect = 0.17
 Identities = 8/24 (33%), Positives = 12/24 (50%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           KK   K  +K K++    +K  KA
Sbjct: 835 KKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 30.5 bits (70), Expect = 0.20
 Identities = 6/25 (24%), Positives = 9/25 (36%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
            K    KK   K  +K K++     
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASE 853



 Score = 30.1 bits (69), Expect = 0.26
 Identities = 6/27 (22%), Positives = 10/27 (37%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           L+   +   K    KK   K  +K  +
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQ 847



 Score = 29.4 bits (67), Expect = 0.39
 Identities = 7/26 (26%), Positives = 10/26 (38%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKA 115
           L     K    KK   K  +K K++ 
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQY 849



 Score = 29.0 bits (66), Expect = 0.58
 Identities = 5/26 (19%), Positives = 11/26 (42%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +    KK   K  +K K++    + +
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 29.0 bits (66), Expect = 0.65
 Identities = 6/24 (25%), Positives = 10/24 (41%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           K   K  +K K++    +K  K  
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKAT 859



 Score = 28.6 bits (65), Expect = 0.84
 Identities = 6/29 (20%), Positives = 11/29 (37%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
              +    KK   K  +K K++    E+ 
Sbjct: 827 APAKDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 28.6 bits (65), Expect = 0.87
 Identities = 9/31 (29%), Positives = 12/31 (38%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           L +K EK +       K    KK   +  RK
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844



 Score = 26.7 bits (60), Expect = 4.3
 Identities = 8/30 (26%), Positives = 12/30 (40%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           E + L  +K +K +       K    KK A
Sbjct: 809 EALELLAEKPEKLRYLADAPAKDPAGKKAA 838



 Score = 25.5 bits (57), Expect = 8.2
 Identities = 4/29 (13%), Positives = 8/29 (27%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            +       K    KK   K  +   ++ 
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQQY 849


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.8 bits (72), Expect = 0.061
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +++ E+KKKK++K K+K+ KK K A+ E K
Sbjct: 15  EEELERKKKKEEKAKEKELKKLKAAQKEAK 44



 Score = 31.4 bits (71), Expect = 0.081
 Identities = 15/37 (40%), Positives = 26/37 (70%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           T E +  +KKKE+K K+K+ KK K  +K+ KA+++ +
Sbjct: 14  TEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50



 Score = 27.2 bits (60), Expect = 3.1
 Identities = 13/30 (43%), Positives = 22/30 (73%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           E   L +++ ++KKKK++K K+K+ KK KA
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKA 38


>gnl|CDD|214425 MTH00119, CYTB, cytochrome b; Provisional.
          Length = 380

 Score = 31.8 bits (73), Expect = 0.067
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS+FF+        C+      LH G GL Y +    E  W  G++
Sbjct: 83  LHANGASMFFI--------CIY-----LHIGRGLYYGSYLYKET-WNTGVI 119


>gnl|CDD|214440 MTH00156, CYTB, cytochrome b; Provisional.
          Length = 356

 Score = 31.7 bits (73), Expect = 0.072
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS FF+        C+      LH G G+ Y +       W  G++
Sbjct: 72  LHANGASFFFI--------CIY-----LHIGRGIYYGSYKLKHT-WMSGVI 108


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 31.6 bits (72), Expect = 0.073
 Identities = 15/33 (45%), Positives = 26/33 (78%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           +  +  ++K KKKKKK+KKK+++ K+++KA IE
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768



 Score = 31.6 bits (72), Expect = 0.082
 Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 5/39 (12%)

Query: 89   NLQKKKEK---KKKKKKKKKKKKKKKKKKAEIE--RKRY 122
            N +K  E+   +   KK+K KK+ K   +AE++   KRY
Sbjct: 1552 NQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRY 1590



 Score = 31.2 bits (71), Expect = 0.13
 Identities = 13/27 (48%), Positives = 23/27 (85%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           +K K+KKKK+KKK+++ K+++K + EI
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKARIEI 769



 Score = 28.9 bits (65), Expect = 0.70
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K  +  ++K KKKKKK+KKK+++ + E K 
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKA 765



 Score = 28.5 bits (64), Expect = 0.86
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 95  EKKKKKKKKKKKKKKKKKKKAEIER 119
           E+K KKKKKK+KKK+++ K+ E  R
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.1 bits (63), Expect = 1.3
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
              ++V  + KK+KKK+KKK+++ K+++K +
Sbjct: 736 KISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.9 bits (70), Expect = 0.084
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           Q+   + K + K+KKK++  K K  + + K 
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKT 141



 Score = 26.6 bits (59), Expect = 3.1
 Identities = 8/31 (25%), Positives = 20/31 (64%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + ++E   + K + K+KKK++  K + E+++
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEK 138



 Score = 26.3 bits (58), Expect = 4.3
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            + K + K+KKK++  K K +K+K + E K+
Sbjct: 115 PETKTESKEKKKREVPKPKTEKEKPKTEPKK 145



 Score = 25.5 bits (56), Expect = 6.4
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           KK+E  K K +K+K K + KK K 
Sbjct: 125 KKREVPKPKTEKEKPKTEPKKPKP 148


>gnl|CDD|177203 MTH00145, CYTB, cytochrome b; Provisional.
          Length = 379

 Score = 31.4 bits (72), Expect = 0.094
 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS FF+        C+      LH G GL Y +       W +G+ 
Sbjct: 83  LHANGASFFFI--------CIY-----LHIGRGLYYGSYLMQHT-WNIGVT 119


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 30.4 bits (69), Expect = 0.11
 Identities = 19/31 (61%), Positives = 26/31 (83%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           L++K ++KK+KKKKKKKKKKK  KKA  ++K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 29.7 bits (67), Expect = 0.19
 Identities = 18/27 (66%), Positives = 20/27 (74%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
             QKK++KKKKKKKKKK  KK  KKK 
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 29.3 bits (66), Expect = 0.26
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           Q  KEK K+KK+KKKKKKKKKKK ++   K
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 28.9 bits (65), Expect = 0.48
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E   + K+  K+K K+KK+KKKKKKKKKK   ++ 
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 27.7 bits (62), Expect = 1.1
 Identities = 17/27 (62%), Positives = 18/27 (66%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKK 112
                QKKK+KKKKKKK  KK  KKKK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.0 bits (60), Expect = 2.2
 Identities = 17/34 (50%), Positives = 24/34 (70%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           V    K++ K+KK+KKKKKKKKKKK   +  +K+
Sbjct: 101 VKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 30.8 bits (70), Expect = 0.11
 Identities = 19/38 (50%), Positives = 26/38 (68%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
                V  ++KKEKKKKK+ KK+KK+KK KK+  +E K
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181



 Score = 27.0 bits (60), Expect = 2.3
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKK 114
            ++KK+KK+K  + K  KKKKKKKK
Sbjct: 167 KKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKK 114
           K K++K  + K  KKKKKKKKKK
Sbjct: 171 KDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 26.6 bits (59), Expect = 2.8
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             + KKEKK+KK KK+K  + K  KK + ++K+
Sbjct: 160 KKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.9 bits (70), Expect = 0.12
 Identities = 10/27 (37%), Positives = 26/27 (96%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
            Q+K+E++++K++KK+++++++K+KAE
Sbjct: 195 QQQKREEERRKQRKKQQEEEERKQKAE 221



 Score = 30.1 bits (68), Expect = 0.21
 Identities = 11/29 (37%), Positives = 25/29 (86%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            KK K++++K++++++K++KK++ E ERK
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERK 217



 Score = 27.0 bits (60), Expect = 2.2
 Identities = 8/29 (27%), Positives = 24/29 (82%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K+ K++++K++++++K++KK+ E E ++
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEERK 217



 Score = 25.8 bits (57), Expect = 6.1
 Identities = 9/27 (33%), Positives = 24/27 (88%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           QK++E+++K++KK+++++++K+K  E 
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEA 223


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.8 bits (70), Expect = 0.13
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           T E V L K + ++++ KKKKKKKKKK KK       
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSS 201


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.8 bits (70), Expect = 0.13
 Identities = 15/38 (39%), Positives = 24/38 (63%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             E  N + KK+ +KKKKKKK+KK+ K + + ++  K 
Sbjct: 58  KKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95



 Score = 27.0 bits (60), Expect = 2.4
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KK EKKKKKKK+KK+ K + + K   +  
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           V   +    KK++K  + KKK +KKKKKKK+K E
Sbjct: 49  VATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82



 Score = 25.4 bits (56), Expect = 9.3
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           KKK+KKK+KK+ K + + K   K   + K
Sbjct: 72  KKKKKKKEKKEPKSEGETKLGFKTPKKSK 100


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 30.7 bits (70), Expect = 0.14
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
            +++KEKKK   K++KK  K++K+K E
Sbjct: 94  FEEEKEKKKAMSKEEKKAIKEEKEKLE 120



 Score = 29.5 bits (67), Expect = 0.36
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           ++EK+KKK   K++KK  K++K ++E  
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 27.2 bits (61), Expect = 1.8
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           +E+K+KKK   K++KK  K++ E   + Y
Sbjct: 95  EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 30.7 bits (70), Expect = 0.15
 Identities = 14/35 (40%), Positives = 25/35 (71%), Gaps = 3/35 (8%)

Query: 90  LQKKKEKKKKKKKKKKKKK---KKKKKKAEIERKR 121
            Q+KKE+KKK++++ K  K   ++++K  E ERK+
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318



 Score = 27.6 bits (62), Expect = 1.7
 Identities = 8/32 (25%), Positives = 22/32 (68%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            + K  E++++++ ++KK++KKK+++     K
Sbjct: 271 KILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 25.7 bits (57), Expect = 7.8
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 5/40 (12%)

Query: 87  TVNLQKKKEKKKKKKKKKKK-----KKKKKKKKAEIERKR 121
               ++++E+K  K  ++++     +KK++KKK E E K 
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKL 300


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 30.0 bits (68), Expect = 0.15
 Identities = 16/28 (57%), Positives = 22/28 (78%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           KK+ K++KKK+KKKKKKK KK   + E+
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKKEEK 112



 Score = 29.6 bits (67), Expect = 0.21
 Identities = 15/30 (50%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           ++K  KK+ K++KKK+KKKKKKK  +  +K
Sbjct: 80  EEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 27.7 bits (62), Expect = 1.1
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           ++K E+K  KK+ K++KKK+KKKK +  +K
Sbjct: 76  KRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105



 Score = 27.3 bits (61), Expect = 1.4
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           KK  K++KKK+KKKKKKK KK  + E K
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKKEEK 112



 Score = 26.2 bits (58), Expect = 3.6
 Identities = 16/22 (72%), Positives = 19/22 (86%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKK 112
           QKKK+KKKKKKK KK  KK++K
Sbjct: 91  QKKKQKKKKKKKAKKGNKKEEK 112



 Score = 25.8 bits (57), Expect = 5.6
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
             +K +EK  KK+ K++KKK+KKKKK +
Sbjct: 75  KKRKDEEKTAKKRAKRQKKKQKKKKKKK 102


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 30.2 bits (69), Expect = 0.18
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
            + +KEKKK   K++KK  K++K K E
Sbjct: 95  FEAEKEKKKAMSKEEKKAIKEEKDKLE 121



 Score = 27.5 bits (62), Expect = 1.3
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + EK+KKK   K++KK  K++K ++E  
Sbjct: 96  EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 30.6 bits (69), Expect = 0.19
 Identities = 14/20 (70%), Positives = 18/20 (90%)

Query: 96  KKKKKKKKKKKKKKKKKKKA 115
           +KKKKKKK K+KKKKK ++A
Sbjct: 684 RKKKKKKKSKEKKKKKNREA 703



 Score = 29.5 bits (66), Expect = 0.47
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 98  KKKKKKKKKKKKKKKKKAEIERKRY 122
           +KKKKKKK K+KKKKK  E  ++ +
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 29.5 bits (66), Expect = 0.47
 Identities = 13/23 (56%), Positives = 19/23 (82%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKK 114
           +KK+KKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 28.7 bits (64), Expect = 0.86
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
            +KKK+KKK K+KKKKK ++  K+   +E
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALE 711


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 30.4 bits (69), Expect = 0.19
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
                   K K ++ KK++  + ++KKK+KKA+ ER+ 
Sbjct: 355 EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392



 Score = 29.3 bits (66), Expect = 0.40
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K E+ KK++  + ++KKK+KK     R  
Sbjct: 364 AKPERDKKERPGRYRRKKKEKKAKSERRGL 393



 Score = 27.0 bits (60), Expect = 2.5
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
            + KKE+  + ++KKK+KK K +++    
Sbjct: 367 ERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.5 bits (69), Expect = 0.20
 Identities = 12/31 (38%), Positives = 15/31 (48%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K K  KK  +K KK   K +K  A  + KR
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATKAKR 805



 Score = 29.3 bits (66), Expect = 0.51
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + EKK K K  KK  +K KK  A+ ++  
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIA 798



 Score = 27.4 bits (61), Expect = 2.5
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +KK K K  KK  +K KK   K  +I   
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAA 800



 Score = 26.2 bits (58), Expect = 6.3
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             KK+ +K KK   K +K     KA+   K+
Sbjct: 779 AAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 30.2 bits (68), Expect = 0.22
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           + K EK KKK    K   KKKKK   
Sbjct: 119 KPKTEKLKKKITVNKSTNKKKKKVLS 144



 Score = 27.9 bits (62), Expect = 1.5
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K + +K KKK    K   KKKK  +  K 
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147



 Score = 27.1 bits (60), Expect = 2.4
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
             +K K+K    K   KKKKK    K E+ 
Sbjct: 121 KTEKLKKKITVNKSTNKKKKKVLSSKDELI 150



 Score = 25.9 bits (57), Expect = 6.7
 Identities = 10/31 (32%), Positives = 11/31 (35%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
               KKK    K   KKKKK    K +    
Sbjct: 122 TEKLKKKITVNKSTNKKKKKVLSSKDELIKY 152


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 29.1 bits (66), Expect = 0.24
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           Q++KE K+K  +     KKKKK+KA+ +R   
Sbjct: 32  QEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63


>gnl|CDD|177145 MTH00074, CYTB, cytochrome b; Provisional.
          Length = 380

 Score = 30.0 bits (68), Expect = 0.27
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           + A+GAS FF+        C+      LH G GL Y +    E  W +G++
Sbjct: 83  IHANGASFFFI--------CIY-----LHIGRGLYYGSYMYKET-WNIGVI 119


>gnl|CDD|133672 MTH00033, CYTB, cytochrome b; Provisional.
          Length = 383

 Score = 29.9 bits (67), Expect = 0.29
 Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 14/49 (28%)

Query: 30  ADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           A+GASLFF+        C+       H G GL Y     V + W VG+L
Sbjct: 81  ANGASLFFI--------CVYC-----HIGRGLYYGGYSRV-LTWIVGVL 115


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.1 bits (68), Expect = 0.29
 Identities = 15/28 (53%), Positives = 18/28 (64%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
             +  K KK KKKKKKKKK++K   E E
Sbjct: 269 VSEMVKFKKPKKKKKKKKKRRKDLDEDE 296



 Score = 29.0 bits (65), Expect = 0.66
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIE 118
            K KK KKKKKKKKK++K   + E+E
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELE 298



 Score = 28.6 bits (64), Expect = 0.84
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
               +  K KK KKKKKKKKK++K  +
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRKDLD 293



 Score = 25.9 bits (57), Expect = 7.1
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKKK 111
            V+   K +K KKKKKKKKK++K  
Sbjct: 268 DVSEMVKFKKPKKKKKKKKKRRKDL 292


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 29.6 bits (67), Expect = 0.31
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             E++KKKKKKK  KKKK KK A +  K 
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 28.4 bits (64), Expect = 0.79
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
                     +KKK+KKK  KKKK KK      KA
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 15/25 (60%), Positives = 17/25 (68%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
             +E+KKKKKKK  KKKK KK  A 
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAV 201



 Score = 25.3 bits (56), Expect = 8.4
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
               ++ N +++K+KKKKK  KKKK KK    
Sbjct: 170 RDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201


>gnl|CDD|177189 MTH00131, CYTB, cytochrome b; Provisional.
          Length = 380

 Score = 29.6 bits (67), Expect = 0.31
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 14/51 (27%)

Query: 28  LDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGIL 78
           L A+GAS FF+        C+      LH G GL Y +    E  W +G++
Sbjct: 82  LHANGASFFFI--------CIY-----LHIGRGLYYGSYLYKET-WNIGVV 118


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 29.8 bits (66), Expect = 0.35
 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 3/45 (6%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKK---KKKKKKKAEIERKRY 122
           +  +GE  N+       KKKKKKKKK     K K++   IE +  
Sbjct: 91  INKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEEI 135


>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
           (A-type).  The trypanosome parasite expresses these
           proteins to evade the immune response. This family
           includes a variety of surface proteins such as
           Trypanosoma brucei VSGs such as expression site
           associated gene (ESAG) 6 and 7.
          Length = 370

 Score = 29.7 bits (67), Expect = 0.35
 Identities = 18/79 (22%), Positives = 23/79 (29%), Gaps = 4/79 (5%)

Query: 43  GGKKCLVSKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKK 102
           G KKC   ++    +G GL         V W  G L  V  +G T+ L            
Sbjct: 198 GSKKC---RLTQGGTGNGLGKGGNLTANVKWAGGYLT-VTGSGATITLADLTALATAATG 253

Query: 103 KKKKKKKKKKKKAEIERKR 121
             K  KK            
Sbjct: 254 AAKAWKKAAHAAKAKPTAT 272


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 29.7 bits (66), Expect = 0.36
 Identities = 8/26 (30%), Positives = 25/26 (96%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           QK +E+++++K+K+K+++++++++AE
Sbjct: 594 QKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.6 bits (66), Expect = 0.37
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 97  KKKKKKKKKKKKKKKKKKAEI 117
           K KK KKKK  KKK+  KA+I
Sbjct: 122 KDKKNKKKKSAKKKEAHKAQI 142



 Score = 26.9 bits (59), Expect = 3.0
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           K + +  K KK KKKK  KKK+  + +
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQ 141


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 28.5 bits (64), Expect = 0.37
 Identities = 14/27 (51%), Positives = 16/27 (59%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
            KK+KKKKKKK +      K KKA  E
Sbjct: 23  VKKKKKKKKKKAEDTAATAKAKKATAE 49



 Score = 26.2 bits (58), Expect = 2.1
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIE 118
             KKKKKKKKKK +      KA+  
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKA 46



 Score = 26.2 bits (58), Expect = 2.5
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           L KKK+KKKKKK +      K KK   
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKATA 48



 Score = 26.2 bits (58), Expect = 2.5
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KKK+KKKKK +      K KK  AE   + 
Sbjct: 25  KKKKKKKKKAEDTAATAKAKKATAEDVSEG 54



 Score = 25.4 bits (56), Expect = 4.3
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAE 116
           K E    KKKKKKKKKK +   A 
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAAT 40



 Score = 25.0 bits (55), Expect = 6.2
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIE 118
             +KKKKKKKKK +      K  +  
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKAT 47


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.7 bits (66), Expect = 0.37
 Identities = 8/32 (25%), Positives = 16/32 (50%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             +KK E+ KK  +  K + +    +AE   ++
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365



 Score = 29.0 bits (64), Expect = 0.64
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             +KK E+ KKK    KKK ++ KK AE  +  
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351



 Score = 29.0 bits (64), Expect = 0.78
 Identities = 7/31 (22%), Positives = 15/31 (48%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +K  E  K + +    + +  ++KAE   K+
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372



 Score = 28.2 bits (62), Expect = 1.3
 Identities = 6/31 (19%), Positives = 16/31 (51%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            + + E    + +  ++K +  +KK E  +K+
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 27.8 bits (61), Expect = 1.7
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            ++K E  +KKK++ KKK    KKKAE ++K
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 27.8 bits (61), Expect = 1.9
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
              + E  ++K +  +KKK++ KKKA+  +K+
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 27.4 bits (60), Expect = 2.2
 Identities = 5/31 (16%), Positives = 15/31 (48%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             K + +    + +  ++K +  +K + E K+
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378



 Score = 26.3 bits (57), Expect = 5.0
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             KKK  + KKK ++KKK  + KKKAE  +K
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 26.3 bits (57), Expect = 5.7
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KK E+KKK  + KKK ++ KKK  E+++ 
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 26.3 bits (57), Expect = 5.7
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KK E+ KKK  + KK    KKK  E ++K
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 26.3 bits (57), Expect = 5.8
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             KKK    KKK ++KKK  + KKKAE ++K+
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406



 Score = 26.3 bits (57), Expect = 6.4
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KKK ++ KK  + KKK ++ KKKA+  +K 
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 26.3 bits (57), Expect = 6.5
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            KKK  + KK    KKK  + KKKAE ++K
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 25.9 bits (56), Expect = 7.1
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KKK ++ KK  + KKK ++ KKKA+  +K+
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337



 Score = 25.9 bits (56), Expect = 7.1
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            KKK ++ KKK  + KK  + KKKA+  +K
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 25.9 bits (56), Expect = 7.8
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            KK    KKK  + KKK ++KKK  E ++K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 25.9 bits (56), Expect = 7.9
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            +KKK  + KKK ++ KK  + KKKAE  +K
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458



 Score = 25.9 bits (56), Expect = 8.2
 Identities = 7/31 (22%), Positives = 13/31 (41%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +K +E KK  +  K + +    +    E K 
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366



 Score = 25.9 bits (56), Expect = 8.4
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KK E+ KK  + KKK ++ KKK    ++K
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337



 Score = 25.9 bits (56), Expect = 8.8
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KK E+ +KK  +  KK+ ++ KKAE  +K+
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710



 Score = 25.9 bits (56), Expect = 8.9
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +KK +  KKK ++ KK  +  K +AE     
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358



 Score = 25.5 bits (55), Expect = 9.2
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KK E+ KK  + KKK ++ KKK  E ++ 
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 25.5 bits (55), Expect = 9.6
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KKK ++ KKK  + KK    KKKA+  +K+
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 25.5 bits (55), Expect = 9.7
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             KK ++ KKK ++ KKK    KKKAE  +K 
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 29.3 bits (66), Expect = 0.41
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 90  LQKKKEKKKKKKKKKKKKKK 109
            ++K+E  KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 95  EKKKKKKKKKKKKKKKKKKKA 115
            K+K++  KK+KKKK KK K 
Sbjct: 355 PKRKREGDKKQKKKKSKKLKL 375



 Score = 25.8 bits (57), Expect = 7.3
 Identities = 11/24 (45%), Positives = 17/24 (70%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKK 111
           + L  K++++  KK+KKKK KK K
Sbjct: 351 LALSPKRKREGDKKQKKKKSKKLK 374



 Score = 25.4 bits (56), Expect = 8.8
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 94  KEKKKKKKKKKKKKKKKKK 112
           K K++  KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374


>gnl|CDD|220636 pfam10222, DUF2152, Uncharacterized conserved protein (DUF2152).
           This is a family of proteins conserved from worms to
           humans. Its function is unknown.
          Length = 603

 Score = 29.5 bits (66), Expect = 0.43
 Identities = 12/56 (21%), Positives = 18/56 (32%), Gaps = 9/56 (16%)

Query: 6   DIARDLDLKSL--SKKTVDNVSVLLDADGASLFFVERRRGGKK-------CLVSKV 52
            + R    + +        NV++LLD D      V      +K       CL   V
Sbjct: 444 VLHRSYAFRGINYGNMDHINVAILLDTDNKPYLHVSVDSSDQKLYACDAGCLDEPV 499


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 28.0 bits (63), Expect = 0.47
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + ++  ++ ++KKKKKKK KKK K  I RK
Sbjct: 31  KVRRISQRNRRKKKKKKKLKKKSKEWILRK 60



 Score = 24.6 bits (54), Expect = 9.2
 Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 74  GVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKK--KKKKAEIERKR 121
             G      +    V    ++ ++KKKKKKK KKK K    +K E  R+R
Sbjct: 18  EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67


>gnl|CDD|222174 pfam13492, GAF_3, GAF domain. 
          Length = 129

 Score = 28.5 bits (64), Expect = 0.49
 Identities = 14/78 (17%), Positives = 32/78 (41%), Gaps = 10/78 (12%)

Query: 12 DLKSLSKKTVDNVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGAGLVYSTADEVEV 71
          D   L ++ ++ ++ +L AD A+L+ ++               L +G+G     ++   +
Sbjct: 1  DPDELLERLLELLAEILGADRAALYLLDEDGL--------ELRLVAGSGGEPRLSE--SL 50

Query: 72 PWGVGILGHVAVTGETVN 89
          P    +       GE V+
Sbjct: 51 PEDSPLAQWALEKGEPVS 68


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 0.49
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K K KK  K K ++K+K K++K+ +  R+ 
Sbjct: 54  AKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 28.9 bits (65), Expect = 0.53
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
            L  +K K +K  K  KKKK K K K 
Sbjct: 189 VLINEKLKAEKAAKGGKKKKGKAKAKL 215



 Score = 27.3 bits (61), Expect = 1.7
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +K K   K K KK  K K ++K+KA+ E++ 
Sbjct: 48  EKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78



 Score = 27.3 bits (61), Expect = 1.8
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +++K K   K K KK  K K ++K + +R++
Sbjct: 46  EEEKAKVAAKAKAKKALKAKIEEKEKAKREK 76



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K   K K KK  K K ++K+K    + ++
Sbjct: 51  KVAAKAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 27.3 bits (61), Expect = 2.1
 Identities = 13/32 (40%), Positives = 16/32 (50%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
            K   K K KK  K K ++K+K K E E K  
Sbjct: 50  AKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 26.5 bits (59), Expect = 3.6
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + +EK+++K K   K K KK  KA+IE K
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEK 69



 Score = 25.8 bits (57), Expect = 5.7
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +K++EK K   K K KK  K K + + + KR
Sbjct: 44  EKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 25.8 bits (57), Expect = 6.8
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKK 112
           V+ T   +  +K K +K  K  KKKK K K K
Sbjct: 183 VSSTLNVLINEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 25.4 bits (56), Expect = 10.0
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           ++KE++K K   K K KK  K K E + K
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIEEKEK 71


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 28.7 bits (64), Expect = 0.49
 Identities = 12/37 (32%), Positives = 19/37 (51%), Gaps = 1/37 (2%)

Query: 86  ETVNLQKKKEK-KKKKKKKKKKKKKKKKKKAEIERKR 121
             V L KK +K   ++ +KK  +KK+KK    + R  
Sbjct: 46  SIVGLNKKPKKISAEEAEKKLLQKKEKKALTNVLRPE 82


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 28.9 bits (65), Expect = 0.50
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
             + K K K K K K  KK +++ K E++ 
Sbjct: 94  KPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123



 Score = 27.3 bits (61), Expect = 2.2
 Identities = 8/26 (30%), Positives = 15/26 (57%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           + K + K K  KK +++ K++ K  E
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVE 125



 Score = 27.0 bits (60), Expect = 2.7
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           K K K K K  KK +++ K++ K    R
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPR 127



 Score = 26.6 bits (59), Expect = 3.4
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             V + K + K K K K K K  KK +++ + E K 
Sbjct: 88  APVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.0 bits (66), Expect = 0.51
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAE 116
           ++++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 29.0 bits (66), Expect = 0.54
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKK 111
           ++L+K++ KKKKK+KK KK + +K
Sbjct: 730 LDLEKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.7 bits (65), Expect = 0.90
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 96  KKKKKKKKKKKKKKKKKKKAEIE 118
           +K++ KKKKK+KK KK +  +IE
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755



 Score = 27.9 bits (63), Expect = 1.6
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 92  KKKEKKKKKKKKKKKKKKKKK 112
           +K+  KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 26.3 bits (59), Expect = 5.4
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 93  KKEKKKKKKKKKKKKKKKKKK 113
           +K + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 26.0 bits (58), Expect = 6.2
 Identities = 12/21 (57%), Positives = 18/21 (85%)

Query: 99  KKKKKKKKKKKKKKKKAEIER 119
           +K++ KKKKK+KK KK E+E+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 29.0 bits (65), Expect = 0.54
 Identities = 14/31 (45%), Positives = 17/31 (54%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + K EKK KK K K K K K K K + + K 
Sbjct: 87  KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKP 117



 Score = 27.8 bits (62), Expect = 1.5
 Identities = 13/35 (37%), Positives = 15/35 (42%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
                 T    K+K K +KK KK K K K K K  
Sbjct: 74  ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108



 Score = 26.7 bits (59), Expect = 2.9
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEI 117
            K KEK K +KK KK K K K K    
Sbjct: 82  PKPKEKPKPEKKPKKPKPKPKPKPKPK 108



 Score = 25.9 bits (57), Expect = 5.3
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +     K K K K K K K K + K KK   
Sbjct: 92  KKPKKPKPKPKPKPKPKPKVKPQPKPKKPPS 122



 Score = 25.9 bits (57), Expect = 5.3
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           ++K + +KK KK K K K K K K +++
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPKPKVK 112



 Score = 25.9 bits (57), Expect = 6.3
 Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           E    +KK +K K K K K K K K K + + +
Sbjct: 86  EKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK 118


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 29.2 bits (66), Expect = 0.56
 Identities = 8/25 (32%), Positives = 17/25 (68%)

Query: 97  KKKKKKKKKKKKKKKKKKAEIERKR 121
           ++KKK KK  +K+ +  +  +++KR
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKR 108



 Score = 27.6 bits (62), Expect = 2.0
 Identities = 12/45 (26%), Positives = 24/45 (53%), Gaps = 5/45 (11%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKK-----KKKKKKKKAEIERKR 121
            +   T   ++KK+ KK  +K+ +       KK+K+ +A  E++R
Sbjct: 75  IIFSTTTYFREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQR 119


>gnl|CDD|146077 pfam03263, Cucumo_2B, Cucumovirus protein 2B.  This protein may be
           a viral movement protein.
          Length = 103

 Score = 28.0 bits (62), Expect = 0.60
 Identities = 12/41 (29%), Positives = 25/41 (60%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            AVT   + L +  EKK+++++  +K ++++  K+  ER R
Sbjct: 2   GAVTNVELQLARLVEKKRQRRRSHRKNRRERGHKSPSERAR 42


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 28.7 bits (65), Expect = 0.64
 Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 7/37 (18%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKK-------KAEIERKR 121
           KKK KK KK+K+K++K+ +K         K   E  +
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAK 50


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 28.8 bits (65), Expect = 0.66
 Identities = 13/46 (28%), Positives = 25/46 (54%), Gaps = 6/46 (13%)

Query: 74  GVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           G G  G V V G         + +KKK++KK+++K+ K+ + + + 
Sbjct: 165 GGGTWGLVGVPG------HSSDSEKKKQRKKQRRKRSKELREDDDD 204


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.9 bits (65), Expect = 0.72
 Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E    + K+  KKKKKKKKKKKKK   +  ++   
Sbjct: 46  ENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80



 Score = 28.5 bits (64), Expect = 0.84
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E ++  ++ E K+  KKKKKKKKKKKKK       
Sbjct: 42  ELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76



 Score = 28.1 bits (63), Expect = 1.0
 Identities = 15/32 (46%), Positives = 20/32 (62%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            + ++ + K+  KKKKKKKKKKKKK   E   
Sbjct: 45  SENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76



 Score = 28.1 bits (63), Expect = 1.2
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           Q+ + K+  KKKKKKKKKKKKK   E     Y
Sbjct: 48  QEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79



 Score = 27.0 bits (60), Expect = 2.8
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
             E    + K++K++ K+K  KKK KK KK  +
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVD 34



 Score = 26.2 bits (58), Expect = 5.9
 Identities = 16/32 (50%), Positives = 18/32 (56%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
             E  N Q  K+KKKKKKKKKKK   +    A
Sbjct: 47  NQEAENKQNNKKKKKKKKKKKKKNLGEAYDLA 78


>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1.  This domain
           family is found in eukaryotes, and is typically between
           216 and 263 amino acids in length. Casc1 has many SNPs
           associated with cancer susceptibility.
          Length = 227

 Score = 28.5 bits (64), Expect = 0.73
 Identities = 6/30 (20%), Positives = 12/30 (40%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
            ++ +  +       K+ K K  K  EI +
Sbjct: 72  AEEAEGARGYVTDIPKEYKAKPVKYLEIPK 101


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 28.6 bits (64), Expect = 0.74
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +KK + KKK    K K  K  KKK++ + K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777



 Score = 28.2 bits (63), Expect = 1.0
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KKK    K K  K  KKK K K  +  +KR
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 27.9 bits (62), Expect = 1.4
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           K K  K  KKK K K KK  KK+A
Sbjct: 761 KGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 27.9 bits (62), Expect = 1.4
 Identities = 10/30 (33%), Positives = 13/30 (43%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K K+K    K K  K  KKK K    +  +
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781



 Score = 27.5 bits (61), Expect = 2.0
 Identities = 12/30 (40%), Positives = 14/30 (46%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +K    K K  K  KKK K K KK   +R 
Sbjct: 755 KKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 27.1 bits (60), Expect = 2.4
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +KK    K K  K  KKK K K K   +++
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 26.3 bits (58), Expect = 5.4
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           AV  +     K   +KK K KKK    K K  K
Sbjct: 734 AVITKYAGTPKTPYEKKTKAKKKSASTKGKAAK 766



 Score = 25.6 bits (56), Expect = 9.0
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKA 115
           K  +  KKK K K KK  KK+   
Sbjct: 763 KAAKTVKKKSKAKSKKTTKKRAGP 786


>gnl|CDD|218675 pfam05648, PEX11, Peroxisomal biogenesis factor 11 (PEX11).  This
           family consists of several peroxisomal biogenesis factor
           11 (PEX11) proteins from several eukaryotic species. The
           PEX11 peroxisomal membrane proteins promote peroxisome
           division in multiple eukaryotes.
          Length = 225

 Score = 28.4 bits (64), Expect = 0.78
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 73  WGVGILGHVAVT-GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           W  G+L  +     E   LQ+K++K KK+K  + +   ++K   +I +KR 
Sbjct: 129 WLFGLLLSLVRDLRELRQLQEKEKKLKKEKDSEGESTSERKLLKKILKKRP 179


>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
           family).  Protein phosphatase 2A (PP2A) is a major
           intracellular protein phosphatase that regulates
           multiple aspects of cell growth and metabolism. The
           ability of this widely distributed heterotrimeric enzyme
           to act on a diverse array of substrates is largely
           controlled by the nature of its regulatory B subunit.
           There are multiple families of B subunits (See also
           pfam01240), this family is called the B56 family.
          Length = 405

 Score = 28.4 bits (64), Expect = 0.85
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERK 120
            + K+K+ K+KKK+KK+KK  A I   
Sbjct: 377 NKYKEKEAKEKKKEKKRKKTWARIAEA 403



 Score = 28.0 bits (63), Expect = 1.3
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
             K KEK+ K+KKK+KK+KK   + AE 
Sbjct: 376 GNKYKEKEAKEKKKEKKRKKTWARIAEA 403


>gnl|CDD|235627 PRK05854, PRK05854, short chain dehydrogenase; Provisional.
          Length = 313

 Score = 28.5 bits (64), Expect = 0.85
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 64  STADEVEVPWGVGILGHVAVTG 85
           +TAD  E+ +G   LGH A+T 
Sbjct: 111 TTADGFELQFGTNHLGHFALTA 132


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 28.1 bits (63), Expect = 0.88
 Identities = 16/26 (61%), Positives = 17/26 (65%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           QK K KKKK KKKK K K KK  K +
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKD 107



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
            K K+KK  KK  K +KK +K+ + ++E    
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127



 Score = 27.7 bits (62), Expect = 1.3
 Identities = 15/30 (50%), Positives = 18/30 (60%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + K +KKK KKKK K K KK  KK +   K
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112



 Score = 27.3 bits (61), Expect = 1.5
 Identities = 16/31 (51%), Positives = 19/31 (61%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           K K+KK KKKK K K KK  KK  + E+K  
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 27.3 bits (61), Expect = 1.6
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK + K KK  KK  K +KK +K   ++  
Sbjct: 94  KKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 27.3 bits (61), Expect = 2.0
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             KKK+ KKKK K K KK  KK  K+E + +
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 27.0 bits (60), Expect = 2.2
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +K+K K KKKK KKKK K K KK  + + K
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 27.0 bits (60), Expect = 2.3
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
             ++K++ K KKKK KKKK K K KK +
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDD 104



 Score = 27.0 bits (60), Expect = 2.3
 Identities = 15/29 (51%), Positives = 17/29 (58%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           KKK KKKK K K KK  KK  K  + + K
Sbjct: 88  KKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116



 Score = 27.0 bits (60), Expect = 2.3
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            KK+K K K KK  KK  K +KK E E +
Sbjct: 91  SKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 27.0 bits (60), Expect = 2.6
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +K K+K KK  KK  K +KK +K+AE
Sbjct: 94  KKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 26.2 bits (58), Expect = 4.9
 Identities = 14/27 (51%), Positives = 20/27 (74%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           ++K+ E+K+K K KKKK KKKK K  +
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKD 100



 Score = 25.8 bits (57), Expect = 5.3
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK +KKK K K KK  KK  K + + E++ 
Sbjct: 89  KKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 25.0 bits (55), Expect = 9.9
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
             +K K+KK K K KK  KK  K +K + +
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.5 bits (64), Expect = 0.92
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             K K  K +K K KKK+KKKKK  A+  +K
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191



 Score = 28.1 bits (63), Expect = 1.1
 Identities = 16/33 (48%), Positives = 18/33 (54%)

Query: 89   NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             L+K K KKK+KKKKK    K KK       KR
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200



 Score = 26.9 bits (60), Expect = 2.9
 Identities = 12/31 (38%), Positives = 14/31 (45%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
              K  K K KKK+KKKKK    K  +     
Sbjct: 1166 ASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196



 Score = 26.9 bits (60), Expect = 2.9
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 91   QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
             K K K  K +K K KKK+KKKKK+  ++ + 
Sbjct: 1160 SKTKGKASKLRKPKLKKKEKKKKKSSADKSKK 1191



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 92   KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            KKK KK+ +      KKKKK +K    +K+
Sbjct: 1307 KKKVKKRLEGSLAALKKKKKSEKKTARKKK 1336



 Score = 25.8 bits (57), Expect = 7.5
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 90   LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             ++++ K K K K  K +K K KKK + ++K
Sbjct: 1153 AKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183



 Score = 25.4 bits (56), Expect = 9.4
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 88   VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
                 K+++ K K K K  K +K K K + ++K+
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKK 1182


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 28.3 bits (63), Expect = 0.92
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             E   ++K K+ +  K  K  K KK KKK +   R 
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177



 Score = 28.3 bits (63), Expect = 0.93
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK +K +  K  K  K KK KKK  +  + 
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177



 Score = 27.5 bits (61), Expect = 1.6
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
             T      + KK KK +  K  K  K KK KK
Sbjct: 137 ISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169



 Score = 26.8 bits (59), Expect = 2.9
 Identities = 10/30 (33%), Positives = 12/30 (40%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           + KK +  K  K  K KK KKK        
Sbjct: 149 KPKKGQSPKVPKAPKPKKPKKKGSVSNRSV 178



 Score = 26.8 bits (59), Expect = 3.7
 Identities = 9/31 (29%), Positives = 11/31 (35%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            K  +  K KK KKK     +  K      R
Sbjct: 156 PKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186



 Score = 26.4 bits (58), Expect = 4.1
 Identities = 9/31 (29%), Positives = 11/31 (35%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             K  K KK KKK     +  K      R +
Sbjct: 158 VPKAPKPKKPKKKGSVSNRSVKMPGIDPRSK 188



 Score = 26.0 bits (57), Expect = 5.1
 Identities = 9/31 (29%), Positives = 14/31 (45%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
                ++ K+ KK +  K  K  K KK K +
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKK 170



 Score = 25.6 bits (56), Expect = 7.6
 Identities = 8/28 (28%), Positives = 9/28 (32%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
                K  K  K KK KKK     +   
Sbjct: 152 KGQSPKVPKAPKPKKPKKKGSVSNRSVK 179



 Score = 25.6 bits (56), Expect = 8.6
 Identities = 11/33 (33%), Positives = 13/33 (39%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
               + V   KK +  K  K  K KK KKK   
Sbjct: 141 PPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 27.5 bits (62), Expect = 0.93
 Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 3/32 (9%)

Query: 92  KKKEKKKKKKKKKKKKKKK---KKKKAEIERK 120
           KKKEKK+ K + K+   KK   +++KA + +K
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93



 Score = 25.6 bits (57), Expect = 3.9
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 96  KKKKKKKKKKKKKKKKKKKAEIERKR 121
           KKK+KK+ K + K+   KK  +E+++
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQRK 87


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 28.1 bits (63), Expect = 1.0
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            + +KE+KK   K++KK  K +K+K E E  
Sbjct: 94  FKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 28.2 bits (63), Expect = 1.0
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           +T + +NL       K+KK+ +K KK+KK K ++
Sbjct: 52  ITADLINLGAAISIPKRKKQTEKGKKEKKPKVSD 85


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 28.3 bits (63), Expect = 1.1
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +  KE++K+K++ K++KKKKK+K  E  + R
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131



 Score = 27.5 bits (61), Expect = 1.9
 Identities = 16/39 (41%), Positives = 24/39 (61%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
              E    ++ KE+KKKKK+K K++ K +K K E + KR
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKR 141



 Score = 26.8 bits (59), Expect = 3.1
 Identities = 15/36 (41%), Positives = 26/36 (72%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           E  N   K+E+K+K++ K++KKKKK+K K E + ++
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 26.4 bits (58), Expect = 5.2
 Identities = 9/31 (29%), Positives = 22/31 (70%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           ++ K +  K+++K+K++ K++KKK + + K 
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126



 Score = 25.6 bits (56), Expect = 8.1
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           Q K+EKKKKK+K K++ K +K K+   E++ 
Sbjct: 112 QVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 25.6 bits (56), Expect = 8.7
 Identities = 18/67 (26%), Positives = 33/67 (49%)

Query: 55  LHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           L   A    S+ + V+     G  G  A T      + +  K+++K+K++ K++KKKKK+
Sbjct: 63  LAKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122

Query: 115 AEIERKR 121
              E  +
Sbjct: 123 KPKEEPK 129



 Score = 25.6 bits (56), Expect = 9.8
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + K+++  K+K+K+K+KK ++ +  E E+KR
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 28.0 bits (62), Expect = 1.1
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
           +K K   KKK K KK KKK+ K++A
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREA 231



 Score = 26.8 bits (59), Expect = 2.6
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAE 116
           K+K   KKK K KK KKK+ K E
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKRE 230



 Score = 26.0 bits (57), Expect = 5.7
 Identities = 14/24 (58%), Positives = 16/24 (66%)

Query: 95  EKKKKKKKKKKKKKKKKKKKAEIE 118
           EK K   KKK K KK KKK+A+ E
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKRE 230



 Score = 26.0 bits (57), Expect = 5.9
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKK 111
              K   KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 27.9 bits (62), Expect = 1.1
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 93  KKEKKKKKKKKKKKKKKKK 111
            +E K+KKKKK KKKK  K
Sbjct: 86  NQENKQKKKKKDKKKKSPK 104



 Score = 27.1 bits (60), Expect = 2.0
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 91  QKKKEKKKKKKKKKKKKK 108
           Q+ K+KKKKK KKKK  K
Sbjct: 87  QENKQKKKKKDKKKKSPK 104



 Score = 26.3 bits (58), Expect = 3.7
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 92  KKKEKKKKKKKKKKKKKKK 110
            ++ K+KKKKK KKKK  K
Sbjct: 86  NQENKQKKKKKDKKKKSPK 104



 Score = 26.3 bits (58), Expect = 3.7
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKK 112
           ET +     ++ K+KKKKK KKKK  K
Sbjct: 78  ETEDWFSPNQENKQKKKKKDKKKKSPK 104



 Score = 26.3 bits (58), Expect = 4.7
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 96  KKKKKKKKKKKKKKKKKKK 114
            ++ K+KKKKK KKKK  K
Sbjct: 86  NQENKQKKKKKDKKKKSPK 104


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 27.6 bits (62), Expect = 1.3
 Identities = 7/33 (21%), Positives = 21/33 (63%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            + +  EK+  K K++K+++ ++K+K  ++ + 
Sbjct: 119 QIDELLEKELAKLKREKRRENERKQKEILKEQM 151



 Score = 26.1 bits (58), Expect = 4.7
 Identities = 8/28 (28%), Positives = 19/28 (67%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KE  K K++K+++ ++K+K+  + + K 
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMKM 153


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 27.2 bits (61), Expect = 1.3
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           T    +K+E++K++KK+K +KK +++ K 
Sbjct: 59  TETWLEKREREKREKKEKLEKKLEEELKE 87


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 27.7 bits (62), Expect = 1.3
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 96  KKKKKKKKKKKKKKKKKKKA 115
           KK K K+ +K + KKKKKK 
Sbjct: 87  KKAKGKRSRKNQTKKKKKKD 106



 Score = 26.5 bits (59), Expect = 2.8
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 93  KKEKKKKKKKKKKKKKKKKKK 113
             +K K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 26.5 bits (59), Expect = 3.0
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 95  EKKKKKKKKKKKKKKKKKKK 114
           E  KK K K+ +K + KKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKK 103



 Score = 25.7 bits (57), Expect = 5.5
 Identities = 10/19 (52%), Positives = 13/19 (68%)

Query: 96  KKKKKKKKKKKKKKKKKKK 114
            KK K K+ +K + KKKKK
Sbjct: 86  SKKAKGKRSRKNQTKKKKK 104



 Score = 25.7 bits (57), Expect = 5.6
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 92  KKKEKKKKKKKKKKKKKKKK 111
            KK K K+ +K + KKKKKK
Sbjct: 86  SKKAKGKRSRKNQTKKKKKK 105



 Score = 25.4 bits (56), Expect = 9.2
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 92  KKKEKKKKKKKKKKKKKKKKK 112
             K+ K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|165586 PHA03342, PHA03342, US22 family homolog; Provisional.
          Length = 511

 Score = 28.0 bits (62), Expect = 1.4
 Identities = 18/68 (26%), Positives = 32/68 (47%), Gaps = 3/68 (4%)

Query: 55  LHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           L+SGAGL    AD V   + +G+L  +  +G        KE++K+K +   +   +   K
Sbjct: 293 LYSGAGLTRRVADNVAEFFQIGMLKRLFSSGFEF---HAKERRKEKLEGAPRCPHRIADK 349

Query: 115 AEIERKRY 122
            E+  +  
Sbjct: 350 FELAEEHC 357


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 27.9 bits (63), Expect = 1.5
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
            +      ++K+KKK+KKK KKKK ++ R +
Sbjct: 390 AQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420



 Score = 26.8 bits (60), Expect = 3.3
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           +      V  ++K++KK+KKK KKKK K  + K  ++
Sbjct: 388 LKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKV 424


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 27.6 bits (62), Expect = 1.6
 Identities = 8/31 (25%), Positives = 18/31 (58%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +   EK+K  ++++K K K  +KK +   ++
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153



 Score = 26.4 bits (59), Expect = 3.5
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             +KEK  ++++K K K  +KK+K   ++ 
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154



 Score = 26.0 bits (58), Expect = 5.0
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           K + ++ +++ KK++   +K+K    ERK 
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKP 138



 Score = 26.0 bits (58), Expect = 5.2
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           E    ++K  ++++K K K  +KK+K +  + E + 
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158



 Score = 25.7 bits (57), Expect = 7.5
 Identities = 7/31 (22%), Positives = 21/31 (67%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +++ +K++   +K+K  ++++K K +  RK+
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146



 Score = 25.7 bits (57), Expect = 7.7
 Identities = 5/25 (20%), Positives = 17/25 (68%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
           Q+ +++ KK++   +K+K  ++++ 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERK 137


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 27.6 bits (61), Expect = 1.7
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           AV   T+    + ++K K+K    K+ K +    E E +R
Sbjct: 785 AVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGER 824


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 27.8 bits (62), Expect = 1.7
 Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           L+KK      KKK+ KK     K KA  ER
Sbjct: 607 LKKKALSDGGKKKQGKKAGGGGKSKAAAER 636


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 27.1 bits (61), Expect = 1.7
 Identities = 15/29 (51%), Positives = 21/29 (72%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +KK+KKKKKK+ +  ++KKKK     ERK
Sbjct: 83  RKKKKKKKKKEGEFFEEKKKKYVVSEERK 111


>gnl|CDD|185727 cd08986, GH43_7, Glycosyl hydrolase family 43.  This glycosyl
           hydrolase family 43 (GH43) includes enzymes with
           beta-1,4-xylosidase (xylan 1,4-beta-xylosidase; EC
           3.2.1.37), beta-1,3-xylosidase (EC 3.2.1.-),
           alpha-L-arabinofuranosidase (EC 3.2.1.55), arabinanase
           (EC 3.2.1.99), xylanase (EC 3.2.1.8),
           endo-alpha-L-arabinanase and galactan
           1,3-beta-galactosidase (EC 3.2.1.145) activities. These
           are inverting enzymes (i.e. they invert the
           stereochemistry of the anomeric carbon atom of the
           substrate) that have an aspartate as the catalytic
           general base, a glutamate as the catalytic general acid
           and another aspartate that is responsible for pKa
           modulation and orienting the catalytic acid. Many of the
           enzymes in this family display both
           alpha-L-arabinofuranosidase and beta-D-xylosidase
           activity using aryl-glycosides as substrates. A common
           structural feature of GH43 enzymes is a 5-bladed
           beta-propeller domain that contains the catalytic acid
           and catalytic base. A long V-shaped groove, partially
           enclosed at one end, forms a single extended
           substrate-binding surface across the face of the
           propeller.
          Length = 269

 Score = 27.4 bits (61), Expect = 1.8
 Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 13/60 (21%)

Query: 31  DGASLFFVERRRGGKKCLVSKVFDLHSGA----GLVYSTADEVEVPWG-----VGILGHV 81
           +GA +F    + GGK  L    +    G      L Y+ AD++  P+G        LGH 
Sbjct: 184 EGAFVF----KYGGKYYLFGTAWSTDKGRKGSYDLYYAVADKITGPYGARYFAGRFLGHN 239


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 27.3 bits (61), Expect = 2.0
 Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           V  T ++ +  +K++  KK  KK    KK+K+K++  
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPP 250


>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767).  This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 112 and 199 amino
           acids in length.
          Length = 119

 Score = 26.9 bits (60), Expect = 2.1
 Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 73  WGVGILGHVAVTG-ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           W VG     ++ G E    +++K  +  KK  +   +KK KK  E
Sbjct: 72  WAVGTFALGSIAGWEQCRYRRRKSFQGMKKAVEAMAEKKAKKLRE 116


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 27.1 bits (60), Expect = 2.2
 Identities = 10/32 (31%), Positives = 19/32 (59%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           N + KK +  +K + K +KK+  +K+   ER+
Sbjct: 329 NAEIKKTRTAEKNEAKARKKEIAQKRRAAERE 360



 Score = 26.0 bits (57), Expect = 7.4
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           +  EK + K +KK+  +K++  + EI R+ 
Sbjct: 336 RTAEKNEAKARKKEIAQKRRAAEREINREA 365


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 27.2 bits (61), Expect = 2.3
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIER 119
           + EKK  +KK KK  KK K ++A  E 
Sbjct: 89  EAEKKGLEKKAKKAIKKGKDEEALAEE 115


>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
           protein DltB.  Members of this protein family are DltB,
           part of a four-gene operon for D-alanyl-lipoteichoic
           acid biosynthesis that is present in the vast majority
           of low-GC Gram-positive organisms. This protein may be
           involved in transport of D-alanine across the plasma
           membrane [Cell envelope, Biosynthesis and degradation of
           murein sacculus and peptidoglycan].
          Length = 380

 Score = 27.1 bits (61), Expect = 2.3
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 95  EKKKKKKKKKKKKK 108
           ++K KK KK KK K
Sbjct: 342 QRKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.2
 Identities = 8/14 (57%), Positives = 9/14 (64%)

Query: 102 KKKKKKKKKKKKKA 115
           +K KK KK KK K 
Sbjct: 343 RKSKKYKKNKKNKW 356



 Score = 26.3 bits (59), Expect = 4.8
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 97  KKKKKKKKKKKKK 109
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.8
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 98  KKKKKKKKKKKKK 110
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.8
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 99  KKKKKKKKKKKKK 111
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.8
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 100 KKKKKKKKKKKKK 112
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355



 Score = 26.3 bits (59), Expect = 4.8
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 101 KKKKKKKKKKKKK 113
           +K KK KK KK K
Sbjct: 343 RKSKKYKKNKKNK 355


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 27.1 bits (60), Expect = 2.4
 Identities = 10/26 (38%), Positives = 20/26 (76%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIE 118
            K++ K +K K+ KK+KK+++K E++
Sbjct: 24  VKDEAKPRKIKRVKKRKKREEKDELD 49



 Score = 25.2 bits (55), Expect = 10.0
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 97  KKKKKKKKKKKKKKKKKKAEIER 119
            K + K +K K+ KK+KK E + 
Sbjct: 24  VKDEAKPRKIKRVKKRKKREEKD 46


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 27.1 bits (61), Expect = 2.4
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
             KE+ +    K+ ++K+KK +K EI 
Sbjct: 518 YIKEQMEGSAPKEPEEKEKKPEKPEIT 544



 Score = 26.7 bits (60), Expect = 4.2
 Identities = 8/33 (24%), Positives = 19/33 (57%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           E +   K++ +    K+ ++K+KK +K +  I+
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPEITID 546


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 26.8 bits (60), Expect = 2.4
 Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKA-EIERKR 121
            LQ K  + KK+K + + + K+ + KA E E + 
Sbjct: 71  ELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +++ +KKK+ K K  K+  KKKKK +    
Sbjct: 81  EERLKKKKRVKTKAYKEPTKKKKKKDPTAA 110



 Score = 25.8 bits (57), Expect = 6.0
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + K+KK+ K K  K+  KKKKKK     K 
Sbjct: 83  RLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 27.1 bits (60), Expect = 2.6
 Identities = 9/24 (37%), Positives = 19/24 (79%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAE 116
           K+E+ +KK+K++  + K+K++K E
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDE 431



 Score = 26.7 bits (59), Expect = 3.3
 Identities = 7/24 (29%), Positives = 19/24 (79%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
           + +K++K++  + K+K++K ++KK
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434



 Score = 26.7 bits (59), Expect = 3.3
 Identities = 7/29 (24%), Positives = 21/29 (72%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           E +  ++ ++K+K++  + K+K++K ++K
Sbjct: 405 EKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 25.5 bits (56), Expect = 9.2
 Identities = 9/23 (39%), Positives = 18/23 (78%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKK 111
           N +K+KE+  + K+K++K ++KK
Sbjct: 412 NEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 26.9 bits (60), Expect = 2.7
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
             K   KKK K +    KK +KK  +++  
Sbjct: 87  AAKAPAKKKLKDELDSSKKAEKKNALDKDD 116



 Score = 26.9 bits (60), Expect = 3.4
 Identities = 10/29 (34%), Positives = 12/29 (41%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            K    K   KKK K +    KKAE +  
Sbjct: 83  AKAAAAKAPAKKKLKDELDSSKKAEKKNA 111



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 13/52 (25%), Positives = 14/52 (26%)

Query: 64  STADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           S     E    V I     V       +    KKK K   K    K   KK 
Sbjct: 44  SKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKK 95



 Score = 26.1 bits (58), Expect = 6.3
 Identities = 11/45 (24%), Positives = 15/45 (33%)

Query: 76  GILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            + G V  T +       K+K K   K    K   KKK  +    
Sbjct: 58  FLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 26.1 bits (58), Expect = 6.4
 Identities = 11/52 (21%), Positives = 17/52 (32%)

Query: 64  STADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           +     +V   +  +          ++ KKK K   K    K   KKK K  
Sbjct: 48  TPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99



 Score = 25.3 bits (56), Expect = 9.1
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           KK +   K    K   KKK K + +  +K
Sbjct: 77  KKTKTAAKAAAAKAPAKKKLKDELDSSKK 105



 Score = 25.3 bits (56), Expect = 9.3
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           K   KKK K +    KK +KK   + +  
Sbjct: 89  KAPAKKKLKDELDSSKKAEKKNALDKDDD 117


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.3 bits (61), Expect = 2.7
 Identities = 11/33 (33%), Positives = 22/33 (66%)

Query: 85  GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
            +   +    E+K KK++KK+K K+++K++ EI
Sbjct: 650 KKEFKILDYTEEKTKKEEKKEKNKREEKERIEI 682



 Score = 26.1 bits (58), Expect = 5.2
 Identities = 15/31 (48%), Positives = 20/31 (64%), Gaps = 3/31 (9%)

Query: 92  KKKEKKKKK---KKKKKKKKKKKKKKAEIER 119
           KKKE K      +K KK++KK+K K+ E ER
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKER 679


>gnl|CDD|240094 cd04743, NPD_PKS, 2-Nitropropane dioxygenase (NPD)-like domain,
          associated with polyketide synthases (PKS). NPD is part
          of the nitroalkaneoxidizing enzyme family, that
          catalyzes oxidative denitrification of nitroalkanes to
          their corresponding carbonyl compounds and nitrites.
          NDPs are members of the NAD(P)H-dependent flavin
          oxidoreductase family that reduce a range of
          alternative  electron acceptors. Most use FAD/FMN as a
          cofactor and NAD(P)H as electron donor. Some contain
          4Fe-4S cluster to transfer electron from FAD to FMN.
          Length = 320

 Score = 27.1 bits (60), Expect = 2.8
 Identities = 12/24 (50%), Positives = 16/24 (66%), Gaps = 1/24 (4%)

Query: 59 AGLVYSTADEV-EVPWGVGILGHV 81
            L+  TA+ + + PWGVGILG V
Sbjct: 43 KALLEETAELLGDKPWGVGILGFV 66


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 26.4 bits (59), Expect = 2.8
 Identities = 11/31 (35%), Positives = 25/31 (80%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           + K+EKKK+ K+KK+++K++K++  + ++K 
Sbjct: 97  RNKQEKKKRSKEKKEEEKERKRQLKQQKKKA 127



 Score = 24.9 bits (55), Expect = 9.2
 Identities = 10/23 (43%), Positives = 20/23 (86%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKK 114
           +KKE++K++K++ K++KKK K +
Sbjct: 108 EKKEEEKERKRQLKQQKKKAKHR 130


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 26.9 bits (60), Expect = 2.8
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
           N  KK+ K K+ +++K  K++ K   A
Sbjct: 11  NALKKRLKAKQAEEEKAAKEEAKAAAA 37



 Score = 25.7 bits (57), Expect = 7.1
 Identities = 5/25 (20%), Positives = 11/25 (44%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
           +K  K++ K        K + +K+ 
Sbjct: 25  EKAAKEEAKAAAAAAAAKGRSRKSA 49


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 26.5 bits (59), Expect = 2.8
 Identities = 8/27 (29%), Positives = 20/27 (74%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
            ++ KE+ +K+K+ +K ++KK K++ +
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQK 126



 Score = 24.9 bits (55), Expect = 9.8
 Identities = 9/23 (39%), Positives = 19/23 (82%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKK 113
           + +KEK+ +K ++KK K+++K+K
Sbjct: 106 RAEKEKRTRKNREKKFKRRQKEK 128


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 27.1 bits (60), Expect = 2.9
 Identities = 8/29 (27%), Positives = 22/29 (75%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           ++E++++K++ K K ++++K K E  ++R
Sbjct: 13  EREQRERKQRAKLKLERERKAKEEAAKQR 41


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 27.0 bits (60), Expect = 2.9
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           L ++K   K  +K K ++ +KK KKA +
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKKARL 368



 Score = 26.6 bits (59), Expect = 3.5
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           +K   K  +K K ++ +KK KK +     K  
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 27.0 bits (60), Expect = 3.0
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           N+ KKK+KKKKK  K+     +  KK   
Sbjct: 6   NIFKKKKKKKKKASKESSNGSQNSKKTIF 34



 Score = 26.6 bits (59), Expect = 4.3
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 99  KKKKKKKKKKKKKKKKAEIERKR 121
           KKKKKKKKK  K+        K+
Sbjct: 9   KKKKKKKKKASKESSNGSQNSKK 31


>gnl|CDD|235083 PRK02889, tolB, translocation protein TolB; Provisional.
          Length = 427

 Score = 26.9 bits (60), Expect = 3.2
 Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 5/48 (10%)

Query: 23  NVSVLLDADGASLFFVERRRGGKKCLVSKVFDLHSGA--GLVYSTADE 68
           N S  +  DG  L ++ R  G  K     V DL +G    L  +T DE
Sbjct: 330 NTSPRISPDGKLLAYISRVGGAFKL---YVQDLATGQVTALTDTTRDE 374


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 26.7 bits (59), Expect = 3.2
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +   E KKK  + KKK + + K KAE + K
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 26.3 bits (58), Expect = 4.0
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           Q ++E K K   + KKK  + KKKAE E K
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180



 Score = 26.0 bits (57), Expect = 6.9
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           Q  + K K + + +KK K++ KK+AE E K
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157



 Score = 25.6 bits (56), Expect = 7.6
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           E    Q+  E+ ++K+ +++   +K  K+AE   K
Sbjct: 78  EEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 26.8 bits (60), Expect = 3.2
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 85  GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
           GE      + E+  K+KKKKKKKKKKK++  E 
Sbjct: 73  GELYERADRAEELLKEKKKKKKKKKKKEELREW 105


>gnl|CDD|173227 PRK14766, PRK14766, lipoprotein signal peptidase; Provisional.
          Length = 201

 Score = 26.8 bits (59), Expect = 3.2
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 4/57 (7%)

Query: 53  FDLHSGAGLVYSTAD-EVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKK 108
           F L +    V++ AD  V +  G+ IL  +    E V   K K K KK+K + ++  
Sbjct: 147 FSLLNSRDYVFNLADLYVNIAIGLTILFTII---ELVLYIKSKIKTKKEKIENEQNN 200


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 26.9 bits (60), Expect = 3.3
 Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 87  TVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           T    KK   K K KKKKKK       K+E  ++
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQE 382



 Score = 25.4 bits (56), Expect = 9.6
 Identities = 12/29 (41%), Positives = 15/29 (51%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           KK  KK   K K KKKKKK   + + +  
Sbjct: 350 KKPIKKSNSKAKLKKKKKKAGASAVPKSE 378


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 26.7 bits (59), Expect = 3.3
 Identities = 9/32 (28%), Positives = 12/32 (37%)

Query: 85   GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
                  +    KK  ++  KK   KK  KKA 
Sbjct: 1184 RAAARGESGAAKKVSRQAPKKPAPKKTTKKAS 1215



 Score = 25.6 bits (56), Expect = 9.5
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 93   KKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
            KK  ++  KK   KK  KK  ++E   + Y
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETY 1224


>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
          Length = 307

 Score = 26.6 bits (59), Expect = 3.3
 Identities = 9/28 (32%), Positives = 13/28 (46%)

Query: 83  VTGETVNLQKKKEKKKKKKKKKKKKKKK 110
           VT           KKKKK +   K++K+
Sbjct: 280 VTPSQNQAIASLPKKKKKNRNAGKQRKR 307


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 26.6 bits (59), Expect = 3.4
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
                +K+    +K    KK KKK K+K  E E+K
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKK 82


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 26.6 bits (59), Expect = 3.5
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKAEI 117
               +   +KKK + KKK  K  KK + ++
Sbjct: 146 CERTEILAEKKKPRSKKKSSKNSKKLRIDL 175



 Score = 26.2 bits (58), Expect = 4.6
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 97  KKKKKKKKKKKKKKKKKKAEIER 119
            +KKK + KKK  K  KK  I+ 
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRIDL 175


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 26.4 bits (58), Expect = 3.5
 Identities = 13/27 (48%), Positives = 21/27 (77%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           L+++KEKKK   K++K+ KKK++ K E
Sbjct: 93  LEEEKEKKKNLNKEEKEAKKKERAKRE 119



 Score = 26.0 bits (57), Expect = 5.0
 Identities = 12/25 (48%), Positives = 20/25 (80%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIE 118
           +EK+KKK   K++K+ KKK++A+ E
Sbjct: 95  EEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
           factor.  RNA polymerase I-specific
           transcription-initiation factor Rrn6 and Rrn7 represent
           components of a multisubunit transcription factor
           essential for the initiation of rDNA transcription by
           Pol I. These proteins are found in fungi.
          Length = 753

 Score = 26.6 bits (59), Expect = 3.6
 Identities = 8/14 (57%), Positives = 10/14 (71%)

Query: 93  KKEKKKKKKKKKKK 106
            +   KKKKKKK+K
Sbjct: 737 GRSASKKKKKKKRK 750



 Score = 25.9 bits (57), Expect = 7.8
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query: 101 KKKKKKKKKKKKKKA 115
            +   KKKKKKK+K 
Sbjct: 737 GRSASKKKKKKKRKG 751


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 26.6 bits (59), Expect = 3.6
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           +   L     + ++ KKK++ + KKKK+  E++
Sbjct: 76  DEKELSASSLEAEQAKKKEEAEAKKKKEMEELK 108


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 26.6 bits (59), Expect = 3.7
 Identities = 9/36 (25%), Positives = 25/36 (69%)

Query: 85  GETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            E +N   + ++K K ++ K+K++K+ +++A+ E++
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 26.7 bits (59), Expect = 3.7
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 95  EKKKKKKKKKKKKKKKKKKKAEIE 118
           +++++K++KKKK KK K+   E E
Sbjct: 62  DEEEEKEEKKKKTKKVKETTTEWE 85


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 26.5 bits (58), Expect = 4.0
 Identities = 14/38 (36%), Positives = 25/38 (65%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           VA     V  Q+K +++++ K K  KK+++K+K+AE E
Sbjct: 46  VATGAGVVYYQRKGQQQRESKPKISKKERRKRKQAEKE 83



 Score = 26.1 bits (57), Expect = 6.0
 Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 60  GLVYSTADEVEVPWGVGILGHV---AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
             VY+ A    V  G G++ +        E+     KKE++K+K+ +K+ + K ++KK+ 
Sbjct: 35  TAVYTIAGVTVVATGAGVVYYQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKST 94


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 26.3 bits (59), Expect = 4.4
 Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 87  TVNLQ--KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
            V L   +K +K KKKKKKK K  K K +   +E
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLE 706



 Score = 26.3 bits (59), Expect = 4.5
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           + +K   K  +KK+KKKKK+K+K+ E++
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 26.3 bits (59), Expect = 5.4
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           V   +  E +K+  K  +KK+KKKKK+ E + +
Sbjct: 604 VKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636


>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 260 amino acids in length.
          Length = 242

 Score = 26.4 bits (58), Expect = 4.5
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 77  ILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           ++G +   G   NL  +KEK   K+     K+KKK+KK
Sbjct: 136 LIGGLIFGGLLANLIHRKEKDDNKENINLFKQKKKRKK 173


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 26.3 bits (58), Expect = 4.6
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKA 115
           +  +E K++ +KK+K KKK +K+K 
Sbjct: 345 EDDEESKEEVEKKQKVKKKPRKRKV 369


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 26.5 bits (59), Expect = 4.7
 Identities = 8/36 (22%), Positives = 22/36 (61%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           E + ++ + E+ KK+ KK ++K ++ + + E + + 
Sbjct: 89  ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEE 124



 Score = 25.7 bits (57), Expect = 8.1
 Identities = 7/32 (21%), Positives = 19/32 (59%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           L+K +EK ++ + + +KK+++ +K       +
Sbjct: 104 LKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135



 Score = 25.4 bits (56), Expect = 9.9
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           + KKE KK ++K ++ + + +KK+ E+E
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELE 126


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 26.5 bits (58), Expect = 4.7
 Identities = 6/20 (30%), Positives = 12/20 (60%)

Query: 99  KKKKKKKKKKKKKKKKAEIE 118
           KK+   +K  K+ +++A  E
Sbjct: 33  KKENAIRKLGKEAEEEAMEE 52


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 26.3 bits (58), Expect = 4.7
 Identities = 10/26 (38%), Positives = 21/26 (80%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEI 117
           ++KEK++KKK K+ K+++K++K+   
Sbjct: 185 QQKEKEEKKKVKEAKRREKEEKRMAA 210



 Score = 25.5 bits (56), Expect = 7.6
 Identities = 11/21 (52%), Positives = 19/21 (90%)

Query: 91  QKKKEKKKKKKKKKKKKKKKK 111
           QK+KE+KKK K+ K+++K++K
Sbjct: 186 QKEKEEKKKVKEAKRREKEEK 206


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 26.2 bits (58), Expect = 4.8
 Identities = 15/38 (39%), Positives = 26/38 (68%), Gaps = 3/38 (7%)

Query: 85  GETVNLQ---KKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           G  V+LQ    K E  +KK+K+KK++ K +KK++ +E+
Sbjct: 146 GGDVHLQANPTKLELLRKKEKEKKEQLKIQKKQSLLEK 183


>gnl|CDD|235222 PRK04125, PRK04125, murein hydrolase regulator LrgA; Provisional.
          Length = 141

 Score = 25.8 bits (57), Expect = 5.2
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           A TG        K +K+K+ KK+KK K K ++  +
Sbjct: 107 ACTGLFSQFILGKTEKEKEDKKEKKVKIKGRQHGK 141


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 26.3 bits (58), Expect = 5.3
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 96  KKKKKKKKKKKKKKKKKKKAEIERK 120
            K+K KK       KK K+ E+E K
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESK 355


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 26.0 bits (58), Expect = 5.4
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
            L++K  +++++K +KKK    ++ K +I  +
Sbjct: 6   TLEEKLNEEEREKLEKKKSSLSEEDKEKIIER 37


>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
          Length = 399

 Score = 26.1 bits (58), Expect = 5.6
 Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 92  KKKEKKKK----KKKKKKKKKKKKKKKAEIER 119
            K+EKK      K  +++ ++KKK +K + ER
Sbjct: 48  SKREKKPYPRPMKLLRREAREKKKLRKLQPER 79


>gnl|CDD|215877 pfam00360, PHY, Phytochrome region.  Phytochromes are red/far-red
          photochromic biliprotein photoreceptors which regulate
          plant development. They are widely represented in both
          photosynthetic and non-photosynthetic bacteria and are
          known in a variety of fungi. Although sequence
          similarities are low, this domain is structurally
          related to pfam01590, which is generally located
          immediately N-terminal to this domain. Compared with
          pfam01590, this domain carries an additional
          tongue-like hairpin loop between the fifth beta-sheet
          and the sixth alpha-helix which functions to seal the
          chromophore pocket and stabilise the photoactivated
          far-red-absorbing state (Pfr). The tongue carries a
          conserved PRxSF motif, from which an arginine finger
          points into the chromophore pocket close to ring D
          forming a salt bridge with a conserved aspartate
          residue.
          Length = 183

 Score = 26.0 bits (58), Expect = 5.6
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query: 3  IVRDIARDLDLKSLSKKTVDNVSVLLDADGASLFF 37
          +   ++R+ DL         N+  L+ ADGA+L++
Sbjct: 8  LCDMLSREADLPEGLVTQSPNLLDLVKADGAALYY 42


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 25.9 bits (57), Expect = 5.8
 Identities = 10/30 (33%), Positives = 22/30 (73%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           +K++++  K+ ++K+  K KK+KK + E+K
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125


>gnl|CDD|220644 pfam10238, Eapp_C, E2F-associated phosphoprotein.  This entry
           represents the conserved C-terminal portion of an E2F
           binding protein. E2F transcription factors play an
           essential role in cell proliferation and apoptosis and
           their activity is frequently deregulated in human
           cancers. E2F activity is regulated by a variety of
           mechanisms, frequently mediated by proteins binding to
           individual members or a subgroup of the family. EAPP
           interacts with a subset of E2F factors and influences
           E2F-dependent promoter activity. EAPP is present
           throughout the cell cycle but disappears during mitosis.
          Length = 128

 Score = 25.4 bits (56), Expect = 5.8
 Identities = 10/40 (25%), Positives = 16/40 (40%), Gaps = 1/40 (2%)

Query: 81  VAVTGETVNLQKKKEKKKKKKKKKK-KKKKKKKKKAEIER 119
             V  E V   K   +KK+++  KK +        +EI  
Sbjct: 61  CKVDEEKVLYSKPINRKKRRRNSKKNELNPNNTATSEIYH 100


>gnl|CDD|237262 PRK12997, PRK12997, PTS system ascorbate-specific transporter
           subunit IIC; Reviewed.
          Length = 466

 Score = 26.0 bits (58), Expect = 5.8
 Identities = 9/57 (15%), Positives = 21/57 (36%), Gaps = 4/57 (7%)

Query: 52  VFDLHSGAGLVYSTADEVEVPWGV--GILGHVAVTGETVNLQKKKEKKKKKKKKKKK 106
           +  L    G  +    + +  W     +L ++ + G  +    + +  K K K+K  
Sbjct: 412 LLGLAKYGG--WHGNTDWDTDWPAFGLLLKYIGLGGYLLLAIPQLQFAKAKDKEKYY 466


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 26.1 bits (57), Expect = 6.0
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 88  VNLQKKKEKKKKKKKKKKKKKKKKKKK 114
           V+ ++  +KKK KK K +K+ K+K KK
Sbjct: 66  VSGKRVPKKKKIKKPKLRKRTKRKNKK 92


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 26.1 bits (58), Expect = 6.1
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           +++KEK+KK KK+ K++++   +   IE
Sbjct: 393 REEKEKEKKAKKEGKEERRIHFQNKSIE 420


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 25.6 bits (57), Expect = 6.3
 Identities = 7/29 (24%), Positives = 19/29 (65%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKKAEIER 119
           Q+ K+ +++K++ +K+  + + K   IE+
Sbjct: 127 QEIKKLEEEKEELEKRVAELEAKLEAIEK 155


>gnl|CDD|173607 PTZ00417, PTZ00417, lysine-tRNA ligase; Provisional.
          Length = 585

 Score = 26.1 bits (57), Expect = 6.5
 Identities = 17/75 (22%), Positives = 36/75 (48%), Gaps = 8/75 (10%)

Query: 50  SKVFDLHSGAGLVYSTADEVEVPWGVGILGHVAVTGETVNLQKKK--EKKKKKKKKKKKK 107
           S  +  HS + ++ +    +  P       H      T++ +K+   E +KK +  +  K
Sbjct: 17  SHFYKQHSFSKIILNKNKNIICPV------HCKQCFVTMSEKKEHVMEGEKKVRSVQASK 70

Query: 108 KKKKKKKAEIERKRY 122
            KKK+++AE++ + Y
Sbjct: 71  DKKKEEEAEVDPRLY 85


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 25.9 bits (57), Expect = 6.7
 Identities = 8/27 (29%), Positives = 9/27 (33%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKA 115
            L        +K  K K   KK  KK 
Sbjct: 186 KLSSTASSGDEKSPKSKAAPKKAGKKM 212



 Score = 25.5 bits (56), Expect = 9.4
 Identities = 9/22 (40%), Positives = 10/22 (45%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKK 113
              EK  K K   KK  KK +K
Sbjct: 193 SGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 25.8 bits (57), Expect = 6.7
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
             +   E+KK      +  +K +KK AE E
Sbjct: 105 IKRDSAERKKAASSGSRAIEKLRKKLAEQE 134


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 26.1 bits (58), Expect = 6.8
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKK 113
           L K K      K + K KK  K K
Sbjct: 187 LAKHKILIPSLKTRPKGKKYVKIK 210


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 26.0 bits (57), Expect = 6.8
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAE 116
            +K K  +K    K KK  KK +AE
Sbjct: 357 ARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|235737 PRK06197, PRK06197, short chain dehydrogenase; Provisional.
          Length = 306

 Score = 25.8 bits (57), Expect = 6.9
 Identities = 14/31 (45%), Positives = 20/31 (64%), Gaps = 4/31 (12%)

Query: 59  AGLVY----STADEVEVPWGVGILGHVAVTG 85
           AG++Y    +TAD  E+ +G   LGH A+TG
Sbjct: 103 AGVMYTPKQTTADGFELQFGTNHLGHFALTG 133


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 25.9 bits (57), Expect = 7.0
 Identities = 8/27 (29%), Positives = 13/27 (48%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           K+ E +     K   +K+KK+ K   E
Sbjct: 429 KRPEPRAAADTKSAAEKQKKRAKEPFE 455


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 25.8 bits (57), Expect = 7.2
 Identities = 8/29 (27%), Positives = 21/29 (72%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKK 114
                +++KE++KK+ K++K+++K+  +K
Sbjct: 181 AERRRREEKERRKKQDKERKQREKETAEK 209


>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP).  NAP proteins
           are involved in moving histones into the nucleus,
           nucleosome assembly and chromatin fluidity. They affect
           the transcription of many genes.
          Length = 236

 Score = 25.4 bits (56), Expect = 7.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 89  NLQKKKEKKKKKKKKKKKKKKKKK 112
           NL  K  KKK++ KK  + +   K
Sbjct: 156 NLTVKTVKKKQRNKKTGQTRTITK 179


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 25.5 bits (57), Expect = 7.8
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 91  QKKKEKKKKKKKKKKKKKK 109
           +K  EK +K KKKKK++K 
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 25.3 bits (56), Expect = 8.0
 Identities = 9/30 (30%), Positives = 28/30 (93%)

Query: 84  TGETVNLQKKKEKKKKKKKKKKKKKKKKKK 113
           +G  +++++K+E+K+++K++K++K++K+K+
Sbjct: 8   SGRIIDIEEKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 24.7 bits (54), Expect = 8.0
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERKRY 122
           KK++KKKK+  K  + K  KK + + ++K  
Sbjct: 22  KKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This is
           a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 24.1 bits (53), Expect = 8.7
 Identities = 12/40 (30%), Positives = 24/40 (60%)

Query: 77  ILGHVAVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAE 116
           I+G++AV   +  ++  +  K+KKKK K+  ++  K  A+
Sbjct: 16  IVGNLAVLKYSAKMKFPQFDKQKKKKAKRNAEQDAKNSAD 55


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 25.4 bits (56), Expect = 8.7
 Identities = 13/29 (44%), Positives = 21/29 (72%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           K   + K+KKK+KKK++KKKK+   +E +
Sbjct: 323 KLDAEVKEKKKEKKKEEKKKKQIERLEER 351


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 25.5 bits (56), Expect = 8.8
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 93  KKEKKKKKKKKKKKKKKKKKKK 114
           K +K   KK KK KKKKKKK+ 
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 25.4 bits (56), Expect = 9.4
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 94  KEKKKKKKKKKKKKKKKKKKKAEIERK 120
             K++K ++KK+K + + +++    RK
Sbjct: 33  ALKREKAQEKKRKAEAQAERRELKARK 59


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 24.5 bits (54), Expect = 9.5
 Identities = 15/26 (57%), Positives = 16/26 (61%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEI 117
           KKK KK KK     +KKKK KKK  I
Sbjct: 31  KKKLKKLKKIDDDLEKKKKNKKKIII 56


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
           includes the E. faecalis PcfJ protein, which is
           functionally uncharacterized. It is found in bacteria
           and viruses, and is typically between 159 and 170 amino
           acids in length. There is a conserved HCV sequence
           motif.
          Length = 169

 Score = 25.0 bits (55), Expect = 9.5
 Identities = 9/35 (25%), Positives = 20/35 (57%)

Query: 86  ETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
             V   + K++K+K +K +K+  + +K   E++ K
Sbjct: 33  RAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67


>gnl|CDD|130283 TIGR01216, ATP_synt_epsi, ATP synthase, F1 epsilon subunit (delta
           in mitochondria).  This model describes one of the five
           types of subunits in the F1 part of F1/F0 ATP synthases.
           Members of this family are designated epsilon in
           bacterial and chloroplast systems but designated delta
           in mitochondria, where the counterpart of the bacterial
           delta subunit is designated OSCP. In a few cases
           (Propionigenium modestum, Acetobacterium woodii) scoring
           above the trusted cutoff and designated here as
           exceptions, Na+ replaces H+ for translocation [Energy
           metabolism, ATP-proton motive force interconversion].
          Length = 130

 Score = 24.9 bits (55), Expect = 9.5
 Identities = 10/50 (20%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 75  VGILGHVAVTGETVN---LQKKKEKKKKKKKKKKKKKKKKKKKAEIERKR 121
           V IL   AV  + ++    +K  E  +K  +  +  K   +   ++++ R
Sbjct: 74  VTILADGAVFADDIDEAEAEKALEAAEKLLESAEDDKDLAEALLKLKKAR 123


>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
           basal transcription 1 (ABT1) and similar proteins.  This
           subfamily corresponds to the RRM of novel nuclear
           proteins termed ABT1 and its homologous counterpart,
           pre-rRNA-processing protein ESF2 (eighteen S factor 2),
           from yeast. ABT1 associates with the TATA-binding
           protein (TBP) and enhances basal transcription activity
           of class II promoters. Meanwhile, ABT1 could be a
           transcription cofactor that can bind to DNA in a
           sequence-independent manner. The yeast ABT1 homolog,
           ESF2, is a component of 90S preribosomes and 5'
           ETS-based RNPs. It is previously identified as a
           putative partner of the TATA-element binding protein.
           However, it is primarily localized to the nucleolus and
           physically associates with pre-rRNA processing factors.
           ESF2 may play a role in ribosome biogenesis. It is
           required for normal pre-rRNA processing, as well as for
           SSU processome assembly and function. Both ABT1 and ESF2
           contain an RNA recognition motif (RRM), also termed RBD
           (RNA binding domain) or RNP (ribonucleoprotein domain).
           .
          Length = 98

 Score = 24.5 bits (54), Expect = 9.6
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 90  LQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           LQ +   K+K++KKK   KKKK  +  +E
Sbjct: 32  LQPEDPAKRKRRKKKGGNKKKKFTEGWVE 60


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 24.7 bits (54), Expect = 9.8
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 91  QKKKEKKKKKKKKKKKKKKKKKKK 114
            KKK KK +    +  KK   KK+
Sbjct: 78  HKKKRKKSRHVSSRSAKKISAKKR 101


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 25.0 bits (55), Expect = 9.9
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 11/53 (20%)

Query: 59  AGLVYST---ADEVEVPWGVGILGHVAVTGETVNLQKKKEKKKKKKKKKKKKK 108
           AGL Y+    A  +E           A   E     K++E  + +KK+ KKKK
Sbjct: 152 AGLWYAEWKDAKLLEE--------FAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 25.2 bits (55), Expect = 9.9
 Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 92  KKKEKKKKKKKKKKKKKKKKKKKAEIERK 120
           KK  KKKK   +KKK  KK KKK + + +
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEE 76


>gnl|CDD|232985 TIGR00463, gltX_arch, glutamyl-tRNA synthetase, archaeal and
           eukaryotic family.  The glutamyl-tRNA synthetases of the
           eukaryotic cytosol and of the Archaea are more similar
           to glutaminyl-tRNA synthetases than to bacterial
           glutamyl-tRNA synthetases. This model models just the
           eukaryotic cytosolic and archaeal forms of the enzyme.
           In some eukaryotes, the glutamyl-tRNA synthetase is part
           of a longer, multifunctional aminoacyl-tRNA ligase. In
           many species, the charging of tRNA(gln) proceeds first
           through misacylation with Glu and then transamidation.
           For this reason, glutamyl-tRNA synthetases, including
           all known archaeal enzymes (as of 2010) may act on both
           tRNA(gln) and tRNA(glu) [Protein synthesis, tRNA
           aminoacylation].
          Length = 556

 Score = 25.6 bits (56), Expect = 10.0
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 82  AVTGETVNLQKKKEKKKKKKKKKKKKKKKKKKKAEIE 118
           A   E  +L  +++K+  K+     KKK+KK+K   E
Sbjct: 49  AAVEEVNSLSPEEQKELMKRLGLDIKKKEKKRKGLRE 85


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.132    0.357 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,268,187
Number of extensions: 581734
Number of successful extensions: 11752
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7534
Number of HSP's successfully gapped: 1931
Length of query: 122
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 38
Effective length of database: 7,211,866
Effective search space: 274050908
Effective search space used: 274050908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 53 (24.0 bits)